Page 1 of 1

Project: 2665 (Run 1, Clone 775, Gen 151)

Posted: Fri Nov 27, 2009 7:43 pm
by SKeptical_Thinker
This WU has died twice in the last 24 hours, though I didn't kill it as it claims. Both times it has picked up where it left off when I restarted the client. I will keep checking its progress in an attempt to complete it.

Code: Select all

[21:02:32] + Core successfully engaged
[21:02:37] 
[21:02:37] + Processing work unit
[21:02:37] Core required: FahCore_a1.exe
[21:02:37] Core found.
[21:02:37] Working on Unit 04 [November 26 21:02:37]
[21:02:37] + Working ...
[21:02:37] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 2453 -version 602'

[21:02:37] 
[21:02:37] *------------------------------*
[21:02:37] Folding@Home Gromacs SMP Core
[21:02:37] Version 1.74 (November 27, 2006)
[21:02:37] 
[21:02:37] Preparing to commence simulation
[21:02:37] - Ensuring status. Please wait.
[21:02:38] - Starting from initial work packet
[21:02:38] 
[21:02:38] Project: 2665 (Run 1, Clone 775, Gen 151)
[21:02:38] 
[21:02:38] Assembly optimizations on if available.
[21:02:38] Entering M.D.
[21:02:55] percent)
[21:02:56] - Starting from initial work packet
[21:02:56] 
[21:02:56] Project: 2665 (Run 1, Clone 775, Gen 151)
[21:02:56] 
[21:02:56] Entering M.D.
[21:03:02] Rejecting checkpoint
[21:03:03] 
[21:03:03] Writing local files
[21:03:03] 
[21:03:03] Writing local files
[21:03:04] Extra SSE boost OK.
[21:03:05] Writing local files
[21:03:05] Completed 0 out of 250000 steps  (0 percent)
[21:18:06] Timered checkpoint triggered.
[21:30:04] Writing local files
[21:30:05] Completed 2500 out of 250000 steps  (1 percent)
[21:45:06] Timered checkpoint triggered.
[21:57:04] Writing local files
[21:57:04] Completed 5000 out of 250000 steps  (2 percent)
[22:12:05] Timered checkpoint triggered.
[22:24:04] Writing local files
[22:24:04] Completed 7500 out of 250000 steps  (3 percent)
[22:39:05] Timered checkpoint triggered.
[22:51:02] Writing local files
[22:51:02] Completed 10000 out of 250000 steps  (4 percent)
[23:06:04] Timered checkpoint triggered.
[23:17:59] Writing local files
[23:18:00] Completed 12500 out of 250000 steps  (5 percent)
[23:33:00] Timered checkpoint triggered.
[23:44:57] Writing local files
[23:44:57] Completed 15000 out of 250000 steps  (6 percent)
[23:59:58] Timered checkpoint triggered.
[00:11:53] Writing local files
[00:11:53] Completed 17500 out of 250000 steps  (7 percent)
[00:17:31] - Autosending finished units...
[00:17:31] Trying to send all finished work units
[00:17:31] + No unsent completed units remaining.
[00:17:31] - Autosend completed
[00:26:55] Timered checkpoint triggered.
[00:38:51] Writing local files
[00:38:51] Completed 20000 out of 250000 steps  (8 percent)
[00:53:52] Timered checkpoint triggered.
[01:05:48] Writing local files
[01:05:48] Completed 22500 out of 250000 steps  (9 percent)
[01:20:49] Timered checkpoint triggered.
[01:32:47] Writing local files
[01:32:47] Completed 25000 out of 250000 steps  (10 percent)
[01:47:48] Timered checkpoint triggered.
[01:59:45] Writing local files
[01:59:45] Completed 27500 out of 250000 steps  (11 percent)
[02:14:46] Timered checkpoint triggered.
[02:26:42] Writing local files
[02:26:42] Completed 30000 out of 250000 steps  (12 percent)
[02:41:43] Timered checkpoint triggered.
[02:53:39] Writing local files
[02:53:39] Completed 32500 out of 250000 steps  (13 percent)
[03:08:40] Timered checkpoint triggered.
[03:20:38] Writing local files
[03:20:39] Completed 35000 out of 250000 steps  (14 percent)
[03:35:39] Timered checkpoint triggered.
[03:47:35] Writing local files
[03:47:35] Completed 37500 out of 250000 steps  (15 percent)
[04:02:36] Timered checkpoint triggered.
[04:14:34] Writing local files
[04:14:34] Completed 40000 out of 250000 steps  (16 percent)
[04:29:34] Timered checkpoint triggered.
[04:41:31] Writing local files
[04:41:32] Completed 42500 out of 250000 steps  (17 percent)
[04:56:32] Timered checkpoint triggered.
[05:08:28] Writing local files
[05:08:28] Completed 45000 out of 250000 steps  (18 percent)
[05:23:29] Timered checkpoint triggered.
[05:35:27] Writing local files
[05:35:27] Completed 47500 out of 250000 steps  (19 percent)
[05:50:28] Timered checkpoint triggered.
[06:02:26] Writing local files
[06:02:26] Completed 50000 out of 250000 steps  (20 percent)
[06:17:28] Timered checkpoint triggered.
[06:17:31] - Autosending finished units...
[06:17:31] Trying to send all finished work units
[06:17:31] + No unsent completed units remaining.
[06:17:31] - Autosend completed
[06:29:22] Writing local files
[06:29:23] Completed 52500 out of 250000 steps  (21 percent)
[06:44:23] Timered checkpoint triggered.
[06:56:19] Writing local files
[06:56:20] Completed 55000 out of 250000 steps  (22 percent)
[07:11:20] Timered checkpoint triggered.
[07:23:16] Writing local files
[07:23:16] Completed 57500 out of 250000 steps  (23 percent)
[07:38:17] Timered checkpoint triggered.
[07:50:12] Writing local files
[07:50:12] Completed 60000 out of 250000 steps  (24 percent)
[08:05:13] Timered checkpoint triggered.
[08:17:08] Writing local files
[08:17:08] Completed 62500 out of 250000 steps  (25 percent)
[08:32:09] Timered checkpoint triggered.
[08:44:03] Writing local files
[08:44:03] Completed 65000 out of 250000 steps  (26 percent)
[08:58:47] 
[08:58:47] Folding@home Core Shutdown: INTERRUPTED
[08:58:52] CoreStatus = 66 (102)
[08:58:52] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[08:58:52] Killing all core threads

Folding@Home Client Shutdown.

Code: Select all


--- Opening Log file [November 27 12:47:19] 


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/mlm/FAH/SMP-WORK
Executable: ./fah6
Arguments: -smp -verbosity 9 

[12:47:19] - Ask before connecting: No
[12:47:19] - User name: SKeptical_Thinker (Team 31574)
[12:47:19] - User ID: 40702AC168F0DD5B
[12:47:19] - Machine ID: 1
[12:47:19] 
[12:47:20] Loaded queue successfully.
[12:47:20] - Autosending finished units...
[12:47:20] Trying to send all finished work units
[12:47:20] + No unsent completed units remaining.
[12:47:20] - Autosend completed
[12:47:20] 
[12:47:20] + Processing work unit
[12:47:20] Core required: FahCore_a1.exe
[12:47:20] Core found.
[12:47:20] Working on Unit 04 [November 27 12:47:20]
[12:47:20] + Working ...
[12:47:20] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 5188 -version 602'

[12:47:20] 
[12:47:20] *------------------------------*
[12:47:20] Folding@Home Gromacs SMP Core
[12:47:20] Version 1.74 (November 27, 2006)
[12:47:20] 
[12:47:20] Preparing to commence simulation
[12:47:20] - Ensuring status. Please wait.
[12:47:21] 
[12:47:21] Project: 2665 (Run 1, Clone 775, Gen 151)
[12:47:21] 
[12:47:21] Assembly optimizations on if available.
[12:47:21] Entering M.D.
[12:47:38] decompressed 517.2 percent)
[12:47:38] 
[12:47:38] Project: 2665 (Run 1, Clone 775, Gen 151)
[12:47:38] 
[12:47:38] Entering M.D.
[12:47:39] e 775, Gen 151)
[12:47:39] 
[12:47:39] Entering M.D.
[12:47:46] Protein: IBX in water
[12:47:46] Writing local files
[12:47:46] 
[12:47:46] ompleted 65000 out of 250000 steps  (26 percent)
[12:47:46] ed 65000 out of 250000 steps  (26 percent)
[12:47:47] Extra SSE boost OK.
[13:02:49] Timered checkpoint triggered.
[13:14:49] Writing local files
[13:14:49] Completed 67500 out of 250000 steps  (27 percent)
[13:29:49] Timered checkpoint triggered.
[13:44:50] Timered checkpoint triggered.
[13:50:09] Writing local files
[13:50:09] Completed 70000 out of 250000 steps  (28 percent)
[14:05:10] Timered checkpoint triggered.
[14:20:11] Timered checkpoint triggered.
[14:22:27] Writing local files
[14:22:27] Completed 72500 out of 250000 steps  (29 percent)
[14:37:28] Timered checkpoint triggered.
[14:49:19] Writing local files
[14:49:19] Completed 75000 out of 250000 steps  (30 percent)
[15:04:19] Timered checkpoint triggered.
[15:16:11] Writing local files
[15:16:11] Completed 77500 out of 250000 steps  (31 percent)
[15:31:12] Timered checkpoint triggered.
[15:43:03] Writing local files
[15:43:03] Completed 80000 out of 250000 steps  (32 percent)
[15:58:04] Timered checkpoint triggered.
[16:13:05] Timered checkpoint triggered.
[16:16:54] Writing local files
[16:16:54] Completed 82500 out of 250000 steps  (33 percent)
[16:31:55] Timered checkpoint triggered.
[16:43:51] Writing local files
[16:43:51] Completed 85000 out of 250000 steps  (34 percent)
[16:58:52] Timered checkpoint triggered.
[17:10:43] Writing local files
[17:10:43] Completed 87500 out of 250000 steps  (35 percent)
[17:25:42] Timered checkpoint triggered.
[17:37:32] Writing local files
[17:37:33] Completed 90000 out of 250000 steps  (36 percent)
[17:52:32] Timered checkpoint triggered.
[18:04:23] Writing local files
[18:04:23] Completed 92500 out of 250000 steps  (37 percent)
[18:19:24] Timered checkpoint triggered.
[18:31:13] Writing local files
[18:31:13] Completed 95000 out of 250000 steps  (38 percent)
[18:46:14] Timered checkpoint triggered.
[18:47:20] - Autosending finished units...
[18:47:20] Trying to send all finished work units
[18:47:20] + No unsent completed units remaining.
[18:47:20] - Autosend completed
[18:58:05] Writing local files
[18:58:06] Completed 97500 out of 250000 steps  (39 percent)
[19:13:06] Timered checkpoint triggered.
[19:22:27] 
[19:22:27] Folding@home Core Shutdown: INTERRUPTED
[19:22:31] CoreStatus = 66 (102)
[19:22:31] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[19:22:31] Killing all core threads

Folding@Home Client Shutdown.
Edit: Make that three times:

Code: Select all



--- Opening Log file [November 27 19:37:52] 


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/mlm/FAH/SMP-WORK
Executable: ./fah6
Arguments: -smp -verbosity 9 

[19:37:52] - Ask before connecting: No
[19:37:52] - User name: SKeptical_Thinker (Team 31574)
[19:37:52] - User ID: 40702AC168F0DD5B
[19:37:52] - Machine ID: 1
[19:37:52] 
[19:37:52] Loaded queue successfully.
[19:37:52] - Autosending finished units...
[19:37:52] Trying to send all finished work units
[19:37:52] + No unsent completed units remaining.
[19:37:52] - Autosend completed
[19:37:52] 
[19:37:52] + Processing work unit
[19:37:52] Core required: FahCore_a1.exe
[19:37:52] Core found.
[19:37:52] Working on Unit 04 [November 27 19:37:52]
[19:37:52] + Working ...
[19:37:52] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 7332 -version 602'

[19:37:53] 
[19:37:53] *------------------------------*
[19:37:53] Folding@Home Gromacs SMP Core
[19:37:53] Version 1.74 (November 27, 2006)
[19:37:53] 
[19:37:53] Preparing to commence simulation
[19:37:53] - Ensuring status. Please wait.
[19:37:54] 
[19:37:54] Project: 2665 (Run 1, Clone 775, Gen 151)
[19:37:54] 
[19:37:54] Assembly optimizations on if available.
[19:37:54] Entering M.D.
[19:38:12]  Expanded 4660947 -> 24111057 (decompressed 517.2 percent)
[19:38:12] 
[19:38:12] Project: 2665 (Run 1, Clone 775, Gen 151)
[19:38:12] 
[19:38:13] Entering M.D.
[19:38:20] Protein: IBX in water
[19:38:20] Writing local files
[19:38:21] rcent)
[19:38:21] 39 percent)
[19:38:21]  of 250000 steps  (39 percent)
[19:38:21] Extra SSE boost OK.
[19:41:23] Finalizing output
[19:41:27] CoreStatus = 66 (102)
[19:41:27] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[19:41:27] Killing all core threads

Folding@Home Client Shutdown.

Re: Project: 2665 (Run 1, Clone 775, Gen 151)

Posted: Fri Nov 27, 2009 10:19 pm
by Pick2
Which operating system are you using ?

Code: Select all

                       Folding@Home Client Version 6.02

[19:37:53] Folding@Home Gromacs SMP Core
[19:37:53] Version 1.74 (November 27, 2006)
You should probably delete the core after this finishes ( or not ) and let the client download a new one.

Re: Project: 2665 (Run 1, Clone 775, Gen 151)

Posted: Sat Nov 28, 2009 1:08 am
by bruce
Pick2 wrote:Which operating system are you using ?
He's running Linux.
SKeptical_Thinker wrote:

Code: Select all

Launch directory: /home/mlm/FAH/SMP-WORK
Executable: ./fah6

Re: Project: 2665 (Run 1, Clone 775, Gen 151)

Posted: Sat Nov 28, 2009 1:53 pm
by SKeptical_Thinker
It died:

Code: Select all

# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/mlm/FAH/SMP-WORK
Executable: ./fah6
Arguments: -smp -verbosity 9 

[12:30:15] - Ask before connecting: No
[12:30:15] - User name: SKeptical_Thinker (Team 31574)
[12:30:15] - User ID: 40702AC168F0DD5B
[12:30:15] - Machine ID: 1
[12:30:15] 
[12:30:15] Loaded queue successfully.
[12:30:15] - Autosending finished units...
[12:30:15] Trying to send all finished work units
[12:30:15] + No unsent completed units remaining.
[12:30:15] - Autosend completed
[12:30:15] 
[12:30:15] + Processing work unit
[12:30:15] Core required: FahCore_a1.exe
[12:30:15] Core found.
[12:30:15] Working on Unit 04 [November 28 12:30:15]
[12:30:15] + Working ...
[12:30:15] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 2550 -version 602'

[12:30:15] 
[12:30:15] *------------------------------*
[12:30:15] Folding@Home Gromacs SMP Core
[12:30:15] Version 1.74 (November 27, 2006)
[12:30:15] 
[12:30:15] Preparing to commence simulation
[12:30:15] - Ensuring status. Please wait.
[12:30:32] - Looking at optimizations...
[12:30:32] - Working with standard loops on this execution.
[12:30:32] - Previous termination of core was improper.
[12:30:32] - Going to use standard loops.
[12:30:32] - Files status OK
[12:30:34] - Expanded 4660947 -> 24111057 (decompressed 517.2 percent)
[12:30:35] 
[12:30:35] Project: 2665 (Run 1, Clone 775, Gen 151)
[12:30:35] 
[12:30:35] Entering M.D.
[12:30:42] Calling FAH init
[12:30:42] Read topology
[12:30:42] (Starting from checkpoint)
[12:30:43] Read checkpoint
[12:30:43] teps  (74 percent)
[12:30:43] er
[12:30:43] Writing local files
[12:30:43] Completed 185000 out of 250000 steps  (74 percent)
[12:30:44] Extra SSE boost OK.
[12:45:46] Timered checkpoint triggered.
[13:00:48] Timered checkpoint triggered.
[13:07:43] Writing local files
[13:07:43] Completed 187500 out of 250000 steps  (75 percent)
[13:12:02] ess can be made.
[13:12:02] This may be the correct result of the simulation, however if you
[13:12:02]   often see other project units terminating early like this
[13:12:02]   too, you may wish to check the stability of your computer (issues
[13:12:02]   such as high temperature, overclocking, etc.).
[13:12:02] Going to send back what have done.
[13:12:02] logfile size: 184017
[13:12:02] tability of your computer (issues
[13:12:02]   such as high temperature, overclocking, etc.).
[13:12:02] Going to send back what have done.
[13:12:02] logfile size: 184017
[13:12:02] - Writing 184567 bytes of core data to disk...
[13:12:02]   ... Done.
[13:12:04] 
[13:12:04] Folding@home Core Shutdown: EARLY_UNIT_END
[13:12:13] CoreStatus = 72 (114)
[13:12:13] Sending work to server


[13:12:13] + Attempting to send results
[13:12:13] - Reading file work/wuresults_04.dat from core
[13:12:13]   (Read 184567 bytes from disk)
[13:12:13] Connecting to http://171.64.65.64:8080/
[13:12:15] Posted data.
[13:12:15] Initial: 0000; - Uploaded at ~90 kB/s
[13:12:15] - Averaged speed for that direction ~118 kB/s
[13:12:15] + Results successfully sent
[13:12:15] Thank you for your contribution to Folding@Home.
[13:16:19] - Warning: Could not delete all work unit files (4): Core returned invalid code
I have been folding on ubuntu 9.10 for several WUs before this one failed.