Page 1 of 1
Project: 6013 (Run 0, Clone 95, Gen 85)
Posted: Tue May 18, 2010 10:37 pm
by EvilAlchemist
WU Never gets past 0% , just stalls out.
Code: Select all
13:47:07] *------------------------------*
[13:47:07] Folding@Home Gromacs SMP Core
[13:47:07] Version 2.19 (Mar 12, 2010)
[13:47:07]
[13:47:07] Preparing to commence simulation
[13:47:07] - Looking at optimizations...
[13:47:07] - Created dyn
[13:47:07] - Files status OK
[13:47:08] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[13:47:08] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[13:47:08] - Digital signature verified
[13:47:08]
[13:47:08] Project: 6013 (Run 0, Clone 95, Gen 85)
[13:47:08]
[13:47:08] Assembly optimizations on if available.
[13:47:08] Entering M.D.
[13:51:30] Completed 0 out of 250000 steps (0%)
[19:42:45] - Autosending finished units... [May 18 19:42:45 UTC]
[19:42:45] Trying to send all finished work units
[19:42:45] + No unsent completed units remaining.
[19:42:45] - Autosend completed
[22:26:03] Killing all core threads
[22:26:03] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown at user request.
[22:26:03] ***** Got a SIGTERM signal (2)
[22:26:03] Killing all core threads
[22:26:03] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown.
Re: Project: 6013 (Run 0, Clone 95, Gen 85)
Posted: Tue May 18, 2010 10:59 pm
by toTOW
There's no data for this WU in the DB yet ... if it doesn't in a couple of days, I'll mark it as bad.
Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Wed Jun 16, 2010 6:34 pm
by Bob8421
I was assigned that work unit this morning. After more than an hour, the console was still showing the 0% message. Since project 6013 on that system typically takes 7.5 minutes per step, I assumed that the client had died. I stopped and restarted it, but it still had not reached the 1% mark. Going from 1% to 2% took 1:15:10, making it about 125 hours to completion, which is way beyond the 72 hour deadline.
I was not fond of project 6013 before this, but now I absolutely HATE it!
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Wed Jun 16, 2010 7:09 pm
by hootis
Post your log file plz
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Wed Jun 16, 2010 10:32 pm
by Bob8421
When I restarted the client to finish the first step (from 0% to 1%) it gave the following messages:
- Looking at optimizations...
- Working with standard loops on this execution.
- Previous termination of core was improper.
That would explain why the next steps (from 1% to 4%) took so long, but it doesn't explain why the first step, when optimizations were on, also took the exact same length.
I also don't understand what it means by improper termination since I terminated the client the same way I always have (the X at top right of the window).
Restarting the client again gave me the same messages, so on the last restart I used the -forceasm flag, but the following step (from 4% to 5%) also took 1.25 hours to complete.
Code: Select all
[15:13:42] Completed 480000 out of 500000 steps (96%)
[15:21:22] Completed 485000 out of 500000 steps (97%)
[15:29:02] Completed 490000 out of 500000 steps (98%)
[15:36:42] Completed 495000 out of 500000 steps (99%)
[15:44:22] Completed 500000 out of 500000 steps (100%)
[15:44:23] DynamicWrapper: Finished Work Unit: sleep=10000
[15:44:33]
[15:44:33] Finished Work Unit:
[15:44:33] - Reading up to 20457096 from "work/wudata_03.trr": Read 20457096
[15:44:33] trr file hash check passed.
[15:44:33] edr file hash check passed.
[15:44:33] logfile size: 58833
[15:44:33] Leaving Run
[15:44:34] - Writing 20551489 bytes of core data to disk...
[15:44:35] ... Done.
[15:44:38] - Shutting down core
[15:44:38]
[15:44:38] Folding@home Core Shutdown: FINISHED_UNIT
[15:44:41] CoreStatus = 64 (100)
[15:44:41] Unit 3 finished with 91 percent of time to deadline remaining.
[15:44:41] Updated performance fraction: 0.884096
[15:44:41] Sending work to server
[15:44:41] Project: 6014 (Run 3, Clone 180, Gen 45)
[15:44:41] + Attempting to send results [June 16 15:44:41 UTC]
[15:44:41] - Reading file work/wuresults_03.dat from core
[15:44:41] (Read 20551489 bytes from disk)
[15:44:41] Connecting to http://130.237.232.140:8080/
[15:47:09] Posted data.
[15:47:09] Initial: 0000; - Uploaded at ~134 kB/s
[15:47:10] - Averaged speed for that direction ~123 kB/s
[15:47:10] + Results successfully sent
[15:47:10] Thank you for your contribution to Folding@Home.
[15:47:10] + Number of Units Completed: 53
[15:47:15] Trying to send all finished work units
[15:47:15] + No unsent completed units remaining.
[15:47:15] - Preparing to get new work unit...
[15:47:15] Cleaning up work directory
[15:47:15] + Attempting to get work packet
[15:47:15] Passkey found
[15:47:15] - Will indicate memory of 3582 MB
[15:47:15] - Connecting to assignment server
[15:47:15] Connecting to http://assign.stanford.edu:8080/
[15:47:16] Posted data.
[15:47:16] Initial: ED82; - Successful: assigned to (130.237.232.140).
[15:47:16] + News From Folding@Home: Welcome to Folding@Home
[15:47:16] Loaded queue successfully.
[15:47:16] Connecting to http://130.237.232.140:8080/
[15:47:20] Posted data.
[15:47:20] Initial: 0000; - Receiving payload (expected size: 979931)
[15:47:24] - Downloaded at ~239 kB/s
[15:47:24] - Averaged speed for that direction ~309 kB/s
[15:47:24] + Received work.
[15:47:24] Trying to send all finished work units
[15:47:24] + No unsent completed units remaining.
[15:47:24] + Closed connections
[15:47:24]
[15:47:24] + Processing work unit
[15:47:24] Core required: FahCore_a3.exe
[15:47:24] Core found.
[15:47:24] Working on queue slot 04 [June 16 15:47:24 UTC]
[15:47:24] + Working ...
[15:47:24] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 4 -checkpoint 10 -verbose -lifeline 3660 -version 629'
[15:47:24]
[15:47:24] *------------------------------*
[15:47:24] Folding@Home Gromacs SMP Core
[15:47:24] Version 2.19 (Mar 12, 2010)
[15:47:24]
[15:47:24] Preparing to commence simulation
[15:47:24] - Looking at optimizations...
[15:47:24] - Created dyn
[15:47:24] - Files status OK
[15:47:26] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[15:47:26] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[15:47:26] - Digital signature verified
[15:47:26]
[15:47:26] Project: 6013 (Run 0, Clone 95, Gen 85)
[15:47:26]
[15:47:26] Assembly optimizations on if available.
[15:47:26] Entering M.D.
[15:47:51] Completed 0 out of 250000 steps (0%)
[16:58:55] Killing all core threads
[16:58:55] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown at user request.
[16:58:55] ***** Got a SIGTERM signal (2)
[16:58:55] Killing all core threads
[16:58:55] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown.
--- Opening Log file [June 16 16:58:58 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.29
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: F:\Folding@Home SMP2 Console 2
Executable: F:\Folding@Home SMP2 Console 2\fah6.exe
Arguments: -smp -verbosity 9
[16:58:58] - Ask before connecting: No
[16:58:58] - User name: Bob8421 (Team 11314)
[16:58:58] - User ID: 2991D43E138E9B64
[16:58:58] - Machine ID: 6
[16:58:58]
[16:58:59] Loaded queue successfully.
[16:58:59]
[16:58:59] - Autosending finished units... [June 16 16:58:59 UTC]
[16:58:59] + Processing work unit
[16:58:59] Trying to send all finished work units
[16:58:59] Core required: FahCore_a3.exe
[16:58:59] + No unsent completed units remaining.
[16:58:59] - Autosend completed
[16:58:59] Core found.
[16:58:59] Working on queue slot 04 [June 16 16:58:59 UTC]
[16:58:59] + Working ...
[16:58:59] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 4 -checkpoint 10 -verbose -lifeline 708 -version 629'
[16:58:59]
[16:58:59] *------------------------------*
[16:58:59] Folding@Home Gromacs SMP Core
[16:58:59] Version 2.19 (Mar 12, 2010)
[16:58:59]
[16:58:59] Preparing to commence simulation
[16:58:59] - Ensuring status. Please wait.
[16:59:08] - Looking at optimizations...
[16:59:08] - Working with standard loops on this execution.
[16:59:08] - Previous termination of core was improper.
[16:59:08] - Files status OK
[16:59:10] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[16:59:10] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[16:59:10] - Digital signature verified
[16:59:10]
[16:59:10] Project: 6013 (Run 0, Clone 95, Gen 85)
[16:59:10]
[16:59:10] Entering M.D.
[16:59:16] Using Gromacs checkpoints
[16:59:18] Resuming from checkpoint
[16:59:18] Verified work/wudata_04.log
[16:59:18] Verified work/wudata_04.trr
[16:59:18] Verified work/wudata_04.xtc
[16:59:18] Verified work/wudata_04.edr
[16:59:36] Completed 2338 out of 250000 steps (0%)
[17:04:24] Completed 2500 out of 250000 steps (1%)
[18:19:34] Completed 5000 out of 250000 steps (2%)
[19:34:43] Completed 7500 out of 250000 steps (3%)
[20:49:51] Completed 10000 out of 250000 steps (4%)
[20:52:12] Killing all core threads
[20:52:12] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown at user request.
[20:52:12] ***** Got a SIGTERM signal (2)
[20:52:12] Killing all core threads
[20:52:12] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown.
--- Opening Log file [June 16 20:53:17 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.29
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: F:\Folding@Home SMP2 Console 2
Executable: F:\Folding@Home SMP2 Console 2\fah6.exe
Arguments: -smp -verbosity 9
[20:53:17] - Ask before connecting: No
[20:53:17] - User name: Bob8421 (Team 11314)
[20:53:17] - User ID: 2991D43E138E9B64
[20:53:17] - Machine ID: 6
[20:53:17]
[20:53:18] Loaded queue successfully.
[20:53:18]
[20:53:18] - Autosending finished units... [June 16 20:53:18 UTC]
[20:53:18] + Processing work unit
[20:53:18] Trying to send all finished work units
[20:53:18] Core required: FahCore_a3.exe
[20:53:18] + No unsent completed units remaining.
[20:53:18] - Autosend completed
[20:53:18] Core found.
[20:53:18] Working on queue slot 04 [June 16 20:53:18 UTC]
[20:53:18] + Working ...
[20:53:18] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 4 -checkpoint 10 -verbose -lifeline 316 -version 629'
[20:53:18]
[20:53:18] *------------------------------*
[20:53:18] Folding@Home Gromacs SMP Core
[20:53:18] Version 2.19 (Mar 12, 2010)
[20:53:18]
[20:53:18] Preparing to commence simulation
[20:53:18] - Ensuring status. Please wait.
[20:53:27] - Looking at optimizations...
[20:53:27] - Working with standard loops on this execution.
[20:53:27] - Previous termination of core was improper.
[20:53:27] - Going to use standard loops.
[20:53:27] - Files status OK
[20:53:29] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[20:53:29] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[20:53:29] - Digital signature verified
[20:53:29]
[20:53:29] Project: 6013 (Run 0, Clone 95, Gen 85)
[20:53:29]
[20:53:29] Entering M.D.
[20:53:35] Using Gromacs checkpoints
[20:53:37] Resuming from checkpoint
[20:53:37] Verified work/wudata_04.log
[20:53:37] Verified work/wudata_04.trr
[20:53:37] Verified work/wudata_04.xtc
[20:53:37] Verified work/wudata_04.edr
[20:53:55] Completed 9998 out of 250000 steps (3%)
[20:53:55] Completed 10000 out of 250000 steps (4%)
[20:54:10] Killing all core threads
[20:54:10] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown at user request.
[20:54:10] ***** Got a SIGTERM signal (2)
[20:54:10] Killing all core threads
[20:54:10] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown.
--- Opening Log file [June 16 20:55:32 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.29
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: F:\Folding@Home SMP2 Console 2
Executable: fah6.exe
Arguments: -smp -verbosity 9 -forceasm -oneunit -smp -verbosity 9
[20:55:32] - Ask before connecting: No
[20:55:32] - User name: Bob8421 (Team 11314)
[20:55:32] - User ID: 2991D43E138E9B64
[20:55:32] - Machine ID: 6
[20:55:32]
[20:55:32] Loaded queue successfully.
[20:55:32]
[20:55:32] - Autosending finished units... [June 16 20:55:32 UTC]
[20:55:32] + Processing work unit
[20:55:32] Trying to send all finished work units
[20:55:32] Core required: FahCore_a3.exe
[20:55:32] + No unsent completed units remaining.
[20:55:32] - Autosend completed
[20:55:32] Core found.
[20:55:32] Working on queue slot 04 [June 16 20:55:32 UTC]
[20:55:32] + Working ...
[20:55:32] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 4 -checkpoint 10 -forceasm -verbose -lifeline 588 -version 629'
[20:55:32]
[20:55:32] *------------------------------*
[20:55:32] Folding@Home Gromacs SMP Core
[20:55:32] Version 2.19 (Mar 12, 2010)
[20:55:32]
[20:55:32] Preparing to commence simulation
[20:55:32] - Ensuring status. Please wait.
[20:55:42] - Assembly optimizations manually forced on.
[20:55:42] - Not checking prior termination.
[20:55:43] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[20:55:43] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[20:55:43] - Digital signature verified
[20:55:43]
[20:55:43] Project: 6013 (Run 0, Clone 95, Gen 85)
[20:55:43]
[20:55:43] Assembly optimizations on if available.
[20:55:43] Entering M.D.
[20:55:49] Using Gromacs checkpoints
[20:55:52] Resuming from checkpoint
[20:55:52] Verified work/wudata_04.log
[20:55:52] Verified work/wudata_04.trr
[20:55:52] Verified work/wudata_04.xtc
[20:55:52] Verified work/wudata_04.edr
[20:56:09] Completed 9998 out of 250000 steps (3%)
[20:56:09] Completed 10000 out of 250000 steps (4%)
[22:11:13] Completed 12500 out of 250000 steps (5%)
[22:11:26] Killing all core threads
[22:11:26] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown at user request.
[22:11:26] ***** Got a SIGTERM signal (2)
[22:11:26] Killing all core threads
[22:11:26] Could not get process id information. Please kill core process manually
Folding@Home Client Shutdown.
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Wed Jun 16, 2010 10:46 pm
by glussier
I had a few of those, my q9650@4ghz can barely make the deadline on these workunits. I decided to stop this machine until I can get something else. Usually, this computer can do 9.5k ppd with the 6013, but for the past few days, my 9650 will only get the base points.
Folding should be something we set an forget. If Stanford can't get 6 months without any problem, I think I'll do something else with my computers, I don't have time to keep babysitting these computers.
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Wed Jun 16, 2010 10:52 pm
by hootis
you have to Ctrl-c to close properly
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Thu Jun 17, 2010 12:05 am
by Bob8421
hootis wrote:you have to Ctrl-c to close properly
I've never done that before, whether with the console client or the SMP client, and I never had a problem until now.
And isn't Ctrl-C the Copy command???
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Thu Jun 17, 2010 12:07 am
by Bob8421
glussier wrote:I don't have time to keep babysitting these computers.
I kind of have the same feeling, but by choosing a beta client we are agreeing to accept a certain amount of babysitting.
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Thu Jun 17, 2010 12:39 am
by PantherX
Bob8421 wrote:hootis wrote:you have to Ctrl-c to close properly
I've never done that before, whether with the console client or the SMP client, and I never had a problem until now.
And isn't Ctrl-C the Copy command???
Not when you are using it on a Command Line Interface. I have also read somewhere in this forum that the message about improper shutdown is a cosmetic one and doesn't effect the SMP2 Clients. The advance features (-forceasm) are hardcoded in the Core itself thus there isn't any need to use it. You can also use the X to Close. I too have seen this messages in my FAHLog and my system runs F@H smoothly.
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Thu Jun 17, 2010 2:54 am
by 7im
PantherX wrote:...
Not when you are using it on a Command Line Interface. ... You can also use the X to Close...
You can use a lot of things to end the CLI client, but we only recommend the Ctrl+C as the correct way to close the command line client gracefully. X, ending the task, alt+F4, holding in the power button on the PC for 5 seconds, and a 12 gauge shotgun will all shut down the client too, just not as risk free as ctrl+c.
ctrl+c is the best answer.
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Thu Jun 17, 2010 3:31 am
by stevehat1
I too have had issues with 6013's in the last two days, both units have caused an "A3 core exe shutdown" message to appear in Vista 32. This happens almost instantly as there is no progress shown in the console. The machine that this is happening on is probably a 99%+ effective machine and is a designated folder with 2 instances of GPU3 and 1 instance SMP2 running.
Both of these events were followed with 6701 WU's upon restarting the client and they finished just fine (kinda like adding insult to injury, 12 hours of down from bad WU's and then being rewarded with the crappiest A3's to date)
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Sun Jun 20, 2010 1:51 pm
by 58Enfield
I received a P6013 R0 C95 G85 also, and it displays the same slow behavior as shown in the log.
Code: Select all
[06:12:55] + Number of Units Completed: 184
[06:12:56] Trying to send all finished work units
[06:12:56] + No unsent completed units remaining.
[06:12:56] - Preparing to get new work unit...
[06:12:56] Cleaning up work directory
[06:12:56] + Attempting to get work packet
[06:12:56] Passkey found
[06:12:56] - Will indicate memory of 1956 MB
[06:12:56] - Connecting to assignment server
[06:12:56] Connecting to http://assign.stanford.edu:8080/
[06:12:57] Posted data.
[06:12:57] Initial: ED82; - Successful: assigned to (130.237.232.140).
[06:12:57] + News From Folding@Home: Welcome to Folding@Home
[06:12:57] Loaded queue successfully.
[06:12:57] Connecting to http://130.237.232.140:8080/
[06:13:01] Posted data.
[06:13:01] Initial: 0000; - Receiving payload (expected size: 979931)
[06:13:04] - Downloaded at ~318 kB/s
[06:13:04] - Averaged speed for that direction ~636 kB/s
[06:13:04] + Received work.
[06:13:04] Trying to send all finished work units
[06:13:04] + No unsent completed units remaining.
[06:13:04] + Closed connections
[06:13:04]
[06:13:04] + Processing work unit
[06:13:04] Core required: FahCore_a3.exe
[06:13:04] Core found.
[06:13:04] Working on queue slot 00 [June 20 06:13:04 UTC]
[06:13:04] + Working ...
[06:13:04] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 00 -np 4 -checkpoint 15 -verbose -lifeline 2430 -version 629'
[06:13:04]
[06:13:04] *------------------------------*
[06:13:04] Folding@Home Gromacs SMP Core
[06:13:04] Version 2.22 (June 10, 2010)
[06:13:04]
[06:13:04] Preparing to commence simulation
[06:13:04] - Looking at optimizations...
[06:13:04] - Created dyn
[06:13:04] - Files status OK
[06:13:05] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[06:13:05] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[06:13:05] - Digital signature verified
[06:13:05]
[06:13:05] Project: 6013 (Run 0, Clone 95, Gen 85)
[06:13:05]
[06:13:05] Assembly optimizations on if available.
[06:13:05] Entering M.D.
Starting 4 threads
NNODES=4, MYRANK=2, HOSTNAME=thread #2
NNODES=4, MYRANK=3, HOSTNAME=thread #3
NNODES=4, MYRANK=1, HOSTNAME=thread #1
NNODES=4, MYRANK=0, HOSTNAME=thread #0
Reading file work/wudata_00.tpr, VERSION 4.0.99_development_20090605 (single precision)
Note: tpx file_version 68, software version 70
Making 1D domain decomposition 1 x 1 x 4
starting mdrun 'IBX in water'
21500002 steps, 43000.0 ps (continuing from step 21250002, 42500.0 ps).
[06:13:23] Completed 0 out of 250000 steps (0%)
[07:04:27] Completed 2500 out of 250000 steps (1%)
[07:55:30] Completed 5000 out of 250000 steps (2%)
[08:46:34] Completed 7500 out of 250000 steps (3%)
[09:37:37] Completed 10000 out of 250000 steps (4%)
[10:28:41] Completed 12500 out of 250000 steps (5%)
[11:19:44] Completed 15000 out of 250000 steps (6%)
[11:54:23] - Autosending finished units... [June 20 11:54:23 UTC]
[11:54:23] Trying to send all finished work units
[11:54:23] + No unsent completed units remaining.
[11:54:23] - Autosend completed
[12:10:48] Completed 17500 out of 250000 steps (7%)
51:03 tpf is
not normal for that machine as QD shows.....
Code: Select all
Index 7: finished 921.00 pts (47.591 pt/hr, 1141.83 ppd) 7.44 X min speed
bonus pts: 5086.26 (262.743 pt/hr, 6305.83 ppd); bonus factor: 5.52; kfactor: 4.10
server: 171.64.65.56:8080; project: 6701
Folding: run 68, clone 26, generation 3; benchmark 0; misc: 500, 200, 12 (le)
issue: Fri Jun 18 10:26:41 2010; begin: Fri Jun 18 10:27:02 2010
end: Sat Jun 19 05:48:11 2010; due: Thu Jun 24 10:27:02 2010 (6 days)
preferred: Mon Jun 21 15:15:02 2010 (3 days)
user: 58Enfield; team: 131; ID: XXXXXXXXXXXXXXXX; mach ID: 1
(switched to new version Fahcore_a3.exe)
Index 8: finished 470.00 pts (49.703 pt/hr, 1191.93 ppd) 15.2 X min speed
bonus pts: 2592.78 (273.974 pt/hr, 6575.37 ppd); bonus factor: 5.52; kfactor: 2.00
server: 130.237.232.140:8080; project: 6012
Folding: run 1, clone 302, generation 75; benchmark 0; misc: 500, 600, 12 (le)
issue: Sat Jun 19 05:55:50 2010; begin: Sat Jun 19 05:56:17 2010
end: Sat Jun 19 15:23:39 2010; due: Fri Jun 25 05:56:17 2010 (6 days)
preferred: Tue Jun 22 05:56:17 2010 (3 days)
user: 58Enfield; team: 131; ID: XXXXXXXXXXXXXXXX; mach ID: 1
Index 9: finished 380.00 pts (49.520 pt/hr, 1187.03 ppd) 9.38 X min speed
bonus pts: 2132.32 (277.535 pt/hr, 6660.85 ppd); bonus factor: 5.61; kfactor: 3.36
server: 130.237.232.140:8080; project: 6013
Folding: run 0, clone 84, generation 184; benchmark 0; misc: 500, 600, 12 (le)
issue: Sat Jun 19 15:27:06 2010; begin: Sat Jun 19 15:27:40 2010
end: Sat Jun 19 23:08:05 2010; due: Tue Jun 22 15:27:40 2010 (3 days)
preferred: Tue Jun 22 15:27:40 2010 (3 days)
user: 58Enfield; team: 131; ID: XXXXXXXXXXXXXXXXX; mach ID: 1
Index 0: folding now 380.00 pts (4.461 pt/hr, 107.06 ppd) 0.845 X min speed; 7% complete
server: 130.237.232.140:8080; project: 6013
Folding: run 0, clone 95, generation 85; benchmark 0; misc: 500, 600, 12 (le)
issue: Sat Jun 19 23:12:31 2010; begin: Sat Jun 19 23:13:04 2010
expect: Wed Jun 23 12:23:32 2010; due: Tue Jun 22 23:13:04 2010 (3 days)
preferred: Tue Jun 22 23:13:04 2010 (3 days)
user: 58Enfield; team: 131; ID: XXXXXXXXXXXXXXXXX; mach ID: 1
Average download rate 652.035 KB/s (u=4); upload rate 85.111 KB/s (u=4)
Performance fraction 0.904590 (u=4)
Average pph: 45.692, ppd: 1096.60, ppw: 7676.2, ppy: 400523
Average bonus pph: 261.897, ppd: 6285.53, ppw: 43998.7, ppy: 2295726
Average alternate pph: 30.159, ppd: 723.82, ppw: 5066.7, ppy: 264367
Average alternate bonus pph: 261.897, ppd: 6285.53, ppw: 43998.7, ppy: 2295726
Given the information on the other 6013 thread about getting the same defective work unit back over and over, I have already renamed the folding directory and setup a new folder on that machine.
The only wrinkles are that I did upgrade to the new core version three work units back, and all work units have been running faster except this one (other machines included). This machine was getting marginal on heat under the old core (59-61C @ 30C ambient)...and went to 63-64C under the new core. No complaints...it is also working harder as the QD log shows. It is going to be offline today while I re-validate it at a lower overclock (and hopefully lower heat).
Old specs: 3.4 gh Q6600 dedicated Kubuntu 8.04.4 2.6.24-28 generic
Re: Project: 6013 (Run 0, Clone 95, Gen 85)
Posted: Mon Jun 28, 2010 5:48 pm
by bollix47
Apparently this one is still being assigned.
Code: Select all
[17:10:23] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 8 -checkpoint 30 -verbose -lifeline 3975 -version 629'
[17:10:24]
[17:10:24] *------------------------------*
[17:10:24] Folding@Home Gromacs SMP Core
[17:10:24] Version 2.22 (June 10, 2010)
[17:10:24]
[17:10:24] Preparing to commence simulation
[17:10:24] - Looking at optimizations...
[17:10:24] - Created dyn
[17:10:24] - Files status OK
[17:10:24] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[17:10:24] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[17:10:24] - Digital signature verified
[17:10:24]
[17:10:24] Project: 6013 (Run 0, Clone 95, Gen 85)
[17:10:24]
[17:10:24] Assembly optimizations on if available.
[17:10:24] Entering M.D.
Note: tpx file_version 68, software version 70
Making 3D domain decomposition 2 x 2 x 2
starting mdrun 'IBX in water'
21500002 steps, 43000.0 ps (continuing from step 21250002, 42500.0 ps).
[17:10:51] Completed 0 out of 250000 steps (0%)
tMPI error: Invalid buffer (null pointer in send or receive buffer) (in valid comm)
tMPI error: Invalid buffer (null pointer in send or receive buffer) (in valid comm)
tMPI error: Invalid buffer (null pointer in send or receive buffer) (in valid comm)
Aborted
[17:40:55] CoreStatus = 86 (134)
[17:40:55] Client-core communications error: ERROR 0x86
[17:40:55] Deleting current work unit & continuing...
[17:41:05] Trying to send all finished work units
[17:41:05] + No unsent completed units remaining.
[17:41:05] - Preparing to get new work unit...
[17:41:05] Cleaning up work directory
[17:41:05] + Attempting to get work packet
Re: Project 6013 (Run 0, Clone 95, Gen 85)
Posted: Wed Jun 30, 2010 7:12 am
by bruce
p6013 has been suspended.