Project 6013 (Run 0, Clone 95, Gen 85)

Moderators: Site Moderators, FAHC Science Team

Post Reply
EvilAlchemist
Posts: 53
Joined: Fri Feb 08, 2008 4:24 pm
Hardware configuration: 2 x X5550 Xeons - SuperMicro MBD-X8DAi-O
Server 2008 R2 x64 - 12GB Crucial DDR3 ECC Ram
PCP&C 910 Silencer - 1 x HIS 4850 ICEQ Turbo Edition

6 x E5530 Xeons (3 Systems) - SUPERMICRO MBD-X8DTL-i-O
Server 2008 RS x64 - 8GB DDR3 GSkill Non-ECC Ram
Seasonic 80+ Bronze 380w PSU

2 x E5504 - SUPERMICRO MBD-X8DTL-i-O
Server 2008 R2 x64 - 6GB DDR3 GSkill Non-ECC Ram
2.3 TB Raid 5 Array - Corsair 520 Power Supply

E5504 - EVGA X58 ATX Motherboard
Windows 7 x64 - 6GB DDR3 GSkill Non-ECC Ram
Seasonic 300 Power Supply

Intel X5550 CPU - EVGA X58 Micro ATX Motherboard
Windows 7 x64 - 3GB Corsair DDR3-1600
Corsair 550 Power Supply - ATI 4350

Dell Vostro 1500 Laptop - Intel T9300 C2D CPU
Windows 7 x64 - 4 GB DDR2-6400 - nVidia 8400m GS

Xeon 3075 C2D - Intel P35 Motherboard - 4GB DDR2 Non-ECC Ram
Server 2008 R2 x64- Seasonic 300 Power Supply
Location: Columbia, Tennessee
Contact:

Project: 6013 (Run 0, Clone 95, Gen 85)

Post by EvilAlchemist »

WU Never gets past 0% , just stalls out.

Code: Select all

13:47:07] *------------------------------*
[13:47:07] Folding@Home Gromacs SMP Core
[13:47:07] Version 2.19 (Mar 12, 2010)
[13:47:07] 
[13:47:07] Preparing to commence simulation
[13:47:07] - Looking at optimizations...
[13:47:07] - Created dyn
[13:47:07] - Files status OK
[13:47:08] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[13:47:08] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[13:47:08] - Digital signature verified
[13:47:08] 
[13:47:08] Project: 6013 (Run 0, Clone 95, Gen 85)
[13:47:08] 
[13:47:08] Assembly optimizations on if available.
[13:47:08] Entering M.D.
[13:51:30] Completed 0 out of 250000 steps  (0%)
[19:42:45] - Autosending finished units... [May 18 19:42:45 UTC]
[19:42:45] Trying to send all finished work units
[19:42:45] + No unsent completed units remaining.
[19:42:45] - Autosend completed
[22:26:03] Killing all core threads
[22:26:03] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[22:26:03] ***** Got a SIGTERM signal (2)
[22:26:03] Killing all core threads
[22:26:03] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.
toTOW
Site Moderator
Posts: 6359
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 6013 (Run 0, Clone 95, Gen 85)

Post by toTOW »

There's no data for this WU in the DB yet ... if it doesn't in a couple of days, I'll mark it as bad.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Bob8421
Posts: 53
Joined: Tue Dec 22, 2009 5:16 pm

Project 6013 (Run 0, Clone 95, Gen 85)

Post by Bob8421 »

I was assigned that work unit this morning. After more than an hour, the console was still showing the 0% message. Since project 6013 on that system typically takes 7.5 minutes per step, I assumed that the client had died. I stopped and restarted it, but it still had not reached the 1% mark. Going from 1% to 2% took 1:15:10, making it about 125 hours to completion, which is way beyond the 72 hour deadline.

I was not fond of project 6013 before this, but now I absolutely HATE it!
hootis
Posts: 70
Joined: Fri Nov 27, 2009 3:34 am
Hardware configuration: AMD Phenom II 955 @ 3.8 +ATi HD 5870
AMD Athlon II 610e @ 2.5 + 2x Nvidia 560Ti @ 1ghz
AMD Athlon II 605e @ 2.4 +Nvidia 550Ti + Nvidia GT 240
AMD Athlon II 620 @ 3.3ghz

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by hootis »

Post your log file plz
Things yet to materialize.
Bob8421
Posts: 53
Joined: Tue Dec 22, 2009 5:16 pm

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by Bob8421 »

When I restarted the client to finish the first step (from 0% to 1%) it gave the following messages:
- Looking at optimizations...
- Working with standard loops on this execution.
- Previous termination of core was improper.
That would explain why the next steps (from 1% to 4%) took so long, but it doesn't explain why the first step, when optimizations were on, also took the exact same length.

I also don't understand what it means by improper termination since I terminated the client the same way I always have (the X at top right of the window).

Restarting the client again gave me the same messages, so on the last restart I used the -forceasm flag, but the following step (from 4% to 5%) also took 1.25 hours to complete.

Code: Select all

[15:13:42] Completed 480000 out of 500000 steps  (96%)
[15:21:22] Completed 485000 out of 500000 steps  (97%)
[15:29:02] Completed 490000 out of 500000 steps  (98%)
[15:36:42] Completed 495000 out of 500000 steps  (99%)
[15:44:22] Completed 500000 out of 500000 steps  (100%)
[15:44:23] DynamicWrapper: Finished Work Unit: sleep=10000
[15:44:33] 
[15:44:33] Finished Work Unit:
[15:44:33] - Reading up to 20457096 from "work/wudata_03.trr": Read 20457096
[15:44:33] trr file hash check passed.
[15:44:33] edr file hash check passed.
[15:44:33] logfile size: 58833
[15:44:33] Leaving Run
[15:44:34] - Writing 20551489 bytes of core data to disk...
[15:44:35]   ... Done.
[15:44:38] - Shutting down core
[15:44:38] 
[15:44:38] Folding@home Core Shutdown: FINISHED_UNIT
[15:44:41] CoreStatus = 64 (100)
[15:44:41] Unit 3 finished with 91 percent of time to deadline remaining.
[15:44:41] Updated performance fraction: 0.884096
[15:44:41] Sending work to server
[15:44:41] Project: 6014 (Run 3, Clone 180, Gen 45)
[15:44:41] + Attempting to send results [June 16 15:44:41 UTC]
[15:44:41] - Reading file work/wuresults_03.dat from core
[15:44:41]   (Read 20551489 bytes from disk)
[15:44:41] Connecting to http://130.237.232.140:8080/
[15:47:09] Posted data.
[15:47:09] Initial: 0000; - Uploaded at ~134 kB/s
[15:47:10] - Averaged speed for that direction ~123 kB/s
[15:47:10] + Results successfully sent
[15:47:10] Thank you for your contribution to Folding@Home.
[15:47:10] + Number of Units Completed: 53

[15:47:15] Trying to send all finished work units
[15:47:15] + No unsent completed units remaining.
[15:47:15] - Preparing to get new work unit...
[15:47:15] Cleaning up work directory
[15:47:15] + Attempting to get work packet
[15:47:15] Passkey found
[15:47:15] - Will indicate memory of 3582 MB
[15:47:15] - Connecting to assignment server
[15:47:15] Connecting to http://assign.stanford.edu:8080/
[15:47:16] Posted data.
[15:47:16] Initial: ED82; - Successful: assigned to (130.237.232.140).
[15:47:16] + News From Folding@Home: Welcome to Folding@Home
[15:47:16] Loaded queue successfully.
[15:47:16] Connecting to http://130.237.232.140:8080/
[15:47:20] Posted data.
[15:47:20] Initial: 0000; - Receiving payload (expected size: 979931)
[15:47:24] - Downloaded at ~239 kB/s
[15:47:24] - Averaged speed for that direction ~309 kB/s
[15:47:24] + Received work.
[15:47:24] Trying to send all finished work units
[15:47:24] + No unsent completed units remaining.
[15:47:24] + Closed connections
[15:47:24] 
[15:47:24] + Processing work unit
[15:47:24] Core required: FahCore_a3.exe
[15:47:24] Core found.
[15:47:24] Working on queue slot 04 [June 16 15:47:24 UTC]
[15:47:24] + Working ...
[15:47:24] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 4 -checkpoint 10 -verbose -lifeline 3660 -version 629'

[15:47:24] 
[15:47:24] *------------------------------*
[15:47:24] Folding@Home Gromacs SMP Core
[15:47:24] Version 2.19 (Mar 12, 2010)
[15:47:24] 
[15:47:24] Preparing to commence simulation
[15:47:24] - Looking at optimizations...
[15:47:24] - Created dyn
[15:47:24] - Files status OK
[15:47:26] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[15:47:26] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[15:47:26] - Digital signature verified
[15:47:26] 
[15:47:26] Project: 6013 (Run 0, Clone 95, Gen 85)
[15:47:26] 
[15:47:26] Assembly optimizations on if available.
[15:47:26] Entering M.D.
[15:47:51] Completed 0 out of 250000 steps  (0%)
[16:58:55] Killing all core threads
[16:58:55] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[16:58:55] ***** Got a SIGTERM signal (2)
[16:58:55] Killing all core threads
[16:58:55] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.




--- Opening Log file [June 16 16:58:58 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.29

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: F:\Folding@Home SMP2 Console 2
Executable: F:\Folding@Home SMP2 Console 2\fah6.exe
Arguments: -smp -verbosity 9 

[16:58:58] - Ask before connecting: No
[16:58:58] - User name: Bob8421 (Team 11314)
[16:58:58] - User ID: 2991D43E138E9B64
[16:58:58] - Machine ID: 6
[16:58:58] 
[16:58:59] Loaded queue successfully.
[16:58:59] 
[16:58:59] - Autosending finished units... [June 16 16:58:59 UTC]
[16:58:59] + Processing work unit
[16:58:59] Trying to send all finished work units
[16:58:59] Core required: FahCore_a3.exe
[16:58:59] + No unsent completed units remaining.
[16:58:59] - Autosend completed
[16:58:59] Core found.
[16:58:59] Working on queue slot 04 [June 16 16:58:59 UTC]
[16:58:59] + Working ...
[16:58:59] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 4 -checkpoint 10 -verbose -lifeline 708 -version 629'

[16:58:59] 
[16:58:59] *------------------------------*
[16:58:59] Folding@Home Gromacs SMP Core
[16:58:59] Version 2.19 (Mar 12, 2010)
[16:58:59] 
[16:58:59] Preparing to commence simulation
[16:58:59] - Ensuring status. Please wait.
[16:59:08] - Looking at optimizations...
[16:59:08] - Working with standard loops on this execution.
[16:59:08] - Previous termination of core was improper.
[16:59:08] - Files status OK
[16:59:10] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[16:59:10] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[16:59:10] - Digital signature verified
[16:59:10] 
[16:59:10] Project: 6013 (Run 0, Clone 95, Gen 85)
[16:59:10] 
[16:59:10] Entering M.D.
[16:59:16] Using Gromacs checkpoints
[16:59:18] Resuming from checkpoint
[16:59:18] Verified work/wudata_04.log
[16:59:18] Verified work/wudata_04.trr
[16:59:18] Verified work/wudata_04.xtc
[16:59:18] Verified work/wudata_04.edr
[16:59:36] Completed 2338 out of 250000 steps  (0%)
[17:04:24] Completed 2500 out of 250000 steps  (1%)
[18:19:34] Completed 5000 out of 250000 steps  (2%)
[19:34:43] Completed 7500 out of 250000 steps  (3%)
[20:49:51] Completed 10000 out of 250000 steps  (4%)
[20:52:12] Killing all core threads
[20:52:12] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[20:52:12] ***** Got a SIGTERM signal (2)
[20:52:12] Killing all core threads
[20:52:12] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


--- Opening Log file [June 16 20:53:17 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.29

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: F:\Folding@Home SMP2 Console 2
Executable: F:\Folding@Home SMP2 Console 2\fah6.exe
Arguments: -smp -verbosity 9 

[20:53:17] - Ask before connecting: No
[20:53:17] - User name: Bob8421 (Team 11314)
[20:53:17] - User ID: 2991D43E138E9B64
[20:53:17] - Machine ID: 6
[20:53:17] 
[20:53:18] Loaded queue successfully.
[20:53:18] 
[20:53:18] - Autosending finished units... [June 16 20:53:18 UTC]
[20:53:18] + Processing work unit
[20:53:18] Trying to send all finished work units
[20:53:18] Core required: FahCore_a3.exe
[20:53:18] + No unsent completed units remaining.
[20:53:18] - Autosend completed
[20:53:18] Core found.
[20:53:18] Working on queue slot 04 [June 16 20:53:18 UTC]
[20:53:18] + Working ...
[20:53:18] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 4 -checkpoint 10 -verbose -lifeline 316 -version 629'

[20:53:18] 
[20:53:18] *------------------------------*
[20:53:18] Folding@Home Gromacs SMP Core
[20:53:18] Version 2.19 (Mar 12, 2010)
[20:53:18] 
[20:53:18] Preparing to commence simulation
[20:53:18] - Ensuring status. Please wait.
[20:53:27] - Looking at optimizations...
[20:53:27] - Working with standard loops on this execution.
[20:53:27] - Previous termination of core was improper.
[20:53:27] - Going to use standard loops.
[20:53:27] - Files status OK
[20:53:29] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[20:53:29] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[20:53:29] - Digital signature verified
[20:53:29] 
[20:53:29] Project: 6013 (Run 0, Clone 95, Gen 85)
[20:53:29] 
[20:53:29] Entering M.D.
[20:53:35] Using Gromacs checkpoints
[20:53:37] Resuming from checkpoint
[20:53:37] Verified work/wudata_04.log
[20:53:37] Verified work/wudata_04.trr
[20:53:37] Verified work/wudata_04.xtc
[20:53:37] Verified work/wudata_04.edr
[20:53:55] Completed 9998 out of 250000 steps  (3%)
[20:53:55] Completed 10000 out of 250000 steps  (4%)
[20:54:10] Killing all core threads
[20:54:10] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[20:54:10] ***** Got a SIGTERM signal (2)
[20:54:10] Killing all core threads
[20:54:10] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


--- Opening Log file [June 16 20:55:32 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.29

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: F:\Folding@Home SMP2 Console 2
Executable: fah6.exe
Arguments: -smp -verbosity 9 -forceasm -oneunit -smp -verbosity 9 

[20:55:32] - Ask before connecting: No
[20:55:32] - User name: Bob8421 (Team 11314)
[20:55:32] - User ID: 2991D43E138E9B64
[20:55:32] - Machine ID: 6
[20:55:32] 
[20:55:32] Loaded queue successfully.
[20:55:32] 
[20:55:32] - Autosending finished units... [June 16 20:55:32 UTC]
[20:55:32] + Processing work unit
[20:55:32] Trying to send all finished work units
[20:55:32] Core required: FahCore_a3.exe
[20:55:32] + No unsent completed units remaining.
[20:55:32] - Autosend completed
[20:55:32] Core found.
[20:55:32] Working on queue slot 04 [June 16 20:55:32 UTC]
[20:55:32] + Working ...
[20:55:32] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 4 -checkpoint 10 -forceasm -verbose -lifeline 588 -version 629'

[20:55:32] 
[20:55:32] *------------------------------*
[20:55:32] Folding@Home Gromacs SMP Core
[20:55:32] Version 2.19 (Mar 12, 2010)
[20:55:32] 
[20:55:32] Preparing to commence simulation
[20:55:32] - Ensuring status. Please wait.
[20:55:42] - Assembly optimizations manually forced on.
[20:55:42] - Not checking prior termination.
[20:55:43] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[20:55:43] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[20:55:43] - Digital signature verified
[20:55:43] 
[20:55:43] Project: 6013 (Run 0, Clone 95, Gen 85)
[20:55:43] 
[20:55:43] Assembly optimizations on if available.
[20:55:43] Entering M.D.
[20:55:49] Using Gromacs checkpoints
[20:55:52] Resuming from checkpoint
[20:55:52] Verified work/wudata_04.log
[20:55:52] Verified work/wudata_04.trr
[20:55:52] Verified work/wudata_04.xtc
[20:55:52] Verified work/wudata_04.edr
[20:56:09] Completed 9998 out of 250000 steps  (3%)
[20:56:09] Completed 10000 out of 250000 steps  (4%)
[22:11:13] Completed 12500 out of 250000 steps  (5%)
[22:11:26] Killing all core threads
[22:11:26] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[22:11:26] ***** Got a SIGTERM signal (2)
[22:11:26] Killing all core threads
[22:11:26] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.

glussier
Posts: 9
Joined: Wed Nov 18, 2009 3:57 am

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by glussier »

I had a few of those, my q9650@4ghz can barely make the deadline on these workunits. I decided to stop this machine until I can get something else. Usually, this computer can do 9.5k ppd with the 6013, but for the past few days, my 9650 will only get the base points.

Folding should be something we set an forget. If Stanford can't get 6 months without any problem, I think I'll do something else with my computers, I don't have time to keep babysitting these computers.
Image
hootis
Posts: 70
Joined: Fri Nov 27, 2009 3:34 am
Hardware configuration: AMD Phenom II 955 @ 3.8 +ATi HD 5870
AMD Athlon II 610e @ 2.5 + 2x Nvidia 560Ti @ 1ghz
AMD Athlon II 605e @ 2.4 +Nvidia 550Ti + Nvidia GT 240
AMD Athlon II 620 @ 3.3ghz

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by hootis »

you have to Ctrl-c to close properly
Things yet to materialize.
Bob8421
Posts: 53
Joined: Tue Dec 22, 2009 5:16 pm

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by Bob8421 »

hootis wrote:you have to Ctrl-c to close properly
I've never done that before, whether with the console client or the SMP client, and I never had a problem until now.

And isn't Ctrl-C the Copy command???
Bob8421
Posts: 53
Joined: Tue Dec 22, 2009 5:16 pm

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by Bob8421 »

glussier wrote:I don't have time to keep babysitting these computers.
I kind of have the same feeling, but by choosing a beta client we are agreeing to accept a certain amount of babysitting.
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by PantherX »

Bob8421 wrote:
hootis wrote:you have to Ctrl-c to close properly
I've never done that before, whether with the console client or the SMP client, and I never had a problem until now.

And isn't Ctrl-C the Copy command???
Not when you are using it on a Command Line Interface. I have also read somewhere in this forum that the message about improper shutdown is a cosmetic one and doesn't effect the SMP2 Clients. The advance features (-forceasm) are hardcoded in the Core itself thus there isn't any need to use it. You can also use the X to Close. I too have seen this messages in my FAHLog and my system runs F@H smoothly.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
7im
Posts: 10179
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by 7im »

PantherX wrote:...
Not when you are using it on a Command Line Interface. ... You can also use the X to Close...
You can use a lot of things to end the CLI client, but we only recommend the Ctrl+C as the correct way to close the command line client gracefully. X, ending the task, alt+F4, holding in the power button on the PC for 5 seconds, and a 12 gauge shotgun will all shut down the client too, just not as risk free as ctrl+c. ;) ctrl+c is the best answer.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
stevehat1
Posts: 23
Joined: Fri Jun 06, 2008 12:33 pm

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by stevehat1 »

I too have had issues with 6013's in the last two days, both units have caused an "A3 core exe shutdown" message to appear in Vista 32. This happens almost instantly as there is no progress shown in the console. The machine that this is happening on is probably a 99%+ effective machine and is a designated folder with 2 instances of GPU3 and 1 instance SMP2 running.

Both of these events were followed with 6701 WU's upon restarting the client and they finished just fine (kinda like adding insult to injury, 12 hours of down from bad WU's and then being rewarded with the crappiest A3's to date) :roll:
ImageImage
58Enfield
Posts: 26
Joined: Sun Dec 02, 2007 1:35 pm
Location: Cedar Wilds of North Central Arizona

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by 58Enfield »

I received a P6013 R0 C95 G85 also, and it displays the same slow behavior as shown in the log.

Code: Select all

[06:12:55] + Number of Units Completed: 184

[06:12:56] Trying to send all finished work units
[06:12:56] + No unsent completed units remaining.
[06:12:56] - Preparing to get new work unit...
[06:12:56] Cleaning up work directory
[06:12:56] + Attempting to get work packet
[06:12:56] Passkey found
[06:12:56] - Will indicate memory of 1956 MB
[06:12:56] - Connecting to assignment server
[06:12:56] Connecting to http://assign.stanford.edu:8080/
[06:12:57] Posted data.
[06:12:57] Initial: ED82; - Successful: assigned to (130.237.232.140).
[06:12:57] + News From Folding@Home: Welcome to Folding@Home
[06:12:57] Loaded queue successfully.
[06:12:57] Connecting to http://130.237.232.140:8080/
[06:13:01] Posted data.
[06:13:01] Initial: 0000; - Receiving payload (expected size: 979931)
[06:13:04] - Downloaded at ~318 kB/s
[06:13:04] - Averaged speed for that direction ~636 kB/s
[06:13:04] + Received work.
[06:13:04] Trying to send all finished work units
[06:13:04] + No unsent completed units remaining.
[06:13:04] + Closed connections
[06:13:04]
[06:13:04] + Processing work unit
[06:13:04] Core required: FahCore_a3.exe
[06:13:04] Core found.
[06:13:04] Working on queue slot 00 [June 20 06:13:04 UTC]
[06:13:04] + Working ...
[06:13:04] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 00 -np 4 -checkpoint 15 -verbose -lifeline 2430 -version 629'

[06:13:04]
[06:13:04] *------------------------------*
[06:13:04] Folding@Home Gromacs SMP Core
[06:13:04] Version 2.22 (June 10, 2010)
[06:13:04]
[06:13:04] Preparing to commence simulation
[06:13:04] - Looking at optimizations...
[06:13:04] - Created dyn
[06:13:04] - Files status OK
[06:13:05] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[06:13:05] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[06:13:05] - Digital signature verified
[06:13:05]
[06:13:05] Project: 6013 (Run 0, Clone 95, Gen 85)
[06:13:05]
[06:13:05] Assembly optimizations on if available.
[06:13:05] Entering M.D.
Starting 4 threads
NNODES=4, MYRANK=2, HOSTNAME=thread #2
NNODES=4, MYRANK=3, HOSTNAME=thread #3
NNODES=4, MYRANK=1, HOSTNAME=thread #1
NNODES=4, MYRANK=0, HOSTNAME=thread #0
Reading file work/wudata_00.tpr, VERSION 4.0.99_development_20090605 (single precision)
Note: tpx file_version 68, software version 70
Making 1D domain decomposition 1 x 1 x 4
starting mdrun 'IBX in water'
21500002 steps,  43000.0 ps (continuing from step 21250002,  42500.0 ps).
[06:13:23] Completed 0 out of 250000 steps  (0%)
[07:04:27] Completed 2500 out of 250000 steps  (1%)
[07:55:30] Completed 5000 out of 250000 steps  (2%)
[08:46:34] Completed 7500 out of 250000 steps  (3%)
[09:37:37] Completed 10000 out of 250000 steps  (4%)
[10:28:41] Completed 12500 out of 250000 steps  (5%)
[11:19:44] Completed 15000 out of 250000 steps  (6%)
[11:54:23] - Autosending finished units... [June 20 11:54:23 UTC]
[11:54:23] Trying to send all finished work units
[11:54:23] + No unsent completed units remaining.
[11:54:23] - Autosend completed
[12:10:48] Completed 17500 out of 250000 steps  (7%)
51:03 tpf is not normal for that machine as QD shows.....

Code: Select all

Index 7: finished 921.00 pts (47.591 pt/hr, 1141.83 ppd) 7.44 X min speed
   bonus pts: 5086.26 (262.743 pt/hr, 6305.83 ppd); bonus factor: 5.52; kfactor: 4.10
   server: 171.64.65.56:8080; project: 6701
   Folding: run 68, clone 26, generation 3; benchmark 0; misc: 500, 200, 12 (le)
   issue: Fri Jun 18 10:26:41 2010; begin: Fri Jun 18 10:27:02 2010
   end: Sat Jun 19 05:48:11 2010; due: Thu Jun 24 10:27:02 2010 (6 days)
   preferred: Mon Jun 21 15:15:02 2010 (3 days)
   user: 58Enfield; team: 131; ID: XXXXXXXXXXXXXXXX; mach ID: 1

(switched to new version Fahcore_a3.exe)

 Index 8: finished 470.00 pts (49.703 pt/hr, 1191.93 ppd) 15.2 X min speed
   bonus pts: 2592.78 (273.974 pt/hr, 6575.37 ppd); bonus factor: 5.52; kfactor: 2.00
   server: 130.237.232.140:8080; project: 6012
   Folding: run 1, clone 302, generation 75; benchmark 0; misc: 500, 600, 12 (le)
   issue: Sat Jun 19 05:55:50 2010; begin: Sat Jun 19 05:56:17 2010
   end: Sat Jun 19 15:23:39 2010; due: Fri Jun 25 05:56:17 2010 (6 days)
   preferred: Tue Jun 22 05:56:17 2010 (3 days)
   user: 58Enfield; team: 131; ID: XXXXXXXXXXXXXXXX; mach ID: 1

 Index 9: finished 380.00 pts (49.520 pt/hr, 1187.03 ppd) 9.38 X min speed
   bonus pts: 2132.32 (277.535 pt/hr, 6660.85 ppd); bonus factor: 5.61; kfactor: 3.36
   server: 130.237.232.140:8080; project: 6013
   Folding: run 0, clone 84, generation 184; benchmark 0; misc: 500, 600, 12 (le)
   issue: Sat Jun 19 15:27:06 2010; begin: Sat Jun 19 15:27:40 2010
   end: Sat Jun 19 23:08:05 2010; due: Tue Jun 22 15:27:40 2010 (3 days)
   preferred: Tue Jun 22 15:27:40 2010 (3 days)
   user: 58Enfield; team: 131; ID: XXXXXXXXXXXXXXXXX; mach ID: 1

 Index 0: folding now 380.00 pts (4.461 pt/hr, 107.06 ppd) 0.845 X min speed; 7% complete
   server: 130.237.232.140:8080; project: 6013
   Folding: run 0, clone 95, generation 85; benchmark 0; misc: 500, 600, 12 (le)
   issue: Sat Jun 19 23:12:31 2010; begin: Sat Jun 19 23:13:04 2010
   expect: Wed Jun 23 12:23:32 2010; due: Tue Jun 22 23:13:04 2010 (3 days)
   preferred: Tue Jun 22 23:13:04 2010 (3 days)
   user: 58Enfield; team: 131; ID: XXXXXXXXXXXXXXXXX; mach ID: 1

Average download rate 652.035 KB/s (u=4); upload rate 85.111 KB/s (u=4)
Performance fraction 0.904590 (u=4)
Average pph: 45.692, ppd: 1096.60, ppw: 7676.2, ppy: 400523
Average bonus pph: 261.897, ppd: 6285.53, ppw: 43998.7, ppy: 2295726
Average alternate pph: 30.159, ppd: 723.82, ppw: 5066.7, ppy: 264367
Average alternate bonus pph: 261.897, ppd: 6285.53, ppw: 43998.7, ppy: 2295726
Given the information on the other 6013 thread about getting the same defective work unit back over and over, I have already renamed the folding directory and setup a new folder on that machine.

The only wrinkles are that I did upgrade to the new core version three work units back, and all work units have been running faster except this one (other machines included). This machine was getting marginal on heat under the old core (59-61C @ 30C ambient)...and went to 63-64C under the new core. No complaints...it is also working harder as the QD log shows. It is going to be offline today while I re-validate it at a lower overclock (and hopefully lower heat).

Old specs: 3.4 gh Q6600 dedicated Kubuntu 8.04.4 2.6.24-28 generic
bollix47
Posts: 2958
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project: 6013 (Run 0, Clone 95, Gen 85)

Post by bollix47 »

Apparently this one is still being assigned.

Code: Select all

[17:10:23] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 04 -np 8 -checkpoint 30 -verbose -lifeline 3975 -version 629'

[17:10:24] 
[17:10:24] *------------------------------*
[17:10:24] Folding@Home Gromacs SMP Core
[17:10:24] Version 2.22 (June 10, 2010)
[17:10:24] 
[17:10:24] Preparing to commence simulation
[17:10:24] - Looking at optimizations...
[17:10:24] - Created dyn
[17:10:24] - Files status OK
[17:10:24] - Expanded 979419 -> 10427873 (decompressed 1064.6 percent)
[17:10:24] Called DecompressByteArray: compressed_data_size=979419 data_size=10427873, decompressed_data_size=10427873 diff=0
[17:10:24] - Digital signature verified
[17:10:24] 
[17:10:24] Project: 6013 (Run 0, Clone 95, Gen 85)
[17:10:24] 
[17:10:24] Assembly optimizations on if available.
[17:10:24] Entering M.D.
Note: tpx file_version 68, software version 70
Making 3D domain decomposition 2 x 2 x 2
starting mdrun 'IBX in water'
21500002 steps,  43000.0 ps (continuing from step 21250002,  42500.0 ps).
[17:10:51] Completed 0 out of 250000 steps  (0%)
tMPI error: Invalid buffer (null pointer in send or receive buffer) (in valid comm)
tMPI error: Invalid buffer (null pointer in send or receive buffer) (in valid comm)
tMPI error: Invalid buffer (null pointer in send or receive buffer) (in valid comm)
Aborted
[17:40:55] CoreStatus = 86 (134)
[17:40:55] Client-core communications error: ERROR 0x86
[17:40:55] Deleting current work unit & continuing...
[17:41:05] Trying to send all finished work units
[17:41:05] + No unsent completed units remaining.
[17:41:05] - Preparing to get new work unit...
[17:41:05] Cleaning up work directory
[17:41:05] + Attempting to get work packet
Image
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project 6013 (Run 0, Clone 95, Gen 85)

Post by bruce »

p6013 has been suspended.
Post Reply