Page 1 of 1

Project: 6880 (Run 451, Clone 13, Gen 34)

Posted: Wed Feb 23, 2011 9:53 pm
by topodisc
It seems stuck after 49% on the same error. See log below.

I deleted the queue, work directory, and unitinfo file and restarted but it gets stuck at the same point. This is a new machine, stock settings. It's serving as the NAS for the home network but it has plenty of spare cycles to run one instance of F@H.

Code: Select all



--- Opening Log file [February 23 15:56:13] 


# Linux Console Edition #######################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /c/home/drago/fah/nas1
Executable: ./fah6
Arguments: -verbosity 9 

[15:56:13] - Ask before connecting: No
[15:56:13] - User name: topodisc (Team 103062)
[15:56:13] - User ID: <hidden>
[15:56:13] - Machine ID: 5
[15:56:13] 
[15:56:13] Loaded queue successfully.
[15:56:13] - Autosending finished units...
[15:56:13] Trying to send all finished work units
[15:56:13] + No unsent completed units remaining.
[15:56:13] - Autosend completed
[15:56:13] 
[15:56:13] + Processing work unit
[15:56:13] Core required: FahCore_78.exe
[15:56:13] Core found.
[15:56:14] Working on Unit 04 [February 23 15:56:14]
[15:56:14] + Working ...
[15:56:14] - Calling './FahCore_78.exe -dir work/ -suffix 04 -checkpoint 10 -verbose -lifeline 21314 -version 602'

[15:56:14] 
[15:56:14] *------------------------------*
[15:56:14] Folding@Home Gromacs Core
[15:56:14] Version 1.90 (March 8, 2006)
[15:56:14] 
[15:56:14] Preparing to commence simulation
[15:56:14] - Ensuring status. Please wait.
[15:56:31] - Looking at optimizations...
[15:56:31] - Working with standard loops on this execution.
[15:56:31] - Previous termination of core was improper.
[15:56:31] - Files status OK
[15:56:31] - Expanded 375782 -> 1806556 (decompressed 480.7 percent)
[15:56:31] 
[15:56:31] Project: 6880 (Run 451, Clone 13, Gen 34)
[15:56:31] 
[15:56:31] Entering M.D.
[15:56:52] (Starting from checkpoint)
[15:56:52] Protein: ALZHEIMER DISEASE AMYLOID
[15:56:52] 
[15:56:52] Writing local files
[15:57:00] Completed 122500 out of 250000 steps  (49%)
[15:59:58] ***** Got an Activate signal (2)
[15:59:58] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [February 23 16:01:00] 


# Linux Console Edition #######################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /c/home/drago/fah/nas1
Executable: ./fah6
Arguments: -verbosity 9 

[16:01:00] - Ask before connecting: No
[16:01:00] - User name: topodisc (Team 103062)
[16:01:00] - User ID: <hidden>
[16:01:00] - Machine ID: 5
[16:01:00] 
[16:01:00] Work directory not found. Creating...
[16:01:00] Could not open work queue, generating new queue...
[16:01:00] - Preparing to get new work unit...
[16:01:00] + Attempting to get work packet
[16:01:00] - Will indicate memory of 1000 MB
[16:01:00] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 13
[16:01:00] - Connecting to assignment server
[16:01:00] Connecting to http://assign.stanford.edu:8080/
[16:01:00] - Autosending finished units...
[16:01:00] Trying to send all finished work units
[16:01:00] + No unsent completed units remaining.
[16:01:00] - Autosend completed
[16:01:00] Posted data.
[16:01:00] Initial: 43AB; - Successful: assigned to (171.67.108.33).
[16:01:00] + News From Folding@Home: Welcome to Folding@Home
[16:01:00] Loaded queue successfully.
[16:01:00] Connecting to http://171.67.108.33:8080/
[16:01:01] Posted data.
[16:01:01] Initial: 0000; - Receiving payload (expected size: 376294)
[16:01:02] - Downloaded at ~367 kB/s
[16:01:02] - Averaged speed for that direction ~367 kB/s
[16:01:02] + Received work.
[16:01:02] + Closed connections
[16:01:02] 
[16:01:02] + Processing work unit
[16:01:02] Core required: FahCore_78.exe
[16:01:02] Core not found.
[16:01:02] - Core is not present or corrupted.
[16:01:02] - Attempting to download new core...
[16:01:02] + Downloading new core: FahCore_78.exe
[16:01:02] Downloading core (/~pande/Linux/x86/Core_78.fah from http://www.stanford.edu)
[16:01:02] Initial: AFDE; + 10240 bytes downloaded
[16:01:02] Initial: CC14; + 20480 bytes downloaded
[16:01:02] Initial: BEE8; + 30720 bytes downloaded
[16:01:02] Initial: DAAF; + 40960 bytes downloaded
[16:01:02] Initial: 1C7A; + 51200 bytes downloaded
[16:01:02] Initial: 5758; + 61440 bytes downloaded
[16:01:02] Initial: 4CCD; + 71680 bytes downloaded
[16:01:02] Initial: 15EF; + 81920 bytes downloaded
[16:01:02] Initial: 48E8; + 92160 bytes downloaded
[16:01:02] Initial: D320; + 102400 bytes downloaded
[16:01:02] Initial: 82DB; + 112640 bytes downloaded
[16:01:02] Initial: 4576; + 122880 bytes downloaded
[16:01:02] Initial: FB62; + 133120 bytes downloaded
[16:01:02] Initial: 71CD; + 143360 bytes downloaded
[16:01:02] Initial: F63A; + 153600 bytes downloaded
[16:01:02] Initial: 0B66; + 163840 bytes downloaded
[16:01:02] Initial: C516; + 174080 bytes downloaded
[16:01:02] Initial: 3E7D; + 184320 bytes downloaded
[16:01:02] Initial: D29C; + 194560 bytes downloaded
[16:01:02] Initial: E3AD; + 204800 bytes downloaded
[16:01:02] Initial: ACFA; + 215040 bytes downloaded
[16:01:02] Initial: 348C; + 225280 bytes downloaded
[16:01:02] Initial: F2B6; + 235520 bytes downloaded
[16:01:02] Initial: CC9E; + 245760 bytes downloaded
[16:01:02] Initial: 1231; + 256000 bytes downloaded
[16:01:02] Initial: 9693; + 266240 bytes downloaded
[16:01:02] Initial: 4073; + 276480 bytes downloaded
[16:01:02] Initial: 616B; + 286720 bytes downloaded
[16:01:02] Initial: 5E96; + 296960 bytes downloaded
[16:01:02] Initial: 4430; + 307200 bytes downloaded
[16:01:02] Initial: B959; + 317440 bytes downloaded
[16:01:02] Initial: 48AC; + 327680 bytes downloaded
[16:01:02] Initial: 7846; + 337920 bytes downloaded
[16:01:02] Initial: 0B78; + 348160 bytes downloaded
[16:01:02] Initial: 653D; + 358400 bytes downloaded
[16:01:02] Initial: B0D6; + 368640 bytes downloaded
[16:01:02] Initial: 841B; + 378880 bytes downloaded
[16:01:02] Initial: 75AF; + 389120 bytes downloaded
[16:01:02] Initial: B47C; + 399360 bytes downloaded
[16:01:02] Initial: 4DC0; + 409600 bytes downloaded
[16:01:02] Initial: 8F7E; + 419840 bytes downloaded
[16:01:02] Initial: 9EF4; + 430080 bytes downloaded
[16:01:02] Initial: 0181; + 440320 bytes downloaded
[16:01:02] Initial: 503C; + 450560 bytes downloaded
[16:01:02] Initial: 2D30; + 460800 bytes downloaded
[16:01:02] Initial: 8867; + 471040 bytes downloaded
[16:01:02] Initial: CE43; + 481280 bytes downloaded
[16:01:02] Initial: 614C; + 491520 bytes downloaded
[16:01:02] Initial: 96F2; + 501760 bytes downloaded
[16:01:02] Initial: 252D; + 512000 bytes downloaded
[16:01:02] Initial: 97FE; + 522240 bytes downloaded
[16:01:02] Initial: 1024; + 532480 bytes downloaded
[16:01:02] Initial: 0666; + 542720 bytes downloaded
[16:01:02] Initial: 53CF; + 552960 bytes downloaded
[16:01:02] Initial: D31E; + 563200 bytes downloaded
[16:01:02] Initial: 1A46; + 573440 bytes downloaded
[16:01:02] Initial: B2C1; + 583680 bytes downloaded
[16:01:02] Initial: 17AF; + 593920 bytes downloaded
[16:01:02] Initial: BE0D; + 604160 bytes downloaded
[16:01:02] Initial: 79C2; + 614400 bytes downloaded
[16:01:02] Initial: 6B14; + 624640 bytes downloaded
[16:01:02] Initial: 1611; + 634880 bytes downloaded
[16:01:02] Initial: 4B64; + 645120 bytes downloaded
[16:01:02] Initial: E520; + 655360 bytes downloaded
[16:01:02] Initial: ADD2; + 665600 bytes downloaded
[16:01:02] Initial: 4218; + 675840 bytes downloaded
[16:01:02] Initial: 7E58; + 686080 bytes downloaded
[16:01:02] Initial: 913F; + 696320 bytes downloaded
[16:01:02] Initial: A369; + 706560 bytes downloaded
[16:01:02] Initial: 8E3A; + 716800 bytes downloaded
[16:01:02] Initial: D3A6; + 727040 bytes downloaded
[16:01:02] Initial: D3CB; + 737280 bytes downloaded
[16:01:02] Initial: 6736; + 747520 bytes downloaded
[16:01:02] Initial: 071F; + 757760 bytes downloaded
[16:01:02] Initial: AC46; + 768000 bytes downloaded
[16:01:02] Initial: 1B7F; + 778240 bytes downloaded
[16:01:02] Initial: 1E88; + 788480 bytes downloaded
[16:01:02] Initial: 5A90; + 798720 bytes downloaded
[16:01:02] Initial: 5F2E; + 808960 bytes downloaded
[16:01:02] Initial: AC86; + 819200 bytes downloaded
[16:01:02] Initial: 0E27; + 829440 bytes downloaded
[16:01:02] Initial: 9AFA; + 839680 bytes downloaded
[16:01:02] Initial: 5A8B; + 849920 bytes downloaded
[16:01:02] Initial: 9D8E; + 860160 bytes downloaded
[16:01:02] Initial: 63B7; + 870400 bytes downloaded
[16:01:02] Initial: 7E7F; + 880640 bytes downloaded
[16:01:02] Initial: CC68; + 890880 bytes downloaded
[16:01:02] Initial: 0C12; + 901120 bytes downloaded
[16:01:02] Initial: EA6C; + 911360 bytes downloaded
[16:01:02] Initial: 07EE; + 921600 bytes downloaded
[16:01:02] Initial: 45B7; + 931840 bytes downloaded
[16:01:02] Initial: F8C7; + 942080 bytes downloaded
[16:01:02] Initial: DEE6; + 952320 bytes downloaded
[16:01:02] Initial: C4DF; + 962560 bytes downloaded
[16:01:02] Initial: 5CEC; + 972800 bytes downloaded
[16:01:02] Initial: C871; + 983040 bytes downloaded
[16:01:02] Initial: F427; + 993280 bytes downloaded
[16:01:02] Initial: F6DF; + 1003520 bytes downloaded
[16:01:02] Initial: 19B3; + 1013760 bytes downloaded
[16:01:03] Initial: 1DE1; + 1024000 bytes downloaded
[16:01:03] Initial: F17C; + 1034240 bytes downloaded
[16:01:03] Initial: A200; + 1044480 bytes downloaded
[16:01:03] Initial: 93DE; + 1054720 bytes downloaded
[16:01:03] Initial: 5E7D; + 1064960 bytes downloaded
[16:01:03] Initial: F350; + 1075200 bytes downloaded
[16:01:03] Initial: C54F; + 1085440 bytes downloaded
[16:01:03] Initial: 4D25; + 1095680 bytes downloaded
[16:01:03] Initial: 1289; + 1105920 bytes downloaded
[16:01:03] Initial: B74E; + 1116160 bytes downloaded
[16:01:03] Initial: EF43; + 1126400 bytes downloaded
[16:01:03] Initial: 6B45; + 1134407 bytes downloaded
[16:01:03] Verifying core Core_78.fah...
[16:01:03] Signature is VALID
[16:01:03] 
[16:01:03] Trying to unzip core FahCore_78.exe
[16:01:03] Decompressed FahCore_78.exe (3435296 bytes) successfully
[16:01:03] + Core successfully engaged
[16:01:20] 
[16:01:20] + Processing work unit
[16:01:20] Core required: FahCore_78.exe
[16:01:20] Core found.
[16:01:20] Working on Unit 01 [February 23 16:01:20]
[16:01:20] + Working ...
[16:01:20] - Calling './FahCore_78.exe -dir work/ -suffix 01 -checkpoint 10 -verbose -lifeline 21411 -version 602'

[16:01:20] 
[16:01:20] *------------------------------*
[16:01:20] Folding@Home Gromacs Core
[16:01:20] Version 1.90 (March 8, 2006)
[16:01:20] 
[16:01:20] Preparing to commence simulation
[16:01:20] - Looking at optimizations...
[16:01:20] - Created dyn
[16:01:20] - Files status OK
[16:01:20] - Expanded 375782 -> 1806556 (decompressed 480.7 percent)
[16:01:20] - Starting from initial work packet
[16:01:20] 
[16:01:20] Project: 6880 (Run 451, Clone 13, Gen 34)
[16:01:20] 
[16:01:20] Assembly optimizations on if available.
[16:01:20] Entering M.D.
[16:01:26] Protein: ALZHEIMER DISEASE AMYLOID
[16:01:26] 
[16:01:26] Writing local files
[16:01:35] Extra SSE boost OK.
[16:01:35] Writing local files
[16:01:35] Completed 0 out of 250000 steps  (0%)
[16:07:35] Writing local files
[16:07:35] Completed 2500 out of 250000 steps  (1%)
[16:13:35] Writing local files
[16:13:35] Completed 5000 out of 250000 steps  (2%)
[16:19:34] Writing local files
[16:19:34] Completed 7500 out of 250000 steps  (3%)
[16:25:33] Writing local files
[16:25:33] Completed 10000 out of 250000 steps  (4%)
[16:31:33] Writing local files
[16:31:33] Completed 12500 out of 250000 steps  (5%)
[16:37:33] Writing local files
[16:37:33] Completed 15000 out of 250000 steps  (6%)
[16:43:36] Writing local files
[16:43:36] Completed 17500 out of 250000 steps  (7%)
[16:49:35] Writing local files
[16:49:35] Completed 20000 out of 250000 steps  (8%)
[16:55:34] Writing local files
[16:55:34] Completed 22500 out of 250000 steps  (9%)
[17:01:33] Writing local files
[17:01:33] Completed 25000 out of 250000 steps  (10%)
[17:07:32] Writing local files
[17:07:32] Completed 27500 out of 250000 steps  (11%)
[17:13:31] Writing local files
[17:13:31] Completed 30000 out of 250000 steps  (12%)
[17:19:30] Writing local files
[17:19:30] Completed 32500 out of 250000 steps  (13%)
[17:25:29] Writing local files
[17:25:29] Completed 35000 out of 250000 steps  (14%)
[17:31:27] Writing local files
[17:31:27] Completed 37500 out of 250000 steps  (15%)
[17:37:26] Writing local files
[17:37:26] Completed 40000 out of 250000 steps  (16%)
[17:43:29] Writing local files
[17:43:29] Completed 42500 out of 250000 steps  (17%)
[17:49:29] Writing local files
[17:49:29] Completed 45000 out of 250000 steps  (18%)
[17:55:30] Writing local files
[17:55:30] Completed 47500 out of 250000 steps  (19%)
[18:01:29] Writing local files
[18:01:29] Completed 50000 out of 250000 steps  (20%)
[18:07:28] Writing local files
[18:07:28] Completed 52500 out of 250000 steps  (21%)
[18:13:27] Writing local files
[18:13:27] Completed 55000 out of 250000 steps  (22%)
[18:19:27] Writing local files
[18:19:27] Completed 57500 out of 250000 steps  (23%)
[18:25:26] Writing local files
[18:25:26] Completed 60000 out of 250000 steps  (24%)
[18:31:25] Writing local files
[18:31:25] Completed 62500 out of 250000 steps  (25%)
[18:37:24] Writing local files
[18:37:24] Completed 65000 out of 250000 steps  (26%)
[18:43:27] Writing local files
[18:43:27] Completed 67500 out of 250000 steps  (27%)
[18:49:25] Writing local files
[18:49:25] Completed 70000 out of 250000 steps  (28%)
[18:55:25] Writing local files
[18:55:25] Completed 72500 out of 250000 steps  (29%)
[19:01:25] Writing local files
[19:01:25] Completed 75000 out of 250000 steps  (30%)
[19:07:23] Writing local files
[19:07:23] Completed 77500 out of 250000 steps  (31%)
[19:13:22] Writing local files
[19:13:22] Completed 80000 out of 250000 steps  (32%)
[19:19:21] Writing local files
[19:19:21] Completed 82500 out of 250000 steps  (33%)
[19:25:19] Writing local files
[19:25:19] Completed 85000 out of 250000 steps  (34%)
[19:31:19] Writing local files
[19:31:20] Completed 87500 out of 250000 steps  (35%)
[19:37:19] Writing local files
[19:37:19] Completed 90000 out of 250000 steps  (36%)
[19:43:22] Writing local files
[19:43:22] Completed 92500 out of 250000 steps  (37%)
[19:49:21] Writing local files
[19:49:21] Completed 95000 out of 250000 steps  (38%)
[19:55:21] Writing local files
[19:55:21] Completed 97500 out of 250000 steps  (39%)
[20:01:21] Writing local files
[20:01:21] Completed 100000 out of 250000 steps  (40%)
[20:07:20] Writing local files
[20:07:20] Completed 102500 out of 250000 steps  (41%)
[20:13:20] Writing local files
[20:13:20] Completed 105000 out of 250000 steps  (42%)
[20:19:20] Writing local files
[20:19:20] Completed 107500 out of 250000 steps  (43%)
[20:25:19] Writing local files
[20:25:20] Completed 110000 out of 250000 steps  (44%)
[20:31:19] Writing local files
[20:31:19] Completed 112500 out of 250000 steps  (45%)
[20:37:20] Writing local files
[20:37:20] Completed 115000 out of 250000 steps  (46%)
[20:43:23] Writing local files
[20:43:23] Completed 117500 out of 250000 steps  (47%)
[20:49:23] Writing local files
[20:49:23] Completed 120000 out of 250000 steps  (48%)
[20:55:23] Writing local files
[20:55:23] Completed 122500 out of 250000 steps  (49%)
[20:58:53] CoreStatus = 0 (0)
[20:58:53] Client-core communications error: ERROR 0x0
[20:58:53] Deleting current work unit & continuing...
[20:59:49] Trying to send all finished work units
[20:59:49] + No unsent completed units remaining.
[20:59:49] - Preparing to get new work unit...
[20:59:49] + Attempting to get work packet
[20:59:49] - Will indicate memory of 1000 MB
[20:59:49] - Connecting to assignment server
[20:59:49] Connecting to http://assign.stanford.edu:8080/
[20:59:49] Posted data.
[20:59:49] Initial: 43AB; - Successful: assigned to (171.67.108.33).
[20:59:49] + News From Folding@Home: Welcome to Folding@Home
[20:59:49] Loaded queue successfully.
[20:59:49] Connecting to http://171.67.108.33:8080/
[20:59:51] Posted data.
[20:59:51] Initial: 0000; - Receiving payload (expected size: 376294)
[20:59:51] Conversation time very short, giving reduced weight in bandwidth avg
[20:59:51] - Downloaded at ~734 kB/s
[20:59:51] - Averaged speed for that direction ~489 kB/s
[20:59:51] + Received work.
[20:59:51] + Closed connections
[20:59:56] 
[20:59:56] + Processing work unit
[20:59:56] Core required: FahCore_78.exe
[20:59:56] Core found.
[20:59:56] Working on Unit 02 [February 23 20:59:56]
[20:59:56] + Working ...
[20:59:56] - Calling './FahCore_78.exe -dir work/ -suffix 02 -checkpoint 10 -verbose -lifeline 21411 -version 602'

[20:59:56] 
[20:59:56] *------------------------------*
[20:59:56] Folding@Home Gromacs Core
[20:59:56] Version 1.90 (March 8, 2006)
[20:59:56] 
[20:59:56] Preparing to commence simulation
[20:59:56] - Looking at optimizations...
[20:59:56] - Created dyn
[20:59:56] - Files status OK
[20:59:57] - Expanded 375782 -> 1806556 (decompressed 480.7 percent)
[20:59:57] - Starting from initial work packet
[20:59:57] 
[20:59:57] Project: 6880 (Run 451, Clone 13, Gen 34)
[20:59:57] 
[20:59:57] Assembly optimizations on if available.
[20:59:57] Entering M.D.
[21:00:03] Protein: ALZHEIMER DISEASE AMYLOID
[21:00:03] 
[21:00:03] Writing local files
[21:00:12] Extra SSE boost OK.
[21:00:12] Writing local files
[21:00:12] Completed 0 out of 250000 steps  (0%)
[21:06:11] Writing local files
[21:06:11] Completed 2500 out of 250000 steps  (1%)
[21:12:10] Writing local files
[21:12:10] Completed 5000 out of 250000 steps  (2%)
[21:18:09] Writing local files
[21:18:09] Completed 7500 out of 250000 steps  (3%)
[21:24:08] Writing local files
[21:24:08] Completed 10000 out of 250000 steps  (4%)
[21:30:07] Writing local files
[21:30:07] Completed 12500 out of 250000 steps  (5%)
[21:36:05] Writing local files
[21:36:05] Completed 15000 out of 250000 steps  (6%)
[21:42:06] Writing local files
[21:42:06] Completed 17500 out of 250000 steps  (7%)
[21:48:05] Writing local files
[21:48:05] Completed 20000 out of 250000 steps  (8%)

Re: Project: 6880 (Run 451, Clone 13, Gen 34)

Posted: Wed Feb 23, 2011 9:57 pm
by topodisc
I should also add that this is a dual core CPU running the classic client (not SMP). There is only 1GB of memory on this machine (cannot change this). It's my understanding that the classic client does not have excessive memory requirements.

Re: Project: 6880 (Run 451, Clone 13, Gen 34)

Posted: Wed Feb 23, 2011 11:47 pm
by 7im
A work unit that dies in the same place twice is probably bad.

Excessive is relative. And using the BigWU work unit size setting shifts that relativity.

1 GB should be fine, even with 2 CPU clients running BigWUs. 512 MB probably not. WinXP alone doesn't even run that well with only 512.

Re: Project: 6880 (Run 451, Clone 13, Gen 34)

Posted: Thu Feb 24, 2011 5:34 am
by bruce
7im wrote:A work unit that dies in the same place twice is probably bad.

Excessive is relative. . . .
@ topodisc:

I can't tell yet whether it's a bad WU or some limitation in your system.
You said it was a Linux device running NAS and you could not change the 1GB RAM, so I'm envisioning a box with some definite limitations. How big is the swap file?

Re: Project: 6880 (Run 451, Clone 13, Gen 34)

Posted: Thu Feb 24, 2011 7:39 am
by topodisc
2GB of swap.

Code: Select all

nas1:~# free -t
             total       used       free     shared    buffers     cached
Mem:       1022640     927924      94716          0      66096     410240
-/+ buffers/cache:     451588     571052
Swap:      2096888      10004    2086884
Total:     3119528     937928    2181600

Re: Project: 6880 (Run 451, Clone 13, Gen 34)

Posted: Thu Feb 24, 2011 8:54 pm
by topodisc
I deleted the project from my queue and changed my machine id. I got assigned a new/different WU and it was successful.

Re: Project: 6880 (Run 451, Clone 13, Gen 34)

Posted: Mon Mar 14, 2011 9:39 pm
by PantherX
We finally have some data in the WU Database:
Your WU (P6880 R451 C13 G34) was added to the stats database on 2011-02-19 08:08:15 for 0 points of credit.
Looks like a bad WU but we still need to wait.

Re: Project: 6880 (Run 451, Clone 13, Gen 34)

Posted: Sun Mar 27, 2011 11:16 pm
by PantherX
It was finally completed:
Your WU (P6880 R451 C13 G34) was added to the stats database on 2011-03-27 07:06:01 for 69 points of credit.