Project: 2665 (Run 0, Clone 973, Gen 62)

Moderators: Site Moderators, FAHC Science Team

Post Reply
jbiel
Posts: 3
Joined: Thu Nov 06, 2008 11:49 pm
Hardware configuration: K8N NEO4 Plat. SLI (7100-020)
Socket 939 AMD Opteron 170
2 nVidia 7600GS 256MB (DVR 97.92)
PCP&C 750 Silencer 60 12v Amps
2 Seagate Barracuda ST380817AS 80gig SATA HD Raid
300 gig Wd IDE HD (WD3000JB)
Win XP Pro w/SP2
DVD+/-ROM NEC ND3550A
DVDROM ASUS E616AG
2G OCZ EB PC-4000 /500MHz

Project: 2665 (Run 0, Clone 973, Gen 62)

Post by jbiel »

Long time folder, first post.

I did try to search this out to no avail. Hopefully someone can help me figure out my issue.

Seems that that WU does not like me. On severlal occasions over the past few days the WU encounters an error @ about 5%.
I hve deleted the "Work Folder", The core, the queue.dat and the unitinfo files.
Restart the F@H with the 'smp' and 'verbosity 9' flags as instructed. I'll dl the a new core and of course the same WU and stall @ 5% again.

The specs for the rig are listed and there is no OC or overheat. The core has worked just fine for quite awhile until now. The log is pasted below.

Thanks in advance.

Code: Select all

Arguments: -smp -verbosity 9 

[19:49:14] - Ask before connecting: No
[19:49:14] - User name: jbiel (Team 37766)
[19:49:14] - User ID: 37C15045171A453A
[19:49:14] - Machine ID: 1
[19:49:14] 
[19:49:14] Work directory not found. Creating...
[19:49:14] Could not open work queue, generating new queue...
[19:49:14] - Preparing to get new work unit...
[19:49:14] + Attempting to get work packet
[19:49:14] - Autosending finished units... [November 6 19:49:14 UTC]
[19:49:14] Trying to send all finished work units
[19:49:14] - Will indicate memory of 2047 MB
[19:49:14] + No unsent completed units remaining.
[19:49:14] - Autosend completed
[19:49:14] - Detect CPU. Vendor: AuthenticAMD, Family: 15, Model: 3, Stepping: 2
[19:49:14] - Connecting to assignment server
[19:49:14] Connecting to http://assign.stanford.edu:8080/
[19:49:15] Posted data.
[19:49:15] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[19:49:15] + News From Folding@Home: Welcome to Folding@Home
[19:49:15] Loaded queue successfully.
[19:49:15] Connecting to http://171.64.65.64:8080/
[19:49:20] Posted data.
[19:49:20] Initial: 0000; - Receiving payload (expected size: 4738136)
[19:49:30] - Downloaded at ~462 kB/s
[19:49:30] - Averaged speed for that direction ~462 kB/s
[19:49:30] + Received work.
[19:49:30] + Closed connections
[19:49:30] 
[19:49:30] + Processing work unit
[19:49:30] Work type a1 not eligible for variable processors
[19:49:30] Core required: FahCore_a1.exe
[19:49:30] Core not found.
[19:49:30] - Core is not present or corrupted.
[19:49:30] - Attempting to download new core...
[19:49:30] + Downloading new core: FahCore_a1.exe
[19:49:30] Downloading core (/~pande/Win32/x86/Core_a1.fah from www.stanford.edu)
[19:49:30] Initial: AFDE; + 10240 bytes downloaded
[19:49:31] Initial: AD21; + 20480 bytes downloaded
[19:49:31] Initial: CC38; + 30720 bytes downloaded
[19:49:31] Initial: 8501; + 40960 bytes downloaded
[19:49:31] Initial: F56A; + 51200 bytes downloaded
[19:49:31] Initial: ABAE; + 61440 bytes downloaded
[19:49:31] Initial: B6B0; + 71680 bytes downloaded
[19:49:31] Initial: 783A; + 81920 bytes downloaded
[19:49:31] Initial: B2A6; + 92160 bytes downloaded
[19:49:31] Initial: 1409; + 102400 bytes downloaded
[19:49:31] Initial: BBF0; + 112640 bytes downloaded
[19:49:31] Initial: 1861; + 122880 bytes downloaded
[19:49:31] Initial: 5950; + 133120 bytes downloaded
[19:49:31] Initial: 1081; + 143360 bytes downloaded
[19:49:31] Initial: 26BC; + 153600 bytes downloaded
[19:49:31] Initial: FE4A; + 163840 bytes downloaded
[19:49:31] Initial: C1C3; + 174080 bytes downloaded
[19:49:31] Initial: 9B49; + 184320 bytes downloaded
[19:49:31] Initial: 9EE5; + 194560 bytes downloaded
[19:49:31] Initial: D79D; + 204800 bytes downloaded
[19:49:31] Initial: 7801; + 215040 bytes downloaded
[19:49:31] Initial: 8B51; + 225280 bytes downloaded
[19:49:31] Initial: E26E; + 235520 bytes downloaded
[19:49:31] Initial: EDB0; + 245760 bytes downloaded
[19:49:31] Initial: 0919; + 256000 bytes downloaded
[19:49:31] Initial: CDDE; + 266240 bytes downloaded
[19:49:31] Initial: 7A7E; + 276480 bytes downloaded
[19:49:31] Initial: 034E; + 286720 bytes downloaded
[19:49:31] Initial: 88D0; + 296960 bytes downloaded
[19:49:31] Initial: D66D; + 307200 bytes downloaded
[19:49:31] Initial: 6A52; + 317440 bytes downloaded
[19:49:31] Initial: B478; + 327680 bytes downloaded
[19:49:31] Initial: CF8A; + 337920 bytes downloaded
[19:49:31] Initial: 8407; + 348160 bytes downloaded
[19:49:31] Initial: 2246; + 358400 bytes downloaded
[19:49:31] Initial: 1C69; + 368640 bytes downloaded
[19:49:31] Initial: 1287; + 378880 bytes downloaded
[19:49:31] Initial: 19B3; + 389120 bytes downloaded
[19:49:32] Initial: 1AD1; + 399360 bytes downloaded
[19:49:32] Initial: 5791; + 409600 bytes downloaded
[19:49:32] Initial: 76C5; + 419840 bytes downloaded
[19:49:32] Initial: 9B77; + 430080 bytes downloaded
[19:49:32] Initial: E82F; + 440320 bytes downloaded
[19:49:32] Initial: D0D3; + 450560 bytes downloaded
[19:49:32] Initial: 0F5E; + 460800 bytes downloaded
[19:49:32] Initial: D743; + 471040 bytes downloaded
[19:49:32] Initial: 0B7C; + 481280 bytes downloaded
[19:49:32] Initial: FAFD; + 491520 bytes downloaded
[19:49:32] Initial: 0E14; + 501760 bytes downloaded
[19:49:32] Initial: 4048; + 512000 bytes downloaded
[19:49:32] Initial: 21A5; + 522240 bytes downloaded
[19:49:32] Initial: C1A5; + 532480 bytes downloaded
[19:49:32] Initial: F716; + 542720 bytes downloaded
[19:49:32] Initial: DD98; + 552960 bytes downloaded
[19:49:32] Initial: 9F7B; + 563200 bytes downloaded
[19:49:32] Initial: 1CC0; + 573440 bytes downloaded
[19:49:32] Initial: 4D37; + 583680 bytes downloaded
[19:49:32] Initial: 222A; + 593920 bytes downloaded
[19:49:32] Initial: 8E33; + 604160 bytes downloaded
[19:49:32] Initial: D3C9; + 614400 bytes downloaded
[19:49:32] Initial: 9821; + 624640 bytes downloaded
[19:49:32] Initial: 236E; + 634880 bytes downloaded
[19:49:32] Initial: 1A7A; + 645120 bytes downloaded
[19:49:32] Initial: 6D64; + 655360 bytes downloaded
[19:49:32] Initial: 4ADC; + 665600 bytes downloaded
[19:49:32] Initial: 3854; + 675840 bytes downloaded
[19:49:32] Initial: CB5C; + 686080 bytes downloaded
[19:49:32] Initial: 2A88; + 696320 bytes downloaded
[19:49:32] Initial: 1199; + 706560 bytes downloaded
[19:49:32] Initial: 0512; + 716800 bytes downloaded
[19:49:32] Initial: 316E; + 727040 bytes downloaded
[19:49:32] Initial: D89D; + 737280 bytes downloaded
[19:49:32] Initial: E6A3; + 747520 bytes downloaded
[19:49:32] Initial: B488; + 757760 bytes downloaded
[19:49:32] Initial: BAFD; + 768000 bytes downloaded
[19:49:32] Initial: 34A0; + 778240 bytes downloaded
[19:49:32] Initial: DD6C; + 788480 bytes downloaded
[19:49:32] Initial: D2E9; + 789667 bytes downloaded
[19:49:32] Verifying core Core_a1.fah...
[19:49:32] Signature is VALID
[19:49:32] 
[19:49:32] Trying to unzip core FahCore_a1.exe
[19:49:32] Decompressed FahCore_a1.exe (2035712 bytes) successfully
[19:49:37] + Core successfully engaged
[19:49:42] 
[19:49:42] + Processing work unit
[19:49:42] Work type a1 not eligible for variable processors
[19:49:42] Core required: FahCore_a1.exe
[19:49:42] Core found.
[19:49:42] Using generic mpiexec calls
[19:49:42] Working on queue slot 01 [November 6 19:49:42 UTC]
[19:49:42] + Working ...
[19:49:42] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 1460 -version 622'

[19:49:43] 
[19:49:43] *------------------------------*
[19:49:43] Folding@Home Gromacs SMP Core
[19:49:43] Version 1.74 (March 10, 2007)
[19:49:43] 
[19:49:43] Preparing to commence simulation
[19:49:43] - Ensuring status. Please wait.
[19:49:50] - Starting from initial work packet
[19:49:50] 
[19:49:50] Project: 2665 (Run 0, Clone 973, Gen 62)
[19:49:50] 
[19:49:51] Assembly optimizations on if available.
[19:49:51] Entering M.D.
[19:50:14]  percent)
[19:50:14] - Starting from initial work packet
[19:50:14] 
[19:50:14] Project: 2665 (Run 0, Clone 973, Gen 62)
[19:50:14] 
[19:50:18] Entering M.D.
[19:50:24] Rejecting checkpoint
[19:50:27] Protein: HGG in water
[19:50:27] Writing local files
[19:50:39] Extra SSE boost OK.
[19:50:40] Writing local files
[19:50:40] Completed 0 out of 250000 steps  (0 percent)
[20:05:41] Timered checkpoint triggered.
[20:20:41] Timered checkpoint triggered.
[20:33:35] Writing local files
[20:33:35] Completed 2500 out of 250000 steps  (1 percent)
[20:48:36] Timered checkpoint triggered.
[21:03:37] Timered checkpoint triggered.
[21:16:14] Writing local files
[21:16:14] Completed 5000 out of 250000 steps  (2 percent)
[21:31:16] Timered checkpoint triggered.
[21:46:18] Timered checkpoint triggered.
[21:58:55] Writing local files
[21:58:56] Completed 7500 out of 250000 steps  (3 percent)
[22:13:55] Timered checkpoint triggered.
[22:28:58] Timered checkpoint triggered.
[22:41:36] Writing local files
[22:41:36] Completed 10000 out of 250000 steps  (4 percent)
[22:56:37] Timered checkpoint triggered.
[23:11:38] Timered checkpoint triggered.
[23:24:21] Writing local files
[23:24:22] Completed 12500 out of 250000 steps  (5 percent)
[23:30:04] Warning:  long 1-4 interactions
[23:30:05] Gromacs cannot continue further.
[23:30:05] Going to send back what have done.
[23:30:05] logfile size: 18843
[23:30:05] - Writing 19379 bytes of core data to disk...
[23:30:05]   ... Done.
[23:30:05] - Failed to delete work/wudata_01.sas
[23:30:05] - Failed to delete work/wudata_01.goe
[23:30:05] Warning:  check for stray files
[23:32:05] 
[23:32:05] Folding@home Core Shutdown: EARLY_UNIT_END
[23:32:05] 
[23:32:05] Folding@home Core Shutdown: EARLY_UNIT_END
[23:32:09] CoreStatus = 7B (123)
[23:32:09] Client-core communications error: ERROR 0x7b
[23:32:09] This is a sign of more serious problems, shutting down.
[
anandhanju
Posts: 522
Joined: Mon Dec 03, 2007 4:33 am
Location: Australia

Re: Project: 2665 (Run 0, Clone 973, Gen 62)

Post by anandhanju »

Welcome to the forum, jbiel :)

Are you running the 6.23 client? It has better EUE handling and reports EUEs like these back to Stanford so that you don't get the same WU again. This probably is a faulty WU although I cannot be the judge for that.

The 6.23 client can be downloaded from viewtopic.php?f=46&t=6642
jbiel
Posts: 3
Joined: Thu Nov 06, 2008 11:49 pm
Hardware configuration: K8N NEO4 Plat. SLI (7100-020)
Socket 939 AMD Opteron 170
2 nVidia 7600GS 256MB (DVR 97.92)
PCP&C 750 Silencer 60 12v Amps
2 Seagate Barracuda ST380817AS 80gig SATA HD Raid
300 gig Wd IDE HD (WD3000JB)
Win XP Pro w/SP2
DVD+/-ROM NEC ND3550A
DVDROM ASUS E616AG
2G OCZ EB PC-4000 /500MHz

Re: Project: 2665 (Run 0, Clone 973, Gen 62)

Post by jbiel »

I dropped in the linked EXE. and got the same WU again. It crashed @ 5% agian, but F@H kept running and got me a different WU after sending the info back.
anandhanju
Posts: 522
Joined: Mon Dec 03, 2007 4:33 am
Location: Australia

Re: Project: 2665 (Run 0, Clone 973, Gen 62)

Post by anandhanju »

Nice. Hope this is the last you'll see of that WU. +1 to kasson's credit right there ;)
toTOW
Site Moderator
Posts: 6453
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2665 (Run 0, Clone 973, Gen 62)

Post by toTOW »

Hi jbiel (team 37766),
Your WU (P2665 R0 C973 G62) was added to the stats database on 2008-11-07 14:46:17 for 49.22 points of credit.

Four other people tried to fold it without success ... it's probably a bad WU :(
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
jbiel
Posts: 3
Joined: Thu Nov 06, 2008 11:49 pm
Hardware configuration: K8N NEO4 Plat. SLI (7100-020)
Socket 939 AMD Opteron 170
2 nVidia 7600GS 256MB (DVR 97.92)
PCP&C 750 Silencer 60 12v Amps
2 Seagate Barracuda ST380817AS 80gig SATA HD Raid
300 gig Wd IDE HD (WD3000JB)
Win XP Pro w/SP2
DVD+/-ROM NEC ND3550A
DVDROM ASUS E616AG
2G OCZ EB PC-4000 /500MHz

Re: Project: 2665 (Run 0, Clone 973, Gen 62)

Post by jbiel »

toTOW wrote:Hi jbiel (team 37766),
Your WU (P2665 R0 C973 G62) was added to the stats database on 2008-11-07 14:46:17 for 49.22 points of credit.

Four other people tried to fold it without success ... it's probably a bad WU :(
Thank you for that info.
I do think that without the new .exe I would have recieved no credit at all, as was the case for the previous attempts with this WU.
Post Reply