Page 1 of 1

Project: 4743 (Run 3, Clone 7, Gen 4)

Posted: Sun Mar 01, 2009 1:11 pm
by Oldhat
Hi,

Possibly a faulty WU with NAN errors at start, but as always could be faulty hardware.

Windows 7, q9450, 4Gb RAM, 4850 running 9.2 drivers and GPU 6.23 client

Code: Select all

[06:49:31] Project: 4743 (Run 3, Clone 7, Gen 4)
[06:49:31] 
[06:49:31] Assembly optimizations on if available.
[06:49:31] Entering M.D.
[06:49:37] Working on p4743_lam5w_300K
[06:49:38] Client config found, loading data.
[06:49:38] Starting GUI Server
[06:49:40] mdrun_gpu returned 
[06:49:40] NANs detected on GPU
[06:49:40] 
[06:49:40] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:49:43] CoreStatus = 7A (122)
[06:49:43] Sending work to server
[06:49:43] Project: 4743 (Run 3, Clone 7, Gen 4)
[06:49:43] - Read packet limit of 540015616... Set to 524286976.
[06:49:43] - Error: Could not get length of results file work/wuresults_04.dat
[06:49:43] - Error: Could not read unit 04 file. Removing from queue.
[06:49:43] - Preparing to get new work unit...
[06:49:43] + Attempting to get work packet
[06:49:43] - Connecting to assignment server
[06:49:44] - Successful: assigned to (171.64.65.103).
[06:49:44] + News From Folding@Home: GPU folding beta
[06:49:44] Loaded queue successfully.
[06:49:46] + Closed connections
[06:49:51] 
[06:49:51] + Processing work unit
[06:49:51] Core required: FahCore_11.exe
[06:49:51] Core found.
[06:49:51] Working on queue slot 05 [March 1 06:49:51 UTC]
[06:49:51] + Working ...
[06:49:51] 
[06:49:51] *------------------------------*
[06:49:51] Folding@Home GPU Core - Beta
[06:49:51] Version 1.22 (Mon Dec 8 12:57:56 PST 2008)
[06:49:51] 
[06:49:51] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[06:49:51] Build host: amoeba
[06:49:51] Board Type: AMD
[06:49:51] Core      : 
[06:49:51] Preparing to commence simulation
[06:49:51] - Looking at optimizations...
[06:49:51] - Created dyn
[06:49:51] - Files status OK
[06:49:51] - Expanded 88337 -> 447304 (decompressed 506.3 percent)
[06:49:51] Called DecompressByteArray: compressed_data_size=88337 data_size=447304, decompressed_data_size=447304 diff=0
[06:49:51] - Digital signature verified
[06:49:51] 
[06:49:51] Project: 4743 (Run 3, Clone 7, Gen 4)
[06:49:51] 
[06:49:51] Assembly optimizations on if available.
[06:49:51] Entering M.D.
[06:49:57] Working on p4743_lam5w_300K
[06:49:57] Client config found, loading data.
[06:49:57] Starting GUI Server
[06:50:00] mdrun_gpu returned 
[06:50:00] NANs detected on GPU
[06:50:00] 
[06:50:00] Folding@home Core Shutdown: UNSTABLE_MACHINE
[06:50:03] CoreStatus = 7A (122)
[06:50:03] Sending work to server
[06:50:03] Project: 4743 (Run 3, Clone 7, Gen 4)
[06:50:03] - Read packet limit of 540015616... Set to 524286976.
[06:50:03] - Error: Could not get length of results file work/wuresults_05.dat
[06:50:03] - Error: Could not read unit 05 file. Removing from queue.
[06:50:03] EUE limit exceeded. Pausing 24 hours.
Cheers

Re: Project: 4743 (Run 3, Clone 7, Gen 4)

Posted: Sun Mar 01, 2009 6:05 pm
by toTOW
Someone else was able to complete it successfully.

Re: Project: 4743 (Run 3, Clone 7, Gen 4)

Posted: Tue Mar 24, 2009 5:40 pm
by valleton
Same problems here, NANs detected at start. Never been any stability issues or EUEs. Been folding for over 2 months now.

Same project, but Run 9, Clone 139, Gen 10.
WinXP 32bit, E8600, HD4870, Catalyst 8.12, GPU Systray client 6.23

Code: Select all

[14:53:47] Project: 4743 (Run 9, Clone 139, Gen 10)
[14:53:47] 
[14:53:47] Assembly optimizations on if available.
[14:53:47] Entering M.D.
[14:53:53] Working on p4743_lam5w_300K
[14:53:53] Client config found, loading data.
[14:53:53] Starting GUI Server
[14:53:55] mdrun_gpu returned 
[14:53:55] NANs detected on GPU
[14:53:55] 
[14:53:55] Folding@home Core Shutdown: UNSTABLE_MACHINE
[14:53:59] CoreStatus = 7A (122)
[14:53:59] Sending work to server
[14:53:59] Project: 4743 (Run 9, Clone 139, Gen 10)
[14:53:59] - Read packet limit of 540015616... Set to 524286976.
[14:53:59] - Error: Could not get length of results file work/wuresults_08.dat
[14:53:59] - Error: Could not read unit 08 file. Removing from queue.
[14:53:59] - Preparing to get new work unit...
[14:53:59] + Attempting to get work packet
[14:53:59] - Connecting to assignment server
[14:53:59] - Successful: assigned to (171.64.65.103).
[14:53:59] + News From Folding@Home: GPU folding beta
[14:54:00] Loaded queue successfully.
[14:54:01] + Closed connections
[14:54:06] 
[14:54:06] + Processing work unit
[14:54:06] Core required: FahCore_11.exe
[14:54:06] Core found.
[14:54:06] Working on queue slot 09 [March 24 14:54:06 UTC]
[14:54:06] + Working ...
[14:54:06] 
[14:54:06] *------------------------------*
[14:54:06] Folding@Home GPU Core - Beta
[14:54:06] Version 1.22 (Mon Dec 8 12:57:56 PST 2008)
[14:54:06] 
[14:54:06] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[14:54:06] Build host: amoeba
[14:54:06] Board Type: AMD
[14:54:06] Core      : 
[14:54:06] Preparing to commence simulation
[14:54:06] - Looking at optimizations...
[14:54:06] - Created dyn
[14:54:06] - Files status OK
[14:54:06] - Expanded 57514 -> 447304 (decompressed 777.7 percent)
[14:54:06] Called DecompressByteArray: compressed_data_size=57514 data_size=447304, decompressed_data_size=447304 diff=0
[14:54:06] - Digital signature verified
[14:54:06] 
[14:54:06] Project: 4743 (Run 9, Clone 139, Gen 10)
[14:54:06] 
[14:54:06] Assembly optimizations on if available.
[14:54:06] Entering M.D.
[14:54:12] Working on p4743_lam5w_300K
[14:54:13] Client config found, loading data.
[14:54:13] Starting GUI Server
[14:54:15] mdrun_gpu returned 
[14:54:15] NANs detected on GPU
[14:54:15] 
[14:54:15] Folding@home Core Shutdown: UNSTABLE_MACHINE
[14:54:18] CoreStatus = 7A (122)
[14:54:18] Sending work to server
[14:54:18] Project: 4743 (Run 9, Clone 139, Gen 10)
[14:54:18] - Read packet limit of 540015616... Set to 524286976.
[14:54:18] - Error: Could not get length of results file work/wuresults_09.dat
[14:54:18] - Error: Could not read unit 09 file. Removing from queue.
[14:54:18] EUE limit exceeded. Pausing 24 hours.

EDIT: maybe i should have done new topic, since they are differenct "specific WUs", only project is the same... ?

Re: Project: 4743 (Run 3, Clone 7, Gen 4)

Posted: Tue Mar 24, 2009 8:08 pm
by toTOW
valleton> Project: 4743 (Run 9, Clone 139, Gen 10) is a bad WU, there are 10 reports of immediate failure.

I've marked the WU as bad.

Re: Project: 4743 (Run 3, Clone 7, Gen 4)

Posted: Wed Mar 25, 2009 6:24 pm
by rhavern
I've had the same problem, same WU, thanks for the info in this thread, I'll restart and see what happens.

Rick