Page 1 of 10

Project 5801 issues. [Should be Offline]

Posted: Tue Oct 28, 2008 5:24 pm
by BrgHW
Both do not fold at all. Same error at same place for these.
Edit: Now also on R4,C125,G0 and you can add 3-4 other P5801 I have tried now.

ALL P5801 DO NOT WORK AT ALL. 5 of 6 GPU's are now whitout work!!!

[17:19:13] - Expanded 42882 -> 246265 (decompressed 574.2 percent)
[17:19:13] Called DecompressByteArray: compressed_data_size=42882 data_size=246265, decompressed_data_size=246265 diff=0
[17:19:13] - Digital signature verified
[17:19:13]
[17:19:13] Project: 5801 (Run 9, Clone 63, Gen 0)
[17:19:13]
[17:19:13] Assembly optimizations on if available.
[17:19:13] Entering M.D.
[17:19:19] Working on p5801_supervillin_e1
[17:19:20] Client config found, loading data.
[17:19:20] Starting GUI Server
[17:19:32] mdrun_gpu returned
[17:19:32] NANs detected on GPU
[17:19:32]
[17:19:32] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:19:35] CoreStatus = 7A (122)
[17:19:35] Sending work to server
[17:19:35] Project: 5801 (Run 9, Clone 63, Gen 0)
[17:19:35] - Read packet limit of 540015616... Set to 524286976.
[17:19:35] - Error: Could not get length of results file work/wuresults_01.dat
[17:19:35] - Error: Could not read unit 01 file. Removing from queue.

Edit by toTOW :
VijayPande wrote:We've taken these off line until we can see what's up.

Re: Project 5801(R0,C4,G0) and (R9,C63,G0)

Posted: Tue Oct 28, 2008 5:32 pm
by jevans64
Project 5801 is NO GOOD!!! WinXP SP3 nV 178.24 9800GTX+ FahCore_11 v1.15

EUEs from 3 clients.

Code: Select all

[17:26:10] Project: 5801 (Run 5, Clone 179, Gen 0)
[17:26:10] 
[17:26:10] Assembly optimizations on if available.
[17:26:10] Entering M.D.
[17:26:17] Working on p5801_supervillin_e1
[17:26:17] Client config found, loading data.
[17:26:17] Starting GUI Server
[17:26:30] mdrun_gpu returned 
[17:26:30] NANs detected on GPU
[17:26:30] 
[17:26:30] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:26:33] CoreStatus = 7A (122)
[17:26:33] Sending work to server
[17:26:33] Project: 5801 (Run 5, Clone 179, Gen 0)
[17:26:33] - Read packet limit of 540015616... Set to 524286976.
[17:26:33] - Error: Could not get length of results file work/wuresults_07.dat
[17:26:33] - Error: Could not read unit 07 file. Removing from queue.
[17:26:33] EUE limit exceeded. Pausing 24 hours.

-----

[17:26:10] Project: 5801 (Run 3, Clone 209, Gen 0)
[17:26:10] 
[17:26:10] Assembly optimizations on if available.
[17:26:10] Entering M.D.
[17:26:17] Working on p5801_supervillin_e1
[17:26:17] Client config found, loading data.
[17:26:17] Starting GUI Server
[17:26:30] mdrun_gpu returned 
[17:26:30] NANs detected on GPU
[17:26:30] 
[17:26:30] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:26:32] CoreStatus = 7A (122)
[17:26:32] Sending work to server
[17:26:32] Project: 5801 (Run 3, Clone 209, Gen 0)
[17:26:32] - Read packet limit of 540015616... Set to 524286976.
[17:26:32] - Error: Could not get length of results file work/wuresults_03.dat
[17:26:32] - Error: Could not read unit 03 file. Removing from queue.
[17:26:32] EUE limit exceeded. Pausing 24 hours.

-----

[17:31:53] Project: 5801 (Run 8, Clone 216, Gen 0)
[17:31:53] 
[17:31:53] Assembly optimizations on if available.
[17:31:53] Entering M.D.
[17:32:00] Working on p5801_supervillin_e1
[17:32:00] Client config found, loading data.
[17:32:00] Starting GUI Server
[17:32:13] mdrun_gpu returned 
[17:32:13] NANs detected on GPU
[17:32:13] 
[17:32:13] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:32:16] CoreStatus = 7A (122)
[17:32:16] Sending work to server
[17:32:16] Project: 5801 (Run 8, Clone 216, Gen 0)
[17:32:16] - Read packet limit of 540015616... Set to 524286976.
[17:32:16] - Error: Could not get length of results file work/wuresults_01.dat
[17:32:16] - Error: Could not read unit 01 file. Removing from queue.
[17:32:16] Trying to send all finished work units
[17:32:16] Project: 5506 (Run 6, Clone 977, Gen 58)
[17:32:16] - Read packet limit of 540015616... Set to 524286976.


Re: Project 5801(R0,C4,G0) and (R9,C63,G0)

Posted: Tue Oct 28, 2008 5:33 pm
by Leganfuh
I agree, 5801 Sucks. I have 24 systems that run a stock 2.4 Quad and a stock nVidia 8800 GT, I get UNSTABLE_MACHINE also, this is the first GPU-2 project that will not run on my systems.

Code: Select all

[17:33:38] Folding@Home GPU Core - Beta
[17:33:38] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:33:38] 
[17:33:38] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:33:38] Build host: amoeba
[17:33:38] Board Type: Nvidia
[17:33:38] Core      : 
[17:33:38] Preparing to commence simulation
[17:33:38] - Looking at optimizations...
[17:33:38] - Created dyn
[17:33:38] - Files status OK
[17:33:38] - Expanded 42934 -> 246265 (decompressed 573.5 percent)
[17:33:38] Called DecompressByteArray: compressed_data_size=42934 data_size=246265, decompressed_data_size=246265 diff=0
[17:33:38] - Digital signature verified
[17:33:38] 
[17:33:38] Project: 5801 (Run 6, Clone 296, Gen 0)
[17:33:38] 
[17:33:38] Assembly optimizations on if available.
[17:33:38] Entering M.D.
[17:33:44] Working on p5801_supervillin_e1
[17:33:45] Client config found, loading data.
[17:33:45] Starting GUI Server
[17:34:01] mdrun_gpu returned 
[17:34:01] NANs detected on GPU
[17:34:01] 
[17:34:01] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:34:04] CoreStatus = 7A (122)
[17:34:04] Sending work to server
[17:34:04] Project: 5801 (Run 6, Clone 296, Gen 0)
[17:34:04] - Read packet limit of 540015616... Set to 524286976.
[17:34:04] - Error: Could not get length of results file work/wuresults_08.dat
[17:34:04] - Error: Could not read unit 08 file. Removing from queue.
[17:34:04] Project: 5506 (Run 3, Clone 687, Gen 133)
[17:34:04] - Read packet limit of 540015616... Set to 524286976.


[17:34:04] + Attempting to send results [October 28 17:34:04 UTC]
Mike
Team XCPUs - Leganfuh
aka: The Commander
1) Dell XPS 720 2.4 Quad Core, GeForce 8800GTX 2-SMP-MPICH 1-GPU2
(20) Dell Vostro 400's 2.4 Quad GeForce 8800GT 40-SMP-MPICH 20-GPU2
(3) Dell Vostro 410's 2.4 Quad GeForce 8800GT 6-SMP-MPICH 3-GPU2

Image

Re: Project 5801(R0,C4,G0) and (R9,C63,G0)

Posted: Tue Oct 28, 2008 5:42 pm
by MrBooMY
I concur. My mom's PC started getting 5801s, and it crashes the same as all of you guys. I clocked it back to STOCK, same results!

Re: Project 5801(R0,C4,G0) and (R9,C63,G0)

Posted: Tue Oct 28, 2008 5:46 pm
by JX99
Yep, 5801's are fubar

Stock clocked 8800GT EVGA Akimbo 600/900/1524 on XP home with 6.20r1 console client

Code: Select all

--- Opening Log file [October 28 17:31:14 UTC] 


# Windows GPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.20r1

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files\Folding@home\Folding@home-gpu
Executable: C:\Program Files\Folding@home\Folding@home-gpu\Folding@home-Win32-GPU.exe
Arguments: -gpu 0 

[17:31:14] - Ask before connecting: No
[17:31:14] - User name: JammerX99 (Team 12912)
[17:31:14] - User ID: 7A8992C2612BDCBA
[17:31:14] - Machine ID: 1
[17:31:14] 
[17:31:15] Loaded queue successfully.
[17:31:15] 
[17:31:15] Project: 5506 (Run 6, Clone 601, Gen 174)
[17:31:15] + Processing work unit
[17:31:15] Core required: FahCore_11.exe


[17:31:15] + Attempting to send results [October 28 17:31:15 UTC]
[17:31:15] Core found.
[17:31:15] Working on queue slot 04 [October 28 17:31:15 UTC]
[17:31:15] + Working ...
[17:31:15] 
[17:31:15] *------------------------------*
[17:31:15] Folding@Home GPU Core - Beta
[17:31:15] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:31:15] 
[17:31:15] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:31:15] Build host: amoeba
[17:31:15] Board Type: Nvidia
[17:31:15] Core      : 
[17:31:15] Preparing to commence simulation
[17:31:15] - Looking at optimizations...
[17:31:15] - Created dyn
[17:31:15] - Files status OK
[17:31:15] - Expanded 42805 -> 246265 (decompressed 575.3 percent)
[17:31:15] Called DecompressByteArray: compressed_data_size=42805 data_size=246265, decompressed_data_size=246265 diff=0
[17:31:15] - Digital signature verified
[17:31:15] 
[17:31:15] Project: 5801 (Run 9, Clone 68, Gen 0)
[17:31:15] 
[17:31:15] Assembly optimizations on if available.
[17:31:15] Entering M.D.
[17:31:21] Working on p5801_supervillin_e1
[17:31:22] Client config found, loading data.
[17:31:22] Starting GUI Server
[17:31:35] - Couldn't send HTTP request to server
[17:31:35] + Could not connect to Work Server (results)
[17:31:35]     (171.64.65.106:8080)
[17:31:35] + Retrying using alternative port
[17:31:38] mdrun_gpu returned 
[17:31:38] NANs detected on GPU
[17:31:38] 
[17:31:38] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:31:41] CoreStatus = 7A (122)
[17:31:41] Sending work to server
[17:31:41] - Preparing to get new work unit...
[17:31:41] + Attempting to get work packet
[17:31:41] - Connecting to assignment server
[17:31:41] - Successful: assigned to (171.67.108.11).
[17:31:41] + News From Folding@Home: GPU folding beta
[17:31:41] Loaded queue successfully.
[17:31:42] + Closed connections
[17:31:47] 
[17:31:47] + Processing work unit
[17:31:47] Core required: FahCore_11.exe
[17:31:47] Core found.
[17:31:47] Working on queue slot 05 [October 28 17:31:47 UTC]
[17:31:47] + Working ...
[17:31:47] 
[17:31:47] *------------------------------*
[17:31:47] Folding@Home GPU Core - Beta
[17:31:47] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:31:47] 
[17:31:47] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:31:47] Build host: amoeba
[17:31:47] Board Type: Nvidia
[17:31:47] Core      : 
[17:31:47] Preparing to commence simulation
[17:31:47] - Looking at optimizations...
[17:31:47] - Created dyn
[17:31:47] - Files status OK
[17:31:47] - Expanded 42861 -> 246265 (decompressed 574.5 percent)
[17:31:47] Called DecompressByteArray: compressed_data_size=42861 data_size=246265, decompressed_data_size=246265 diff=0
[17:31:47] - Digital signature verified
[17:31:47] 
[17:31:47] Project: 5801 (Run 4, Clone 179, Gen 0)
[17:31:47] 
[17:31:47] Assembly optimizations on if available.
[17:31:47] Entering M.D.
[17:31:54] Working on p5801_supervillin_e1
[17:31:54] Client config found, loading data.
[17:31:54] Starting GUI Server
[17:31:56] - Couldn't send HTTP request to server
[17:31:56] + Could not connect to Work Server (results)
[17:31:56]     (171.64.65.106:80)
[17:31:56] - Error: Could not transmit unit 06 (completed October 28) to work server.


[17:31:56] + Attempting to send results [October 28 17:31:56 UTC]
[17:31:57] - Couldn't send HTTP request to server
[17:31:57]   (Got status 503)
[17:31:57] + Could not connect to Work Server (results)
[17:31:57]     (171.67.108.25:8080)
[17:31:57] + Retrying using alternative port
[17:31:57] - Couldn't send HTTP request to server
[17:31:57]   (Got status 503)
[17:31:57] + Could not connect to Work Server (results)
[17:31:57]     (171.67.108.25:80)
[17:31:57]   Could not transmit unit 06 to Collection server; keeping in queue.
[17:32:10] mdrun_gpu returned 
[17:32:10] NANs detected on GPU
[17:32:10] 
[17:32:10] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:32:13] CoreStatus = 7A (122)
[17:32:13] Sending work to server
[17:32:13] Project: 5801 (Run 4, Clone 179, Gen 0)
[17:32:13] - Error: Could not get length of results file work/wuresults_05.dat
[17:32:13] - Error: Could not read unit 05 file. Removing from queue.
[17:32:13] Project: 5801 (Run 9, Clone 68, Gen 0)
[17:32:13] - Error: Could not get length of results file work/wuresults_04.dat
[17:32:13] - Error: Could not read unit 04 file. Removing from queue.
[17:32:13] Project: 5506 (Run 6, Clone 601, Gen 174)


[17:32:13] + Attempting to send results [October 28 17:32:13 UTC]
[17:32:34] - Couldn't send HTTP request to server
[17:32:34] + Could not connect to Work Server (results)
[17:32:34]     (171.64.65.106:8080)
[17:32:34] + Retrying using alternative port
[17:32:55] - Couldn't send HTTP request to server
[17:32:55] + Could not connect to Work Server (results)
[17:32:55]     (171.64.65.106:80)
[17:32:55] - Error: Could not transmit unit 06 (completed October 28) to work server.


[17:32:55] + Attempting to send results [October 28 17:32:55 UTC]
[17:32:55] - Couldn't send HTTP request to server
[17:32:55]   (Got status 503)
[17:32:55] + Could not connect to Work Server (results)
[17:32:55]     (171.67.108.25:8080)
[17:32:55] + Retrying using alternative port
[17:32:56] - Couldn't send HTTP request to server
[17:32:56]   (Got status 503)
[17:32:56] + Could not connect to Work Server (results)
[17:32:56]     (171.67.108.25:80)
[17:32:56]   Could not transmit unit 06 to Collection server; keeping in queue.
[17:32:56] - Preparing to get new work unit...
[17:32:56] + Attempting to get work packet
[17:32:56] - Connecting to assignment server
[17:32:56] - Successful: assigned to (171.67.108.11).
[17:32:56] + News From Folding@Home: GPU folding beta
[17:32:56] Loaded queue successfully.
[17:32:56] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[17:33:13] + Attempting to get work packet
[17:33:13] - Connecting to assignment server
[17:33:14] - Successful: assigned to (171.67.108.11).
[17:33:14] + News From Folding@Home: GPU folding beta
[17:33:14] Loaded queue successfully.
[17:33:15] Project: 5506 (Run 6, Clone 601, Gen 174)


[17:33:15] + Attempting to send results [October 28 17:33:15 UTC]
[17:33:36] - Couldn't send HTTP request to server
[17:33:36] + Could not connect to Work Server (results)
[17:33:36]     (171.64.65.106:8080)
[17:33:36] + Retrying using alternative port
[17:33:57] - Couldn't send HTTP request to server
[17:33:57] + Could not connect to Work Server (results)
[17:33:57]     (171.64.65.106:80)
[17:33:57] - Error: Could not transmit unit 06 (completed October 28) to work server.


[17:33:57] + Attempting to send results [October 28 17:33:57 UTC]
[17:33:57] - Couldn't send HTTP request to server
[17:33:57]   (Got status 503)
[17:33:57] + Could not connect to Work Server (results)
[17:33:57]     (171.67.108.25:8080)
[17:33:57] + Retrying using alternative port
[17:33:57] - Couldn't send HTTP request to server
[17:33:57]   (Got status 503)
[17:33:57] + Could not connect to Work Server (results)
[17:33:57]     (171.67.108.25:80)
[17:33:57]   Could not transmit unit 06 to Collection server; keeping in queue.
[17:33:57] + Closed connections
[17:34:02] 
[17:34:02] + Processing work unit
[17:34:02] Core required: FahCore_11.exe
[17:34:02] Core found.
[17:34:02] Working on queue slot 07 [October 28 17:34:02 UTC]
[17:34:02] + Working ...
[17:34:02] 
[17:34:02] *------------------------------*
[17:34:02] Folding@Home GPU Core - Beta
[17:34:02] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:34:02] 
[17:34:02] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:34:02] Build host: amoeba
[17:34:02] Board Type: Nvidia
[17:34:02] Core      : 
[17:34:02] Preparing to commence simulation
[17:34:02] - Looking at optimizations...
[17:34:02] - Created dyn
[17:34:02] - Files status OK
[17:34:02] - Expanded 42905 -> 246265 (decompressed 573.9 percent)
[17:34:02] Called DecompressByteArray: compressed_data_size=42905 data_size=246265, decompressed_data_size=246265 diff=0
[17:34:02] - Digital signature verified
[17:34:02] 
[17:34:02] Project: 5801 (Run 1, Clone 260, Gen 0)
[17:34:02] 
[17:34:02] Assembly optimizations on if available.
[17:34:02] Entering M.D.
[17:34:08] Working on p5801_supervillin_e1
[17:34:09] Client config found, loading data.
[17:34:09] Starting GUI Server
[17:34:25] mdrun_gpu returned 
[17:34:25] NANs detected on GPU
[17:34:25] 
[17:34:25] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:34:28] CoreStatus = 7A (122)
[17:34:28] Sending work to server
[17:34:28] Project: 5801 (Run 1, Clone 260, Gen 0)
[17:34:28] - Error: Could not get length of results file work/wuresults_07.dat
[17:34:28] - Error: Could not read unit 07 file. Removing from queue.
[17:34:28] Project: 5506 (Run 6, Clone 601, Gen 174)


[17:34:28] + Attempting to send results [October 28 17:34:28 UTC]
[17:34:49] - Couldn't send HTTP request to server
[17:34:49] + Could not connect to Work Server (results)
[17:34:49]     (171.64.65.106:8080)
[17:34:49] + Retrying using alternative port
[17:35:10] - Couldn't send HTTP request to server
[17:35:10] + Could not connect to Work Server (results)
[17:35:10]     (171.64.65.106:80)
[17:35:10] - Error: Could not transmit unit 06 (completed October 28) to work server.


[17:35:10] + Attempting to send results [October 28 17:35:10 UTC]
[17:35:10] - Couldn't send HTTP request to server
[17:35:10]   (Got status 503)
[17:35:10] + Could not connect to Work Server (results)
[17:35:10]     (171.67.108.25:8080)
[17:35:10] + Retrying using alternative port
[17:35:11] - Couldn't send HTTP request to server
[17:35:11]   (Got status 503)
[17:35:11] + Could not connect to Work Server (results)
[17:35:11]     (171.67.108.25:80)
[17:35:11]   Could not transmit unit 06 to Collection server; keeping in queue.
[17:35:11] - Preparing to get new work unit...
[17:35:11] + Attempting to get work packet
[17:35:11] - Connecting to assignment server
[17:35:11] - Successful: assigned to (171.67.108.11).
[17:35:11] + News From Folding@Home: GPU folding beta
[17:35:11] Loaded queue successfully.
[17:35:12] Project: 5506 (Run 6, Clone 601, Gen 174)


[17:35:12] + Attempting to send results [October 28 17:35:12 UTC]
[17:35:33] - Couldn't send HTTP request to server
[17:35:33] + Could not connect to Work Server (results)
[17:35:33]     (171.64.65.106:8080)
[17:35:33] + Retrying using alternative port
[17:35:54] - Couldn't send HTTP request to server
[17:35:54] + Could not connect to Work Server (results)
[17:35:54]     (171.64.65.106:80)
[17:35:54] - Error: Could not transmit unit 06 (completed October 28) to work server.


[17:35:54] + Attempting to send results [October 28 17:35:54 UTC]
[17:35:54] - Couldn't send HTTP request to server
[17:35:54]   (Got status 503)
[17:35:54] + Could not connect to Work Server (results)
[17:35:54]     (171.67.108.25:8080)
[17:35:54] + Retrying using alternative port
[17:35:54] - Couldn't send HTTP request to server
[17:35:54]   (Got status 503)
[17:35:54] + Could not connect to Work Server (results)
[17:35:54]     (171.67.108.25:80)
[17:35:54]   Could not transmit unit 06 to Collection server; keeping in queue.
[17:35:54] + Closed connections
[17:35:59] 
[17:35:59] + Processing work unit
[17:35:59] Core required: FahCore_11.exe
[17:35:59] Core found.
[17:35:59] Working on queue slot 08 [October 28 17:35:59 UTC]
[17:35:59] + Working ...
[17:35:59] 
[17:35:59] *------------------------------*
[17:35:59] Folding@Home GPU Core - Beta
[17:35:59] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:35:59] 
[17:35:59] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:35:59] Build host: amoeba
[17:35:59] Board Type: Nvidia
[17:35:59] Core      : 
[17:35:59] Preparing to commence simulation
[17:35:59] - Looking at optimizations...
[17:35:59] - Created dyn
[17:35:59] - Files status OK
[17:35:59] - Expanded 42905 -> 246265 (decompressed 573.9 percent)
[17:35:59] Called DecompressByteArray: compressed_data_size=42905 data_size=246265, decompressed_data_size=246265 diff=0
[17:35:59] - Digital signature verified
[17:35:59] 
[17:35:59] Project: 5801 (Run 1, Clone 260, Gen 0)
[17:35:59] 
[17:35:59] Assembly optimizations on if available.
[17:35:59] Entering M.D.
[17:36:06] Working on p5801_supervillin_e1
[17:36:06] Client config found, loading data.
[17:36:06] Starting GUI Server
[17:36:22] mdrun_gpu returned 
[17:36:22] NANs detected on GPU
[17:36:22] 
[17:36:22] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:36:25] CoreStatus = 7A (122)
[17:36:25] Sending work to server
[17:36:25] Project: 5801 (Run 1, Clone 260, Gen 0)
[17:36:25] - Error: Could not get length of results file work/wuresults_08.dat
[17:36:25] - Error: Could not read unit 08 file. Removing from queue.
[17:36:25] Project: 5506 (Run 6, Clone 601, Gen 174)


[17:36:25] + Attempting to send results [October 28 17:36:25 UTC]
[17:36:46] - Couldn't send HTTP request to server
[17:36:46] + Could not connect to Work Server (results)
[17:36:46]     (171.64.65.106:8080)
[17:36:46] + Retrying using alternative port
[17:37:08] - Couldn't send HTTP request to server
[17:37:08] + Could not connect to Work Server (results)
[17:37:08]     (171.64.65.106:80)
[17:37:08] - Error: Could not transmit unit 06 (completed October 28) to work server.


[17:37:08] + Attempting to send results [October 28 17:37:08 UTC]
[17:37:08] - Couldn't send HTTP request to server
[17:37:08]   (Got status 503)
[17:37:08] + Could not connect to Work Server (results)
[17:37:08]     (171.67.108.25:8080)
[17:37:08] + Retrying using alternative port
[17:37:08] - Couldn't send HTTP request to server
[17:37:08]   (Got status 503)
[17:37:08] + Could not connect to Work Server (results)
[17:37:08]     (171.67.108.25:80)
[17:37:08]   Could not transmit unit 06 to Collection server; keeping in queue.
[17:37:08] - Preparing to get new work unit...
[17:37:08] + Attempting to get work packet
[17:37:08] - Connecting to assignment server
[17:37:08] - Successful: assigned to (171.67.108.11).
[17:37:08] + News From Folding@Home: GPU folding beta
[17:37:08] Loaded queue successfully.
[17:37:10] Project: 5506 (Run 6, Clone 601, Gen 174)


[17:37:10] + Attempting to send results [October 28 17:37:10 UTC]
[17:37:31] - Couldn't send HTTP request to server
[17:37:31] + Could not connect to Work Server (results)
[17:37:31]     (171.64.65.106:8080)
[17:37:31] + Retrying using alternative port
[17:37:52] - Couldn't send HTTP request to server
[17:37:52] + Could not connect to Work Server (results)
[17:37:52]     (171.64.65.106:80)
[17:37:52] - Error: Could not transmit unit 06 (completed October 28) to work server.


[17:37:52] + Attempting to send results [October 28 17:37:52 UTC]
[17:37:52] - Couldn't send HTTP request to server
[17:37:52]   (Got status 503)
[17:37:52] + Could not connect to Work Server (results)
[17:37:52]     (171.67.108.25:8080)
[17:37:52] + Retrying using alternative port
[17:37:52] - Couldn't send HTTP request to server
[17:37:52]   (Got status 503)
[17:37:52] + Could not connect to Work Server (results)
[17:37:52]     (171.67.108.25:80)
[17:37:52]   Could not transmit unit 06 to Collection server; keeping in queue.
[17:37:52] + Closed connections
[17:37:57] 
[17:37:57] + Processing work unit
[17:37:57] Core required: FahCore_11.exe
[17:37:57] Core found.
[17:37:57] Working on queue slot 09 [October 28 17:37:57 UTC]
[17:37:57] + Working ...
[17:37:57] 
[17:37:57] *------------------------------*
[17:37:57] Folding@Home GPU Core - Beta
[17:37:57] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:37:57] 
[17:37:57] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:37:57] Build host: amoeba
[17:37:57] Board Type: Nvidia
[17:37:57] Core      : 
[17:37:57] Preparing to commence simulation
[17:37:57] - Looking at optimizations...
[17:37:57] - Created dyn
[17:37:57] - Files status OK
[17:37:57] - Expanded 42905 -> 246265 (decompressed 573.9 percent)
[17:37:57] Called DecompressByteArray: compressed_data_size=42905 data_size=246265, decompressed_data_size=246265 diff=0
[17:37:57] - Digital signature verified
[17:37:57] 
[17:37:57] Project: 5801 (Run 1, Clone 260, Gen 0)
[17:37:57] 
[17:37:57] Assembly optimizations on if available.
[17:37:57] Entering M.D.
[17:38:03] Working on p5801_supervillin_e1
[17:38:04] Client config found, loading data.
[17:38:04] Starting GUI Server
[17:38:20] mdrun_gpu returned 
[17:38:20] NANs detected on GPU
[17:38:20] 
[17:38:20] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:38:23] CoreStatus = 7A (122)
[17:38:23] Sending work to server
[17:38:23] Project: 5801 (Run 1, Clone 260, Gen 0)
[17:38:23] - Error: Could not get length of results file work/wuresults_09.dat
[17:38:23] - Error: Could not read unit 09 file. Removing from queue.
[17:38:23] EUE limit exceeded. Pausing 24 hours

Re: Project 5801(R0,C4,G0) and (R9,C63,G0)

Posted: Tue Oct 28, 2008 5:55 pm
by ElectricVehicle
I'm also seeing all the 5801's EUE before they even get to 1%.

The same Project: 5801 (Run 7, Clone 176, Gen 0) WU is assigned after every EUE, so the client is stuck executing with this same exact (R,C,G) bad WU 5 times until it reaches the "EUE limit exceeded. Pausing 24 hours." taking it offline for 24 hours.

I understand that there's a need to keep clients from trashing too many WU's, but there's a similar need to keep a bad WU from trashing too many clients! Please revise the algorithm next time you work on the client or server software responsible for this.

Folding@Home Client Version 6.20
[17:28:02] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:28:02]
[17:28:02] Assembly optimizations on if available.
[17:28:02] Entering M.D.
[17:28:08] Working on p5801_supervillin_e1
[17:28:09] Client config found, loading data.
[17:28:09] Starting GUI Server
[17:28:25] mdrun_gpu returned
[17:28:25] NANs detected on GPU
[17:28:25]
[17:28:25] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:28:28] CoreStatus = 7A (122)
[17:28:28] Sending work to server

As others have pointed out, this is BAD WU and not UNSTABLE_MACHINE. 5 of these units in a row all with the exact same "NANs detected on GPU" error about 16 seconds after "Starting GUI Server".

Code: Select all

# Windows GPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.20

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\GatoPrimo\AppData\Roaming\Folding@home-gpu2
Arguments: -gpu 1 -verbosity 9 

[17:27:04] - Ask before connecting: No
[17:27:04] - User name: [EV]Aptera (Team 104636)
[17:27:04] - User ID: 3AB3EB432D76466E
[17:27:04] - Machine ID: 4
[17:27:04] 
[17:27:04] Loaded queue successfully.
[17:27:04] Initialization complete
[17:27:04] - Preparing to get new work unit...
[17:27:04] + Attempting to get work packet
[17:27:04] - Autosending finished units... [17:27:04]
[17:27:04] Trying to send all finished work units
[17:27:04] + No unsent completed units remaining.
[17:27:04] - Autosend completed
[17:27:04] - Will indicate memory of 3070 MB
[17:27:04] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 11
[17:27:04] - Connecting to assignment server
[17:27:04] Connecting to http://assign-GPU.stanford.edu:8080/
[17:27:04] Posted data.
[17:27:04] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[17:27:04] + News From Folding@Home: GPU folding beta
[17:27:05] Loaded queue successfully.
[17:27:05] Connecting to http://171.67.108.11:8080/
[17:27:08] Posted data.
[17:27:08] Initial: 0000; + Could not connect to Work Server
[17:27:08] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[17:27:18] + Attempting to get work packet
[17:27:18] - Will indicate memory of 3070 MB
[17:27:18] - Connecting to assignment server
[17:27:18] Connecting to http://assign-GPU.stanford.edu:8080/
[17:27:18] Posted data.
[17:27:18] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[17:27:18] + News From Folding@Home: GPU folding beta
[17:27:18] Loaded queue successfully.
[17:27:18] Connecting to http://171.67.108.11:8080/
[17:27:18] Posted data.
[17:27:18] Initial: 0000; + Could not connect to Work Server
[17:27:18] - Attempt #2  to get work failed, and no other work to do.
Waiting before retry.
[17:27:34] + Attempting to get work packet
[17:27:34] - Will indicate memory of 3070 MB
[17:27:34] - Connecting to assignment server
[17:27:34] Connecting to http://assign-GPU.stanford.edu:8080/
[17:27:35] Posted data.
[17:27:35] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[17:27:35] + News From Folding@Home: GPU folding beta
[17:27:35] Loaded queue successfully.
[17:27:35] Connecting to http://171.67.108.11:8080/
[17:27:35] Posted data.
[17:27:35] Initial: 0000; + Could not connect to Work Server
[17:27:35] - Attempt #3  to get work failed, and no other work to do.
Waiting before retry.
[17:28:01] + Attempting to get work packet
[17:28:01] - Will indicate memory of 3070 MB
[17:28:01] - Connecting to assignment server
[17:28:01] Connecting to http://assign-GPU.stanford.edu:8080/
[17:28:01] Posted data.
[17:28:01] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[17:28:01] + News From Folding@Home: GPU folding beta
[17:28:01] Loaded queue successfully.
[17:28:01] Connecting to http://171.67.108.11:8080/
[17:28:01] Posted data.
[17:28:01] Initial: 0000; - Receiving payload (expected size: 43404)
[17:28:02] - Downloaded at ~42 kB/s
[17:28:02] - Averaged speed for that direction ~66 kB/s
[17:28:02] + Received work.
[17:28:02] + Closed connections
[17:28:02] 
[17:28:02] + Processing work unit
[17:28:02] Core required: FahCore_11.exe
[17:28:02] Core found.
[17:28:02] Working on queue slot 08 [October 28 17:28:02 UTC]
[17:28:02] + Working ...
[17:28:02] - Calling '.\FahCore_11.exe -dir work/ -suffix 08 -priority 96 -checkpoint 30 -verbose -lifeline 4028 -version 620'

[17:28:02] 
[17:28:02] *------------------------------*
[17:28:02] Folding@Home GPU Core - Beta
[17:28:02] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:28:02] 
[17:28:02] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:28:02] Build host: amoeba
[17:28:02] Board Type: Nvidia
[17:28:02] Core      : 
[17:28:02] Preparing to commence simulation
[17:28:02] - Looking at optimizations...
[17:28:02] - Created dyn
[17:28:02] - Files status OK
[17:28:02] - Expanded 42892 -> 246265 (decompressed 574.1 percent)
[17:28:02] Called DecompressByteArray: compressed_data_size=42892 data_size=246265, decompressed_data_size=246265 diff=0
[17:28:02] - Digital signature verified
[17:28:02] 
[17:28:02] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:28:02] 
[17:28:02] Assembly optimizations on if available.
[17:28:02] Entering M.D.
[17:28:08] Working on p5801_supervillin_e1
[17:28:09] Client config found, loading data.
[17:28:09] Starting GUI Server
[17:28:25] mdrun_gpu returned 
[17:28:25] NANs detected on GPU
[17:28:25] 
[17:28:25] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:28:28] CoreStatus = 7A (122)
[17:28:28] Sending work to server
[17:28:28] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:28:28] - Read packet limit of 540015616... Set to 524286976.
[17:28:28] - Error: Could not get length of results file work/wuresults_08.dat
[17:28:28] - Error: Could not read unit 08 file. Removing from queue.
[17:28:28] Trying to send all finished work units
[17:28:28] + No unsent completed units remaining.
[17:28:28] - Preparing to get new work unit...
[17:28:28] + Attempting to get work packet
[17:28:28] - Will indicate memory of 3070 MB
[17:28:28] - Connecting to assignment server
[17:28:28] Connecting to http://assign-GPU.stanford.edu:8080/
[17:28:28] Posted data.
[17:28:28] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[17:28:28] + News From Folding@Home: GPU folding beta
[17:28:28] Loaded queue successfully.
[17:28:28] Connecting to http://171.67.108.11:8080/
[17:28:28] Posted data.
[17:28:28] Initial: 0000; - Receiving payload (expected size: 43404)
[17:28:29] - Downloaded at ~42 kB/s
[17:28:29] - Averaged speed for that direction ~61 kB/s
[17:28:29] + Received work.
[17:28:29] Trying to send all finished work units
[17:28:29] + No unsent completed units remaining.
[17:28:29] + Closed connections
[17:28:34] 
[17:28:34] + Processing work unit
[17:28:34] Core required: FahCore_11.exe
[17:28:34] Core found.
[17:28:34] Working on queue slot 09 [October 28 17:28:34 UTC]
[17:28:34] + Working ...
[17:28:34] - Calling '.\FahCore_11.exe -dir work/ -suffix 09 -priority 96 -checkpoint 30 -verbose -lifeline 4028 -version 620'

[17:28:34] 
[17:28:34] *------------------------------*
[17:28:34] Folding@Home GPU Core - Beta
[17:28:34] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:28:34] 
[17:28:34] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:28:34] Build host: amoeba
[17:28:34] Board Type: Nvidia
[17:28:34] Core      : 
[17:28:34] Preparing to commence simulation
[17:28:34] - Looking at optimizations...
[17:28:34] - Created dyn
[17:28:34] - Files status OK
[17:28:34] - Expanded 42892 -> 246265 (decompressed 574.1 percent)
[17:28:34] Called DecompressByteArray: compressed_data_size=42892 data_size=246265, decompressed_data_size=246265 diff=0
[17:28:34] - Digital signature verified
[17:28:34] 
[17:28:34] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:28:34] 
[17:28:34] Assembly optimizations on if available.
[17:28:34] Entering M.D.
[17:28:40] Working on p5801_supervillin_e1
[17:28:41] Client config found, loading data.
[17:28:41] Starting GUI Server
[17:28:57] mdrun_gpu returned 
[17:28:57] NANs detected on GPU
[17:28:57] 
[17:28:57] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:29:00] CoreStatus = 7A (122)
[17:29:00] Sending work to server
[17:29:00] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:29:00] - Read packet limit of 540015616... Set to 524286976.
[17:29:00] - Error: Could not get length of results file work/wuresults_09.dat
[17:29:00] - Error: Could not read unit 09 file. Removing from queue.
[17:29:00] Trying to send all finished work units
[17:29:00] + No unsent completed units remaining.
[17:29:00] - Preparing to get new work unit...
[17:29:00] + Attempting to get work packet
[17:29:00] - Will indicate memory of 3070 MB
[17:29:00] - Connecting to assignment server
[17:29:00] Connecting to http://assign-GPU.stanford.edu:8080/
[17:29:00] Posted data.
[17:29:00] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[17:29:00] + News From Folding@Home: GPU folding beta
[17:29:00] Loaded queue successfully.
[17:29:00] Connecting to http://171.67.108.11:8080/
[17:29:01] Posted data.
[17:29:01] Initial: 0000; - Receiving payload (expected size: 43404)
[17:29:01] Conversation time very short, giving reduced weight in bandwidth avg
[17:29:01] - Downloaded at ~84 kB/s
[17:29:01] - Averaged speed for that direction ~64 kB/s
[17:29:01] + Received work.
[17:29:01] Trying to send all finished work units
[17:29:01] + No unsent completed units remaining.
[17:29:01] + Closed connections
[17:29:06] 
[17:29:06] + Processing work unit
[17:29:06] Core required: FahCore_11.exe
[17:29:06] Core found.
[17:29:06] Working on queue slot 00 [October 28 17:29:06 UTC]
[17:29:06] + Working ...
[17:29:06] - Calling '.\FahCore_11.exe -dir work/ -suffix 00 -priority 96 -checkpoint 30 -verbose -lifeline 4028 -version 620'

[17:29:06] 
[17:29:06] *------------------------------*
[17:29:06] Folding@Home GPU Core - Beta
[17:29:06] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:29:06] 
[17:29:06] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:29:06] Build host: amoeba
[17:29:06] Board Type: Nvidia
[17:29:06] Core      : 
[17:29:06] Preparing to commence simulation
[17:29:06] - Looking at optimizations...
[17:29:06] - Created dyn
[17:29:06] - Files status OK
[17:29:06] - Expanded 42892 -> 246265 (decompressed 574.1 percent)
[17:29:06] Called DecompressByteArray: compressed_data_size=42892 data_size=246265, decompressed_data_size=246265 diff=0
[17:29:06] - Digital signature verified
[17:29:06] 
[17:29:06] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:29:06] 
[17:29:06] Assembly optimizations on if available.
[17:29:06] Entering M.D.
[17:29:12] Working on p5801_supervillin_e1
[17:29:13] Client config found, loading data.
[17:29:13] Starting GUI Server
[17:29:30] mdrun_gpu returned 
[17:29:30] NANs detected on GPU
[17:29:30] 
[17:29:30] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:29:32] CoreStatus = 7A (122)
[17:29:32] Sending work to server
[17:29:32] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:29:32] - Read packet limit of 540015616... Set to 524286976.
[17:29:32] - Error: Could not get length of results file work/wuresults_00.dat
[17:29:32] - Error: Could not read unit 00 file. Removing from queue.
[17:29:32] Trying to send all finished work units
[17:29:32] + No unsent completed units remaining.
[17:29:32] - Preparing to get new work unit...
[17:29:32] + Attempting to get work packet
[17:29:32] - Will indicate memory of 3070 MB
[17:29:32] - Connecting to assignment server
[17:29:32] Connecting to http://assign-GPU.stanford.edu:8080/
[17:29:33] Posted data.
[17:29:33] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[17:29:33] + News From Folding@Home: GPU folding beta
[17:29:33] Loaded queue successfully.
[17:29:33] Connecting to http://171.67.108.11:8080/
[17:29:33] Posted data.
[17:29:33] Initial: 0000; - Receiving payload (expected size: 43404)
[17:29:33] Conversation time very short, giving reduced weight in bandwidth avg
[17:29:33] - Downloaded at ~84 kB/s
[17:29:33] - Averaged speed for that direction ~66 kB/s
[17:29:33] + Received work.
[17:29:33] Trying to send all finished work units
[17:29:33] + No unsent completed units remaining.
[17:29:33] + Closed connections
[17:29:38] 
[17:29:38] + Processing work unit
[17:29:38] Core required: FahCore_11.exe
[17:29:38] Core found.
[17:29:38] Working on queue slot 01 [October 28 17:29:38 UTC]
[17:29:38] + Working ...
[17:29:38] - Calling '.\FahCore_11.exe -dir work/ -suffix 01 -priority 96 -checkpoint 30 -verbose -lifeline 4028 -version 620'

[17:29:39] 
[17:29:39] *------------------------------*
[17:29:39] Folding@Home GPU Core - Beta
[17:29:39] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:29:39] 
[17:29:39] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:29:39] Build host: amoeba
[17:29:39] Board Type: Nvidia
[17:29:39] Core      : 
[17:29:39] Preparing to commence simulation
[17:29:39] - Looking at optimizations...
[17:29:39] - Created dyn
[17:29:39] - Files status OK
[17:29:39] - Expanded 42892 -> 246265 (decompressed 574.1 percent)
[17:29:39] Called DecompressByteArray: compressed_data_size=42892 data_size=246265, decompressed_data_size=246265 diff=0
[17:29:39] - Digital signature verified
[17:29:39] 
[17:29:39] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:29:39] 
[17:29:39] Assembly optimizations on if available.
[17:29:39] Entering M.D.
[17:29:45] Working on p5801_supervillin_e1
[17:29:46] Client config found, loading data.
[17:29:46] Starting GUI Server
[17:30:02] mdrun_gpu returned 
[17:30:02] NANs detected on GPU
[17:30:02] 
[17:30:02] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:30:05] CoreStatus = 7A (122)
[17:30:05] Sending work to server
[17:30:05] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:30:05] - Read packet limit of 540015616... Set to 524286976.
[17:30:05] - Error: Could not get length of results file work/wuresults_01.dat
[17:30:05] - Error: Could not read unit 01 file. Removing from queue.
[17:30:05] Trying to send all finished work units
[17:30:05] + No unsent completed units remaining.
[17:30:05] - Preparing to get new work unit...
[17:30:05] + Attempting to get work packet
[17:30:05] - Will indicate memory of 3070 MB
[17:30:05] - Connecting to assignment server
[17:30:05] Connecting to http://assign-GPU.stanford.edu:8080/
[17:30:05] Posted data.
[17:30:05] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[17:30:05] + News From Folding@Home: GPU folding beta
[17:30:05] Loaded queue successfully.
[17:30:05] Connecting to http://171.67.108.11:8080/
[17:30:05] Posted data.
[17:30:05] Initial: 0000; - Receiving payload (expected size: 43404)
[17:30:06] - Downloaded at ~42 kB/s
[17:30:06] - Averaged speed for that direction ~61 kB/s
[17:30:06] + Received work.
[17:30:06] Trying to send all finished work units
[17:30:06] + No unsent completed units remaining.
[17:30:06] + Closed connections
[17:30:11] 
[17:30:11] + Processing work unit
[17:30:11] Core required: FahCore_11.exe
[17:30:11] Core found.
[17:30:11] Working on queue slot 02 [October 28 17:30:11 UTC]
[17:30:11] + Working ...
[17:30:11] - Calling '.\FahCore_11.exe -dir work/ -suffix 02 -priority 96 -checkpoint 30 -verbose -lifeline 4028 -version 620'

[17:30:11] 
[17:30:11] *------------------------------*
[17:30:11] Folding@Home GPU Core - Beta
[17:30:11] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:30:11] 
[17:30:11] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:30:11] Build host: amoeba
[17:30:11] Board Type: Nvidia
[17:30:11] Core      : 
[17:30:11] Preparing to commence simulation
[17:30:11] - Looking at optimizations...
[17:30:11] - Created dyn
[17:30:11] - Files status OK
[17:30:11] - Expanded 42892 -> 246265 (decompressed 574.1 percent)
[17:30:11] Called DecompressByteArray: compressed_data_size=42892 data_size=246265, decompressed_data_size=246265 diff=0
[17:30:11] - Digital signature verified
[17:30:11] 
[17:30:11] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:30:11] 
[17:30:11] Assembly optimizations on if available.
[17:30:11] Entering M.D.
[17:30:17] Working on p5801_supervillin_e1
[17:30:18] Client config found, loading data.
[17:30:18] Starting GUI Server
[17:30:34] mdrun_gpu returned 
[17:30:34] NANs detected on GPU
[17:30:34] 
[17:30:34] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:30:37] CoreStatus = 7A (122)
[17:30:37] Sending work to server
[17:30:37] Project: 5801 (Run 7, Clone 176, Gen 0)
[17:30:37] - Read packet limit of 540015616... Set to 524286976.
[17:30:37] - Error: Could not get length of results file work/wuresults_02.dat
[17:30:37] - Error: Could not read unit 02 file. Removing from queue.
[17:30:37] EUE limit exceeded. Pausing 24 hours.

Re: Project 5801(R0,C4,G0) and (R9,C63,G0) +++

Posted: Tue Oct 28, 2008 5:55 pm
by jevans64
I guess the problem is that the client pauses for 24 hours and there unsent WUs in the queue because the servers have been crap for the last 4 days.

I am shutting down ALL of my nVidia clients.

Re: Project 5801(R0,C4,G0) and (R9,C63,G0)

Posted: Tue Oct 28, 2008 5:58 pm
by ei57
Got enough 5801 to put the client to sleep twice. EUE even before the GPU gets warm. Overclock or not doesn't matter.

[17:55:19] Starting GUI Server
[17:55:32] mdrun_gpu returned
[17:55:32] NANs detected on GPU
[17:55:32]
[17:55:32] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:55:34] CoreStatus = 7A (122)

[17:55:34] Sending work to server
[17:55:34] Project: 5801 (Run 9, Clone 310, Gen 0)

Log from second GPU - turned off too. 5801 is a disaster!

Re: Project 5801(R0,C4,G0) and (R9,C63,G0) +++

Posted: Tue Oct 28, 2008 6:02 pm
by BrgHW
No change. Impossibile to get other Project's. Shutting down all GPU folding.

Re: Project 5801(R0,C4,G0) and (R9,C63,G0) +++

Posted: Tue Oct 28, 2008 6:05 pm
by toTOW
Thanks for the reports, PM sent.

Re: Project 5801(R0,C4,G0) and (R9,C63,G0) +++

Posted: Tue Oct 28, 2008 6:12 pm
by 6rill2000
I've got exactly the same problem, and my queue.dat is full of this 5801 (we can't see this WU on the beta projects summary page), so what are we suppose to do ?

I've got the UNSTABLE_MACHINE every time, and even if I lower the FSB, or the GPU clock, to a lower value than it's suppose to work (My GPU have ever folded at 800 Mhz, but, just to test, I'm now a 700 Mhz and I've got the same problem :( and I have lowered the FSB from 237 Mhz to 200 Mhz and nothing change too)

Code: Select all

[17:57:16] *------------------------------*
[17:57:16] Folding@Home GPU Core - Beta
[17:57:16] Version 1.15 (Mon Oct 13 11:11:30 PDT 2008)
[17:57:16] 
[17:57:16] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[17:57:16] Build host: amoeba
[17:57:16] Board Type: Nvidia
[17:57:16] Core      : 
[17:57:16] Preparing to commence simulation
[17:57:16] - Looking at optimizations...
[17:57:16] - Created dyn
[17:57:16] - Files status OK
[17:57:16] - Expanded 42943 -> 246265 (decompressed 573.4 percent)
[17:57:16] Called DecompressByteArray: compressed_data_size=42943 data_size=246265, decompressed_data_size=246265 diff=0
[17:57:16] - Digital signature verified
[17:57:16] 
[17:57:16] Project: 5801 (Run 2, Clone 404, Gen 0)
[17:57:16] 
[17:57:16] Assembly optimizations on if available.
[17:57:16] Entering M.D.
[17:57:23] Working on p5801_supervillin_e1
[17:57:25] Client config found, loading data.
[17:57:25] Starting GUI Server
[17:58:35] mdrun_gpu returned 
[17:58:35] NANs detected on GPU
[17:58:35] 
[17:58:35] Folding@home Core Shutdown: UNSTABLE_MACHINE
[17:58:40] CoreStatus = 7A (122)
[17:58:40] Sending work to server
[17:58:40] Project: 5801 (Run 2, Clone 404, Gen 0)
[17:58:40] - Read packet limit of 540015616... Set to 524286976.
[17:58:40] - Error: Could not get length of results file work/wuresults_03.dat
[17:58:40] - Error: Could not read unit 03 file. Removing from queue.
[17:58:40] EUE limit exceeded. Pausing 24 hours.

Re: Project 5801 issues.

Posted: Tue Oct 28, 2008 6:13 pm
by alexopth69
Same problems here, NO 5801 will work of any generation. Please remove them for now

Re: Project 5801 issues.

Posted: Tue Oct 28, 2008 6:17 pm
by road-runner
5801 is no good, will not run at default clocks and shaders Image

Re: Project 5801 issues.

Posted: Tue Oct 28, 2008 6:19 pm
by toTOW
I wish I had access to the servers ... :roll:

Re: Project 5801 issues.

Posted: Tue Oct 28, 2008 6:21 pm
by soa_rru
Thankgod for that, thought my new rig was playing up already :lol: