Page 1 of 3

Project: 6806 (Run 3987, Clone 2, Gen 10) - Sudden errors.

Posted: Sat Feb 26, 2011 3:51 am
by Rich_D
I started getting these errors this evening.
I did have the card at a stable overclock and the card was folding along fine for the past week, then all of a sudden this evening it started getting errors at the 1% mark and going to the next WU.
I have tried clocking the card back to stock configuration but I am still getting the same errors.

Code: Select all

[03:35:43] + Processing work unit
[03:35:43] Core required: FahCore_15.exe
[03:35:43] Core found.
[03:35:43] Working on queue slot 03 [February 26 03:35:43 UTC]
[03:35:43] + Working ...
[03:35:43] 
[03:35:43] *------------------------------*
[03:35:43] Folding@Home GPU Core
[03:35:43] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[03:35:43] 
[03:35:43] Build host: SimbiosNvdWin7
[03:35:43] Board Type: NVIDIA/CUDA
[03:35:43] Core      : x=15
[03:35:43]  Window's signal control handler registered.
[03:35:43] Preparing to commence simulation
[03:35:43] - Looking at optimizations...
[03:35:43] DeleteFrameFiles: successfully deleted file=work/wudata_03.ckp
[03:35:43] - Created dyn
[03:35:43] - Files status OK
[03:35:43] sizeof(CORE_PACKET_HDR) = 512 file=<>
[03:35:43] - Expanded 43699 -> 172159 (decompressed 393.9 percent)
[03:35:43] Called DecompressByteArray: compressed_data_size=43699 data_size=172159, decompressed_data_size=172159 diff=0
[03:35:43] - Digital signature verified
[03:35:43] 
[03:35:43] Project: 6806 (Run 3987, Clone 2, Gen 10)
[03:35:43] 
[03:35:43] Assembly optimizations on if available.
[03:35:43] Entering M.D.
[03:35:45] Tpr hash work/wudata_03.tpr:  1399213409 2679800626 620498429 1228651168 3715002604
[03:35:45] Working on 2 PEPTIDE (1-42)
[03:35:45] Client config found, loading data.
[03:35:45] Starting GUI Server
[03:35:47] Setting checkpoint frequency: 500000
[03:35:47] Setting checkpoint frequency: 500000
[03:37:17] Completed    500000 out of 50000000 steps (1%).
[03:37:18] mdrun_gpu returned 52
[03:37:18] NANs detected on GPU
[03:37:18] 
[03:37:18] Folding@home Core Shutdown: UNSTABLE_MACHINE
[03:37:21] CoreStatus = 7A (122)
[03:37:21] Sending work to server
[03:37:21] Project: 6806 (Run 3987, Clone 2, Gen 10)
[03:37:21] - Read packet limit of 540015616... Set to 524286976.
[03:37:21] - Error: Could not get length of results file work/wuresults_03.dat
[03:37:21] - Error: Could not read unit 03 file. Removing from queue.
[03:37:21] EUE limit exceeded. Pausing 24 hours.
System specs:
Win 7 Ultimate w/SP1
Athlon II x4 640
GTX 560 Ti OC w/266.66 drivers
8GB Mushkin DDR3 1333

Re: Sudden errors......Help please.

Posted: Sat Feb 26, 2011 7:19 am
by HendricksSA
Rich_D, welcome to the Fold. I guess to get started, were all the EUEs from the same work unit, Project: 6806 (Run 3987, Clone 2, Gen 10) or are they appearing from a variety of work units? Perhaps one of the moderators can check to see if the 6806 was completed. I did not notice any complaints about it in the bad work unit thread.

As for your 560, is this a new card with just a week on it or have you had it for a while? You mentioned that you returned the clock to stock with the same problems. Have you updated its driver lately? I would suggest a sweep and reinstall. If that doesn't work, then I'd suggest PantherX's guide, section 15. It should help you sort out what may be a hardware failure. See it at: viewtopic.php?f=59&t=14683&p=144648#p144648

Good luck! Let us know how it goes.

Re: Sudden errors......Help please.

Posted: Sat Feb 26, 2011 8:03 am
by Rich_D
Driver is newest available for this card.

Yes card is just new last week.

I will take a look at the link you provided and see if it sheds some light.

And yes it looks it was only WU 6806 that it was having the errors on. I have deleted the work folder and queue.dat, changed the machine ID and will try for another WU to see if that changes anything.

I am running memtestg80 right now but looks good so far with no errors after 1000 iterations.

Re: Sudden errors......Help please.

Posted: Sat Feb 26, 2011 8:05 am
by PantherX
There isn't any data in the WU Database so I have marked it for a follow-up.

Re: Sudden errors......Help please.

Posted: Sat Feb 26, 2011 8:31 am
by Rich_D
Looks like it was that Project.....got 6800 Project and seems to be folding fine now.

Code: Select all

[08:27:26] + Processing work unit
[08:27:26] Core required: FahCore_15.exe
[08:27:26] Core found.
[08:27:26] Working on queue slot 01 [February 26 08:27:26 UTC]
[08:27:26] + Working ...
[08:27:26] 
[08:27:26] *------------------------------*
[08:27:26] Folding@Home GPU Core
[08:27:26] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[08:27:26] 
[08:27:26] Build host: SimbiosNvdWin7
[08:27:26] Board Type: NVIDIA/CUDA
[08:27:26] Core      : x=15
[08:27:26]  Window's signal control handler registered.
[08:27:26] Preparing to commence simulation
[08:27:26] - Looking at optimizations...
[08:27:26] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[08:27:26] - Created dyn
[08:27:26] - Files status OK
[08:27:26] sizeof(CORE_PACKET_HDR) = 512 file=<>
[08:27:26] - Expanded 43434 -> 169787 (decompressed 390.9 percent)
[08:27:26] Called DecompressByteArray: compressed_data_size=43434 data_size=169787, decompressed_data_size=169787 diff=0
[08:27:26] - Digital signature verified
[08:27:26] 
[08:27:26] Project: 6800 (Run 7043, Clone 0, Gen 11)
[08:27:26] 
[08:27:26] Assembly optimizations on if available.
[08:27:26] Entering M.D.
[08:27:29] Tpr hash work/wudata_01.tpr:  1527850101 1445066742 2228363247 3915626403 1217746235
[08:27:29] Working on PEPTIDE (1-42)
[08:27:29] Client config found, loading data.
[08:27:29] Starting GUI Server
[08:27:30] Setting checkpoint frequency: 500000
[08:27:30] Setting checkpoint frequency: 500000
[08:28:57] Completed    500001 out of 50000064 steps (1%).
[08:30:24] Completed   1000002 out of 50000064 steps (2%).
Still going strong :)

Code: Select all

[09:39:50] Completed  10000001 out of 50000064 steps (19%).
[09:39:50] Completed  10000013 out of 50000064 steps (20%).
[09:41:17] Completed  10500013 out of 50000064 steps (21%).
[09:42:41] Completed  11000014 out of 50000064 steps (22%).
[09:44:06] Completed  11500015 out of 50000064 steps (23%).
[09:45:30] Completed  12000015 out of 50000064 steps (24%).
[09:46:55] Completed  12500016 out of 50000064 steps (25%).
[09:48:20] Completed  13000017 out of 50000064 steps (26%).
[09:49:45] Completed  13500017 out of 50000064 steps (27%).
[09:51:09] Completed  14000018 out of 50000064 steps (28%).
[09:52:34] Completed  14500019 out of 50000064 steps (29%).
[09:53:59] Completed  15000019 out of 50000064 steps (30%).
[09:55:24] Completed  15500020 out of 50000064 steps (31%).
[09:56:49] Completed  16000021 out of 50000064 steps (32%).
[09:58:15] Completed  16500021 out of 50000064 steps (33%).
[09:59:42] Completed  17000021 out of 50000064 steps (34%).
[10:01:09] Completed  17500022 out of 50000064 steps (35%).
[10:02:35] Completed  18000023 out of 50000064 steps (36%).
[10:04:02] Completed  18500023 out of 50000064 steps (37%).
[10:05:28] Completed  19000024 out of 50000064 steps (38%).
[10:06:55] Completed  19500025 out of 50000064 steps (39%).
[10:08:22] Completed  20000025 out of 50000064 steps (40%).
[10:09:48] Completed  20500026 out of 50000064 steps (41%).
[10:11:15] Completed  21000026 out of 50000064 steps (42%).
[10:12:41] Completed  21500027 out of 50000064 steps (43%).
[10:14:08] Completed  22000028 out of 50000064 steps (44%).
[10:15:34] Completed  22500028 out of 50000064 steps (45%).
[10:17:01] Completed  23000029 out of 50000064 steps (46%).
[10:18:28] Completed  23500030 out of 50000064 steps (47%).
[10:19:54] Completed  24000030 out of 50000064 steps (48%).
[10:21:19] Completed  24500031 out of 50000064 steps (49%).
[10:22:45] Completed  25000032 out of 50000064 steps (50%).
[10:24:09] Completed  25500032 out of 50000064 steps (51%).

Re: Sudden errors......Help please.

Posted: Sun Feb 27, 2011 5:45 pm
by HendricksSA
Rich_D, it looks like you might have just swallowed a bad work unit. Hopefully your new card keeps right on working perfectly. Let us know if you have future problems. Fold on!

Re: Sudden errors......Help please.

Posted: Mon Feb 28, 2011 12:48 am
by bruce
I'm going to move this topic to the "issues with specific WUs" forum and flag it for a followup. There are no error reports in the Mod DB yet for Project: 6806 (Run 3987, Clone 2, Gen 10) and make sure it's flagged for followup.

Unfortunately when the client says - Error: .... Removing from queue. there's no electronic report, so letting us know about it is important.

Re: Project: 6806 (Run 3987, Clone 2, Gen 10) - Sudden error

Posted: Sat Mar 05, 2011 6:00 am
by jmaertens
For what it's worth, I started getting this exact same error on my GTX 460 the other day. And every WU I pull is the same "Project: 6806 (Run 3987, Clone 2, Gen 10)", despite my best efforts of getting rid of the queue.dat, unitinfo, work folder, and even blowing away the core and letting it download again. Log snippet below:

Code: Select all

--- Opening Log file [March 5 05:33:18 UTC] 


# Windows GPU Systray Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.30r2

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\admin\AppData\Roaming\Folding@home-gpu
Arguments: -gpu 2 -forcegpu nvidia_fermi 

[05:33:18] - Ask before connecting: No
[05:33:18] - User name: jmaertens (Team 111065)
[05:33:18] - User ID: 50EAE9A51BE61CBA
[05:33:18] - Machine ID: 4
[05:33:18] 
[05:33:18] Gpu type=3 species=30.
[05:33:18] Could not open work queue, generating new queue...
[05:33:18] Initialization complete
[05:33:18] - Preparing to get new work unit...
[05:33:18] Cleaning up work directory
[05:33:18] + Attempting to get work packet
[05:33:18] Passkey found
[05:33:18] Gpu type=3 species=30.
[05:33:18] - Connecting to assignment server
[05:33:19] - Successful: assigned to (171.64.65.64).
[05:33:19] + News From Folding@Home: Welcome to Folding@Home
[05:33:19] Loaded queue successfully.
[05:33:19] Gpu type=3 species=30.
[05:33:20] + Closed connections
[05:33:20] 
[05:33:20] + Processing work unit
[05:33:20] Core required: FahCore_15.exe
[05:33:20] Core not found.
[05:33:20] - Core is not present or corrupted.
[05:33:20] - Attempting to download new core...
[05:33:20] + Downloading new core: FahCore_15.exe
[05:33:20] + 10240 bytes downloaded
[05:33:20] + 20480 bytes downloaded
[05:33:20] + 30720 bytes downloaded
[05:33:20] + 40960 bytes downloaded
[05:33:20] + 51200 bytes downloaded
[05:33:20] + 61440 bytes downloaded
[05:33:20] + 71680 bytes downloaded
[05:33:20] + 81920 bytes downloaded
[05:33:20] + 92160 bytes downloaded
[05:33:20] + 102400 bytes downloaded
[05:33:21] + 112640 bytes downloaded
[05:33:21] + 122880 bytes downloaded
[05:33:21] + 133120 bytes downloaded
[05:33:21] + 143360 bytes downloaded
[05:33:21] + 153600 bytes downloaded
[05:33:21] + 163840 bytes downloaded
[05:33:21] + 174080 bytes downloaded
[05:33:21] + 184320 bytes downloaded
[05:33:21] + 194560 bytes downloaded
[05:33:21] + 204800 bytes downloaded
[05:33:21] + 215040 bytes downloaded
[05:33:21] + 225280 bytes downloaded
[05:33:21] + 235520 bytes downloaded
[05:33:21] + 245760 bytes downloaded
[05:33:21] + 256000 bytes downloaded
[05:33:21] + 266240 bytes downloaded
[05:33:21] + 276480 bytes downloaded
[05:33:21] + 286720 bytes downloaded
[05:33:21] + 296960 bytes downloaded
[05:33:21] + 307200 bytes downloaded
[05:33:21] + 317440 bytes downloaded
[05:33:21] + 327680 bytes downloaded
[05:33:21] + 337920 bytes downloaded
[05:33:22] + 348160 bytes downloaded
[05:33:22] + 358400 bytes downloaded
[05:33:22] + 368640 bytes downloaded
[05:33:22] + 378880 bytes downloaded
[05:33:22] + 389120 bytes downloaded
[05:33:22] + 399360 bytes downloaded
[05:33:22] + 409600 bytes downloaded
[05:33:22] + 419840 bytes downloaded
[05:33:22] + 430080 bytes downloaded
[05:33:22] + 440320 bytes downloaded
[05:33:22] + 450560 bytes downloaded
[05:33:22] + 460800 bytes downloaded
[05:33:22] + 471040 bytes downloaded
[05:33:22] + 481280 bytes downloaded
[05:33:22] + 491520 bytes downloaded
[05:33:22] + 501760 bytes downloaded
[05:33:22] + 512000 bytes downloaded
[05:33:22] + 522240 bytes downloaded
[05:33:22] + 532480 bytes downloaded
[05:33:22] + 542720 bytes downloaded
[05:33:22] + 552960 bytes downloaded
[05:33:22] + 563200 bytes downloaded
[05:33:22] + 573440 bytes downloaded
[05:33:22] + 583680 bytes downloaded
[05:33:22] + 593920 bytes downloaded
[05:33:22] + 604160 bytes downloaded
[05:33:22] + 614400 bytes downloaded
[05:33:22] + 624640 bytes downloaded
[05:33:22] + 634880 bytes downloaded
[05:33:22] + 645120 bytes downloaded
[05:33:22] + 655360 bytes downloaded
[05:33:23] + 665600 bytes downloaded
[05:33:23] + 675840 bytes downloaded
[05:33:23] + 686080 bytes downloaded
[05:33:23] + 696320 bytes downloaded
[05:33:23] + 706560 bytes downloaded
[05:33:23] + 716800 bytes downloaded
[05:33:23] + 727040 bytes downloaded
[05:33:23] + 737280 bytes downloaded
[05:33:23] + 747520 bytes downloaded
[05:33:23] + 757760 bytes downloaded
[05:33:23] + 768000 bytes downloaded
[05:33:23] + 778240 bytes downloaded
[05:33:23] + 788480 bytes downloaded
[05:33:23] + 798720 bytes downloaded
[05:33:23] + 808960 bytes downloaded
[05:33:23] + 819200 bytes downloaded
[05:33:23] + 829440 bytes downloaded
[05:33:23] + 839680 bytes downloaded
[05:33:23] + 849920 bytes downloaded
[05:33:23] + 860160 bytes downloaded
[05:33:23] + 870400 bytes downloaded
[05:33:23] + 880640 bytes downloaded
[05:33:23] + 890880 bytes downloaded
[05:33:23] + 901120 bytes downloaded
[05:33:23] + 911360 bytes downloaded
[05:33:23] + 921600 bytes downloaded
[05:33:23] + 931840 bytes downloaded
[05:33:23] + 942080 bytes downloaded
[05:33:24] + 952320 bytes downloaded
[05:33:24] + 962560 bytes downloaded
[05:33:24] + 972800 bytes downloaded
[05:33:24] + 983040 bytes downloaded
[05:33:24] + 993280 bytes downloaded
[05:33:24] + 1003520 bytes downloaded
[05:33:24] + 1013760 bytes downloaded
[05:33:24] + 1024000 bytes downloaded
[05:33:24] + 1034240 bytes downloaded
[05:33:24] + 1044480 bytes downloaded
[05:33:24] + 1054720 bytes downloaded
[05:33:24] + 1064960 bytes downloaded
[05:33:24] + 1075200 bytes downloaded
[05:33:24] + 1085440 bytes downloaded
[05:33:24] + 1095680 bytes downloaded
[05:33:24] + 1105920 bytes downloaded
[05:33:24] + 1116160 bytes downloaded
[05:33:24] + 1126400 bytes downloaded
[05:33:24] + 1136640 bytes downloaded
[05:33:24] + 1146880 bytes downloaded
[05:33:24] + 1157120 bytes downloaded
[05:33:24] + 1167360 bytes downloaded
[05:33:24] + 1177600 bytes downloaded
[05:33:24] + 1187840 bytes downloaded
[05:33:24] + 1198080 bytes downloaded
[05:33:24] + 1208320 bytes downloaded
[05:33:24] + 1218560 bytes downloaded
[05:33:24] + 1228800 bytes downloaded
[05:33:24] + 1239040 bytes downloaded
[05:33:25] + 1249280 bytes downloaded
[05:33:25] + 1259520 bytes downloaded
[05:33:25] + 1269760 bytes downloaded
[05:33:25] + 1280000 bytes downloaded
[05:33:25] + 1290240 bytes downloaded
[05:33:25] + 1300480 bytes downloaded
[05:33:25] + 1310720 bytes downloaded
[05:33:25] + 1320960 bytes downloaded
[05:33:25] + 1331200 bytes downloaded
[05:33:25] + 1341440 bytes downloaded
[05:33:25] + 1351680 bytes downloaded
[05:33:25] + 1361920 bytes downloaded
[05:33:25] + 1372160 bytes downloaded
[05:33:25] + 1382400 bytes downloaded
[05:33:25] + 1392640 bytes downloaded
[05:33:25] + 1402880 bytes downloaded
[05:33:25] + 1413120 bytes downloaded
[05:33:25] + 1422551 bytes downloaded
[05:33:25] Verifying core Core_15.fah...
[05:33:25] Signature is VALID
[05:33:25] 
[05:33:25] Trying to unzip core FahCore_15.exe
[05:33:25] Decompressed FahCore_15.exe (3903488 bytes) successfully
[05:33:30] + Core successfully engaged
[05:33:36] 
[05:33:36] + Processing work unit
[05:33:36] Core required: FahCore_15.exe
[05:33:36] Core found.
[05:33:36] Working on queue slot 01 [March 5 05:33:36 UTC]
[05:33:36] + Working ...
[05:33:36] 
[05:33:36] *------------------------------*
[05:33:36] Folding@Home GPU Core
[05:33:36] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[05:33:36] 
[05:33:36] Build host: SimbiosNvdWin7
[05:33:36] Board Type: NVIDIA/CUDA
[05:33:36] Core      : x=15
[05:33:36]  Window's signal control handler registered.
[05:33:36] Preparing to commence simulation
[05:33:36] - Looking at optimizations...
[05:33:36] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[05:33:36] - Created dyn
[05:33:36] - Files status OK
[05:33:36] sizeof(CORE_PACKET_HDR) = 512 file=<>
[05:33:36] - Expanded 43699 -> 172159 (decompressed 393.9 percent)
[05:33:36] Called DecompressByteArray: compressed_data_size=43699 data_size=172159, decompressed_data_size=172159 diff=0
[05:33:36] - Digital signature verified
[05:33:36] 
[05:33:36] Project: 6806 (Run 3987, Clone 2, Gen 10)
[05:33:36] 
[05:33:36] Assembly optimizations on if available.
[05:33:36] Entering M.D.
[05:33:38] Tpr hash work/wudata_01.tpr:  1399213409 2679800626 620498429 1228651168 3715002604
[05:33:38] Working on 2 PEPTIDE (1-42)
[05:33:38] Client config found, loading data.
[05:33:38] Starting GUI Server
[05:33:39] Setting checkpoint frequency: 500000
[05:33:39] Setting checkpoint frequency: 500000
[05:36:25] Completed    500000 out of 50000000 steps (1%).
[05:36:25] mdrun_gpu returned 52
[05:36:25] NANs detected on GPU
[05:36:25] 
[05:36:25] Folding@home Core Shutdown: UNSTABLE_MACHINE
[05:36:28] CoreStatus = 7A (122)
[05:36:28] Sending work to server
[05:36:28] Project: 6806 (Run 3987, Clone 2, Gen 10)
[05:36:28] - Read packet limit of 540015616... Set to 524286976.
[05:36:28] - Error: Could not get length of results file work/wuresults_01.dat
[05:36:28] - Error: Could not read unit 01 file. Removing from queue.
[05:36:28] - Preparing to get new work unit...
[05:36:28] Cleaning up work directory
[05:36:28] + Attempting to get work packet
[05:36:28] Passkey found
[05:36:28] Gpu type=3 species=30.
[05:36:28] - Connecting to assignment server
[05:36:29] - Successful: assigned to (171.64.65.64).
[05:36:29] + News From Folding@Home: Welcome to Folding@Home
[05:36:29] Loaded queue successfully.
[05:36:29] Gpu type=3 species=30.
[05:36:30] + Closed connections
[05:36:35] 
[05:36:35] + Processing work unit
[05:36:35] Core required: FahCore_15.exe
[05:36:35] Core found.
[05:36:35] Working on queue slot 02 [March 5 05:36:35 UTC]
[05:36:35] + Working ...
[05:36:35] 
[05:36:35] *------------------------------*
[05:36:35] Folding@Home GPU Core
[05:36:35] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[05:36:35] 
[05:36:35] Build host: SimbiosNvdWin7
[05:36:35] Board Type: NVIDIA/CUDA
[05:36:35] Core      : x=15
[05:36:35]  Window's signal control handler registered.
[05:36:35] Preparing to commence simulation
[05:36:35] - Looking at optimizations...
[05:36:35] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[05:36:35] - Created dyn
[05:36:35] - Files status OK
[05:36:35] sizeof(CORE_PACKET_HDR) = 512 file=<>
[05:36:35] - Expanded 43699 -> 172159 (decompressed 393.9 percent)
[05:36:35] Called DecompressByteArray: compressed_data_size=43699 data_size=172159, decompressed_data_size=172159 diff=0
[05:36:35] - Digital signature verified
[05:36:35] 
[05:36:35] Project: 6806 (Run 3987, Clone 2, Gen 10)
[05:36:35] 
[05:36:35] Assembly optimizations on if available.
[05:36:35] Entering M.D.
[05:36:37] Tpr hash work/wudata_02.tpr:  1399213409 2679800626 620498429 1228651168 3715002604
[05:36:37] Working on 2 PEPTIDE (1-42)
[05:36:37] Client config found, loading data.
[05:36:37] Starting GUI Server
[05:36:38] Setting checkpoint frequency: 500000
[05:36:38] Setting checkpoint frequency: 500000

I'm running Windows Vista Ultimate 64-bit, SP2, with a GTX 460 at stock clock speeds, running 260.99 drivers. I also have 2 GTX 260s in the same box, and both of those are humming right along.

Re: Project: 6806 (Run 3987, Clone 2, Gen 10) - Sudden error

Posted: Sat Mar 05, 2011 10:03 pm
by PantherX
Welcome to the F@H Forum jmaertens,

If you got a bad WU, you will have to do this:
Step 1: Stop the F@h Client
Step 2: Delete the Work folder
Step 3: Delete the queue.dat file
Step 4: Change the Machine ID to another unique value
Step 5: Start the F@h Client

BTW, there is still no data in the WU Databse :(

Re: Project: 6806 (Run 3987, Clone 2, Gen 10) - Sudden error

Posted: Sun Mar 06, 2011 2:13 am
by jmaertens
PantherX wrote: Step 4: Change the Machine ID to another unique value
Wow, I've never heard that was necessary, but it worked like a champ. Thanks for the help!

Re: Project: 6806 (Run 3987, Clone 2, Gen 10) - Sudden error

Posted: Sun Mar 06, 2011 8:01 pm
by Mortlake
Must be a defective WU, I had the exactly the same problem on one of my 460s - your tip about changing the machine ID worked a treat - thanks for that. :D

Re: Project: 6806 (Run 3987, Clone 2, Gen 10) - Sudden error

Posted: Mon Mar 07, 2011 9:06 pm
by PantherX
No data in the WU Database yet.

Re: Project: 6806 (Run 3987, Clone 2, Gen 10) - Sudden error

Posted: Mon Mar 14, 2011 10:27 pm
by PantherX
We have some data:
Your WU (P6806 R3987 C2 G10) was added to the stats database on 2011-03-08 17:08:14 for 0 points of credit.
Maybe it's a bad WU but we still have to wait for more results.

Re: Project: 6806 (Run 3987, Clone 2, Gen 10) - Sudden error

Posted: Tue Mar 22, 2011 4:15 pm
by ChrisM101
Im failing this unit 6806 at a 100% rate on an otherwise stable PC that has put up 1million points already this month.
Fails very shortly into unit maybe at 1% dropped to stock clocks and still failed.

Code: Select all

-- Opening Log file [March 20 18:29:11 UTC]  

# Windows GPU Console Edition ################################################################################################################################ 
                       Folding@Home Client Version 6.30r1 
                          http://folding.stanford.edu 
############################################################################################################################################################## 
Launch directory: D:\Downloads\FAH_GPU_Tracker_V2\FAH GPU Tracker V2\GPU0
Executable: D:\Downloads\FAH_GPU_Tracker_V2\FAH GPU Tracker V2\FAH_GPU3.exe
Arguments: -oneunit -forcegpu nvidia_fermi -verbosity 9 -gpu 0 
 
[18:29:11] - Ask before connecting: No
[18:29:11] - User name: ChrisM101 (Team 111065)
[18:29:11] - User ID: 21BE43D7336DE836
[18:29:11] - Machine ID: 3
[18:29:11] 
[18:29:11] Gpu species not recognized.
[18:29:11] Work directory not found. Creating...
[18:29:11] Could not open work queue, generating new queue...
[18:29:11] - Preparing to get new work unit...
[18:29:11] - Autosending finished units... [March 20 18:29:11 UTC]
[18:29:11] Cleaning up work directory
[18:29:11] Trying to send all finished work units
[18:29:11] + No unsent completed units remaining.
[18:29:11] - Autosend completed
[18:29:11] + Attempting to get work packet
[18:29:11] Passkey found
[18:29:11] - Will indicate memory of 6135 MB
[18:29:11] Gpu species not recognized.
[18:29:11] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 10, Stepping: 5
[18:29:11] - Connecting to assignment server
[18:29:11] Connecting to http://assign-GPU.stanford.edu:8080/
[18:29:12] Posted data.[18:29:12] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[18:29:12] + News From Folding@Home: Welcome to Folding@Home
[18:29:12] Loaded queue successfully.
[18:29:12] Gpu species not recognized.
[18:29:12] Sent data
[18:29:12] Connecting to http://171.64.65.64:8080/
[18:29:12] Posted data.
[18:29:12] Initial: 0000; - Receiving payload (expected size: 44211)
[18:29:12] Conversation time very short, giving reduced weight in bandwidth avg
[18:29:12] - Downloaded at ~86 kB/s
[18:29:12] - Averaged speed for that direction ~86 kB/s
[18:29:12] + Received work.
[18:29:12] + Closed connections
[18:29:12] 
[18:29:12] + Processing work unit
[18:29:12] Core required: FahCore_15.exe
[18:29:12] Core found.
[18:29:12] Working on queue slot 01 [March 20 18:29:12 UTC]
[18:29:12] + Working ...
[18:29:12] - Calling '.\FahCore_15.exe -dir work/ -suffix 01 -nice 19 -priority 96 -nocpulock -checkpoint 3 -verbose -lifeline 5996 -version 630' 
[18:29:13] 
[18:29:13] *------------------------------*
[18:29:13] Folding@Home GPU Core
[18:29:13] Version 2.15 (Tue Nov 16 09:05:18 PST 2010)
[18:29:13] 
[18:29:13] Build host: SimbiosNvdWin7[18:29:13] Board Type: NVIDIA/CUDA
[18:29:13] Core      : x=15
[18:29:13]  Window's signal control handler registered.
[18:29:13] Preparing to commence simulation
[18:29:13] - Looking at optimizations...
[18:29:13] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[18:29:13] - Created dyn
[18:29:13] - Files status OK
[18:29:13] sizeof(CORE_PACKET_HDR) = 512 file=<>
[18:29:13] - Expanded 43699 -> 172159 (decompressed 393.9 percent)
[18:29:13] Called DecompressByteArray: compressed_data_size=43699 data_size=172159, decompressed_data_size=172159 diff=0
[18:29:13] - Digital signature verified
[18:29:13] 
[18:29:13] Project: 6806 (Run 3987, Clone 2, Gen 10)
[18:29:13] 
[18:29:13] Assembly optimizations on if available.
[18:29:13] Entering M.D.
[18:29:15] Tpr hash work/wudata_01.tpr:  1399213409 2679800626 620498429 1228651168 3715002604
[18:29:15] Working on 2 PEPTIDE (1-42)
[18:29:15] Client config found, loading data.
[18:29:15] Starting GUI Server
[18:29:15] Setting checkpoint frequency: 500000
[18:29:15] Setting checkpoint frequency: 500000
[18:30:26] Completed    500000 out of 50000000 steps (1%).
[18:30:27] mdrun_gpu returned 52
[18:30:27] NANs detected on GPU
[18:30:27] 
[18:30:27] Folding@home Core Shutdown: UNSTABLE_MACHINE
[18:30:31] CoreStatus = 7A (122)
[18:30:31] Sending work to server
[18:30:31] Project: 6806 (Run 3987, Clone 2, Gen 10)
[18:30:31] - Read packet limit of 540015616... Set to 524286976.
[18:30:31] - Error: Could not get length of results file work/wuresults_01.dat
[18:30:31] - Error: Could not read unit 01 file. Removing from queue.
[18:30:31] Trying to send all finished work units
[18:30:31] + No unsent completed units remaining.
[18:30:31] + -oneunit flag given and have now finished a unit. Exiting.***** Got a SIGTERM signal (2)
[18:30:31] Killing all core threads 
Folding@Home Client Shutdown. 

Re: Project: 6806 (Run 3987, Clone 2, Gen 10) - Sudden error

Posted: Wed Mar 23, 2011 6:43 pm
by ChrisM101
Please take this work unit off the server until its fixed. It has totally ruined the Folding on the GPU it was running on.
6806 (Run 3987, Clone 2, Gen 10) FAILS everytime