Project: 5506 (Run 7, Clone 133, Gen 229)

Moderators: Site Moderators, FAHC Science Team

Post Reply
Oldhat
Posts: 30
Joined: Mon Dec 03, 2007 11:42 am
Location: Auckland

Project: 5506 (Run 7, Clone 133, Gen 229)

Post by Oldhat »

Just having a small problem with this unit. NAN 7a Unstable machine errors until it decided a 24 hour rest would help. :lol:

Stock e2160 1024Mb RAM 2 x 8800GS Win XP

Code: Select all

[07:46:46] + Attempting to send results [November 9 07:46:46 UTC]
[07:47:03] + Results successfully sent
[07:47:03] Thank you for your contribution to Folding@Home.
[07:47:03] + Number of Units Completed: 740

[07:47:07] - Preparing to get new work unit...
[07:47:07] + Attempting to get work packet
[07:47:07] - Connecting to assignment server
[07:47:08] - Successful: assigned to (171.64.65.106).
[07:47:08] + News From Folding@Home: GPU folding beta
[07:47:08] Loaded queue successfully.
[07:47:11] + Closed connections
[07:47:11] 
[07:47:11] + Processing work unit
[07:47:11] Core required: FahCore_11.exe
[07:47:11] Core found.
[07:47:11] Working on queue slot 00 [November 9 07:47:11 UTC]
[07:47:11] + Working ...
[07:47:11] 
[07:47:11] *------------------------------*
[07:47:11] Folding@Home GPU Core - Beta
[07:47:11] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[07:47:11] 
[07:47:11] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[07:47:11] Build host: amoeba
[07:47:11] Board Type: Nvidia
[07:47:11] Core      : 
[07:47:11] Preparing to commence simulation
[07:47:11] - Looking at optimizations...
[07:47:11] - Created dyn
[07:47:11] - Files status OK
[07:47:11] - Expanded 45874 -> 246249 (decompressed 536.7 percent)
[07:47:11] Called DecompressByteArray: compressed_data_size=45874 data_size=246249, decompressed_data_size=246249 diff=0
[07:47:11] - Digital signature verified
[07:47:11] 
[07:47:11] Project: 5506 (Run 7, Clone 133, Gen 229)
[07:47:11] 
[07:47:11] Assembly optimizations on if available.
[07:47:11] Entering M.D.
[07:47:17] Working on p5506_supervillin_e1
[07:47:19] Client config found, loading data.
[07:47:19] mdrun_gpu returned 
[07:47:19] NANs detected on GPU
[07:47:19] 
[07:47:19] Folding@home Core Shutdown: UNSTABLE_MACHINE
[07:47:23] CoreStatus = 7A (122)
[07:47:23] Sending work to server
[07:47:23] Project: 5506 (Run 7, Clone 133, Gen 229)
[07:47:23] - Read packet limit of 540015616... Set to 524286976.
[07:47:23] - Error: Could not get length of results file work/wuresults_00.dat
[07:47:23] - Error: Could not read unit 00 file. Removing from queue.
[07:47:23] - Preparing to get new work unit...
[07:47:23] + Attempting to get work packet
[07:47:23] - Connecting to assignment server
[07:47:27] - Successful: assigned to (171.64.65.106).
[07:47:27] + News From Folding@Home: GPU folding beta
[07:47:27] Loaded queue successfully.
[07:47:28] + Closed connections
[07:47:34] 
[07:47:34] + Processing work unit
[07:47:34] Core required: FahCore_11.exe
[07:47:34] Core found.
[07:47:34] Working on queue slot 01 [November 9 07:47:34 UTC]
[07:47:34] + Working ...
[07:47:34] 
[07:47:34] *------------------------------*
[07:47:34] Folding@Home GPU Core - Beta
[07:47:34] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[07:47:34] 
[07:47:34] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[07:47:34] Build host: amoeba
[07:47:34] Board Type: Nvidia
[07:47:34] Core      : 
[07:47:34] Preparing to commence simulation
[07:47:34] - Looking at optimizations...
[07:47:34] - Created dyn
[07:47:34] - Files status OK
[07:47:34] - Expanded 45874 -> 246249 (decompressed 536.7 percent)
[07:47:34] Called DecompressByteArray: compressed_data_size=45874 data_size=246249, decompressed_data_size=246249 diff=0
[07:47:34] - Digital signature verified
[07:47:34] 
[07:47:34] Project: 5506 (Run 7, Clone 133, Gen 229)
[07:47:34] 
[07:47:34] Assembly optimizations on if available.
[07:47:34] Entering M.D.
[07:47:40] Working on p5506_supervillin_e1
[07:47:42] Client config found, loading data.
[07:47:42] mdrun_gpu returned 
[07:47:42] NANs detected on GPU
[07:47:42] 
[07:47:42] Folding@home Core Shutdown: UNSTABLE_MACHINE
[07:47:46] CoreStatus = 7A (122)
[07:47:46] Sending work to server
[07:47:46] Project: 5506 (Run 7, Clone 133, Gen 229)
[07:47:46] - Read packet limit of 540015616... Set to 524286976.
[07:47:46] - Error: Could not get length of results file work/wuresults_01.dat
[07:47:46] - Error: Could not read unit 01 file. Removing from queue.
[07:47:46] - Preparing to get new work unit...
[07:47:46] + Attempting to get work packet
[07:47:46] - Connecting to assignment server
[07:47:46] - Successful: assigned to (171.64.65.106).
[07:47:46] + News From Folding@Home: GPU folding beta
[07:47:47] Loaded queue successfully.
[07:47:48] + Closed connections
[07:47:53] 
[07:47:53] + Processing work unit
[07:47:53] Core required: FahCore_11.exe
[07:47:53] Core found.
[07:47:53] Working on queue slot 02 [November 9 07:47:53 UTC]
[07:47:53] + Working ...
[07:47:53] 
Cheers.
toTOW
Site Moderator
Posts: 6334
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by toTOW »

It looks like a bad WU ... there are two other reports for 0 credits.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Drugless
Posts: 58
Joined: Wed Jan 09, 2008 7:55 pm
Location: Durban, South Africa

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by Drugless »

I agree:

Code: Select all

[19:48:58] Project: 5506 (Run 7, Clone 133, Gen 229)
[19:48:58] 
[19:48:58] Assembly optimizations on if available.
[19:48:58] Entering M.D.
[19:49:04] Working on p5506_supervillin_e1
[19:49:05] Client config found, loading data.
[19:49:05] mdrun_gpu returned 
[19:49:05] NANs detected on GPU
[19:49:05] 
[19:49:05] Folding@home Core Shutdown: UNSTABLE_MACHINE
[19:49:08] CoreStatus = 7A (122)
[19:49:08] Sending work to server
[19:49:08] Project: 5506 (Run 7, Clone 133, Gen 229)
[19:49:08] - Read packet limit of 540015616... Set to 524286976.
[19:49:08] - Error: Could not get length of results file work/wuresults_07.dat
[19:49:08] - Error: Could not read unit 07 file. Removing from queue.
[19:49:08] EUE limit exceeded. Pausing 24 hours.
Image
Folding Tools:8 X PS3's, 5 x GTX280,1 x 8800GS, 8 x 9800GX2 GPU's
Drugless
Posts: 58
Joined: Wed Jan 09, 2008 7:55 pm
Location: Durban, South Africa

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by Drugless »

Woke up to same issue, same machine different board:

Code: Select all

[09:23:10] Project: 5506 (Run 7, Clone 133, Gen 229)
[09:23:10] 
[09:23:10] Assembly optimizations on if available.
[09:23:10] Entering M.D.
[09:23:17] Working on p5506_supervillin_e1
[09:23:17] Client config found, loading data.
[09:23:17] mdrun_gpu returned 
[09:23:17] NANs detected on GPU
[09:23:17] 
[09:23:17] Folding@home Core Shutdown: UNSTABLE_MACHINE
[09:23:20] CoreStatus = 7A (122)
[09:23:20] Sending work to server
[09:23:20] Project: 5506 (Run 7, Clone 133, Gen 229)
[09:23:20] - Read packet limit of 540015616... Set to 524286976.
[09:23:20] - Error: Could not get length of results file work/wuresults_06.dat
[09:23:20] - Error: Could not read unit 06 file. Removing from queue.
[09:23:20] EUE limit exceeded. Pausing 24 hours.
Another 2.5 hrs wasted! Could PG look into trying to fix this type of error loop. ie. If RCG sequentially fails x times then download new RCG?

Code: Select all

[11:49:45] - Ask before connecting: No
Image
Folding Tools:8 X PS3's, 5 x GTX280,1 x 8800GS, 8 x 9800GX2 GPU's
toTOW
Site Moderator
Posts: 6334
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by toTOW »

The issue is that this UM is not reported to the server :( :

[09:23:20] - Error: Could not get length of results file work/wuresults_06.dat
[09:23:20] - Error: Could not read unit 06 file. Removing from queue.

It will be assigned 5 or 6 times to the same machine, and then the server will move to another WU ...
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
ChrissyT88
Posts: 9
Joined: Mon Nov 17, 2008 11:15 am

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by ChrissyT88 »

I also encountered this error this morning, and thought i should report it. Error was on the second card of a 9800GX2 at stock. Deleted work files etc and all seems well.
tobor
Posts: 56
Joined: Tue Jul 15, 2008 11:15 pm
Hardware configuration: ASUS M3N-HT deluxe,AMD6400 duel 3.2gig, GeForce9800 GTX C-760 M-1140 S-1900,4 gig OCZ ddr
Location: Missouri,USA

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by tobor »

by toTOW on Sat Nov 15, 2008 8:59 am

The issue is that this UM is not reported to the server :

[09:23:20] - Error: Could not get length of results file work/wuresults_06.dat
[09:23:20] - Error: Could not read unit 06 file. Removing from queue.

It will be assigned 5 or 6 times to the same machine, and then the server will move to another WU ...

So just let it run or delete core or folder or what??
Image
Xilikon
Posts: 155
Joined: Sun Dec 02, 2007 1:34 pm

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by Xilikon »

Received a bunch of this unit on my GTX 260 box which put it to sleep :(
Image
harlam357
Posts: 222
Joined: Fri Jun 27, 2008 11:03 pm
Location: Alabama - USA
Contact:

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by harlam357 »

Same here...

Code: Select all

[21:50:01] *------------------------------*
[21:50:01] Folding@Home GPU Core - Beta
[21:50:01] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[21:50:01] 
[21:50:01] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[21:50:01] Build host: amoeba
[21:50:01] Board Type: Nvidia
[21:50:01] Core      : 
[21:50:01] Preparing to commence simulation
[21:50:01] - Looking at optimizations...
[21:50:01] - Created dyn
[21:50:01] - Files status OK
[21:50:01] - Expanded 45874 -> 246249 (decompressed 536.7 percent)
[21:50:01] Called DecompressByteArray: compressed_data_size=45874 data_size=246249, decompressed_data_size=246249 diff=0
[21:50:01] - Digital signature verified
[21:50:01] 
[21:50:01] Project: 5506 (Run 7, Clone 133, Gen 229)
[21:50:01] 
[21:50:01] Assembly optimizations on if available.
[21:50:01] Entering M.D.
[21:50:07] Working on p5506_supervillin_e1
[21:50:07] Client config found, loading data.
[21:50:07] Starting GUI Server
[21:50:08] mdrun_gpu returned 
[21:50:08] NANs detected on GPU
[21:50:08] 
[21:50:08] Folding@home Core Shutdown: UNSTABLE_MACHINE
[21:50:11] CoreStatus = 7A (122)
Oldhat
Posts: 30
Joined: Mon Dec 03, 2007 11:42 am
Location: Auckland

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by Oldhat »

Just as a matter of interest, are we now running a modified work unit or are we still running the same unit that was reported over a week ago?

Windows XP SP3 AMD 64 4000 1024Mb RAM 2 x GS8800

Code: Select all

[04:28:07] + Processing work unit
[04:28:07] Core required: FahCore_11.exe
[04:28:07] Core found.
[04:28:07] Working on queue slot 02 [November 18 04:28:07 UTC]
[04:28:07] + Working ...
[04:28:07] 
[04:28:07] *------------------------------*
[04:28:07] Folding@Home GPU Core - Beta
[04:28:07] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[04:28:07] 
[04:28:07] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[04:28:07] Build host: amoeba
[04:28:07] Board Type: Nvidia
[04:28:07] Core      : 
[04:28:07] Preparing to commence simulation
[04:28:07] - Looking at optimizations...
[04:28:07] - Created dyn
[04:28:07] - Files status OK
[04:28:07] - Expanded 45874 -> 246249 (decompressed 536.7 percent)
[04:28:07] Called DecompressByteArray: compressed_data_size=45874 data_size=246249, decompressed_data_size=246249 diff=0
[04:28:07] - Digital signature verified
[04:28:07] 
[04:28:07] Project: 5506 (Run 7, Clone 133, Gen 229)
[04:28:07] 
[04:28:07] Assembly optimizations on if available.
[04:28:07] Entering M.D.
[04:28:14] Working on p5506_supervillin_e1
[04:28:14] Client config found, loading data.
[04:28:14] mdrun_gpu returned 
[04:28:14] NANs detected on GPU
[04:28:14] 
[04:28:14] Folding@home Core Shutdown: UNSTABLE_MACHINE
[04:28:18] CoreStatus = 7A (122)
[04:28:18] Sending work to server
[04:28:18] Project: 5506 (Run 7, Clone 133, Gen 229)
[04:28:18] - Read packet limit of 540015616... Set to 524286976.
[04:28:18] - Error: Could not get length of results file work/wuresults_02.dat
[04:28:18] - Error: Could not read unit 02 file. Removing from queue.
[04:28:18] - Preparing to get new work unit...
[04:28:18] + Attempting to get work packet
[04:28:18] - Connecting to assignment server
[04:28:18] - Successful: assigned to (171.64.65.106).
[04:28:18] + News From Folding@Home: GPU folding beta
[04:28:18] Loaded queue successfully.
[04:28:21] + Closed connections
[04:28:26] 
[04:28:26] + Processing work unit
[04:28:26] Core required: FahCore_11.exe
[04:28:26] Core found.
[04:28:26] Working on queue slot 03 [November 18 04:28:26 UTC]
[04:28:26] + Working ...
[04:28:26] 
[04:28:26] *------------------------------*
[04:28:26] Folding@Home GPU Core - Beta
[04:28:26] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[04:28:26] 
[04:28:26] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[04:28:26] Build host: amoeba
[04:28:26] Board Type: Nvidia
[04:28:26] Core      : 
[04:28:26] Preparing to commence simulation
[04:28:26] - Looking at optimizations...
[04:28:26] - Created dyn
[04:28:26] - Files status OK
[04:28:26] - Expanded 45874 -> 246249 (decompressed 536.7 percent)
[04:28:26] Called DecompressByteArray: compressed_data_size=45874 data_size=246249, decompressed_data_size=246249 diff=0
[04:28:26] - Digital signature verified
[04:28:26] 
[04:28:26] Project: 5506 (Run 7, Clone 133, Gen 229)
Cheers
Drugless
Posts: 58
Joined: Wed Jan 09, 2008 7:55 pm
Location: Durban, South Africa

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by Drugless »

Got it AGAIN! Client been asleep for 10 hours!

Code: Select all

[02:04:57] Project: 5506 (Run 7, Clone 133, Gen 229)
[02:04:57] 
[02:04:57] Assembly optimizations on if available.
[02:04:57] Entering M.D.
[02:05:04] Working on p5506_supervillin_e1
[02:05:04] Client config found, loading data.
[02:05:04] mdrun_gpu returned 
[02:05:04] NANs detected on GPU
[02:05:04] 
[02:05:04] Folding@home Core Shutdown: UNSTABLE_MACHINE
[02:05:08] CoreStatus = 7A (122)
[02:05:08] Sending work to server
[02:05:08] Project: 5506 (Run 7, Clone 133, Gen 229)
[02:05:08] - Read packet limit of 540015616... Set to 524286976.
[02:05:08] - Error: Could not get length of results file work/wuresults_00.dat
[02:05:08] - Error: Could not read unit 00 file. Removing from queue.
[02:05:08] EUE limit exceeded. Pausing 24 hours.
Image
Folding Tools:8 X PS3's, 5 x GTX280,1 x 8800GS, 8 x 9800GX2 GPU's
VijayPande
Pande Group Member
Posts: 2058
Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by VijayPande »

We've manually stopped this WU. We're also looking into new client and/or server code to better handle these situations.
Prof. Vijay Pande, PhD
Departments of Chemistry, Structural Biology, and Computer Science
Chair, Biophysics
Director, Folding@home Distributed Computing Project
Stanford University
Drugless
Posts: 58
Joined: Wed Jan 09, 2008 7:55 pm
Location: Durban, South Africa

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by Drugless »

Thanks VJ.
Just a bit frustrating since I cannot babysit 24hrs a day. Got to go to work sometime else cannot pay for folding costs! ;-)
Image
Folding Tools:8 X PS3's, 5 x GTX280,1 x 8800GS, 8 x 9800GX2 GPU's
VijayPande
Pande Group Member
Posts: 2058
Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford

Re: Project: 5506 (Run 7, Clone 133, Gen 229)

Post by VijayPande »

Understood. A plan is underway to try to solve this issue.
Prof. Vijay Pande, PhD
Departments of Chemistry, Structural Biology, and Computer Science
Chair, Biophysics
Director, Folding@home Distributed Computing Project
Stanford University
Post Reply