Project: 5506 (Run 8, Clone 13, Gen 297)

Moderators: Site Moderators, FAHC Science Team

Post Reply
harlam357
Posts: 222
Joined: Fri Jun 27, 2008 11:03 pm
Location: Alabama - USA
Contact:

Project: 5506 (Run 8, Clone 13, Gen 297)

Post by harlam357 »

UNSTABLE_MACHINE Loop... put the client into sleep mode...

XFX 8800GT 256MB (600/1782/700)
Opteron 165 @ (323x9) 2.9GHz
DFI Lanparty nF4 Ultra-D
XP SP3 w/178.24

Code: Select all

[01:31:49] *------------------------------*
[01:31:49] Folding@Home GPU Core - Beta
[01:31:49] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[01:31:49] 
[01:31:49] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[01:31:49] Build host: amoeba
[01:31:49] Board Type: Nvidia
[01:31:49] Core      : 
[01:31:49] Preparing to commence simulation
[01:31:49] - Looking at optimizations...
[01:31:49] - Created dyn
[01:31:49] - Files status OK
[01:31:49] - Expanded 45481 -> 246249 (decompressed 541.4 percent)
[01:31:49] Called DecompressByteArray: compressed_data_size=45481 data_size=246249, decompressed_data_size=246249 diff=0
[01:31:49] - Digital signature verified
[01:31:49] 
[01:31:49] Project: 5506 (Run 8, Clone 13, Gen 297)
[01:31:49] 
[01:31:49] Assembly optimizations on if available.
[01:31:49] Entering M.D.
[01:31:56] Working on p5506_supervillin_e1
[01:31:56] Client config found, loading data.
[01:31:56] mdrun_gpu returned 
[01:31:56] NANs detected on GPU
[01:31:56] 
[01:31:56] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:31:59] CoreStatus = 7A (122)
toTOW
Site Moderator
Posts: 6453
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 5506 (Run 8, Clone 13, Gen 297)

Post by toTOW »

That's another bad WU ... no body was able to even start it :(
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
VijayPande
Pande Group Member
Posts: 2058
Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford

Re: Project: 5506 (Run 8, Clone 13, Gen 297)

Post by VijayPande »

We've manually stopped this WU. We're also looking into new client and/or server code to better handle these situations.
Prof. Vijay Pande, PhD
Departments of Chemistry, Structural Biology, and Computer Science
Chair, Biophysics
Director, Folding@home Distributed Computing Project
Stanford University
BrokenWolf
Posts: 126
Joined: Sat Aug 02, 2008 3:08 am

P5506, (R8, C13, G297)

Post by BrokenWolf »

I am getting a 7A error, UNSTABLE_MACHINE, when my client tries this WU. I have had other 5506 WU's before and they fold fine on this system. Deleting the core did not help. All that has happened now is that a new WU has been downloaded.

BW

Code: Select all

07:19:45] Working on queue slot 02 [November 18 07:19:45 UTC]
[07:19:45] + Working ...
[07:19:45] - Calling '.\FahCore_11.exe -dir work/ -suffix 02 -priority 96 -checkpoint 15 -verbose -lifeline 840 -version 620'

[07:19:45] 
[07:19:45] *------------------------------*
[07:19:45] Folding@Home GPU Core - Beta
[07:19:45] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[07:19:45] 
[07:19:45] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[07:19:45] Build host: amoeba
[07:19:45] Board Type: Nvidia
[07:19:45] Core      : 
[07:19:45] Preparing to commence simulation
[07:19:45] - Looking at optimizations...
[07:19:45] - Created dyn
[07:19:45] - Files status OK
[07:19:45] - Expanded 45481 -> 246249 (decompressed 541.4 percent)
[07:19:45] Called DecompressByteArray: compressed_data_size=45481 data_size=246249, decompressed_data_size=246249 diff=0
[07:19:45] - Digital signature verified
[07:19:45] 
[07:19:45] Project: 5506 (Run 8, Clone 13, Gen 297)
[07:19:45] 
[07:19:45] Assembly optimizations on if available.
[07:19:45] Entering M.D.
[07:19:52] Working on p5506_supervillin_e1
[07:19:52] Client config found, loading data.
[07:19:52] mdrun_gpu returned 
[07:19:52] NANs detected on GPU
[07:19:52] 
[07:19:52] Folding@home Core Shutdown: UNSTABLE_MACHINE
[07:19:55] CoreStatus = 7A (122)
[07:19:55] Sending work to server
[07:19:55] Project: 5506 (Run 8, Clone 13, Gen 297)
[07:19:55] - Read packet limit of 540015616... Set to 524286976.
[07:19:55] - Error: Could not get length of results file work/wuresults_02.dat
[07:19:55] - Error: Could not read unit 02 file. Removing from queue.
[07:19:55] EUE limit exceeded. Pausing 24 hours.
[12:27:54] - Autosending finished units... [November 18 12:27:54 UTC]
[12:27:54] Trying to send all finished work units
[12:27:54] + No unsent completed units remaining.
[12:27:54] - Autosend completed
[12:27:54] + Working...
[18:27:52] - Autosending finished units... [November 18 18:27:52 UTC]
[18:27:52] Trying to send all finished work units
[18:27:52] + No unsent completed units remaining.
[18:27:52] - Autosend completed
[18:27:52] + Working...
[00:27:49] - Autosending finished units... [November 19 00:27:49 UTC]
[00:27:49] Trying to send all finished work units
[00:27:49] + No unsent completed units remaining.
[00:27:49] - Autosend completed
[00:27:49] + Working...
Image
Post Reply