Moderators: Site Moderators , FAHC Science Team
harlam357
Posts: 222 Joined: Fri Jun 27, 2008 11:03 pm
Location: Alabama - USA
Contact:
Post
by harlam357 » Tue Nov 18, 2008 2:39 am
UNSTABLE_MACHINE Loop... put the client into sleep mode...
XFX 8800GT 256MB (600/1782/700)
Opteron 165 @ (323x9) 2.9GHz
DFI Lanparty nF4 Ultra-D
XP SP3 w/178.24
Code: Select all
[01:31:49] *------------------------------*
[01:31:49] Folding@Home GPU Core - Beta
[01:31:49] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[01:31:49]
[01:31:49] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[01:31:49] Build host: amoeba
[01:31:49] Board Type: Nvidia
[01:31:49] Core :
[01:31:49] Preparing to commence simulation
[01:31:49] - Looking at optimizations...
[01:31:49] - Created dyn
[01:31:49] - Files status OK
[01:31:49] - Expanded 45481 -> 246249 (decompressed 541.4 percent)
[01:31:49] Called DecompressByteArray: compressed_data_size=45481 data_size=246249, decompressed_data_size=246249 diff=0
[01:31:49] - Digital signature verified
[01:31:49]
[01:31:49] Project: 5506 (Run 8, Clone 13, Gen 297)
[01:31:49]
[01:31:49] Assembly optimizations on if available.
[01:31:49] Entering M.D.
[01:31:56] Working on p5506_supervillin_e1
[01:31:56] Client config found, loading data.
[01:31:56] mdrun_gpu returned
[01:31:56] NANs detected on GPU
[01:31:56]
[01:31:56] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:31:59] CoreStatus = 7A (122)
toTOW
Site Moderator
Posts: 6453 Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:
Post
by toTOW » Tue Nov 18, 2008 4:44 pm
That's another bad WU ... no body was able to even start it
Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
VijayPande
Pande Group Member
Posts: 2058 Joined: Fri Nov 30, 2007 6:25 am
Location: Stanford
Post
by VijayPande » Tue Nov 18, 2008 4:59 pm
We've manually stopped this WU. We're also looking into new client and/or server code to better handle these situations.
Prof. Vijay Pande, PhD
Departments of Chemistry, Structural Biology, and Computer Science
Chair, Biophysics
Director, Folding@home Distributed Computing Project
Stanford University
BrokenWolf
Posts: 126 Joined: Sat Aug 02, 2008 3:08 am
Post
by BrokenWolf » Wed Nov 19, 2008 3:19 am
I am getting a 7A error, UNSTABLE_MACHINE, when my client tries this WU. I have had other 5506 WU's before and they fold fine on this system. Deleting the core did not help. All that has happened now is that a new WU has been downloaded.
BW
Code: Select all
07:19:45] Working on queue slot 02 [November 18 07:19:45 UTC]
[07:19:45] + Working ...
[07:19:45] - Calling '.\FahCore_11.exe -dir work/ -suffix 02 -priority 96 -checkpoint 15 -verbose -lifeline 840 -version 620'
[07:19:45]
[07:19:45] *------------------------------*
[07:19:45] Folding@Home GPU Core - Beta
[07:19:45] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[07:19:45]
[07:19:45] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[07:19:45] Build host: amoeba
[07:19:45] Board Type: Nvidia
[07:19:45] Core :
[07:19:45] Preparing to commence simulation
[07:19:45] - Looking at optimizations...
[07:19:45] - Created dyn
[07:19:45] - Files status OK
[07:19:45] - Expanded 45481 -> 246249 (decompressed 541.4 percent)
[07:19:45] Called DecompressByteArray: compressed_data_size=45481 data_size=246249, decompressed_data_size=246249 diff=0
[07:19:45] - Digital signature verified
[07:19:45]
[07:19:45] Project: 5506 (Run 8, Clone 13, Gen 297)
[07:19:45]
[07:19:45] Assembly optimizations on if available.
[07:19:45] Entering M.D.
[07:19:52] Working on p5506_supervillin_e1
[07:19:52] Client config found, loading data.
[07:19:52] mdrun_gpu returned
[07:19:52] NANs detected on GPU
[07:19:52]
[07:19:52] Folding@home Core Shutdown: UNSTABLE_MACHINE
[07:19:55] CoreStatus = 7A (122)
[07:19:55] Sending work to server
[07:19:55] Project: 5506 (Run 8, Clone 13, Gen 297)
[07:19:55] - Read packet limit of 540015616... Set to 524286976.
[07:19:55] - Error: Could not get length of results file work/wuresults_02.dat
[07:19:55] - Error: Could not read unit 02 file. Removing from queue.
[07:19:55] EUE limit exceeded. Pausing 24 hours.
[12:27:54] - Autosending finished units... [November 18 12:27:54 UTC]
[12:27:54] Trying to send all finished work units
[12:27:54] + No unsent completed units remaining.
[12:27:54] - Autosend completed
[12:27:54] + Working...
[18:27:52] - Autosending finished units... [November 18 18:27:52 UTC]
[18:27:52] Trying to send all finished work units
[18:27:52] + No unsent completed units remaining.
[18:27:52] - Autosend completed
[18:27:52] + Working...
[00:27:49] - Autosending finished units... [November 19 00:27:49 UTC]
[00:27:49] Trying to send all finished work units
[00:27:49] + No unsent completed units remaining.
[00:27:49] - Autosend completed
[00:27:49] + Working...