Page 1 of 1

Project: 6502 (Run 17, Clone 153, Gen 20)

Posted: Wed Jul 07, 2010 1:53 pm
by fredex
I keep getting handed the same bad WU. So far it's been logged as a failure 57 times and it's still cranking on the 58th attempt (it's always the same WU). It fails at 83% every time, having wasted a bunch of CPU cycles and electricity...

How can I get rid of this (apparently) bad WU and move on?:

[20:43:51] + Processing work unit
[20:43:51] Core required: FahCore_78.exe
[20:43:51] Core found.
[20:43:51] Working on Unit 04 [July 6 20:43:51]
[20:43:51] + Working ...
[20:43:51] - Calling './FahCore_78.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 5548 -version 602'

[20:43:51]
[20:43:51] *------------------------------*
[20:43:51] Folding@Home Gromacs Core
[20:43:51] Version 1.90 (March 8, 2006)
[20:43:51]
[20:43:51] Preparing to commence simulation
[20:43:51] - Looking at optimizations...
[20:43:51] - Created dyn
[20:43:51] - Files status OK
[20:43:51] - Expanded 419875 -> 2019997 (decompressed 481.0 percent)
[20:43:51] - Starting from initial work packet
[20:43:51]
[20:43:51] Project: 6502 (Run 17, Clone 153, Gen 20)
[20:43:51]
[20:43:51] Assembly optimizations on if available.
[20:43:51] Entering M.D.
[20:43:57] Protein: TR462_A_18 in water
[20:43:57]
[20:43:57] Writing local files
[20:43:57] Extra SSE boost OK.
[20:43:57] Writing local files
[20:43:57] Completed 0 out of 250000 steps (0%)
[20:47:37] Writing local files

<snip>

[01:45:03] Completed 205000 out of 250000 steps (82%)
[01:48:43] Writing local files
[01:48:43] Completed 207500 out of 250000 steps (83%)
[01:49:46] CoreStatus = 0 (0)
[01:49:46] Client-core communications error: ERROR 0x0
[01:49:46] Deleting current work unit & continuing...
[01:50:03] Trying to send all finished work units
[01:50:03] + No unsent completed units remaining.
[01:50:03] - Preparing to get new work unit...
[01:50:03] + Attempting to get work packet
[01:50:03] - Connecting to assignment server
[01:50:03] Connecting to http://assign.stanford.edu:8080/
[01:50:04] Posted data.
[01:50:04] Initial: 40AB; - Successful: assigned to (171.64.65.111).

Re: Project: 6502 (Run 17, Clone 153, Gen 20)

Posted: Wed Jul 07, 2010 4:48 pm
by bruce
The WU (P6502,R17,C153,G20) has been reported as a bad WU.

Moderator reports will stop a WU from being issued, but they' are not processed instantly and sit in a queue for maybe an hour or so.

The official method to dump a WU is to run with the -delete 04 flag (since your log says "Working on Unit 04"). If you're certain that you don't have anything in your queue waiting to upload, you can delete the /work folder and queue.dat (with the client stopped). The server may reissue the same WU so you may have to do that more than once.