Page 1 of 1

Project: 9012 (Run 326, Clone 0, Gen 33)

Posted: Fri Dec 12, 2014 3:53 pm
by billford
Client has been looping on this for the last couple of hours:

Code: Select all

14:03:10:WU00:FS00:Starting
14:03:10:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/web.stanford.edu/~pande/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 00 -suffix 01 -version 704 -lifeline 1308 -checkpoint 15 -np 3
14:03:10:WU00:FS00:Started FahCore on PID 29451
14:03:10:WU00:FS00:Core PID:29455
14:03:10:WU00:FS00:FahCore 0xa4 started
14:03:11:WU00:FS00:0xa4:
14:03:11:WU00:FS00:0xa4:*------------------------------*
14:03:11:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
14:03:11:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
14:03:11:WU00:FS00:0xa4:
14:03:11:WU00:FS00:0xa4:Preparing to commence simulation
14:03:11:WU00:FS00:0xa4:- Ensuring status. Please wait.
14:03:20:WU00:FS00:0xa4:- Looking at optimizations...
14:03:20:WU00:FS00:0xa4:- Working with standard loops on this execution.
14:03:20:WU00:FS00:0xa4:- Previous termination of core was improper.
14:03:20:WU00:FS00:0xa4:- Files status OK
14:03:20:WU00:FS00:0xa4:- Expanded 912893 -> 1513440 (decompressed 165.7 percent)
14:03:20:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=912893 data_size=1513440, decompressed_data_size=1513440 diff=0
14:03:20:WU00:FS00:0xa4:- Digital signature verified
14:03:20:WU00:FS00:0xa4:
14:03:20:WU00:FS00:0xa4:Project: 9012 (Run 326, Clone 0, Gen 33)
14:03:20:WU00:FS00:0xa4:
14:03:20:WU00:FS00:0xa4:Entering M.D.
14:03:26:WU00:FS00:0xa4:Completed 0 out of 250000 steps  (0%)
14:03:26:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
I dumped the WU (deleted/recreated the slot), now running OK.

Re: Project: 9012 (Run 326, Clone 0, Gen 33)

Posted: Fri Dec 12, 2014 5:21 pm
by sryckbos
Definitely something funky going on with this WS right now. Trying to figure it out now.

Steven

Re: Project: 9012 (Run 326, Clone 0, Gen 33)

Posted: Fri Dec 12, 2014 5:51 pm
by billford
I've got several other WUs from that server (different projects) currently running, they all seem to be OK.

I thought it was just a possible bad WU.

Re: Project: 9012 (Run 326, Clone 0, Gen 33)

Posted: Fri Dec 12, 2014 6:39 pm
by sryckbos
Hmm, I'll look more closely but at first glance you seem right, it looks like a bad WU. The oddness I'm seeing is unrelated. It's good that everything is running for you well though.

Re: Project: 9012 (Run 326, Clone 0, Gen 33)

Posted: Fri Dec 12, 2014 6:46 pm
by billford
I'll keep an eye on it for the rest of the evening, but the AS seems to be favouring a different server now.