Page 1 of 1

Project: 2675 (Run 0, Clone 193, Gen 51)

Posted: Sat Feb 07, 2009 3:03 pm
by Aardvark
Downloaded the subject WU this morning. I stopped the Client upon start to disconnect from my ISP. (This is standard practice for me due to a chronic "hang on start" problem with fahv6). On restart the Client started folding but immediately reported being at the 192% completion point. IS THIS A FLAWED WU, doomed to failure???

The FAHlog after the restart is as follows:

Code: Select all

# Mac OS X SMP Console Edition ################################################
###############################################################################

                       Folding@Home Client Version 6.24beta

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /Users/tedkreuserIII/Library/Folding@home
Executable: ./fah6
Arguments: -smp -local -verbosity 9 

[14:12:40] - Ask before connecting: Yes
[14:12:40] - User name: Aardvark (Team 48057)
[14:12:40] - User ID: 7B68D95E256C0686
[14:12:40] - Machine ID: 1
[14:12:40] 
[14:12:41] Loaded queue successfully.
[14:12:41] 
[14:12:41] + Processing work unit
[14:12:41] At least 4 processors must be requested.Core required: FahCore_a2.exe
- Autosending finished units... [14:12:41]
[14:12:41] Core found.
[14:12:41] - Using generic ./mpiexec
[14:12:41] Trying to send all finished work units
[14:12:41] + No unsent completed units remaining.
[14:12:41] - Autosend completed
[14:12:41] Working on queue slot 01 [February 7 14:12:41 UTC]
[14:12:41] + Working ...
[14:12:41] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 6744 -version 624'

[14:12:41] 
[14:12:41] *------------------------------*
[14:12:41] Folding@Home Gromacs SMP Core
[14:12:41] Version 2.02 (Mon Nov 24 13:48:35 PST 2008)
[14:12:41] 
[14:12:41] Preparing to commence simulation
[14:12:41] - Ensuring status. Please wait.
[14:12:42] Called DecompressByteArray: compressed_data_size=4844471 data_size=24001453, decompressed_data_size=24001453 diff=0
[14:12:43] - Digital signature verified
[14:12:43] 
[14:12:43] Project: 2675 (Run 0, Clone 193, Gen 51)
[14:12:43] 
[14:12:43] Assembly optimizations on if available.
[14:12:43] Entering M.D.
[14:12:49] Will resume from checkpoint file
[14:12:53] ng M.D.
[14:12:59] Will resume from checkpoint file
[14:13:03] Resuming from checkpoint
[14:13:04] Verified work/wudata_01.log
[14:13:05] Verified work/wudata_01.trr
[14:13:05] Verified work/wudata_01.xtc
[14:13:05] Verified work/wudata_01.edr
[14:13:05] Completed 480019 out of 250000 steps  (192%)
[14:33:10] Completed 482509 out of 250000 steps  (193%)

Re: Project: 2675 (Run 0, Clone 193, Gen 51)

Posted: Sat Feb 07, 2009 4:58 pm
by toTOW
This is a checkpoint issue. Try to force a core upgrade to v2.04, it should help.

Re: Project: 2675 (Run 0, Clone 193, Gen 51)

Posted: Sat Feb 07, 2009 5:10 pm
by Aardvark
I have already tried to discard the a2 core on my machine and the "System" downloaded another v2.02 core. I am of the opinion that Pande Group has not yet seen fit to provide the v2.04 core for MacOSX folders. Am I right??

I tried the replacement action after reading the announcement by kasson in the Linux area of the Forum. Is v2.04 limited to Linux users for the time being??

Re: Project: 2675 (Run 0, Clone 193, Gen 51)

Posted: Sat Feb 07, 2009 5:20 pm
by toTOW
Well I don't know if they released the OSX version yet ...

Try to delete any existing .chk files in your /work folder. That should remove invalid checkpoints from previous WU that hadn't been deleted correctly.

Re: Project: 2675 (Run 0, Clone 193, Gen 51)

Posted: Sat Feb 07, 2009 6:19 pm
by Aardvark
@toTOW:

I stopped the Client and deleted the .chk files (There was only one). On restart of the Client the output to Terminal indicated it was resuming at 203% which is where it was when I shut it down.

No Joy Yet but thanks for the advice. With this problem is there any chance the WU will finish and upload successfully??

Re: Project: 2675 (Run 0, Clone 193, Gen 51)

Posted: Sun Feb 08, 2009 1:47 pm
by Aardvark
I am still folding on the Subject WU. At the last checkpoint it reports being at the 262% point.

Is this WU of any value???? Should it be trashed???

Re: Project: 2675 (Run 0, Clone 193, Gen 51)

Posted: Sun Feb 08, 2009 7:28 pm
by toTOW
I'd say yes ... delete the /work folder and queue.dat file to make sure you removed all bad checkpoints.

Re: Project: 2675 (Run 0, Clone 193, Gen 51)

Posted: Mon Feb 09, 2009 1:15 am
by Aardvark
For those who keep the records:

This WU has been TRASHED. There was no evidence that it would terminate in a manner usable to F@H.

Have moved on to new WUs with v2.04 core a2.