Page 1 of 1

Project: 2669 (Run 4, Clone 19, Gen 111)

Posted: Mon Jun 29, 2009 10:41 pm
by Gemini Cricket
This is one of those giant work units. I will delete mine and start again.
  • [18:49:21] *------------------------------*
    [18:49:21] Folding@Home Gromacs SMP Core
    [18:49:21] Version 1.95 (2007)
    [18:49:21]
    [18:49:21] Preparing to commence simulation
    [18:49:21] - Ensuring status. Please wait.
    [18:49:38] - Assembly optimizations manually forced on.
    [18:49:38] - Not checking prior termination.
    [18:49:38] Need version 207
    [18:49:38] Error: Work unit read from disk is invalid
    [18:49:38] Finalizing output
    [18:49:41] - Expanded 4844194 -> 23991465 (decompressed 495.2 percent)
    [18:49:42]
    [18:49:42] Project: 2669 (Run 4, Clone 19, Gen 111)
    [18:49:42]
    [18:49:43] Assembly optimizations on if available.
    [18:49:43] Entering M.D.
    NNODES=4, MYRANK=0, HOSTNAME=Macintosh-3.local
    NNODES=4, MYRANK=1, HOSTNAME=Macintosh-3.local
    NNODES=4, MYRANK=3, HOSTNAME=Macintosh-3.local
    NNODES=4, MYRANK=2, HOSTNAME=Macintosh-3.local
    NODEID=0 argc=19
    NODEID=1 argc=19
    :-) G R O M A C S (-:

    Groningen Machine for Chemical Simulation

    :-) VERSION 3.3.99_development_20080208 (-:


    Written by David van der Spoel, Erik Lindahl, Berk Hess, and others.
    Copyright (c) 1991-2000, University of Groningen, The Netherlands.
    Copyright (c) 2001-2008, The GROMACS development team,
    check out http://www.gromacs.org for more information.


    :-) mdrun (-:

    NODEID=2 argc=19
    Reading file work/wudata_05.tpr, VERSION 3.3.99_development_20070618 (single precision)
    NODEID=3 argc=19
    Note: tpx file_version 48, software version 54
    Making 1D domain decomposition 1 x 1 x 4
    starting mdrun '22908 system'
    6781886 steps, 13563.8 ps.
    [18:49:53] Completed 0 out of 6781886 steps (0 %)
    [19:04:58] Timer requesting checkpoint

    Writing checkpoint, step 21219500 at Tue Jun 30 04:04:59 2009

    Writing checkpoint, step 21220880 at Tue Jun 30 04:19:58 2009
    [19:20:05] Timer requesting checkpoint

    Writing checkpoint, step 21222260 at Tue Jun 30 04:34:57 2009
    [19:35:10] Timer requesting checkpoint

    Writing checkpoint, step 21223640 at Tue Jun 30 04:49:56 2009
    [19:50:16] Timer requesting checkpoint

    Writing checkpoint, step 21225020 at Tue Jun 30 05:04:55 2009
    [20:05:22] Timer requesting checkpoint

    Writing checkpoint, step 21226400 at Tue Jun 30 05:19:54 2009
    [20:20:27] Timer requesting checkpoint

    Writing checkpoint, step 21227790 at Tue Jun 30 05:35:00 2009
    [20:35:33] Timer requesting checkpoint

    Writing checkpoint, step 21229170 at Tue Jun 30 05:49:58 2009
    [20:50:38] Timer requesting checkpoint

    Writing checkpoint, step 21230550 at Tue Jun 30 06:04:57 2009
    [21:05:43] Timer requesting checkpoint

    Writing checkpoint, step 21231930 at Tue Jun 30 06:19:56 2009
    [21:20:48] Timer requesting checkpoint

    Writing checkpoint, step 21233310 at Tue Jun 30 06:34:55 2009
    [21:35:54] Timer requesting checkpoint

    Writing checkpoint, step 21234690 at Tue Jun 30 06:49:54 2009
    [21:50:59] Timer requesting checkpoint

    Writing checkpoint, step 21236080 at Tue Jun 30 07:05:00 2009
    [22:06:05] Timer requesting checkpoint

    Writing checkpoint, step 21237460 at Tue Jun 30 07:20:00 2009
    [22:21:10] Timer requesting checkpoint

Re: Project 2669 (Run 4, Clone 19, Gen 111)

Posted: Tue Jun 30, 2009 12:03 am
by Gemini Cricket
Please kill this corrupted unit. I trash my work folder and unit info file, but I keep getting this same unit repeatedly. My computer will remain off until you post that this unit has been killed.

Re: Project 2669 (Run 4, Clone 19, Gen 111)

Posted: Tue Jun 30, 2009 12:14 am
by bruce
I notified the appropriate Pande Group member soon after your first post. Until they can take action, there's nothing else that can be done.

If you can't get rid of it by deleting it, change your MachineID to a number you are not using.

Re: Project 2669 (Run 4, Clone 19, Gen 111)

Posted: Tue Jun 30, 2009 1:02 am
by Gemini Cricket
Thanks, Bruce, for alerting the researcher in charge. Also, your trick of changing the MachineID worked, so I am up and running again with a different WU.

Re: Project 2669 (Run 4, Clone 19, Gen 111)

Posted: Sat Jul 11, 2009 6:15 am
by Tigerbiten
This bad work unit is still in the wild as I've just got it again .......... :(

Code: Select all

[06:05:16] Folding@Home Gromacs SMP Core
[06:05:16] Version 2.07 (Sun Apr 19 14:51:09 PDT 2009)
[06:05:16] 
[06:05:16] Preparing to commence simulation
[06:05:16] - Ensuring status. Please wait.
[06:05:17] Called DecompressByteArray: compressed_data_size=4844194 data_size=23991465, decompressed_data_size=23991465 diff=0
[06:05:19] - Digital signature verified
[06:05:19] 
[06:05:19] Project: 2669 (Run 4, Clone 19, Gen 111)
[06:05:19] 
[06:05:19] Assembly optimizations on if available.
[06:05:19] Entering M.D.
[06:05:28]  on if available.
[06:05:28] Entering M.D.
[06:05:38] Completed 0 out of 6781886 steps  (0%)
Can you put the word out again.

Luck ............... :D

Re: Project 2669 (Run 4, Clone 19, Gen 111)

Posted: Sat Jul 11, 2009 4:14 pm
by susato
PM sent. Thanks for the report.

Re: Project 2669 (Run 4, Clone 19, Gen 111)

Posted: Sat Jul 11, 2009 8:34 pm
by markp1989

Code: Select all

[20:26:06] - Ask before connecting: No
[20:26:06] - User name: markp1989 (Team 45032)
[20:26:06] - User ID: 3F94ED4322D297FC
[20:26:06] - Machine ID: 1
[20:26:06] 
[20:26:06] Loaded queue successfully.
[20:26:06] 
[20:26:06] + Processing work unit
[20:26:06] Core required: FahCore_a2.exe
[20:26:06] Core found.
[20:26:06] Working on Unit 01 [July 11 20:26:06]
[20:26:06] + Working ...
[20:26:06] 
[20:26:06] *------------------------------*
[20:26:06] Folding@Home Gromacs SMP Core
[20:26:06] Version 2.07 (Sun Apr 19 14:51:09 PDT 2009)
[20:26:06] 
[20:26:06] Preparing to commence simulation
[20:26:06] - Ensuring status. Please wait.
[20:26:07] Called DecompressByteArray: compressed_data_size=4844194 data_size=23
991465, decompressed_data_size=23991465 diff=0
[20:26:07] Called DecompressByteArray: compressed_data_size=4844194 data_size=23
991465, decompressed_data_size=23991465 diff=0
[20:26:07] - Digital signature verified
[20:26:07] 
[20:26:07] Project: 2669 (Run 4, Clone 19, Gen 111)
[20:26:07] 
[20:26:07] Assembly optimizations on if available.
[20:26:07] Entering M.D.
[20:26:07] - Digital signature verified
[20:26:07] 
[20:26:07] Project: 2669 (Run 4, Clone 19, Gen 111)
[20:26:07] 
[20:26:07] Assembly optimizations on if available.
[20:26:07] Entering M.D.
[20:26:13] Using Gromacs checkpoints
[20:26:13] Using Gromacs checkpoints
[20:26:17] 
[20:26:17] Entering M.D.
[20:26:17] 
[20:26:17] Entering M.D.
[20:26:23] Using Gromacs checkpoints
[20:26:23] Using Gromacs checkpoints
[20:26:28] Resuming from checkpoint
[20:26:28] Verified work/wudata_01.log
[20:26:28] Verified work/wudata_01.trr
[20:26:28] Verified work/wudata_01.xtc
[20:26:28] Verified work/wudata_01.edr
[20:26:28] Resuming from checkpoint
[20:26:28] Verified work/wudata_01.log
[20:26:28] Verified work/wudata_01.trr
[20:26:28] Verified work/wudata_01.xtc
[20:26:28] Verified work/wudata_01.edr
[20:26:28] Completed 40496 out of 6781886 steps  (0%)
i have been given this work uint, is it ok to keep folding it, or is this 1 a waste of time?

edit: just looked in the unit info file. it says i have completed 56117697% of the wu so il gues il delete it and cary on with another wu

Re: Project 2669 (Run 4, Clone 19, Gen 111)

Posted: Sat Jul 11, 2009 9:54 pm
by susato
Mark, it's a known bad WU, so just delete it. If you're using the console client, the best way is to stop folding, then start folding again using the -delete xx flag where xx is the queue position of the bad unit. (in your case xx = 01). The client will start up, delete the bad WU from the queue, then shut down again. Then you can restart Folding using your usual flags.

Re: Project 2669 (Run 4, Clone 19, Gen 111)

Posted: Fri Jul 17, 2009 4:36 pm
by Phantom
I just got this one assigned to me this morning... Hmmm... Deleting and moving on.

Re: Project: 2669 (Run 4, Clone 19, Gen 111)

Posted: Fri Jul 24, 2009 5:23 pm
by Phantom
I just got this one again!... Hmmm!!!... Deleting again and moving on.

Re: Project: 2669 (Run 4, Clone 19, Gen 111)

Posted: Fri Jul 24, 2009 6:18 pm
by susato
Thanks for the new report Don - Another PM sent.