Page 1 of 1

Project: 3770 (Run 6677, Clone 0, Gen 0)

Posted: Mon Jul 20, 2009 1:04 am
by Hammerhead
Got about 16 of these that failed, that I know of. All with this type of error:

Code: Select all

[00:31:17] Completed 1150000 out of 5000000 steps  (23%)
[00:46:17] Timered checkpoint triggered.
Warning: 1-4 interaction at distance larger than 1.96
These are ignored for the rest of the simulation
turn on -debug for more information
[00:46:30] CoreStatus = 0 (0)
[00:46:30] Client-core communications error: ERROR 0x0
[00:46:30] Deleting current work unit & continuing...
This is on a the Linux client 6.02.

Re: Project: 3770 (Run 6677, Clone 0, Gen 0)

Posted: Mon Jul 20, 2009 9:17 am
by Hammerhead
Bruce,

I read your PM. Not sure what you mean by 'all four coordinates '. All 16 have the same Run/Clone/Gen. Here's the log for one of them.

Code: Select all

--- Opening Log file [July 11 14:25:30]


# Linux Console Edition #######################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/xxxxx/df/fah6
Executable: ./fah6
Arguments: -local -verbosity 9 -forceasm

Warning:
 By using the -forceasm flag, you are overriding
 safeguards in the program. If you did not intend to
 do this, please restart the program without -forceasm.
 If work units are not completing fully (and particularly
 if your machine is overclocked), then please discontinue
 use of the flag.

[14:25:30] - Ask before connecting: No
[14:25:30] - Proxy: xxx.xxx.xxx.xxx:8080
[14:25:30] - User name: xxxxxxxx (Team 0)
[14:25:30] - User ID: 7FE50BF96B51E462
[14:25:30] - Machine ID: 6
[14:25:30]
[14:25:30] Loaded queue successfully.
[14:25:30]
.
.
.
[10:51:12] + Processing work unit
[10:51:12] Core required: FahCore_78.exe
[10:51:12] Core found.
[10:51:12] Working on Unit 02 [July 13 10:51:12]
[10:51:12] + Working ...
[10:51:12] - Calling './FahCore_78.exe -dir work/ -suffix 02 -checkpoint 15 -forceasm -verbose -lifeline 6633 -version 602'

[10:51:13]
[10:51:13] *------------------------------*
[10:51:13] Folding@Home Gromacs Core
[10:51:13] Version 1.90 (March 8, 2006)
[10:51:13]
[10:51:13] Preparing to commence simulation
[10:51:13] - Assembly optimizations manually forced on.
[10:51:13] - Not checking prior termination.
[10:51:13] - Expanded 256825 -> 1461492 (decompressed 569.0 percent)
[10:51:13] - Starting from initial work packet
[10:51:13]
[10:51:13] Project: 3770 (Run 6677, Clone 0, Gen 0)
[10:51:13]
[10:51:13] Assembly optimizations on if available.
[10:51:13] Entering M.D.

  Gromacs is Copyright (c) 1991-2003, University of Groningen, The Netherlands
        This inclusion of Gromacs code in the Folding@Home Core is under
        a special license (see http://folding.stanford.edu/gromacs.html)
         specially granted to Stanford by the copyright holders. If you
          are interested in using Gromacs, visit www.gromacs.org where
                you can download a free version of Gromacs under
         the terms of the GNU General Public License (GPL) as published
       by the Free Software Foundation; either version 2 of the License,
                     or (at your option) any later version.

[10:51:19] Protein: p3040_supervillin-03
[10:51:19]
[10:51:19] Writing local files
[10:51:19] Extra SSE boost OK.
[10:51:19] Writing local files
[10:51:19] Completed 0 out of 5000000 steps  (0%)
[11:06:19] Timered checkpoint triggered.
[11:07:24] Writing local files
[11:07:24] Completed 50000 out of 5000000 steps  (1%)
[11:22:23] Timered checkpoint triggered.
[11:23:27] Writing local files
[11:23:27] Completed 100000 out of 5000000 steps  (2%)
[11:38:27] Timered checkpoint triggered.
.
.
.
[16:14:03] Completed 1000000 out of 5000000 steps  (20%)
[16:29:03] Timered checkpoint triggered.
[16:30:10] Writing local files
[16:30:10] Completed 1050000 out of 5000000 steps  (21%)
[16:45:10] Timered checkpoint triggered.
[16:46:16] Writing local files
[16:46:16] Completed 1100000 out of 5000000 steps  (22%)
[17:01:17] Timered checkpoint triggered.
[17:02:26] Writing local files
[17:02:26] Completed 1150000 out of 5000000 steps  (23%)
[17:17:27] Timered checkpoint triggered.
Warning: 1-4 interaction at distance larger than 1.96
These are ignored for the rest of the simulation
turn on -debug for more information
[17:17:40] CoreStatus = 0 (0)
[17:17:40] Client-core communications error: ERROR 0x0
[17:17:40] Deleting current work unit & continuing...
[17:18:01] Trying to send all finished work units
[17:18:01] + No unsent completed units remaining.
[17:18:01] - Preparing to get new work unit...
[17:18:01] + Attempting to get work packet
[17:18:01] - Connecting to assignment server
[17:18:01] Connecting to http://assign.stanford.edu:8080/
[17:18:02] Posted data.
[17:18:02] Initial: 43AB; - Successful: assigned to (171.67.108.20).
[17:18:02] + News From Folding@Home: Welcome to Folding@Home
[17:18:02] Loaded queue successfully.
[17:18:02] Connecting to http://171.67.108.20:8080/
[17:18:03] Posted data.
[17:18:03] Initial: 0000; - Receiving payload (expected size: 257337)
[17:18:04] - Downloaded at ~251 kB/s
[17:18:04] - Averaged speed for that direction ~309 kB/s
[17:18:04] + Received work.
[17:18:04] + Closed connections


Re: Project: 3770 (Run 6677, Clone 0, Gen 0)

Posted: Mon Jul 20, 2009 7:56 pm
by bruce
It wasn't clear whether you meant 16 different P 3770 WUs or 16 of the same one. Thanks' for adding that information.

A unique WU is identified by four numbers or coordinates: Project, Run, Clone and Gen.