Page 1 of 1

Project: 2665 (Run 1, Clone 478, Gen 80)

Posted: Sun Dec 28, 2008 2:42 am
by poiuyut
Failed at 70% and would not upload. I deleted queue.dat, unitinfo.txt, and the work folder, but when FaH restarted I got the same WU again. Should I continue to fold this unit and see if it fails again?

Code: Select all

[01:26:09] Writing local files
[01:26:09] Completed 172500 out of 250000 steps  (69 percent)
[01:40:12] Writing local files
[01:40:13] Completed 175000 out of 250000 steps  (70 percent)
[01:44:36] Gromacs cannot continue further.
[01:44:36] Going to send back what have done.
[01:44:36] logfile size: 185627
[01:44:36] - Writing 186163 bytes of core data to disk...
[01:44:36]   ... Done.
[01:44:36] - Failed to delete work/wudata_07.sas
[01:44:36] - Failed to delete work/wudata_07.goe
[01:44:36] Warning:  check for stray files
[01:44:36] 
[01:44:36] Folding@home Core Shutdown: EARLY_UNIT_END
[01:44:36] 
[01:44:36] Folding@home Core Shutdown: EARLY_UNIT_END
[01:44:39] CoreStatus = 63 (99)
[01:44:39] + Error starting Folding@Home core.
[01:44:44] 
[01:44:44] + Processing work unit
[01:44:44] Work type a1 not eligible for variable processors
[01:44:44] Core required: FahCore_a1.exe
[01:44:44] Core found.
[01:44:44] Working on queue slot 07 [December 28 01:44:44 UTC]
[01:44:44] + Working ...
[01:44:44] - Calling 'mpiexec -np 4 -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 07 -checkpoint 15 -verbose -lifeline 1160 -version 622'

[01:44:45] 
[01:44:45] *------------------------------*
[01:44:45] Folding@Home Gromacs SMP Core
[01:44:45] Version 1.76 (February 23, 2008)
[01:44:45] 
[01:44:45] Preparing to commence simulation
[01:44:45] - Looking at optimizations...
[01:44:45] - Created dyn
[01:44:45] - Files status OK
[01:44:45] 
[01:44:45] Folding@home Core Shutdown: MISSING_WORK_FILES
[01:44:45] Finalizing output
[01:44:45]  OK
[01:46:45] 
[01:46:45] Folding@home Core Shutdown: MISSING_WORK_FILES
[01:46:45] Finalizing output
[01:46:49] CoreStatus = 1 (1)
[01:46:49] Client-core communications error: ERROR 0x1
[01:46:49] This is a sign of more serious problems, shutting down.
[02:23:32] Killing all core threads
[02:23:32] Killing 4 cores
[02:23:32] Killing core 0
[02:23:32] Killing core 1
[02:23:32] Killing core 2
[02:23:32] Killing core 3

Folding@Home Client Shutdown at user request.
[02:23:32] ***** Got a SIGTERM signal (2)
[02:23:32] Killing all core threads
[02:23:32] Killing 4 cores
[02:23:32] Killing core 0
[02:23:32] Killing core 1
[02:23:32] Killing core 2
[02:23:32] Killing core 3

Folding@Home Client Shutdown.

Re: Project: 2665 (Run 1, Clone 478, Gen 80)

Posted: Sun Dec 28, 2008 10:41 am
by toTOW
Did you upgrade your client to 6.23 ? It should avoid this kind of errors.

Re: Project: 2665 (Run 1, Clone 478, Gen 80)

Posted: Tue Dec 30, 2008 1:12 am
by poiuyut
I installed 6.23 after reading your post, and now the WU has finished and uploaded normally. Looks like that fixed it. Thanks.