Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Moderators: Site Moderators, FAHC Science Team

Post Reply
Mactin
Posts: 222
Joined: Sun Dec 02, 2007 1:08 pm
Location: Côte-des-Neiges, Montréal, Québec

Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by Mactin »

Project: 2669 (Run 8, Clone 98, Gen 144) will not process
Same problem four times in a row.
In order to get out of this, I had to change my Machine ID to 2

Code: Select all

[06:49:23] Working on Unit 07 [August 28 06:49:23]
[06:49:23] + Working ...
[06:49:23] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 07 -checkpoint 10 -verbose -lifeline 3424 -version 602'

[06:49:23] 
[06:49:23] *------------------------------*
[06:49:23] Folding@Home Gromacs SMP Core
[06:49:23] Version 2.08 (Mon May 18 14:47:42 PDT 2009)
[06:49:23] 
[06:49:23] Preparing to commence simulation
[06:49:23] - Ensuring status. Please wait.
[06:49:24] Called DecompressByteArray: compressed_data_size=4829877 data_size=23976217, decompressed_data_size=23976217 diff=0
[06:49:24] - Digital signature verified
[06:49:24] 
[06:49:24] Project: 2669 (Run 8, Clone 98, Gen 144)
[06:49:24] 
[06:49:24] Assembly optimizations on if available.
[06:49:24] Entering M.D.
[06:49:33] Run 8, Clone 98, Gen 144)
[06:49:33] 
[06:49:34] Entering M.D.
[06:49:43] Completed 0 out of 250000 steps  (0%)
[06:49:43] 
[06:49:43] Folding@home Core Shutdown: INTERRUPTED
[06:49:47] CoreStatus = 66 (102)
[06:49:47] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[06:49:47] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [August 28 13:18:19] 


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/cormiem/folding
Executable: ./fah6
Arguments: -smp -verbosity 9 

[13:18:19] - Ask before connecting: No
[13:18:19] - User name: Martin_FD_PX3 (Team 96377)
[13:18:19] - User ID: 29E564E4397322CB
[13:18:19] - Machine ID: 1
[13:18:19] 
[13:18:19] Loaded queue successfully.
[13:18:19] 
[13:18:19] - Autosending finished units...
[13:18:19] + Processing work unit
[13:18:19] Trying to send all finished work units
[13:18:19] Core required: FahCore_a2.exe
[13:18:19] + No unsent completed units remaining.
[13:18:19] Core found.
[13:18:19] - Autosend completed
[13:18:19] Working on Unit 07 [August 28 13:18:19]
[13:18:19] + Working ...
[13:18:19] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 07 -checkpoint 10 -verbose -lifeline 23943 -version 602'

[13:18:19] 
[13:18:19] *------------------------------*
[13:18:19] Folding@Home Gromacs SMP Core
[13:18:19] Version 2.08 (Mon May 18 14:47:42 PDT 2009)
[13:18:19] 
[13:18:19] Preparing to commence simulation
[13:18:19] - Ensuring status. Please wait.
[13:18:20] Called DecompressByteArray: compressed_data_size=4829877 data_size=23976217, decompressed_data_size=23976217 diff=0
[13:18:20] - Digital signature verified
[13:18:20] 
[13:18:20] Project: 2669 (Run 8, Clone 98, Gen 144)
[13:18:20] 
[13:18:20] Assembly optimizations on if available.
[13:18:20] Entering M.D.
[13:18:29] Run 8, Clone 98, Gen 144)
[13:18:29] 
[13:18:30] Entering M.D.
[13:18:39] Completed 0 out of 250000 steps  (0%)
[13:18:39] 
[13:18:39] Folding@home Core Shutdown: INTERRUPTED
[13:18:43] CoreStatus = 66 (102)
[13:18:43] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[13:18:43] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [August 28 13:20:00] 


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/cormiem/folding
Executable: ./fah6
Arguments: -smp -verbosity 9 

[13:20:00] Configuring Folding@Home...


[13:20:51] - Ask before connecting: No
[13:20:51] - User name: Martin_FD_PX3 (Team 96377)
[13:20:51] - User ID: 29E564E4397322CB
[13:20:51] - Machine ID: 1
[13:20:51] 
[13:20:51] Work directory not found. Creating...
[13:20:51] Could not open work queue, generating new queue...
[13:20:51] - Autosending finished units...
[13:20:51] - Preparing to get new work unit...
[13:20:51] Trying to send all finished work units
[13:20:51] + No unsent completed units remaining.
[13:20:51] + Attempting to get work packet
[13:20:51] - Autosend completed
[13:20:51] - Will indicate memory of 1666 MB
[13:20:51] - Detect CPU. Vendor: AuthenticAMD, Family: 15, Model: 2, Stepping: 3
[13:20:51] - Connecting to assignment server
[13:20:51] Connecting to http://assign.stanford.edu:8080/
[13:20:51] Posted data.
[13:20:51] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[13:20:51] + News From Folding@Home: Welcome to Folding@Home
[13:20:51] Loaded queue successfully.
[13:20:51] Connecting to http://171.64.65.56:8080/
[13:20:58] Posted data.
[13:20:58] Initial: 0000; - Receiving payload (expected size: 4830389)
[13:21:01] - Downloaded at ~1572 kB/s
[13:21:01] - Averaged speed for that direction ~1572 kB/s
[13:21:01] + Received work.
[13:21:01] + Closed connections
[13:21:01] 
[13:21:01] + Processing work unit
[13:21:01] Core required: FahCore_a2.exe
[13:21:01] Core found.
[13:21:01] Working on Unit 01 [August 28 13:21:01]
[13:21:01] + Working ...
[13:21:01] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 01 -checkpoint 5 -verbose -lifeline 23969 -version 602'

[13:21:01] 
[13:21:01] *------------------------------*
[13:21:01] Folding@Home Gromacs SMP Core
[13:21:01] Version 2.08 (Mon May 18 14:47:42 PDT 2009)
[13:21:01] 
[13:21:01] Preparing to commence simulation
[13:21:01] - Ensuring status. Please wait.
[13:21:02] Called DecompressByteArray: compressed_data_size=4829877 data_size=23976217, decompressed_data_size=23976217 diff=0
[13:21:02] - Digital signature verified
[13:21:02] 
[13:21:02] Project: 2669 (Run 8, Clone 98, Gen 144)
[13:21:02] 
[13:21:03] Assembly optimizations on if available.
[13:21:03] Entering M.D.
[13:21:12] Run 8, Clone 98, Gen 144)
[13:21:12] 
[13:21:12] Entering M.D.
[13:21:21] Completed 0 out of 250000 steps  (0%)
[13:21:21] 
[13:21:21] Folding@home Core Shutdown: INTERRUPTED
[13:21:25] CoreStatus = 66 (102)
[13:21:25] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[13:21:25] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [August 28 13:22:26] 


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/cormiem/folding
Executable: ./fah6
Arguments: -smp -verbosity 9 -config 

[13:22:26] - Ask before connecting: No
[13:22:26] - User name: Martin_FD_PX3 (Team 96377)
[13:22:26] - User ID: 29E564E4397322CB
[13:22:26] - Machine ID: 1
[13:22:26] 
[13:22:26] Configuring Folding@Home...


[13:22:34] - Ask before connecting: No
[13:22:34] - User name: Martin_FD_PX3 (Team 96377)
[13:22:34] - User ID: 29E564E4397322CB
[13:22:34] - Machine ID: 2
[13:22:34] 
[13:22:34] Work directory not found. Creating...
[13:22:34] Could not open work queue, generating new queue...
[13:22:34] - Autosending finished units...
[13:22:34] - Preparing to get new work unit...
[13:22:34] Trying to send all finished work units
[13:22:34] + No unsent completed units remaining.
[13:22:34] + Attempting to get work packet
[13:22:34] - Autosend completed
[13:22:34] - Will indicate memory of 1666 MB
[13:22:34] - Detect CPU. Vendor: AuthenticAMD, Family: 15, Model: 2, Stepping: 3
[13:22:34] - Connecting to assignment server
[13:22:34] Connecting to http://assign.stanford.edu:8080/
[13:22:35] Posted data.
[13:22:35] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[13:22:35] + News From Folding@Home: Welcome to Folding@Home
[13:22:35] Loaded queue successfully.
[13:22:35] Connecting to http://171.64.65.56:8080/
[13:22:40] Posted data.
[13:22:40] Initial: 0000; - Receiving payload (expected size: 4832396)
[13:22:43] - Downloaded at ~1573 kB/s
[13:22:43] - Averaged speed for that direction ~1573 kB/s
[13:22:43] + Received work.
[13:22:43] + Closed connections
[13:22:43] 
[13:22:43] + Processing work unit
[13:22:43] Core required: FahCore_a2.exe
[13:22:43] Core found.
[13:22:43] Working on Unit 01 [August 28 13:22:43]
[13:22:43] + Working ...
[13:22:43] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 01 -checkpoint 5 -verbose -lifeline 24119 -version 602'

[13:22:44] 
[13:22:44] *------------------------------*
[13:22:44] Folding@Home Gromacs SMP Core
[13:22:44] Version 2.08 (Mon May 18 14:47:42 PDT 2009)
[13:22:44] 
[13:22:44] Preparing to commence simulation
[13:22:44] - Ensuring status. Please wait.
[13:22:45] Called DecompressByteArray: compressed_data_size=4831884 data_size=23973957, decompressed_data_size=23973957 diff=0
[13:22:45] - Digital signature verified
[13:22:45] 
[13:22:45] Project: 2669 (Run 14, Clone 79, Gen 196)
[13:22:45] 
[13:22:45] Assembly optimizations on if available.
[13:22:45] Entering M.D.
[13:22:54] un 14, Clone 79, Gen 196)
[13:22:54] 
[13:22:55] Entering M.D.
[13:23:04] Completed 0 out of 250000 steps  (0%)
Image
susato
Site Moderator
Posts: 511
Joined: Fri Nov 30, 2007 4:57 am
Location: Team MacOSX
Contact:

Re: Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by susato »

Also reported here, on OSX with the 2.10 core. Mactin,
viewtopic.php?f=13&t=11330&p=110704#p110686

PM sent.
Phantom
Posts: 23
Joined: Mon Dec 03, 2007 2:14 am
Location: teammacosx.org
Contact:

Re: Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by Phantom »

Got this assigned to one of my Mac Minis... Same symptoms with the latest 2.11 core.
smartcat99s
Posts: 14
Joined: Sun Dec 02, 2007 7:32 pm

Re: Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by smartcat99s »

I just got this one on a Linux SMP box

Code: Select all

[16:09:53] *------------------------------*
[16:09:53] Folding@Home Gromacs SMP Core
[16:09:53] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[16:09:53]
[16:09:53] Preparing to commence simulation
[16:09:53] - Ensuring status. Please wait.
[16:09:54] Called DecompressByteArray: compressed_data_size=4829877 data_size=23976217, decompressed_data_size=23976217 diff=0
[16:09:54] - Digital signature verified
[16:09:54]
[16:09:54] Project: 2669 (Run 8, Clone 98, Gen 144)
[16:09:54]
[16:09:54] Assembly optimizations on if available.
[16:09:54] Entering M.D.
[16:10:04]  on if available.
[16:10:04] Entering M.D.
NNODES=4, MYRANK=2, HOSTNAME=myhost
NNODES=4, MYRANK=3, HOSTNAME=myhost
NNODES=4, MYRANK=1, HOSTNAME=myhost
NNODES=4, MYRANK=0, HOSTNAME=myhost
NODEID=0 argc=20
NODEID=1 argc=20
Reading file work/wudata_01.tpr, VERSION 3.3.99_development_20070618 (single precision)
NODEID=2 argc=20
NODEID=3 argc=20
Note: tpx file_version 48, software version 68

NOTE: The tpr file used for this simulation is in an old format, for less memory usage and possibly more performance create a new tpr file with an up to date version of grompp

Making 1D domain decomposition 1 x 1 x 4
starting mdrun '22860 system'
36250004 steps,  72500.0 ps (continuing from step 36000004,  72000.0 ps).

t = 72000.011 ps: Water molecule starting at atom 53491 can not be settled.
Check for bad contacts and/or reduce the timestep.
[16:10:17]  (0%)
[16:10:17]
[16:10:17] Folding@home Core Shutdown: INTERRUPTED
application called MPI_Abort(MPI_COMM_WORLD, 102) - process 0
[0]0:Return code = 102
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Quit
[0]3:Return code = 0, signaled with Segmentation fault
[16:10:21] CoreStatus = 66 (102)
[16:10:21] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[16:10:21] Killing all core threads

Folding@Home Client Shutdown.
Image
CBEugene
Posts: 2
Joined: Tue Sep 15, 2009 4:25 pm

Re: Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by CBEugene »

I got this too... Machine is Pentium4 3.0 with hyperthreading, 1GB RAM. OS: Ubuntu Linux x64 9.0.4

Code: Select all

--- Opening Log file [September 28 06:20:36] 


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/eugene/folding
Executable: ./fah6
Arguments: -smp -verbosity 9 

[06:20:36] - Ask before connecting: No
[06:20:36] - Proxy: 192.168.0.250:3129
[06:20:36] - User name: CBEugene (Team 1971)
[06:20:36] - User ID: 3C62D77334C8BCA
[06:20:36] - Machine ID: 1
[06:20:36] 
[06:20:36] Loaded queue successfully.
[06:20:36] 
[06:20:36] - Autosending finished units...
[06:20:36] + Processing work unit
[06:20:36] Trying to send all finished work units
[06:20:36] Core required: FahCore_a2.exe
[06:20:36] + No unsent completed units remaining.
[06:20:36] - Autosend completed
[06:20:36] Core found.
[06:20:37] Working on Unit 01 [September 28 06:20:37]
[06:20:37] + Working ...
[06:20:37] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 3537 -version 602'

[06:20:37] 
[06:20:37] *------------------------------*
[06:20:37] Folding@Home Gromacs SMP Core
[06:20:37] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[06:20:37] 
[06:20:37] Preparing to commence simulation
[06:20:37] - Ensuring status. Please wait.
[06:20:37] Files status OK
[06:20:38] - Expanded 4829877 -> 23976217 (decompressed 496.4 percent)
[06:20:38] Called DecompressByteArray: compressed_data_size=4829877 data_size=23976217, decompressed_data_size=23976217 diff=0
[06:20:39] - Digital signature verified
[06:20:39] 
[06:20:39] Project: 2669 (Run 8, Clone 98, Gen 144)
[06:20:39] 
[06:20:39] Assembly optimizations on if available.
[06:20:39] Entering M.D.
[06:20:50] Run 8, Clone 98, Gen 144)
[06:20:50] 
[06:20:50] Entering M.D.
[06:21:12] Completed 0 out of 250000 steps  (0%)
[06:21:12] 
[06:21:12] Folding@home Core Shutdown: INTERRUPTED
[06:21:16] CoreStatus = 66 (102)
[06:21:16] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[06:21:16] Killing all core threads

Folding@Home Client Shutdown.
toTOW
Site Moderator
Posts: 6359
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by toTOW »

SMP on a P4 HT ... are you sure you can meet the deadlines ?
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Phantom
Posts: 23
Joined: Mon Dec 03, 2007 2:14 am
Location: teammacosx.org
Contact:

Re: Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by Phantom »

Like a bad penny, this one came back to me again today! Flat-lined. Bad WU. Recommend pulling it out of the work queue.
susato
Site Moderator
Posts: 511
Joined: Fri Nov 30, 2007 4:57 am
Location: Team MacOSX
Contact:

Re: Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by susato »

PM sent to the researcher.
parkut
Posts: 363
Joined: Tue Feb 12, 2008 7:33 am
Hardware configuration: Running exclusively Linux headless blades. All are dedicated crunching machines.
Location: SE Michigan, USA

Re: Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by parkut »

I got this one too...

Code: Select all

[15:01:32] Folding@Home Gromacs SMP Core
[15:01:32] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[15:01:32] 
[15:01:32] Preparing to commence simulation
[15:01:32] - Ensuring status. Please wait.
[15:01:33] Called DecompressByteArray: compressed_data_size=4829877 data_size=23976217, decompressed_data_size=23976217 diff=0
[15:01:33] - Digital signature verified
[15:01:33] 
[15:01:33] Project: 2669 (Run 8, Clone 98, Gen 144)
[15:01:33] 
[15:01:33] Assembly optimizations on if available.
[15:01:33] Entering M.D.
[15:01:42] Run 8, Clone 98, Gen 144)
[15:01:42] 
[15:01:42] Entering M.D.
[15:01:52] lding@home Core Shutdown: INTERRUPTED
[15:01:56] CoreStatus = 66 (102)
[15:01:56] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[15:01:56] Killing all core threads
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: Project: 2669 (Run 8, Clone 98, Gen 144) will not process

Post by kasson »

Stopped it manually, thanks for the reports.
Post Reply