Page 1 of 1

Project: 2669 (Run 12, Clone 156, Gen 38) seg fault

Posted: Sat Apr 04, 2009 8:04 pm
by alpha754293
console output:

Code: Select all

[01:10:12] *------------------------------*
[01:10:12] Folding@Home Gromacs SMP Core
[01:10:12] Version 2.04 (Thu Jan 29 16:43:57 PST 2009)
[01:10:12]
[01:10:12] Preparing to commence simulation
[01:10:12] - Ensuring status. Please wait.
[01:10:21] - Looking at optimizations...
[01:10:21] - Working with standard loops on this execution.
[01:10:21] - Files status OK
[01:10:22] - Expanded 4834380 -> 23974209 (decompressed 495.9 percent)
[01:10:22] Called DecompressByteArray: compressed_data_size=4834380 data_size=23974209, decompressed_data_size=23974209 diff=0
[01:10:22] - Digital signature verified
[01:10:22]
[01:10:22] Project: 2669 (Run 12, Clone 156, Gen 38)
[01:10:22]
[01:10:23] Entering M.D.
NNODES=4, MYRANK=0, HOSTNAME=computenode
NNODES=4, MYRANK=1, HOSTNAME=computenode
NNODES=4, MYRANK=2, HOSTNAME=computenode
NNODES=4, MYRANK=3, HOSTNAME=computenode
NODEID=0 argc=20
NODEID=1 argc=20
                         :-)  G  R  O  M  A  C  S  (-:

                   Groningen Machine for Chemical Simulation

                          :-)  VERSION 4.0.3_pre  (-:


      Written by David van der Spoel, Erik Lindahl, Berk Hess, and others.
       Copyright (c) 1991-2000, University of Groningen, The Netherlands.
             Copyright (c) 2001-2008, The GROMACS development team,
            check out http://www.gromacs.org for more information.


                                :-)  mdrun  (-:

NODEID=2 argc=20
NODEID=3 argc=20
Reading file work/wudata_08.tpr, VERSION 3.3.99_development_20070618 (single precision)
Note: tpx file_version 48, software version 58

NOTE: The tpr file used for this simulation is in an old format, for less memory usage and possibly more performance create a new tpr file with an up to date version of grompp

Making 1D domain decomposition 1 x 1 x 4
starting mdrun '22866 system'
9750001 steps,  19500.0 ps (continuing from step 9500001,  19000.0 ps).
[01:19:14] Completed 2509 out of 250000 steps  (1%)
[01:27:54] Completed 5009 out of 250000 steps  (2%)
[01:36:33] Completed 7509 out of 250000 steps  (3%)
[01:45:13] Completed 10009 out of 250000 steps  (4%)
[01:53:57] Completed 12509 out of 250000 steps  (5%)
[02:02:42] Completed 15009 out of 250000 steps  (6%)
[02:11:24] Completed 17509 out of 250000 steps  (7%)
[02:20:06] Completed 20009 out of 250000 steps  (8%)
[02:28:48] Completed 22509 out of 250000 steps  (9%)
[02:37:29] Completed 25009 out of 250000 steps  (10%)
[02:46:11] Completed 27509 out of 250000 steps  (11%)
[02:54:52] Completed 30009 out of 250000 steps  (12%)
[03:02:04] - Autosending finished units... [April 4 03:02:04 UTC]
[03:02:04] Trying to send all finished work units
[03:02:04] + No unsent completed units remaining.
[03:02:04] - Autosend completed
[03:03:33] Completed 32509 out of 250000 steps  (13%)
[03:12:16] Completed 35009 out of 250000 steps  (14%)
[03:21:01] Completed 37509 out of 250000 steps  (15%)
[03:29:45] Completed 40009 out of 250000 steps  (16%)
[03:38:29] Completed 42509 out of 250000 steps  (17%)
[03:47:14] Completed 45009 out of 250000 steps  (18%)
[03:55:56] Completed 47509 out of 250000 steps  (19%)
[04:04:39] Completed 50009 out of 250000 steps  (20%)
[04:13:25] Completed 52509 out of 250000 steps  (21%)
[04:22:11] Completed 55009 out of 250000 steps  (22%)
[04:30:57] Completed 57509 out of 250000 steps  (23%)
[04:39:43] Completed 60009 out of 250000 steps  (24%)
[04:48:33] Completed 62509 out of 250000 steps  (25%)
[04:57:23] Completed 65009 out of 250000 steps  (26%)
[05:06:14] Completed 67509 out of 250000 steps  (27%)
[05:15:04] Completed 70009 out of 250000 steps  (28%)
[05:23:54] Completed 72509 out of 250000 steps  (29%)
[05:32:44] Completed 75009 out of 250000 steps  (30%)
[05:41:34] Completed 77509 out of 250000 steps  (31%)
[05:50:23] Completed 80009 out of 250000 steps  (32%)
[05:59:12] Completed 82509 out of 250000 steps  (33%)
[06:08:01] Completed 85009 out of 250000 steps  (34%)
[06:16:49] Completed 87509 out of 250000 steps  (35%)
[06:25:34] Completed 90009 out of 250000 steps  (36%)
[06:34:20] Completed 92509 out of 250000 steps  (37%)
[06:43:08] Completed 95009 out of 250000 steps  (38%)
[06:51:56] Completed 97509 out of 250000 steps  (39%)
[07:00:46] Completed 100009 out of 250000 steps  (40%)
[07:09:35] Completed 102509 out of 250000 steps  (41%)
[07:18:24] Completed 105009 out of 250000 steps  (42%)
[07:26:05]
[07:26:05] Folding@home Core Shutdown: INTERRUPTED
[cli_0]: aborting job:
application called MPI_Abort(MPI_COMM_WORLD, 102) - process 0
[cli_1]: aborting job:
Fatal error in MPI_Sendrecv: Error message texts are not available
[cli_3]: aborting job:
Fatal error in MPI_Sendrecv: Error message texts are not available
[0]0:Return code = 102
[0]1:Return code = 1
[0]2:Return code = 0, signaled with Segmentation fault
[0]3:Return code = 1
[07:26:10] CoreStatus = 66 (102)
[07:26:10] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)[07:26:10] Killing all core threads

Folding@Home Client Shutdown.
return to shell

(message character exceeded)

Re: Project: 2669 (Run 12, Clone 156, Gen 38) seg fault

Posted: Sat Apr 04, 2009 8:06 pm
by alpha754293
fahlog

Code: Select all

*snip*

[06:12:12] 
[06:12:12] *------------------------------*
[06:12:12] Folding@Home Gromacs SMP Core
[06:12:12] Version 2.04 (Thu Jan 29 16:43:57 PST 2009)
[06:12:12] 
[06:12:12] Preparing to commence simulation
[06:12:12] - Ensuring status. Please wait.
[06:12:13] Called DecompressByteArray: compressed_data_size=4836259 data_size=23979541, decompressed_data_size=23979541 diff=0
[06:12:13] - Digital signature verified
[06:12:13] 
[06:12:13] Project: 2669 (Run 11, Clone 149, Gen 102)
[06:12:13] 
[06:12:14] Assembly optimizations on if available.
[06:12:14] Entering M.D.
[06:12:23] n 11, Clone 149, Gen 102)
[06:12:23] 
[06:12:23] Entering M.D.
[06:30:07] Completed 5000 out of 250000 steps  (2%)
[06:38:54] Completed 7500 out of 250000 steps  (3%)
[06:47:41] Completed 10000 out of 250000 steps  (4%)
[06:56:28] Completed 12500 out of 250000 steps  (5%)
[07:05:16] Completed 15000 out of 250000 steps  (6%)
[07:14:03] Completed 17500 out of 250000 steps  (7%)
[07:22:49] Completed 20000 out of 250000 steps  (8%)
[07:31:34] Completed 22500 out of 250000 steps  (9%)
[07:40:20] Completed 25000 out of 250000 steps  (10%)
[07:49:10] Completed 27500 out of 250000 steps  (11%)
[07:58:04] Completed 30000 out of 250000 steps  (12%)
[08:06:57] Completed 32500 out of 250000 steps  (13%)
[08:15:49] Completed 35000 out of 250000 steps  (14%)
[08:24:41] Completed 37500 out of 250000 steps  (15%)
[08:33:34] Completed 40000 out of 250000 steps  (16%)
[08:42:28] Completed 42500 out of 250000 steps  (17%)
[08:51:22] Completed 45000 out of 250000 steps  (18%)
[09:00:15] Completed 47500 out of 250000 steps  (19%)
[09:02:04] - Autosending finished units... [April 3 09:02:04 UTC]
[09:02:04] Trying to send all finished work units
[09:02:04] + No unsent completed units remaining.
[09:02:04] - Autosend completed
[09:09:05] Completed 50000 out of 250000 steps  (20%)
[09:17:56] Completed 52500 out of 250000 steps  (21%)
[09:26:45] Completed 55000 out of 250000 steps  (22%)
[09:35:31] Completed 57500 out of 250000 steps  (23%)
[09:44:16] Completed 60000 out of 250000 steps  (24%)
[09:53:04] Completed 62500 out of 250000 steps  (25%)
[10:01:56] Completed 65000 out of 250000 steps  (26%)
[10:10:47] Completed 67500 out of 250000 steps  (27%)
[10:19:38] Completed 70000 out of 250000 steps  (28%)
[10:28:29] Completed 72500 out of 250000 steps  (29%)
[10:37:21] Completed 75000 out of 250000 steps  (30%)
[10:46:16] Completed 77500 out of 250000 steps  (31%)
[10:55:13] Completed 80000 out of 250000 steps  (32%)
[11:04:09] Completed 82500 out of 250000 steps  (33%)
[11:13:04] Completed 85000 out of 250000 steps  (34%)
[11:21:54] Completed 87500 out of 250000 steps  (35%)
[11:30:44] Completed 90000 out of 250000 steps  (36%)
[11:39:33] Completed 92500 out of 250000 steps  (37%)
[11:48:24] Completed 95000 out of 250000 steps  (38%)
[11:57:15] Completed 97500 out of 250000 steps  (39%)
[12:06:05] Completed 100000 out of 250000 steps  (40%)
[12:14:57] Completed 102500 out of 250000 steps  (41%)
[12:23:51] Completed 105000 out of 250000 steps  (42%)
[12:32:45] Completed 107500 out of 250000 steps  (43%)
[12:41:37] Completed 110000 out of 250000 steps  (44%)
[12:50:28] Completed 112500 out of 250000 steps  (45%)
[12:59:16] Completed 115000 out of 250000 steps  (46%)
[13:08:06] Completed 117500 out of 250000 steps  (47%)
[13:16:57] Completed 120000 out of 250000 steps  (48%)
[13:25:48] Completed 122500 out of 250000 steps  (49%)
[13:34:41] Completed 125000 out of 250000 steps  (50%)
[13:43:34] Completed 127500 out of 250000 steps  (51%)
[13:52:27] Completed 130000 out of 250000 steps  (52%)
[14:01:23] Completed 132500 out of 250000 steps  (53%)
[14:10:16] Completed 135000 out of 250000 steps  (54%)
[14:19:06] Completed 137500 out of 250000 steps  (55%)
[14:27:58] Completed 140000 out of 250000 steps  (56%)
[14:36:47] Completed 142500 out of 250000 steps  (57%)
[14:45:35] Completed 145000 out of 250000 steps  (58%)
[14:54:22] Completed 147500 out of 250000 steps  (59%)
[15:02:04] - Autosending finished units... [April 3 15:02:04 UTC]
[15:02:04] Trying to send all finished work units
[15:02:04] + No unsent completed units remaining.
[15:02:04] - Autosend completed
[15:03:07] Completed 150000 out of 250000 steps  (60%)
[15:11:52] Completed 152500 out of 250000 steps  (61%)
[15:20:37] Completed 155000 out of 250000 steps  (62%)
[15:29:24] Completed 157500 out of 250000 steps  (63%)
[15:38:13] Completed 160000 out of 250000 steps  (64%)
[15:47:00] Completed 162500 out of 250000 steps  (65%)
[15:55:47] Completed 165000 out of 250000 steps  (66%)
[16:04:34] Completed 167500 out of 250000 steps  (67%)
[16:13:23] Completed 170000 out of 250000 steps  (68%)
[16:22:13] Completed 172500 out of 250000 steps  (69%)
[16:31:03] Completed 175000 out of 250000 steps  (70%)
[16:39:53] Completed 177500 out of 250000 steps  (71%)
[16:48:41] Completed 180000 out of 250000 steps  (72%)
[16:57:29] Completed 182500 out of 250000 steps  (73%)
[17:06:17] Completed 185000 out of 250000 steps  (74%)
[17:15:06] Completed 187500 out of 250000 steps  (75%)
[17:23:57] Completed 190000 out of 250000 steps  (76%)
[17:32:52] Completed 192500 out of 250000 steps  (77%)
[17:41:47] Completed 195000 out of 250000 steps  (78%)
[17:50:42] Completed 197500 out of 250000 steps  (79%)
[17:59:36] Completed 200000 out of 250000 steps  (80%)
[18:08:27] Completed 202500 out of 250000 steps  (81%)
[18:17:24] Completed 205000 out of 250000 steps  (82%)
[18:26:21] Completed 207500 out of 250000 steps  (83%)
[18:35:16] Completed 210000 out of 250000 steps  (84%)
[18:44:09] Completed 212500 out of 250000 steps  (85%)
[18:53:02] Completed 215000 out of 250000 steps  (86%)
[19:01:55] Completed 217500 out of 250000 steps  (87%)
[19:10:45] Completed 220000 out of 250000 steps  (88%)
[19:19:34] Completed 222500 out of 250000 steps  (89%)
[19:28:24] Completed 225000 out of 250000 steps  (90%)
[19:37:14] Completed 227500 out of 250000 steps  (91%)
[19:46:03] Completed 230000 out of 250000 steps  (92%)
[19:54:53] Completed 232500 out of 250000 steps  (93%)
[20:03:43] Completed 235000 out of 250000 steps  (94%)
[20:12:34] Completed 237500 out of 250000 steps  (95%)
[20:21:24] Completed 240000 out of 250000 steps  (96%)
[20:30:13] Completed 242500 out of 250000 steps  (97%)
[20:39:02] Completed 245000 out of 250000 steps  (98%)
[20:47:51] Completed 247500 out of 250000 steps  (99%)
[20:56:40] Completed 250000 out of 250000 steps  (100%)
[20:56:41] DynamicWrapper: Finished Work Unit: sleep=1000
[20:56:42] 
[20:56:42] Finished Work Unit:
[20:56:42] - Reading up to 21126384 from "work/wudata_06.trr": Read 21126384
[20:56:43] trr file hash check passed.
[20:56:43] - Reading up to 4511608 from "work/wudata_06.xtc": Read 4511608
[20:56:43] xtc file hash check passed.
[20:56:43] edr file hash check passed.
[20:56:43] logfile size: 175879
[20:56:43] Leaving Run
[20:56:43] Done with run, master node
[20:56:43] - Writing 25995775 bytes of core data to disk...
[20:56:44]   ... Done.
[20:56:48] - Shutting down core
[20:56:48] 
[20:56:48] Folding@home Core Shutdown: FINISHED_UNIT
[21:00:04] CoreStatus = 64 (100)
[21:00:04] Unit 6 finished with 79 percent of time to deadline remaining.
[21:00:04] Updated performance fraction: 0.782534
[21:00:04] Sending work to server
[21:00:04] Project: 2669 (Run 11, Clone 149, Gen 102)


[21:00:04] + Attempting to send results [April 3 21:00:04 UTC]
[21:00:04] - Reading file work/wuresults_06.dat from core
[21:00:04]   (Read 25995775 bytes from disk)
[21:00:04] Connecting to http://171.64.65.56:8080/
[21:02:04] - Autosending finished units... [April 3 21:02:04 UTC]
[21:02:04] Trying to send all finished work units
[21:02:04] - Already sending work
[21:02:04] + Sent 0 of 1 completed units to the server
[21:02:04] - Autosend completed
[21:03:40] Posted data.
[21:03:40] Initial: 0000; - Uploaded at ~116 kB/s
[21:03:41] - Averaged speed for that direction ~131 kB/s
[21:03:41] + Results successfully sent
[21:03:41] Thank you for your contribution to Folding@Home.
[21:03:41] + Number of Units Completed: 32

[21:03:42] - Warning: Could not delete all work unit files (6): Core file absent
[21:03:42] Trying to send all finished work units
[21:03:42] + No unsent completed units remaining.
[21:03:42] - Preparing to get new work unit...
[21:03:42] + Attempting to get work packet
[21:03:42] - Will indicate memory of 16003 MB
[21:03:42] - Connecting to assignment server
[21:03:42] Connecting to http://assign.stanford.edu:8080/
[21:03:42] Posted data.
[21:03:42] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[21:03:42] + News From Folding@Home: Welcome to Folding@Home
[21:03:43] Loaded queue successfully.
[21:03:43] Connecting to http://171.67.108.24:8080/
[21:03:49] Posted data.
[21:03:49] Initial: 0000; - Receiving payload (expected size: 4822926)
[21:04:13] - Downloaded at ~196 kB/s
[21:04:13] - Averaged speed for that direction ~343 kB/s
[21:04:13] + Received work.
[21:04:13] Trying to send all finished work units
[21:04:13] + No unsent completed units remaining.
[21:04:13] + Closed connections
[21:04:13] 
[21:04:13] + Processing work unit
[21:04:13] Core required: FahCore_a2.exe
[21:04:13] Core found.
[21:04:13] Working on queue slot 07 [April 3 21:04:13 UTC]
[21:04:13] + Working ...
[21:04:13] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 07 -checkpoint 15 -verbose -lifeline 9819 -version 624'

[21:04:13] 
[21:04:13] *------------------------------*
[21:04:13] Folding@Home Gromacs SMP Core
[21:04:13] Version 2.04 (Thu Jan 29 16:43:57 PST 2009)
[21:04:13] 
[21:04:13] Preparing to commence simulation
[21:04:13] - Ensuring status. Please wait.
[21:04:14] Called DecompressByteArray: compressed_data_size=4822414 data_size=24052929, decompressed_data_size=24052929 diff=0
[21:04:14] - Digital signature verified
[21:04:14] 
[21:04:14] Project: 2671 (Run 17, Clone 90, Gen 3)
[21:04:14] 
[21:04:14] Assembly optimizations on if available.
[21:04:14] Entering M.D.
[21:04:23] (Run 17, Clone 90, Gen 3)
[21:04:23] 
[21:04:24] Entering M.D.
[21:21:48] Completed 5000 out of 250000 steps  (2%)
[21:30:28] Completed 7500 out of 250000 steps  (3%)
[21:39:09] Completed 10000 out of 250000 steps  (4%)
[21:47:49] Completed 12500 out of 250000 steps  (5%)
[21:56:31] Completed 15000 out of 250000 steps  (6%)
[22:05:11] Completed 17500 out of 250000 steps  (7%)
[22:13:52] Completed 20000 out of 250000 steps  (8%)
[22:22:33] Completed 22500 out of 250000 steps  (9%)
[22:31:14] Completed 25000 out of 250000 steps  (10%)
[22:39:55] Completed 27500 out of 250000 steps  (11%)
[22:48:35] Completed 30000 out of 250000 steps  (12%)
[22:57:16] Completed 32500 out of 250000 steps  (13%)
[23:05:57] Completed 35000 out of 250000 steps  (14%)
[23:14:38] Completed 37500 out of 250000 steps  (15%)
[23:23:21] Completed 40000 out of 250000 steps  (16%)
[23:32:03] Completed 42500 out of 250000 steps  (17%)
[23:40:46] Completed 45000 out of 250000 steps  (18%)
[23:49:25] Completed 47500 out of 250000 steps  (19%)
[23:58:06] Completed 50000 out of 250000 steps  (20%)
[00:06:46] Completed 52500 out of 250000 steps  (21%)
[00:15:31] Completed 55000 out of 250000 steps  (22%)
[00:24:13] Completed 57500 out of 250000 steps  (23%)
[00:32:56] Completed 60000 out of 250000 steps  (24%)
[00:41:36] Completed 62500 out of 250000 steps  (25%)
[00:50:15] Completed 65000 out of 250000 steps  (26%)
[00:58:53] Completed 67500 out of 250000 steps  (27%)
[01:07:31] Completed 70000 out of 250000 steps  (28%)
[01:09:47] CoreStatus = FF (255)
[01:09:47] Sending work to server
[01:09:47] Project: 2671 (Run 17, Clone 90, Gen 3)
[01:09:47] - Error: Could not get length of results file work/wuresults_07.dat
[01:09:47] - Error: Could not read unit 07 file. Removing from queue.
[01:09:47] Trying to send all finished work units
[01:09:47] + No unsent completed units remaining.
[01:09:47] - Preparing to get new work unit...
[01:09:47] + Attempting to get work packet
[01:09:47] - Will indicate memory of 16003 MB
[01:09:47] - Connecting to assignment server
[01:09:47] Connecting to http://assign.stanford.edu:8080/
[01:09:48] Posted data.
[01:09:48] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[01:09:48] + News From Folding@Home: Welcome to Folding@Home
[01:09:48] Loaded queue successfully.
[01:09:48] Connecting to http://171.64.65.56:8080/
[01:09:53] Posted data.
[01:09:53] Initial: 0000; - Receiving payload (expected size: 4834892)
[01:10:07] - Downloaded at ~337 kB/s
[01:10:07] - Averaged speed for that direction ~342 kB/s
[01:10:07] + Received work.
[01:10:07] Trying to send all finished work units
[01:10:07] + No unsent completed units remaining.
[01:10:07] + Closed connections
[01:10:12] 
[01:10:12] + Processing work unit
[01:10:12] Core required: FahCore_a2.exe
[01:10:12] Core found.
[01:10:12] Working on queue slot 08 [April 4 01:10:12 UTC]
[01:10:12] + Working ...
[01:10:12] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 08 -checkpoint 15 -verbose -lifeline 9819 -version 624'

[01:10:12] 
[01:10:12] *------------------------------*
[01:10:12] Folding@Home Gromacs SMP Core
[01:10:12] Version 2.04 (Thu Jan 29 16:43:57 PST 2009)
[01:10:12] 
[01:10:12] Preparing to commence simulation
[01:10:12] - Ensuring status. Please wait.
[01:10:21] - Looking at optimizations...
[01:10:21] - Working with standard loops on this execution.
[01:10:21] - Files status OK
[01:10:22] - Expanded 4834380 -> 23974209 (decompressed 495.9 percent)
[01:10:22] Called DecompressByteArray: compressed_data_size=4834380 data_size=23974209, decompressed_data_size=23974209 diff=0
[01:10:22] - Digital signature verified
[01:10:22] 
[01:10:22] Project: 2669 (Run 12, Clone 156, Gen 38)
[01:10:22] 
[01:10:23] Entering M.D.
[01:19:14] Completed 2509 out of 250000 steps  (1%)
[01:27:54] Completed 5009 out of 250000 steps  (2%)
[01:36:33] Completed 7509 out of 250000 steps  (3%)
[01:45:13] Completed 10009 out of 250000 steps  (4%)
[01:53:57] Completed 12509 out of 250000 steps  (5%)
[02:02:42] Completed 15009 out of 250000 steps  (6%)
[02:11:24] Completed 17509 out of 250000 steps  (7%)
[02:20:06] Completed 20009 out of 250000 steps  (8%)
[02:28:48] Completed 22509 out of 250000 steps  (9%)
[02:37:29] Completed 25009 out of 250000 steps  (10%)
[02:46:11] Completed 27509 out of 250000 steps  (11%)
[02:54:52] Completed 30009 out of 250000 steps  (12%)
[03:02:04] - Autosending finished units... [April 4 03:02:04 UTC]
[03:02:04] Trying to send all finished work units
[03:02:04] + No unsent completed units remaining.
[03:02:04] - Autosend completed
[03:03:33] Completed 32509 out of 250000 steps  (13%)
[03:12:16] Completed 35009 out of 250000 steps  (14%)
[03:21:01] Completed 37509 out of 250000 steps  (15%)
[03:29:45] Completed 40009 out of 250000 steps  (16%)
[03:38:29] Completed 42509 out of 250000 steps  (17%)
[03:47:14] Completed 45009 out of 250000 steps  (18%)
[03:55:56] Completed 47509 out of 250000 steps  (19%)
[04:04:39] Completed 50009 out of 250000 steps  (20%)
[04:13:25] Completed 52509 out of 250000 steps  (21%)
[04:22:11] Completed 55009 out of 250000 steps  (22%)
[04:30:57] Completed 57509 out of 250000 steps  (23%)
[04:39:43] Completed 60009 out of 250000 steps  (24%)
[04:48:33] Completed 62509 out of 250000 steps  (25%)
[04:57:23] Completed 65009 out of 250000 steps  (26%)
[05:06:14] Completed 67509 out of 250000 steps  (27%)
[05:15:04] Completed 70009 out of 250000 steps  (28%)
[05:23:54] Completed 72509 out of 250000 steps  (29%)
[05:32:44] Completed 75009 out of 250000 steps  (30%)
[05:41:34] Completed 77509 out of 250000 steps  (31%)
[05:50:23] Completed 80009 out of 250000 steps  (32%)
[05:59:12] Completed 82509 out of 250000 steps  (33%)
[06:08:01] Completed 85009 out of 250000 steps  (34%)
[06:16:49] Completed 87509 out of 250000 steps  (35%)
[06:25:34] Completed 90009 out of 250000 steps  (36%)
[06:34:20] Completed 92509 out of 250000 steps  (37%)
[06:43:08] Completed 95009 out of 250000 steps  (38%)
[06:51:56] Completed 97509 out of 250000 steps  (39%)
[07:00:46] Completed 100009 out of 250000 steps  (40%)
[07:09:35] Completed 102509 out of 250000 steps  (41%)
[07:18:24] Completed 105009 out of 250000 steps  (42%)
[07:26:05] 
[07:26:05] Folding@home Core Shutdown: INTERRUPTED
[07:26:10] CoreStatus = 66 (102)
[07:26:10] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[07:26:10] Killing all core threads

Folding@Home Client Shutdown.
restarting client...

Re: Project: 2669 (Run 12, Clone 156, Gen 38) seg fault

Posted: Sat Apr 04, 2009 8:36 pm
by bruce
Windows or Linux?

Re: Project: 2669 (Run 12, Clone 156, Gen 38) seg fault

Posted: Sat Apr 04, 2009 10:48 pm
by toTOW
Linux ... that's an A2 WU ;)

Unfortunately, that's a common issue on alpha754293's machines ... which we've never been able to resolve :(

Re: Project: 2669 (Run 12, Clone 156, Gen 38) seg fault

Posted: Sun Apr 05, 2009 11:48 pm
by alpha754293
toTOW wrote:Linux ... that's an A2 WU ;)

Unfortunately, that's a common issue on alpha754293's machines ... which we've never been able to resolve :(
The hardware itself is known good. I've already ran memtest and everything and it passed.

I can enable ECC/Reg to see if it will help with it, but know that that is going to cause a slow down in the runs.

The hardware is also COTS (Tyan B4882 2U barebones, with four AMD Opteron 880s, and 16 GB of Kingston KVR DDR-400 ECC/Reg. RAM, and two Hitachi 146GB 15krpm U320 hard drives -- nothing out of the ordinary).

Also interesting is that even when I run my own sims (FEA/CFD); I don't get seg faults with them, and the OS was rebuilt maybe like 6 months or so ago.

So, I don't know.

*edit*
Also, if you look in the log; I've never really noticed this, but there are apparently a number of WUs that failed that the client took care of by itself.

I only glanced at the log, and saw that there were some WU's with core status FF, so I don't know what's going on there.