Project: 2665 (Run 2, Clone 300, Gen 50) hang/segfault

Moderators: Site Moderators, FAHC Science Team

Post Reply
parkut
Posts: 366
Joined: Tue Feb 12, 2008 7:33 am
Hardware configuration: Running exclusively Linux headless blades. All are dedicated crunching machines.
Location: SE Michigan, USA

Project: 2665 (Run 2, Clone 300, Gen 50) hang/segfault

Post by parkut »

WU got stuck at 43%. Restarted, and immediately segfaults
with error 66 (102)

model name : Intel(R) Core(TM)2 Duo CPU E6750 @ 2.66GHz
cpu MHz : 1998.000
cache size : 4096 KB
Memory: 975.93 MB physical, 1.94 GB virtual
...
Folding@Home Client Version 6.23 Beta R1
Current Work Unit
-----------------
Name: p2665_IBX in water
Tag: P2665R2C300G50
Download time: November 4 14:15:21
Due time: November 10 14:15:21
Progress: 43% [||||______]

[14:15:21] Working on queue slot 00 [November 4 14:15:21 UTC]
[14:15:21] + Working ...
[14:15:39] Project: 2665 (Run 2, Clone 300, Gen 50)
[14:15:39] Entering M.D.
[14:15:47] Protein: HGG with glycosylations
[14:15:47] Writing local files
[14:15:47] Extra SSE boost OK.
[14:30:49] t triggered.
[14:33:57] Writing local files
[14:33:57] Completed 2500 out of 250000 steps (1 percent)

[02:57:08] Completed 105000 out of 250000 steps (42 percent)
[03:12:09] Timered checkpoint triggered.
[03:15:15] Writing local files
[03:15:16] Completed 107500 out of 250000 steps (43 percent)
[03:20:37]
[03:20:37] Folding@home Core Shutdown: INTERRUPTED
[03:31:48] - Autosending finished units... [November 5 03:31:48 UTC]
[03:31:48] Trying to send all finished work units
[03:31:48] + No unsent completed units remaining.
[03:31:48] - Autosend completed
[04:01:01] ***** Got a SIGTERM signal (15)
[04:01:01] Killing all core threads

Folding@Home Client Shutdown.

[12:31:49] Project: 2665 (Run 2, Clone 300, Gen 50)
[12:31:49]
[12:31:49] Entering M.D.
[12:31:57] Protein: HGG with glycosylations
[12:31:57] Writing local files
[12:31:57] Completed 107500 out of 250000 steps (43 percent)
[12:32:02] Extra SSE boost OK.
[12:37:35]
[12:37:35] Folding@home Core Shutdown: INTERRUPTED
[12:37:39] CoreStatus = 66 (102)
[12:37:39] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[12:37:39] Killing all core threads

Folding@Home Client Shutdown.

[0]0:Return code = 102
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Segmentation fault
[0]3:Return code = 0, signaled with Segmentation fault
toTOW
Site Moderator
Posts: 6453
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2665 (Run 2, Clone 300, Gen 50) hang/segfault

Post by toTOW »

There is no data for this WU in the DB yet ...
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
preet.to
Posts: 19
Joined: Sun Dec 16, 2007 3:20 pm

Re: Project: 2665 (Run 2, Clone 300, Gen 50) hang/segfault

Post by preet.to »

I am getting the same problem. All forward progress is halted as I cannot restart his WU and keep getting it assigned.

Code: Select all

[03:38:42] *------------------------------*
[03:38:42] Folding@Home Gromacs SMP Core
[03:38:42] Version 1.74 (November 27, 2006)
[03:38:42]
[03:38:42] Preparing to commence simulation
[03:38:42] - Ensuring status. Please wait.
[03:38:43] - Starting from initial work packet
[03:38:43]
[03:38:43] Project: 2665 (Run 0, Clone 208, Gen 64)
[03:38:43]
[03:38:43] Assembly optimizations on if available.
[03:38:43] Entering M.D.
[03:39:00]  percent)
[03:39:00] cket
[03:39:00]
[03:39:00] Project: 2665 (Run 0, Clone 208, Gen 64)
[03:39:00]
[03:39:00] 65 (Run 0, Clone 208, Gen 64)
[03:39:00]
[03:39:00] Entering M.D.
[03:39:08] Protein: HGG in water
[03:39:08] Writing local files
[03:39:08] Extra SSE boost OK.
[03:54:11] t triggered.
[04:06:04] Writing local files
[04:06:04] Completed 2500 out of 250000 steps  (1 percent)
[04:21:06] Timered checkpoint triggered.
[04:33:02] Writing local files
[04:33:02] Completed 5000 out of 250000 steps  (2 percent)
[04:48:03] Timered checkpoint triggered.
[04:59:55] Writing local files
[04:59:56] Completed 7500 out of 250000 steps  (3 percent)
[05:14:57] Timered checkpoint triggered.
[05:26:53] Writing local files
[05:26:53] Completed 10000 out of 250000 steps  (4 percent)
[05:41:54] Timered checkpoint triggered.
[05:53:54] Writing local files
[05:53:54] Completed 12500 out of 250000 steps  (5 percent)
[06:08:54] Timered checkpoint triggered.
[06:20:48] Writing local files
[06:20:48] Completed 15000 out of 250000 steps  (6 percent)
[06:35:49] Timered checkpoint triggered.
[06:47:44] Writing local files
[06:47:44] Completed 17500 out of 250000 steps  (7 percent)
[06:57:18] - Autosending finished units...
[06:57:18] Trying to send all finished work units
[06:57:18] + No unsent completed units remaining.
[06:57:18] - Autosend completed
[07:02:45] Timered checkpoint triggered.
[07:14:42] Writing local files
[07:14:42] Completed 20000 out of 250000 steps  (8 percent)
[07:29:43] Timered checkpoint triggered.
[07:35:02]
[07:35:02] Folding@home Core Shutdown: INTERRUPTED
[07:35:06] CoreStatus = 66 (102)
[07:35:06] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[07:35:06] Killing all core threads

Folding@Home Client Shutdown.
Post Reply