Project: 2665 (Run 1, Clone 649, Gen 6)
Posted: Wed Jun 25, 2008 5:35 am
One of my dedicated folders is having a problem with Project: 2665 (Run 1, Clone 649, Gen 6). It immediately dies with a CoreStatus = 66 (102), which is the "Shutdown requested by user." I did not, of course, attempt to shut it down. I also see in the syslog that 3 of the 4 cores segfault at that time. It has done this multiple times. As I would love to get past this WU and get this folder folding again, I'll try to dump it, but wanted to report it first.
System is Ubuntu 8.04, Linux 6.02beta1 client, q6600@2.88GHz, 2GB mem.
FAHlog.txt:
syslog:
System is Ubuntu 8.04, Linux 6.02beta1 client, q6600@2.88GHz, 2GB mem.
FAHlog.txt:
Code: Select all
--- Opening Log file [June 25 05:06:16]
# SMP Client ##################################################################
###############################################################################
Folding@Home Client Version 6.02beta
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: /home/smpfold/foldingathome/CPU1
Executable: /home/smpfold/foldingathome/CPU1/fah6
Arguments: -forceasm -smp -verbosity 9
Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.
[05:06:16] - Ask before connecting: No
[05:06:16] - User name: GTron (Team 0)
[05:06:16] - User ID: 76E5E3D439736F7C
[05:06:16] - Machine ID: 5
[05:06:16]
[05:06:16] Could not open work queue, generating new queue...
[05:06:16] - Autosending finished units...
[05:06:16] Trying to send all finished work units
[05:06:16] + No unsent completed units remaining.
[05:06:16] - Autosend completed
[05:06:16] - Preparing to get new work unit...
[05:06:16] + Attempting to get work packet
[05:06:16] - Will indicate memory of 1536 MB
[05:06:16] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 7
[05:06:16] - Connecting to assignment server
[05:06:16] Connecting to http://assign.stanford.edu:8080/
[05:06:16] Posted data.
[05:06:16] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[05:06:16] + News From Folding@Home: Welcome to Folding@Home
[05:06:16] Loaded queue successfully.
[05:06:16] Connecting to http://171.64.65.64:8080/
[05:06:21] Posted data.
[05:06:21] Initial: 0000; - Receiving payload (expected size: 4659162)
[05:06:30] - Downloaded at ~505 kB/s
[05:06:30] - Averaged speed for that direction ~505 kB/s
[05:06:30] + Received work.
[05:06:30] + Closed connections
[05:06:30]
[05:06:30] + Processing work unit
[05:06:30] Core required: FahCore_a1.exe
[05:06:30] Core found.
[05:06:30] Working on Unit 01 [June 25 05:06:30]
[05:06:30] + Working ...
[05:06:30] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 15 -forceasm -verbose -lifeline 7412 -version 602'
[05:06:30]
[05:06:30] *------------------------------*
[05:06:30] Folding@Home Gromacs SMP Core
[05:06:30] Version 1.74 (November 27, 2006)
[05:06:30]
[05:06:30] Preparing to commence simulation
[05:06:30] - Ensuring status. Please wait.
[05:06:30] - Starting from initial work packet
[05:06:31]
[05:06:31] Project: 2665 (Run 1, Clone 649, Gen 6)
[05:06:31]
[05:06:31] Assembly optimizations on if available.
[05:06:31] Entering M.D.
[05:06:48] on if available.
[05:06:48] Entering M.D.
[05:06:55] X in water
[05:06:55] Writing local files
[05:06:55]
[05:06:55] Folding@hoFinalizing output
[05:06:55] Extra SSE boost OK.
[05:06:55] E boost OK.
[05:06:59] CoreStatus = 66 (102)
[05:06:59] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[05:06:59] Killing all core threads
Folding@Home Client Shutdown.
Code: Select all
Jun 24 23:06:55 GHARVCO-17 kernel: [ 708.238208] FahCore_a1.exe[7492]: segfault at 11114c0 rip 5ce05e rsp 40ef3aa0 error 4
Jun 24 23:06:55 GHARVCO-17 kernel: [ 708.256264] FahCore_a1.exe[7493]: segfault at 1112360 rip 5ce07f rsp 40ef3aa0 error 4
Jun 24 23:06:55 GHARVCO-17 kernel: [ 708.298413] FahCore_a1.exe[7496]: segfault at 11154c0 rip 5ce074 rsp 408c5aa0 error 4