Project: 6055 (Run 0, Clone 152, Gen 472)

Moderators: Site Moderators, FAHC Science Team

Post Reply
Foxbat
Posts: 94
Joined: Wed Dec 05, 2007 10:23 pm
Hardware configuration: Apple Mac Pro 1,1 2x2.66 GHz Dual-Core Xeon w/10 GB RAM | EVGA GTX 960, Zotac GTX 750 Ti | Ubuntu 14.04 LTS
Dell Precision T7400 2x3.0 GHz Quad-Core Xeon w/16 GB RAM | Zotac GTX 970 | Ubuntu 14.04 LTS
Apple iMac Retina 5K 4.00 GHz Core i7 w/8 GB RAM | OS X 10.11.3 (El Capitan)
Location: Michiana, USA

Project: 6055 (Run 0, Clone 152, Gen 472)

Post by Foxbat »

My Mac Pro (twin 2.66 GHz dual-core Xeon CPUs w/10 GB RAM) running OS X Snow Leopard 10.6.8 has been spinning its multi-core wheels on this WU. The Activity Monitor has been showing 100% CPU for the last 36 hours and I still show 0%. Here's the Log File after I restarted last night:

Code: Select all

--- Opening Log file [November 9 10:16:26 UTC] 

# Mac OS X SMP Console Edition ################################################
###############################################################################

                       Folding@Home Client Version 6.29r3

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /Users/Foxbat/Library/FAH-SMP-Term1
Executable: /Users/Foxbat/Library/FAH-SMP-Term1/fah6
Arguments: -local -advmethods -forceasm -verbosity 9 -smp 

[10:16:26] - Ask before connecting: No
[10:16:26] - User name: Foxbat (Team 55236)
[10:16:26] - User ID: <snip>
[10:16:26] - Machine ID: 1
[10:16:26] 
[10:16:26] Loaded queue successfully.
[10:16:26] 
[10:16:26] + Processing work unit
[10:16:26] Core required: FahCore_a3.exe
[10:16:26] - Autosending finished units... [November 9 10:16:26 UTC]
[10:16:26] Trying to send all finished work units
[10:16:26] + No unsent completed units remaining.
[10:16:26] - Autosend completed
[10:16:26] Core found.
[10:16:26] Working on queue slot 01 [November 9 10:16:26 UTC]
[10:16:26] + Working ...
[10:16:26] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 01 -np 4 -nocpulock -checkpoint 8 -forceasm -verbose -lifeline 303 -version 629'

[10:16:26] 
[10:16:26] *------------------------------*
[10:16:26] Folding@Home Gromacs SMP Core
[10:16:26] Version 2.22 (May 7 2010)
[10:16:26] 
[10:16:26] Preparing to commence simulation
[10:16:26] - Ensuring status. Please wait.
[10:16:35] - Assembly optimizations manually forced on.
[10:16:35] - Not checking prior termination.
[10:16:36] - Expanded 1765317 -> 2257001 (decompressed 127.8 percent)
[10:16:36] Called DecompressByteArray: compressed_data_size=1765317 data_size=2257001, decompressed_data_size=2257001 diff=0
[10:16:36] - Digital signature verified
[10:16:36] 
[10:16:36] Project: 6060 (Run 1, Clone 150, Gen 366)
[10:16:36] 
[10:16:36] Assembly optimizations on if available.
[10:16:36] Entering M.D.
[10:16:42] Using Gromacs checkpoints
[10:16:43] Resuming from checkpoint
[10:16:43] Verified work/wudata_01.log
[10:16:43] Verified work/wudata_01.trr
[10:16:43] Verified work/wudata_01.edr
[10:16:44] Completed 47574 out of 500000 steps  (9%)
[10:20:54] Completed 50000 out of 500000 steps  (10%)

<normal progress reports removed>

[21:30:38] Completed 490000 out of 500000 steps  (98%)
[21:38:12] Completed 495000 out of 500000 steps  (99%)
[21:45:48] Completed 500000 out of 500000 steps  (100%)
[21:45:48] DynamicWrapper: Finished Work Unit: sleep=10000
[21:45:58] 
[21:45:58] Finished Work Unit:
[21:45:58] - Reading up to 3701520 from "work/wudata_01.trr": Read 3701520
[21:45:58] trr file hash check passed.
[21:45:58] edr file hash check passed.
[21:45:58] logfile size: 60505
[21:45:58] Leaving Run
[21:46:02] - Writing 3797577 bytes of core data to disk...
[21:46:02]   ... Done.
[21:46:03] - Shutting down core
[21:46:03] 
[21:46:03] Folding@home Core Shutdown: FINISHED_UNIT
[21:46:03] CoreStatus = 64 (100)
[21:46:03] Unit 1 finished with 90 percent of time to deadline remaining.
[21:46:03] Updated performance fraction: 0.909645
[21:46:03] Sending work to server
[21:46:03] Project: 6060 (Run 1, Clone 150, Gen 366)


[21:46:03] + Attempting to send results [November 9 21:46:03 UTC]
[21:46:03] - Reading file work/wuresults_01.dat from core
[21:46:03]   (Read 3797577 bytes from disk)
[21:46:03] Connecting to http://171.64.65.54:8080/
[21:46:52] Posted data.
[21:46:52] Initial: 0000; - Uploaded at ~74 kB/s
[21:46:53] - Averaged speed for that direction ~76 kB/s
[21:46:53] + Results successfully sent
[21:46:53] Thank you for your contribution to Folding@Home.
[21:46:53] + Number of Units Completed: 2376

[21:46:54] Trying to send all finished work units
[21:46:54] + No unsent completed units remaining.
[21:46:54] - Preparing to get new work unit...
[21:46:54] Cleaning up work directory
[21:46:54] + Attempting to get work packet
[21:46:54] Passkey found
[21:46:54] - Will indicate memory of 8192 MB
[21:46:54] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 6
[21:46:54] - Connecting to assignment server
[21:46:54] Connecting to http://assign.stanford.edu:8080/
[21:46:54] Posted data.
[21:46:54] Initial: 0000; + No appropriate work server was available; will try again in a bit.
[21:46:54] + Couldn't get work instructions.
[21:46:54] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[21:47:11] + Attempting to get work packet
[21:47:11] Passkey found
[21:47:11] - Will indicate memory of 8192 MB
[21:47:11] - Connecting to assignment server
[21:47:11] Connecting to http://assign.stanford.edu:8080/
[21:47:11] Posted data.
[21:47:11] Initial: 0000; + No appropriate work server was available; will try again in a bit.
[21:47:11] + Couldn't get work instructions.
[21:47:11] - Attempt #2  to get work failed, and no other work to do.
Waiting before retry.
[21:47:27] + Attempting to get work packet
[21:47:27] Passkey found
[21:47:27] - Will indicate memory of 8192 MB
[21:47:27] - Connecting to assignment server
[21:47:27] Connecting to http://assign.stanford.edu:8080/
[21:47:27] Posted data.
[21:47:27] Initial: 0000; + No appropriate work server was available; will try again in a bit.
[21:47:27] + Couldn't get work instructions.
[21:47:27] - Attempt #3  to get work failed, and no other work to do.
Waiting before retry.
[21:47:59] + Attempting to get work packet
[21:47:59] Passkey found
[21:47:59] - Will indicate memory of 8192 MB
[21:47:59] - Connecting to assignment server
[21:47:59] Connecting to http://assign.stanford.edu:8080/
[21:48:00] Posted data.
[21:48:00] Initial: 0000; + No appropriate work server was available; will try again in a bit.
[21:48:00] + Couldn't get work instructions.
[21:48:00] - Attempt #4  to get work failed, and no other work to do.
Waiting before retry.
[21:48:48] + Attempting to get work packet
[21:48:48] Passkey found
[21:48:48] - Will indicate memory of 8192 MB
[21:48:48] - Connecting to assignment server
[21:48:48] Connecting to http://assign.stanford.edu:8080/
[21:48:49] Posted data.
[21:48:49] Initial: 0000; + No appropriate work server was available; will try again in a bit.
[21:48:49] + Couldn't get work instructions.
[21:48:49] - Attempt #5  to get work failed, and no other work to do.
Waiting before retry.
[21:50:11] + Attempting to get work packet
[21:50:11] Passkey found
[21:50:11] - Will indicate memory of 8192 MB
[21:50:11] - Connecting to assignment server
[21:50:11] Connecting to http://assign.stanford.edu:8080/
[21:50:11] Posted data.
[21:50:11] Initial: 0000; + No appropriate work server was available; will try again in a bit.
[21:50:11] + Couldn't get work instructions.
[21:50:11] - Attempt #6  to get work failed, and no other work to do.
Waiting before retry.
[21:53:04] + Attempting to get work packet
[21:53:04] Passkey found
[21:53:04] - Will indicate memory of 8192 MB
[21:53:04] - Connecting to assignment server
[21:53:04] Connecting to http://assign.stanford.edu:8080/
[21:53:04] Posted data.
[21:53:04] Initial: 0000; + No appropriate work server was available; will try again in a bit.
[21:53:04] + Couldn't get work instructions.
[21:53:04] - Attempt #7  to get work failed, and no other work to do.
Waiting before retry.
[21:58:31] + Attempting to get work packet
[21:58:31] Passkey found
[21:58:31] - Will indicate memory of 8192 MB
[21:58:31] - Connecting to assignment server
[21:58:31] Connecting to http://assign.stanford.edu:8080/
[21:58:32] Posted data.
[21:58:32] Initial: 0000; + No appropriate work server was available; will try again in a bit.
[21:58:32] + Couldn't get work instructions.
[21:58:32] - Attempt #8  to get work failed, and no other work to do.
Waiting before retry.
[22:09:19] + Attempting to get work packet
[22:09:19] Passkey found
[22:09:19] - Will indicate memory of 8192 MB
[22:09:19] - Connecting to assignment server
[22:09:19] Connecting to http://assign.stanford.edu:8080/
[22:09:20] Posted data.
[22:09:20] Initial: 40AB; - Successful: assigned to (171.64.65.54).
[22:09:20] + News From Folding@Home: Welcome to Folding@Home
[22:09:20] Loaded queue successfully.
[22:09:20] Sent data
[22:09:20] Connecting to http://171.64.65.54:8080/
[22:09:21] Posted data.
[22:09:21] Initial: 0000; - Receiving payload (expected size: 1390318)
[22:09:31] - Downloaded at ~135 kB/s
[22:09:31] - Averaged speed for that direction ~128 kB/s
[22:09:31] + Received work.
[22:09:31] Trying to send all finished work units
[22:09:31] + No unsent completed units remaining.
[22:09:31] + Closed connections
[22:09:31] 
[22:09:31] + Processing work unit
[22:09:31] Core required: FahCore_a3.exe
[22:09:31] Core found.
[22:09:31] Working on queue slot 02 [November 9 22:09:31 UTC]
[22:09:31] + Working ...
[22:09:31] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 02 -np 4 -nocpulock -checkpoint 8 -forceasm -verbose -lifeline 303 -version 629'

[22:09:31] 
[22:09:31] *------------------------------*
[22:09:31] Folding@Home Gromacs SMP Core
[22:09:31] Version 2.22 (May 7 2010)
[22:09:31] 
[22:09:31] Preparing to commence simulation
[22:09:31] - Assembly optimizations manually forced on.
[22:09:31] - Not checking prior termination.
[22:09:32] - Expanded 1389806 -> 2251569 (decompressed 162.0 percent)
[22:09:32] Called DecompressByteArray: compressed_data_size=1389806 data_size=2251569, decompressed_data_size=2251569 diff=0
[22:09:32] - Digital signature verified
[22:09:32] 
[22:09:32] Project: 6055 (Run 0, Clone 152, Gen 472)
[22:09:32] 
[22:09:32] Assembly optimizations on if available.
[22:09:32] Entering M.D.
[22:09:38] Completed 0 out of 236500016 steps  (0%)
[22:16:28] - Autosending finished units... [November 9 22:16:28 UTC]
[22:16:28] Trying to send all finished work units
[22:16:28] + No unsent completed units remaining.
[22:16:28] - Autosend completed
[04:16:29] - Autosending finished units... [November 10 04:16:29 UTC]
[04:16:29] Trying to send all finished work units
[04:16:29] + No unsent completed units remaining.
[04:16:29] - Autosend completed
[10:16:31] - Autosending finished units... [November 10 10:16:31 UTC]
[10:16:31] Trying to send all finished work units
[10:16:31] + No unsent completed units remaining.
[10:16:31] - Autosend completed
[16:16:32] - Autosending finished units... [November 10 16:16:32 UTC]
[16:16:32] Trying to send all finished work units
[16:16:32] + No unsent completed units remaining.
[16:16:32] - Autosend completed
[22:16:33] - Autosending finished units... [November 10 22:16:33 UTC]
[22:16:33] Trying to send all finished work units
[22:16:33] + No unsent completed units remaining.
[22:16:33] - Autosend completed
[04:12:47] ***** Got a SIGTERM signal (15)
[04:12:47] Killing all core threads

Folding@Home Client Shutdown.

<deleted Core_A3 Executable, Queue file, and Work folder>

--- Opening Log file [November 11 04:15:10 UTC] 


# Mac OS X SMP Console Edition ################################################
###############################################################################

                       Folding@Home Client Version 6.29r3

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /Users/Foxbat/Library/FAH-SMP-Term1
Executable: /Users/Foxbat/Library/FAH-SMP-Term1/fah6
Arguments: -local -advmethods -forceasm -verbosity 9 -smp 

[04:15:10] - Ask before connecting: No
[04:15:10] - User name: Foxbat (Team 55236)
[04:15:10] - User ID: <snip>
[04:15:10] - Machine ID: 1
[04:15:10] 
[04:15:10] Work directory not found. Creating...
[04:15:10] Could not open work queue, generating new queue...
[04:15:10] - Preparing to get new work unit...
[04:15:10] - Autosending finished units... [04:15:10]
[04:15:10] Cleaning up work directory
[04:15:10] Trying to send all finished work units
[04:15:10] + No unsent completed units remaining.
[04:15:10] - Autosend completed
[04:15:10] + Attempting to get work packet
[04:15:10] Passkey found
[04:15:10] - Will indicate memory of 8192 MB
[04:15:10] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 6
[04:15:10] - Connecting to assignment server
[04:15:10] Connecting to http://assign.stanford.edu:8080/
[04:15:11] Posted data.
[04:15:11] Initial: 40AB; - Successful: assigned to (171.64.65.54).
[04:15:11] + News From Folding@Home: Welcome to Folding@Home
[04:15:11] Loaded queue successfully.
[04:15:11] Sent data
[04:15:11] Connecting to http://171.64.65.54:8080/
[04:15:12] Posted data.
[04:15:12] Initial: 0000; - Receiving payload (expected size: 1390318)
[04:15:23] - Downloaded at ~123 kB/s
[04:15:23] - Averaged speed for that direction ~123 kB/s
[04:15:23] + Received work.
[04:15:23] + Closed connections
[04:15:23] 
[04:15:23] + Processing work unit
[04:15:23] Core required: FahCore_a3.exe
[04:15:23] Core not found.
[04:15:23] - Core is not present or corrupted.
[04:15:23] - Attempting to download new core...
[04:15:23] + Downloading new core: FahCore_a3.exe
[04:15:23] Downloading core (/~pande/OSX/x86/Core_a3.fah from www.stanford.edu)
[04:15:24] Initial: AFDE; + 10240 bytes downloaded
[04:15:24] Initial: B94C; + 20480 bytes downloaded
[04:15:24] Initial: 8884; + 30720 bytes downloaded

<removed normal download messages>

[04:15:29] Initial: E2AA; + 1249280 bytes downloaded
[04:15:29] Initial: 3C7A; + 1250687 bytes downloaded
[04:15:29] Verifying core Core_a3.fah...
[04:15:29] Signature is VALID
[04:15:29] 
[04:15:29] Trying to unzip core FahCore_a3.exe
[04:15:29] Decompressed FahCore_a3.exe (3554912 bytes) successfully
[04:15:29] + Core successfully engaged
[04:15:35] 
[04:15:35] + Processing work unit
[04:15:35] Core required: FahCore_a3.exe
[04:15:35] Core found.
[04:15:35] Working on queue slot 01 [November 11 04:15:35 UTC]
[04:15:35] + Working ...
[04:15:35] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 01 -np 4 -nocpulock -checkpoint 8 -forceasm -verbose -lifeline 7590 -version 629'

[04:15:35] 
[04:15:35] *------------------------------*
[04:15:35] Folding@Home Gromacs SMP Core
[04:15:35] Version 2.22 (May 7 2010)
[04:15:35] 
[04:15:35] Preparing to commence simulation
[04:15:35] - Assembly optimizations manually forced on.
[04:15:35] - Not checking prior termination.
[04:15:35] - Expanded 1389806 -> 2251569 (decompressed 162.0 percent)
[04:15:35] Called DecompressByteArray: compressed_data_size=1389806 data_size=2251569, decompressed_data_size=2251569 diff=0
[04:15:35] - Digital signature verified
[04:15:35] 
[04:15:35] Project: 6055 (Run 0, Clone 152, Gen 472)
[04:15:35] 
[04:15:35] Assembly optimizations on if available.
[04:15:35] Entering M.D.
[04:15:42] Completed 0 out of 236500016 steps  (0%)
[10:15:11] - Autosending finished units... [November 11 10:15:11 UTC]
[10:15:11] Trying to send all finished work units
[10:15:11] + No unsent completed units remaining.
[10:15:11] - Autosend completed
It's been a long time since I've seen an abnormally large WU like this.
Image
gwildperson
Posts: 450
Joined: Tue Dec 04, 2007 8:36 pm

Re: Project: 6055 (Run 0, Clone 152, Gen 472)

Post by gwildperson »

I'm (almost) 100% sure you have a bad WU or some problem with the OSX Version 2.22 core. Here's a quote from a Windows user's log who is also running a Project 6055 assignment. Your WU has 236500016 steps and his has 500000 steps. In all my experience here, that total number of steps has always been the same for all WUs from a single project.

viewtopic.php?f=58&t=19026#p190280
[08:18:58] *------------------------------*
[08:18:58] Folding@Home Gromacs SMP Core
[08:18:58] Version 2.27 (Dec. 15, 2010)
[08:18:58]
[08:18:58] Preparing to commence simulation
[08:18:58] - Looking at optimizations...
[08:18:58] - Created dyn
[08:18:58] - Files status OK
[08:18:59] - Expanded 1764417 -> 2251569 (decompressed 127.6 percent)
[08:18:59] Called DecompressByteArray: compressed_data_size=1764417 data_size=2251569, decompressed_data_size=2251569 diff=0
[08:18:59] - Digital signature verified
[08:18:59]
[08:18:59] Project: 6055 (Run 0, Clone 46, Gen 305)
[08:18:59]
[08:18:59] Assembly optimizations on if available.
[08:18:59] Entering M.D.
[08:19:05] Mapping NT from 2 to 2
[08:19:06] Completed 0 out of 500000 steps (0%)
[08:34:02] Completed 5000 out of 500000 steps (1%)
[08:48:07] Completed 10000 out of 500000 steps (2%)
[08:49:06] mdrun returned 255
bruce
Posts: 20822
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 6055 (Run 0, Clone 152, Gen 472)

Post by bruce »

I'll be sure the Pande Group knows this is a problem.
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Project: 6055 (Run 0, Clone 152, Gen 472)

Post by PantherX »

There is a single error report in the WU Database by another user:
Your WU (P6055 R0 C152 G472) was added to the stats database on 2011-11-14 06:06:12 for 0 points of credit.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Post Reply