8014 Folding@home Core Shutdown: UNSTABLE_MACHINE

Moderators: Site Moderators, FAHC Science Team

Post Reply
shunter
Posts: 84
Joined: Sun Apr 06, 2008 8:22 am
Location: Hertfordshire, United Kingdom

8014 Folding@home Core Shutdown: UNSTABLE_MACHINE

Post by shunter »

This unit was acquired by one of my pcs today and keeps failing - see below for log file. To date there have not been any issues with the pc regarding unstable machine and the heat is now much lower than last week. Also this pc is set with flag "-smp" so I would not normally expect it to acquire a 8014 unit although I do not profess to be an expert on Folding.

Similar thing has happened to another pc running an i7 cpu with same smp settings at about same time.

Can anyone explain / advise whether its my systems or faulty units. Can I get round this by removing smp until units complete or am I just barking at the moon.

Thanks
Shunter

Code: Select all

                       http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\fah
Executable: C:\fah\Folding@home-Win32-x86.exe
Arguments: -smp -verbosity 9 -forceasm -config 

[13:16:21] - Ask before connecting: No
[13:16:21] - User name: shunter (Team 46590)
[13:16:21] - User ID: 6BB49B5203015B79
[13:16:21] - Machine ID: 16
[13:16:21] 
[13:16:21] Configuring Folding@Home...


[13:16:26] - Ask before connecting: No
[13:16:26] - User name: shunter (Team 46590)
[13:16:26] - User ID: 6BB49B5203015B79
[13:16:26] - Machine ID: 16
[13:16:26] 
[13:16:27] Loaded queue successfully.
[13:16:27] 
[13:16:27] - Autosending finished units... [August 24 13:16:27 UTC]
[13:16:27] + Processing work unit
[13:16:27] Trying to send all finished work units
[13:16:27] Work type a4 not eligible for variable processors
[13:16:27] + No unsent completed units remaining.
[13:16:27] - Autosend completed
[13:16:27] Core required: FahCore_a4.exe
[13:16:27] Core found.
[13:16:27] Working on queue slot 02 [August 24 13:16:27 UTC]
[13:16:28] + Working ...
[13:16:28] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a4.exe -dir work/ -suffix 02 -checkpoint 15 -forceasm -verbose -lifeline 1160 -version 629'

[13:16:29] 
[13:16:29] *------------------------------*
[13:16:29] Folding@Home Gromacs GB Core
[13:16:29] Version 2.27 (Dec. 15, 2010)
[13:16:29] 
[13:16:29] Preparing to commence simulation
[13:16:29] - Assembly optimizations manually forced on.
[13:16:29] - Not checking prior termination.
[13:16:29] Error: Missing work file=<>
[13:16:29] 
[13:16:30] Folding@home Core Shutdown: MISSING_WORK_FILES
[13:16:32] CoreStatus = 74 (116)
[13:16:32] The core could not find the work files specified. Removing from queue
[13:16:33] Deleting current work unit & continuing...
[13:16:37] Trying to send all finished work units
[13:16:37] + No unsent completed units remaining.
[13:16:37] - Preparing to get new work unit...
[13:16:37] Cleaning up work directory
[13:16:37] + Attempting to get work packet
[13:16:37] Passkey found
[13:16:38] - Will indicate memory of 1917 MB
[13:16:38] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 11
[13:16:38] - Connecting to assignment server
[13:16:38] Connecting to http://assign.stanford.edu:8080/
[13:16:39] Posted data.
[13:16:39] Initial: 43AB; - Successful: assigned to (171.67.108.58).
[13:16:39] + News From Folding@Home: Welcome to Folding@Home
[13:16:40] Loaded queue successfully.
[13:16:40] Connecting to http://171.67.108.58:8080/
[13:16:41] Posted data.
[13:16:41] Initial: 0000; - Receiving payload (expected size: 547351)
[13:16:46] - Downloaded at ~106 kB/s
[13:16:47] - Averaged speed for that direction ~122 kB/s
[13:16:47] + Received work.
[13:16:47] + Closed connections
[13:16:52] 
[13:16:52] + Processing work unit
[13:16:52] Work type a4 not eligible for variable processors
[13:16:52] Core required: FahCore_a4.exe
[13:16:52] Core found.
[13:16:52] Working on queue slot 03 [August 24 13:16:52 UTC]
[13:16:52] + Working ...
[13:16:52] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a4.exe -dir work/ -suffix 03 -checkpoint 15 -forceasm -verbose -lifeline 1160 -version 629'

[13:16:53] 
[13:16:53] *------------------------------*
[13:16:53] Folding@Home Gromacs GB Core
[13:16:53] Version 2.27 (Dec. 15, 2010)
[13:16:53] 
[13:16:53] Preparing to commence simulation
[13:16:54] - Ensuring status. Please wait.
[13:16:54] Called DecompressByteArray: compressed_data_size=546839 data_size=1326096, decompressed_data_size=1326096 diff=0
[13:16:54] - Digital signature verified
[13:16:54] 
[13:16:54] Project: 8014 (Run 5, Clone 627, Gen 89)
[13:16:54] 
[13:16:54] Assembly optimizations on if available.
[13:16:54] Entering M.D.
[13:16:59] Mapping NT from 1 to 1 
[13:17:00] Completed 0 out of 250000 steps  (0%)
[13:17:09] ted 0 out of 250000 steps  (0%)
[13:23:25] ps  (1%)
[13:23:27]  2500 out of 250000 steps  (1%)
[13:29:43] Completed 5000 out of 250000 steps  (2%)
[13:35:59] Completed 7500 out of 250000 steps  (3%)
[13:42:28] Completed 10000 out of 250000 steps  (4%)
[13:48:51] Completed 12500 out of 250000 steps  (5%)
[13:55:15] Completed 15000 out of 250000 steps  (6%)
[14:01:55] Completed 17500 out of 250000 steps  (7%)
[14:08:47] Completed 20000 out of 250000 steps  (8%)
[14:15:12] Completed 22500 out of 250000 steps  (9%)
[14:21:28] Completed 25000 out of 250000 steps  (10%)
[14:27:49] Completed 27500 out of 250000 steps  (11%)
[14:34:07] Completed 30000 out of 250000 steps  (12%)
[14:40:22] Completed 32500 out of 250000 steps  (13%)
[14:46:36] Completed 35000 out of 250000 steps  (14%)
[14:53:04] Completed 37500 out of 250000 steps  (15%)
[14:59:21] Completed 40000 out of 250000 steps  (16%)
[15:05:40] Completed 42500 out of 250000 steps  (17%)
[15:11:54] Completed 45000 out of 250000 steps  (18%)
[15:18:06] Completed 47500 out of 250000 steps  (19%)
[15:24:23] Completed 50000 out of 250000 steps  (20%)
[15:30:36] Completed 52500 out of 250000 steps  (21%)
[15:36:52] Completed 55000 out of 250000 steps  (22%)
[15:43:05] Completed 57500 out of 250000 steps  (23%)
[15:49:21] Completed 60000 out of 250000 steps  (24%)
[15:55:35] Completed 62500 out of 250000 steps  (25%)
[16:01:47] Completed 65000 out of 250000 steps  (26%)
[16:02:12] mdrun returned 255
[16:02:12] Going to send back what have done -- stepsTotalG=250000
[16:02:12] Work fraction=0.2602 steps=250000.
[16:02:16] logfile size=11673 infoLength=11673 edr=0 trr=25
[16:02:16] logfile size: 11673 info=11673 bed=0 hdr=25
[16:02:16] - Writing 12211 bytes of core data to disk...
[16:02:17] Done: 11699 -> 4022 (compressed to 34.3 percent)
[16:02:17]   ... Done.
[16:02:17] 
[16:02:17] Folding@home Core Shutdown: UNSTABLE_MACHINE
[16:57:32] 500 out of 250000 steps  (35%)
[17:03:47] Completed 90000 out of 250000 steps  (36%)
[17:09:56] Completed 92500 out of 250000 steps  (37%)
[17:16:05] Completed 95000 out of 250000 steps  (38%)
[17:22:17] Completed 97500 out of 250000 steps  (39%)
[17:28:26] Completed 100000 out of 250000 steps  (40%)
[17:34:37] Completed 102500 out of 250000 steps  (41%)
[17:40:46] Completed 105000 out of 250000 steps  (42%)
[17:46:54] Completed 107500 out of 250000 steps  (43%)
[17:53:04] Completed 110000 out of 250000 steps  (44%)
[17:59:13] Completed 112500 out of 250000 steps  (45%)
[18:05:22] Completed 115000 out of 250000 steps  (46%)
[18:11:32] Completed 117500 out of 250000 steps  (47%)
[18:17:40] Completed 120000 out of 250000 steps  (48%)
[18:23:50] Completed 122500 out of 250000 steps  (49%)
[18:29:59] Completed 125000 out of 250000 steps  (50%)
[18:36:08] Completed 127500 out of 250000 steps  (51%)
[18:42:22] Completed 130000 out of 250000 steps  (52%)
[18:48:40] Completed 132500 out of 250000 steps  (53%)
[18:55:00] Completed 135000 out of 250000 steps  (54%)
[19:01:10] Completed 137500 out of 250000 steps  (55%)
[19:07:21] Completed 140000 out of 250000 steps  (56%)
[19:13:43] Completed 142500 out of 250000 steps  (57%)
[19:16:27] - Autosending finished units... [August 24 19:16:27 UTC]
[19:16:27] Trying to send all finished work units
[19:16:27] + No unsent completed units remaining.
[19:16:27] - Autosend completed
[19:20:03] Completed 145000 out of 250000 steps  (58%)
Image
Joe_H
Site Admin
Posts: 8158
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Studio M1 Max 32 GB smp6
Mac Hack i7-7700K 48 GB smp4
Location: W. MA

Re: 8014 Folding@home Core Shutdown: UNSTABLE_MACHINE

Post by Joe_H »

8014 is processed by the A4 core which can run either as an uniprocessor or SMP. So it should have been able to run as smp 4 with your settings. Which version of the folding client are you using, that was not included in your log clip? Why it did not run as SMP, I can not say from what I see in the log. There is no report yet on this WU in the database.
Image
artoar_11
Posts: 652
Joined: Sun Nov 22, 2009 8:42 pm
Hardware configuration: AMD R7 3700X @ 4.0 GHz; ASUS ROG STRIX X470-F GAMING; DDR4 2x8GB @ 3.0 GHz; GByte RTX 3060 Ti @ 1890 MHz; Fortron-550W 80+ bronze; Win10 Pro/64
Location: Bulgaria/Team #224497/artoar11_ALL_....

Re: 8014 Folding@home Core Shutdown: UNSTABLE_MACHINE

Post by artoar_11 »

Joe_H wrote:8014 is processed by the A4 core which can run either as an uniprocessor or SMP. So it should have been able to run as smp 4 with your settings. Which version of the folding client are you using, that was not included in your log clip? Why it did not run as SMP, I can not say from what I see in the log. There is no report yet on this WU in the database.
In the log file I see old client V6.29:

Code: Select all

[13:16:28] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a4.exe -dir work/ -suffix 02 -checkpoint 15 -forceasm -verbose -lifeline 1160 -version 629'
To work correctly (all cores) I think should be client V6.34.

Calling 'mpiexec - something very old :?
7im
Posts: 10179
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: 8014 Folding@home Core Shutdown: UNSTABLE_MACHINE

Post by 7im »

How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
shunter
Posts: 84
Joined: Sun Apr 06, 2008 8:22 am
Location: Hertfordshire, United Kingdom

Re: 8014 Folding@home Core Shutdown: UNSTABLE_MACHINE

Post by shunter »

Thanks Gents. Looks as though both using 6.29 so will try and upgrade over weekend.

Update Sat 11.30 - both now updated and running. One still doing an 8014 but going through 1% every 55 secs at moment.
Image
Post Reply