Failing SMP WUs - help wanted

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Post Reply
Breach
Posts: 212
Joined: Sat Mar 09, 2013 8:07 pm
Location: Brussels, Belgium

Failing SMP WUs - help wanted

Post by Breach »

Hi,

Seems that as of yesterday FAH started downloading some CPU Core A3 units and since then all I get is failures on these...

SEND error:FAULTY project:10139 run:41 clone:4 gen:0 core:0xa3
SEND error:FAULTY project:10138 run:47 clone:4 gen:0 core:0xa3
SEND error:FAULTY project:10139 run:44 clone:4 gen:0 core:0xa3
SEND error:FAULTY project:10140 run:28 clone:4 gen:0 core:0xa3

So far my 3770K has been receiving Core A4 WUs only which work fine. Any ideas??

Code: Select all

11:31:14:WU02:FS02:0xa4:Completed 5000 out of 500000 steps  (1%)
11:34:19:WU02:FS02:0xa4:Completed 10000 out of 500000 steps  (2%)
11:37:08:WU02:FS02:0xa4:Completed 15000 out of 500000 steps  (3%)
11:39:56:WU02:FS02:0xa4:Completed 20000 out of 500000 steps  (4%)
11:42:51:WU02:FS02:0xa4:Completed 25000 out of 500000 steps  (5%)
11:45:42:WU02:FS02:0xa4:Completed 30000 out of 500000 steps  (6%)
11:48:42:WU02:FS02:0xa4:Completed 35000 out of 500000 steps  (7%)
11:51:37:WU02:FS02:0xa4:Completed 40000 out of 500000 steps  (8%)
11:54:24:WU02:FS02:0xa4:Completed 45000 out of 500000 steps  (9%)
11:57:10:WU02:FS02:0xa4:Completed 50000 out of 500000 steps  (10%)
11:59:59:WU02:FS02:0xa4:Completed 55000 out of 500000 steps  (11%)
12:02:48:WU02:FS02:0xa4:Completed 60000 out of 500000 steps  (12%)
12:05:28:WU02:FS02:0xa4:Completed 65000 out of 500000 steps  (13%)
12:08:10:WU02:FS02:0xa4:Completed 70000 out of 500000 steps  (14%)
12:11:06:WU02:FS02:0xa4:Completed 75000 out of 500000 steps  (15%)
12:13:42:WU02:FS02:0xa4:Completed 80000 out of 500000 steps  (16%)
12:16:24:WU02:FS02:0xa4:Completed 85000 out of 500000 steps  (17%)
12:19:04:WU02:FS02:0xa4:Completed 90000 out of 500000 steps  (18%)
12:21:47:WU02:FS02:0xa4:Completed 95000 out of 500000 steps  (19%)
12:24:21:WU02:FS02:0xa4:Completed 100000 out of 500000 steps  (20%)
12:26:53:WU02:FS02:0xa4:Completed 105000 out of 500000 steps  (21%)
12:29:27:WU02:FS02:0xa4:Completed 110000 out of 500000 steps  (22%)
12:31:54:WU02:FS02:0xa4:Completed 115000 out of 500000 steps  (23%)
12:34:18:WU02:FS02:0xa4:Completed 120000 out of 500000 steps  (24%)
12:36:45:WU02:FS02:0xa4:Completed 125000 out of 500000 steps  (25%)
12:39:21:WU02:FS02:0xa4:Completed 130000 out of 500000 steps  (26%)
12:42:20:WU02:FS02:0xa4:Completed 135000 out of 500000 steps  (27%)
12:46:01:WU02:FS02:0xa4:Completed 140000 out of 500000 steps  (28%)
12:51:25:WU02:FS02:0xa4:Completed 145000 out of 500000 steps  (29%)
12:54:02:WU02:FS02:0xa4:Completed 150000 out of 500000 steps  (30%)
12:56:32:WU02:FS02:0xa4:Completed 155000 out of 500000 steps  (31%)
12:59:01:WU02:FS02:0xa4:Completed 160000 out of 500000 steps  (32%)
13:01:30:WU02:FS02:0xa4:Completed 165000 out of 500000 steps  (33%)
13:03:58:WU02:FS02:0xa4:Completed 170000 out of 500000 steps  (34%)
13:06:30:WU02:FS02:0xa4:Completed 175000 out of 500000 steps  (35%)
13:09:02:WU02:FS02:0xa4:Completed 180000 out of 500000 steps  (36%)
13:11:32:WU02:FS02:0xa4:Completed 185000 out of 500000 steps  (37%)
13:14:01:WU02:FS02:0xa4:Completed 190000 out of 500000 steps  (38%)
13:16:07:FS02:Shutting core down
13:16:17:WU02:FS02:0xa4:Client no longer detected. Shutting down core 
13:16:17:WU02:FS02:0xa4:
13:16:17:WU02:FS02:0xa4:Folding@home Core Shutdown: CLIENT_DIED
13:16:17:WU02:FS02:FahCore returned: INTERRUPTED (102 = 0x66)
13:16:19:WU02:FS02:Starting
13:16:19:WARNING:WU02:FS02:Changed SMP threads from 8 to 7 this can cause some work units to fail
13:16:19:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 02 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
13:16:19:WU02:FS02:Started FahCore on PID 1132
13:16:19:WU02:FS02:Core PID:4992
13:16:19:WU02:FS02:FahCore 0xa4 started
13:16:20:WU02:FS02:0xa4:
13:16:20:WU02:FS02:0xa4:*------------------------------*
13:16:20:WU02:FS02:0xa4:Folding@Home Gromacs GB Core
13:16:20:WU02:FS02:0xa4:Version 2.27 (Dec. 15, 2010)
13:16:20:WU02:FS02:0xa4:
13:16:20:WU02:FS02:0xa4:Preparing to commence simulation
13:16:20:WU02:FS02:0xa4:- Looking at optimizations...
13:16:20:WU02:FS02:0xa4:- Files status OK
13:16:20:WU02:FS02:0xa4:- Expanded 1084426 -> 3054920 (decompressed 281.7 percent)
13:16:20:WU02:FS02:0xa4:Called DecompressByteArray: compressed_data_size=1084426 data_size=3054920, decompressed_data_size=3054920 diff=0
13:16:20:WU02:FS02:0xa4:- Digital signature verified
13:16:20:WU02:FS02:0xa4:
13:16:20:WU02:FS02:0xa4:Project: 8082 (Run 20, Clone 41, Gen 51)
13:16:20:WU02:FS02:0xa4:
13:16:20:WU02:FS02:0xa4:Assembly optimizations on if available.
13:16:20:WU02:FS02:0xa4:Entering M.D.
13:16:26:WU02:FS02:0xa4:Using Gromacs checkpoints
13:16:26:WU02:FS02:0xa4:Mapping NT from 7 to 7 
13:16:26:WU02:FS02:0xa4:Resuming from checkpoint
13:16:26:WU02:FS02:0xa4:Verified 02/wudata_01.log
13:16:26:WU02:FS02:0xa4:Verified 02/wudata_01.trr
13:16:26:WU02:FS02:0xa4:Verified 02/wudata_01.xtc
13:16:26:WU02:FS02:0xa4:Verified 02/wudata_01.edr
13:16:26:WU02:FS02:0xa4:Completed 178520 out of 500000 steps  (35%)
13:17:05:WU02:FS02:0xa4:Completed 180000 out of 500000 steps  (36%)
13:19:16:WU02:FS02:0xa4:Completed 185000 out of 500000 steps  (37%)
13:21:27:WU02:FS02:0xa4:Completed 190000 out of 500000 steps  (38%)
13:23:38:WU02:FS02:0xa4:Completed 195000 out of 500000 steps  (39%)
13:25:50:WU02:FS02:0xa4:Completed 200000 out of 500000 steps  (40%)
13:28:07:WU02:FS02:0xa4:Completed 205000 out of 500000 steps  (41%)
13:30:25:WU02:FS02:0xa4:Completed 210000 out of 500000 steps  (42%)
13:32:43:WU02:FS02:0xa4:Completed 215000 out of 500000 steps  (43%)
13:35:01:WU02:FS02:0xa4:Completed 220000 out of 500000 steps  (44%)
13:37:18:WU02:FS02:0xa4:Completed 225000 out of 500000 steps  (45%)
13:39:36:WU02:FS02:0xa4:Completed 230000 out of 500000 steps  (46%)
13:41:53:WU02:FS02:0xa4:Completed 235000 out of 500000 steps  (47%)
13:44:11:WU02:FS02:0xa4:Completed 240000 out of 500000 steps  (48%)
13:46:29:WU02:FS02:0xa4:Completed 245000 out of 500000 steps  (49%)
13:48:46:WU02:FS02:0xa4:Completed 250000 out of 500000 steps  (50%)
13:51:04:WU02:FS02:0xa4:Completed 255000 out of 500000 steps  (51%)
13:53:22:WU02:FS02:0xa4:Completed 260000 out of 500000 steps  (52%)
13:55:39:WU02:FS02:0xa4:Completed 265000 out of 500000 steps  (53%)
13:57:57:WU02:FS02:0xa4:Completed 270000 out of 500000 steps  (54%)
14:00:16:WU02:FS02:0xa4:Completed 275000 out of 500000 steps  (55%)
14:02:34:WU02:FS02:0xa4:Completed 280000 out of 500000 steps  (56%)
14:04:51:WU02:FS02:0xa4:Completed 285000 out of 500000 steps  (57%)
14:07:09:WU02:FS02:0xa4:Completed 290000 out of 500000 steps  (58%)
14:09:26:WU02:FS02:0xa4:Completed 295000 out of 500000 steps  (59%)
14:11:44:WU02:FS02:0xa4:Completed 300000 out of 500000 steps  (60%)
14:14:02:WU02:FS02:0xa4:Completed 305000 out of 500000 steps  (61%)
14:16:19:WU02:FS02:0xa4:Completed 310000 out of 500000 steps  (62%)
14:18:36:WU02:FS02:0xa4:Completed 315000 out of 500000 steps  (63%)
14:20:54:WU02:FS02:0xa4:Completed 320000 out of 500000 steps  (64%)
14:23:12:WU02:FS02:0xa4:Completed 325000 out of 500000 steps  (65%)
14:25:30:WU02:FS02:0xa4:Completed 330000 out of 500000 steps  (66%)
14:27:48:WU02:FS02:0xa4:Completed 335000 out of 500000 steps  (67%)
14:30:08:WU02:FS02:0xa4:Completed 340000 out of 500000 steps  (68%)
14:32:26:WU02:FS02:0xa4:Completed 345000 out of 500000 steps  (69%)
14:34:44:WU02:FS02:0xa4:Completed 350000 out of 500000 steps  (70%)
14:37:02:WU02:FS02:0xa4:Completed 355000 out of 500000 steps  (71%)
14:39:19:WU02:FS02:0xa4:Completed 360000 out of 500000 steps  (72%)
14:41:37:WU02:FS02:0xa4:Completed 365000 out of 500000 steps  (73%)
14:43:54:WU02:FS02:0xa4:Completed 370000 out of 500000 steps  (74%)
14:46:12:WU02:FS02:0xa4:Completed 375000 out of 500000 steps  (75%)
14:48:30:WU02:FS02:0xa4:Completed 380000 out of 500000 steps  (76%)
14:50:47:WU02:FS02:0xa4:Completed 385000 out of 500000 steps  (77%)
14:53:05:WU02:FS02:0xa4:Completed 390000 out of 500000 steps  (78%)
14:55:22:WU02:FS02:0xa4:Completed 395000 out of 500000 steps  (79%)
14:57:33:WU02:FS02:0xa4:Completed 400000 out of 500000 steps  (80%)
14:59:44:WU02:FS02:0xa4:Completed 405000 out of 500000 steps  (81%)
15:01:55:WU02:FS02:0xa4:Completed 410000 out of 500000 steps  (82%)
15:04:06:WU02:FS02:0xa4:Completed 415000 out of 500000 steps  (83%)
15:06:25:WU02:FS02:0xa4:Completed 420000 out of 500000 steps  (84%)
15:08:44:WU02:FS02:0xa4:Completed 425000 out of 500000 steps  (85%)
15:11:03:WU02:FS02:0xa4:Completed 430000 out of 500000 steps  (86%)
15:13:24:WU02:FS02:0xa4:Completed 435000 out of 500000 steps  (87%)
15:15:41:WU02:FS02:0xa4:Completed 440000 out of 500000 steps  (88%)
15:17:59:WU02:FS02:0xa4:Completed 445000 out of 500000 steps  (89%)
15:20:16:WU02:FS02:0xa4:Completed 450000 out of 500000 steps  (90%)
15:22:33:WU02:FS02:0xa4:Completed 455000 out of 500000 steps  (91%)
15:24:51:WU02:FS02:0xa4:Completed 460000 out of 500000 steps  (92%)
15:27:08:WU02:FS02:0xa4:Completed 465000 out of 500000 steps  (93%)
15:29:26:WU02:FS02:0xa4:Completed 470000 out of 500000 steps  (94%)
15:31:44:WU02:FS02:0xa4:Completed 475000 out of 500000 steps  (95%)
15:34:01:WU02:FS02:0xa4:Completed 480000 out of 500000 steps  (96%)
15:36:19:WU02:FS02:0xa4:Completed 485000 out of 500000 steps  (97%)
15:38:37:WU02:FS02:0xa4:Completed 490000 out of 500000 steps  (98%)
15:38:37:WU00:FS02:Connecting to assign3.stanford.edu:8080
15:38:37:WU00:FS02:News: Welcome to Folding@Home
15:38:37:WU00:FS02:Assigned to work server 171.64.65.75
15:38:37:WU00:FS02:Requesting new work unit for slot 02: RUNNING cpu:7 from 171.64.65.75
15:38:37:WU00:FS02:Connecting to 171.64.65.75:8080
15:38:39:WU00:FS02:Downloading 763.98KiB
15:38:41:WU00:FS02:Download complete
15:38:41:WU00:FS02:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:10140 run:28 clone:4 gen:0 core:0xa3 unit:0x000000010a3b1e6f5149ed740e9414bd
15:40:54:WU02:FS02:0xa4:Completed 495000 out of 500000 steps  (99%)
15:43:12:WU02:FS02:0xa4:Completed 500000 out of 500000 steps  (100%)
15:43:12:WU02:FS02:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
15:43:22:WU02:FS02:0xa4:
15:43:22:WU02:FS02:0xa4:Finished Work Unit:
15:43:22:WU02:FS02:0xa4:- Reading up to 1352328 from "02/wudata_01.trr": Read 1352328
15:43:22:WU02:FS02:0xa4:trr file hash check passed.
15:43:22:WU02:FS02:0xa4:- Reading up to 1506492 from "02/wudata_01.xtc": Read 1506492
15:43:22:WU02:FS02:0xa4:xtc file hash check passed.
15:43:22:WU02:FS02:0xa4:edr file hash check passed.
15:43:22:WU02:FS02:0xa4:logfile size: 28339
15:43:22:WU02:FS02:0xa4:Leaving Run
15:43:22:WU02:FS02:0xa4:- Writing 2895983 bytes of core data to disk...
15:43:22:WU02:FS02:0xa4:Done: 2895471 -> 2806555 (compressed to 96.9 percent)
15:43:22:WU02:FS02:0xa4:  ... Done.
15:43:22:WU02:FS02:0xa4:- Shutting down core
15:43:22:WU02:FS02:0xa4:
15:43:22:WU02:FS02:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
15:43:23:WU02:FS02:FahCore returned: FINISHED_UNIT (100 = 0x64)
15:43:23:WU02:FS02:Sending unit results: id:02 state:SEND error:NO_ERROR project:8082 run:20 clone:41 gen:51 core:0xa4 unit:0x000000376652edb3512a0b586198ab71
15:43:23:WU02:FS02:Uploading 2.68MiB to 171.67.108.35
15:43:23:WU02:FS02:Connecting to 171.67.108.35:8080
15:43:23:WU00:FS02:Starting
15:43:23:WU00:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 00 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:43:23:WU00:FS02:Started FahCore on PID 3512
15:43:23:WU00:FS02:Core PID:6132
15:43:23:WU00:FS02:FahCore 0xa3 started
15:43:23:WU00:FS02:0xa3:
15:43:23:WU00:FS02:0xa3:*------------------------------*
15:43:23:WU00:FS02:0xa3:Folding@Home Gromacs SMP Core
15:43:23:WU00:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:43:23:WU00:FS02:0xa3:
15:43:23:WU00:FS02:0xa3:Preparing to commence simulation
15:43:23:WU00:FS02:0xa3:- Looking at optimizations...
15:43:23:WU00:FS02:0xa3:- Created dyn
15:43:23:WU00:FS02:0xa3:- Files status OK
15:43:23:WU00:FS02:0xa3:- Expanded 781802 -> 2021624 (decompressed 258.5 percent)
15:43:23:WU00:FS02:0xa3:Called DecompressByteArray: compressed_data_size=781802 data_size=2021624, decompressed_data_size=2021624 diff=0
15:43:23:WU00:FS02:0xa3:- Digital signature verified
15:43:23:WU00:FS02:0xa3:
15:43:23:WU00:FS02:0xa3:Project: 10140 (Run 28, Clone 4, Gen 0)
15:43:23:WU00:FS02:0xa3:
15:43:23:WU00:FS02:0xa3:Assembly optimizations on if available.
15:43:23:WU00:FS02:0xa3:Entering M.D.
15:43:29:WU00:FS02:0xa3:Mapping NT from 7 to 7 
15:43:29:WU00:FS02:0xa3:mdrun returned 255
15:43:29:WU00:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:43:29:WU00:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:43:30:WU02:FS02:Upload 98.06%
15:43:31:WU02:FS02:Upload complete
15:43:31:WU02:FS02:Server responded WORK_ACK (400)
15:43:31:WU02:FS02:Final credit estimate, 4221.00 points
15:43:31:WU02:FS02:Cleaning up
15:43:33:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:43:33:WU00:FS02:Sending unit results: id:00 state:SEND error:FAULTY project:10140 run:28 clone:4 gen:0 core:0xa3 unit:0x000000010a3b1e6f5149ed740e9414bd
15:43:33:WU00:FS02:Uploading 656B to 171.64.65.75
15:43:33:WU00:FS02:Connecting to 171.64.65.75:8080
15:43:34:WU02:FS02:Connecting to assign3.stanford.edu:8080
15:43:34:WU00:FS02:Upload complete
15:43:34:WU00:FS02:Server responded WORK_ACK (400)
15:43:34:WU00:FS02:Cleaning up
15:43:34:WU02:FS02:News: Welcome to Folding@Home
15:43:34:WU02:FS02:Assigned to work server 171.64.65.75
15:43:34:WU02:FS02:Requesting new work unit for slot 02: READY cpu:7 from 171.64.65.75
15:43:34:WU02:FS02:Connecting to 171.64.65.75:8080
15:43:35:WU02:FS02:Downloading 769.34KiB
15:43:37:WU02:FS02:Download complete
15:43:37:WU02:FS02:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:10139 run:41 clone:4 gen:0 core:0xa3 unit:0x000000050a3b1e6f51474affbe1daf0d
15:43:37:WU02:FS02:Starting
15:43:37:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 02 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:43:37:WU02:FS02:Started FahCore on PID 6140
15:43:38:WU02:FS02:Core PID:11032
15:43:38:WU02:FS02:FahCore 0xa3 started
15:43:38:WU02:FS02:0xa3:
15:43:38:WU02:FS02:0xa3:*------------------------------*
15:43:38:WU02:FS02:0xa3:Folding@Home Gromacs SMP Core
15:43:38:WU02:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:43:38:WU02:FS02:0xa3:
15:43:38:WU02:FS02:0xa3:Preparing to commence simulation
15:43:38:WU02:FS02:0xa3:- Looking at optimizations...
15:43:38:WU02:FS02:0xa3:- Created dyn
15:43:38:WU02:FS02:0xa3:- Files status OK
15:43:38:WU02:FS02:0xa3:- Expanded 787297 -> 2031392 (decompressed 258.0 percent)
15:43:38:WU02:FS02:0xa3:Called DecompressByteArray: compressed_data_size=787297 data_size=2031392, decompressed_data_size=2031392 diff=0
15:43:38:WU02:FS02:0xa3:- Digital signature verified
15:43:38:WU02:FS02:0xa3:
15:43:38:WU02:FS02:0xa3:Project: 10139 (Run 41, Clone 4, Gen 0)
15:43:38:WU02:FS02:0xa3:
15:43:38:WU02:FS02:0xa3:Assembly optimizations on if available.
15:43:38:WU02:FS02:0xa3:Entering M.D.
15:43:44:WU02:FS02:0xa3:Mapping NT from 7 to 7 
15:43:44:WU02:FS02:0xa3:mdrun returned 255
15:43:44:WU02:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:43:44:WU02:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:43:48:WARNING:WU02:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:43:48:WU02:FS02:Sending unit results: id:02 state:SEND error:FAULTY project:10139 run:41 clone:4 gen:0 core:0xa3 unit:0x000000050a3b1e6f51474affbe1daf0d
15:43:48:WU02:FS02:Uploading 656B to 171.64.65.75
15:43:48:WU02:FS02:Connecting to 171.64.65.75:8080
15:43:48:WU00:FS02:Connecting to assign3.stanford.edu:8080
15:43:49:WU02:FS02:Upload complete
15:43:49:WU02:FS02:Server responded WORK_ACK (400)
15:43:49:WU02:FS02:Cleaning up
15:43:49:WU00:FS02:News: Welcome to Folding@Home
15:43:49:WU00:FS02:Assigned to work server 171.64.65.75
15:43:49:WU00:FS02:Requesting new work unit for slot 02: READY cpu:7 from 171.64.65.75
15:43:49:WU00:FS02:Connecting to 171.64.65.75:8080
15:43:50:WU00:FS02:Downloading 770.64KiB
15:43:53:WU00:FS02:Download complete
15:43:53:WU00:FS02:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:10138 run:46 clone:4 gen:0 core:0xa3 unit:0x000000030a3b1e6f514749797836d701
15:43:53:WU00:FS02:Starting
15:43:53:WU00:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 00 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:43:53:WU00:FS02:Started FahCore on PID 5000
15:43:53:WU00:FS02:Core PID:9280
15:43:53:WU00:FS02:FahCore 0xa3 started
15:43:53:WU00:FS02:0xa3:
15:43:53:WU00:FS02:0xa3:*------------------------------*
15:43:53:WU00:FS02:0xa3:Folding@Home Gromacs SMP Core
15:43:53:WU00:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:43:53:WU00:FS02:0xa3:
15:43:53:WU00:FS02:0xa3:Preparing to commence simulation
15:43:53:WU00:FS02:0xa3:- Looking at optimizations...
15:43:53:WU00:FS02:0xa3:- Created dyn
15:43:53:WU00:FS02:0xa3:- Files status OK
15:43:53:WU00:FS02:0xa3:- Expanded 788628 -> 2035048 (decompressed 258.0 percent)
15:43:53:WU00:FS02:0xa3:Called DecompressByteArray: compressed_data_size=788628 data_size=2035048, decompressed_data_size=2035048 diff=0
15:43:53:WU00:FS02:0xa3:- Digital signature verified
15:43:53:WU00:FS02:0xa3:
15:43:53:WU00:FS02:0xa3:Project: 10138 (Run 46, Clone 4, Gen 0)
15:43:53:WU00:FS02:0xa3:
15:43:53:WU00:FS02:0xa3:Assembly optimizations on if available.
15:43:53:WU00:FS02:0xa3:Entering M.D.
15:43:59:WU00:FS02:0xa3:Mapping NT from 7 to 7 
15:43:59:WU00:FS02:0xa3:mdrun returned 255
15:43:59:WU00:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:43:59:WU00:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:44:03:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:44:03:WU00:FS02:Sending unit results: id:00 state:SEND error:FAULTY project:10138 run:46 clone:4 gen:0 core:0xa3 unit:0x000000030a3b1e6f514749797836d701
15:44:03:WU00:FS02:Uploading 656B to 171.64.65.75
15:44:03:WU00:FS02:Connecting to 171.64.65.75:8080
15:44:04:WU02:FS02:Connecting to assign3.stanford.edu:8080
15:44:04:WU00:FS02:Upload complete
15:44:04:WU00:FS02:Server responded WORK_ACK (400)
15:44:04:WU00:FS02:Cleaning up
15:44:04:WU02:FS02:News: Welcome to Folding@Home
15:44:04:WU02:FS02:Assigned to work server 171.64.65.75
15:44:04:WU02:FS02:Requesting new work unit for slot 02: READY cpu:7 from 171.64.65.75
15:44:04:WU02:FS02:Connecting to 171.64.65.75:8080
15:44:05:WU02:FS02:Downloading 770.93KiB
15:44:07:WU02:FS02:Download complete
15:44:08:WU02:FS02:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:10138 run:47 clone:4 gen:0 core:0xa3 unit:0x000000020a3b1e6f5147497e9ece7f5c
15:44:08:WU02:FS02:Starting
15:44:08:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 02 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:44:08:WU02:FS02:Started FahCore on PID 3496
15:44:08:WU02:FS02:Core PID:4776
15:44:08:WU02:FS02:FahCore 0xa3 started
15:44:08:WU02:FS02:0xa3:
15:44:08:WU02:FS02:0xa3:*------------------------------*
15:44:08:WU02:FS02:0xa3:Folding@Home Gromacs SMP Core
15:44:08:WU02:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:44:08:WU02:FS02:0xa3:
15:44:08:WU02:FS02:0xa3:Preparing to commence simulation
15:44:08:WU02:FS02:0xa3:- Looking at optimizations...
15:44:08:WU02:FS02:0xa3:- Created dyn
15:44:08:WU02:FS02:0xa3:- Files status OK
15:44:08:WU02:FS02:0xa3:- Expanded 788916 -> 2035048 (decompressed 257.9 percent)
15:44:08:WU02:FS02:0xa3:Called DecompressByteArray: compressed_data_size=788916 data_size=2035048, decompressed_data_size=2035048 diff=0
15:44:08:WU02:FS02:0xa3:- Digital signature verified
15:44:08:WU02:FS02:0xa3:
15:44:08:WU02:FS02:0xa3:Project: 10138 (Run 47, Clone 4, Gen 0)
15:44:08:WU02:FS02:0xa3:
15:44:08:WU02:FS02:0xa3:Assembly optimizations on if available.
15:44:08:WU02:FS02:0xa3:Entering M.D.
15:44:14:WU02:FS02:0xa3:Mapping NT from 7 to 7 
15:44:14:WU02:FS02:0xa3:mdrun returned 255
15:44:14:WU02:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:44:14:WU02:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:44:18:WU02:FS02:0xa3:logfile size=0 infoLength=0 edr=0 trr=25
15:44:18:WU02:FS02:0xa3:logfile size: 0 info=0 bed=0 hdr=25
15:44:18:WU02:FS02:0xa3:- Writing 641 bytes of core data to disk...
15:44:18:WU02:FS02:0xa3:Done: 129 -> 144 (compressed to 111.6 percent)
15:44:18:WU02:FS02:0xa3:  ... Done.
15:44:18:WU02:FS02:0xa3:
15:44:18:WU02:FS02:0xa3:Folding@home Core Shutdown: EARLY_UNIT_END
15:44:18:WARNING:WU02:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:44:18:WU02:FS02:Sending unit results: id:02 state:SEND error:FAULTY project:10138 run:47 clone:4 gen:0 core:0xa3 unit:0x000000020a3b1e6f5147497e9ece7f5c
15:44:18:WU02:FS02:Uploading 656B to 171.64.65.75
15:44:18:WU02:FS02:Connecting to 171.64.65.75:8080
15:44:19:WU00:FS02:Connecting to assign3.stanford.edu:8080
15:44:19:WU02:FS02:Upload complete
15:44:19:WU02:FS02:Server responded WORK_ACK (400)
15:44:19:WU02:FS02:Cleaning up
15:44:19:WU00:FS02:News: Welcome to Folding@Home
15:44:19:WU00:FS02:Assigned to work server 171.64.65.75
15:44:19:WU00:FS02:Requesting new work unit for slot 02: READY cpu:7 from 171.64.65.75
15:44:19:WU00:FS02:Connecting to 171.64.65.75:8080
15:44:20:WU00:FS02:Downloading 769.48KiB
15:44:22:WU00:FS02:Download complete
15:44:22:WU00:FS02:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:10139 run:44 clone:4 gen:0 core:0xa3 unit:0x000000020a3b1e6f51474b0b8c6642aa
15:44:22:WU00:FS02:Starting
15:44:22:WU00:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 00 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:44:22:WU00:FS02:Started FahCore on PID 4292
15:44:22:WU00:FS02:Core PID:8676
15:44:22:WU00:FS02:FahCore 0xa3 started
15:44:23:WU00:FS02:0xa3:
15:44:23:WU00:FS02:0xa3:*------------------------------*
15:44:23:WU00:FS02:0xa3:Folding@Home Gromacs SMP Core
15:44:23:WU00:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:44:23:WU00:FS02:0xa3:
15:44:23:WU00:FS02:0xa3:Preparing to commence simulation
15:44:23:WU00:FS02:0xa3:- Looking at optimizations...
15:44:23:WU00:FS02:0xa3:- Created dyn
15:44:23:WU00:FS02:0xa3:- Files status OK
15:44:23:WU00:FS02:0xa3:- Expanded 787437 -> 2031392 (decompressed 257.9 percent)
15:44:23:WU00:FS02:0xa3:Called DecompressByteArray: compressed_data_size=787437 data_size=2031392, decompressed_data_size=2031392 diff=0
15:44:23:WU00:FS02:0xa3:- Digital signature verified
15:44:23:WU00:FS02:0xa3:
15:44:23:WU00:FS02:0xa3:Project: 10139 (Run 44, Clone 4, Gen 0)
15:44:23:WU00:FS02:0xa3:
15:44:23:WU00:FS02:0xa3:Assembly optimizations on if available.
15:44:23:WU00:FS02:0xa3:Entering M.D.
15:44:29:WU00:FS02:0xa3:Mapping NT from 7 to 7 
15:44:29:WU00:FS02:0xa3:mdrun returned 255
15:44:29:WU00:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:44:29:WU00:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:44:33:WU00:FS02:0xa3:logfile size=0 infoLength=0 edr=0 trr=25
15:44:33:WU00:FS02:0xa3:logfile size: 0 info=0 bed=0 hdr=25
15:44:33:WU00:FS02:0xa3:- Writing 641 bytes of core data to disk...
15:44:33:WU00:FS02:0xa3:Done: 129 -> 144 (compressed to 111.6 percent)
15:44:33:WU00:FS02:0xa3:  ... Done.
15:44:33:WU00:FS02:0xa3:
15:44:33:WU00:FS02:0xa3:Folding@home Core Shutdown: EARLY_UNIT_END
15:44:33:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:44:33:WU00:FS02:Sending unit results: id:00 state:SEND error:FAULTY project:10139 run:44 clone:4 gen:0 core:0xa3 unit:0x000000020a3b1e6f51474b0b8c6642aa
15:44:33:WU00:FS02:Uploading 656B to 171.64.65.75
15:44:33:WU00:FS02:Connecting to 171.64.65.75:8080
15:44:33:WU00:FS02:Upload complete
15:44:34:WU00:FS02:Server responded WORK_ACK (400)
15:44:34:WU00:FS02:Cleaning up
******************************* Date: 2013-03-29 *******************************
Windows 11 x64 / 9800X3D PBO / 32GB DDR5 6400 1:1 / 5090 FE / Sennheiser 650 / PSU Corsair AX1600i
bollix47
Posts: 2974
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Failing SMP WUs - help wanted

Post by bollix47 »

15:44:29:WU00:FS02:0xa3:Mapping NT from 7 to 7
There are some projects which have difficulty with SMP:7. Either switch to Full Power using the slider or use FAHControl (a.k.a. Advanced Control) and edit your CPU slot to use 8 or 6 CPUs(i.e. change the number of CPUs from -1 to 8 or 6). Using 8 should be fine unless you're running a GPU slot as well. Without that information I can't be more specific.
Breach
Posts: 212
Joined: Sat Mar 09, 2013 8:07 pm
Location: Brussels, Belgium

Re: Failing SMP WUs - help wanted

Post by Breach »

I am running 2x GPU slots as well. I'll fix this to 8 now.
Windows 11 x64 / 9800X3D PBO / 32GB DDR5 6400 1:1 / 5090 FE / Sennheiser 650 / PSU Corsair AX1600i
bollix47
Posts: 2974
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Failing SMP WUs - help wanted

Post by bollix47 »

Currently, if the 2 GPU slots are running on your GTX 295 or other Nvidia hardware then 8 might be fine but if they're AMD/ATI GPUs or you see a slowdown of your CPU slot when your GPUs are running then 6 might be better.
Breach
Posts: 212
Joined: Sat Mar 09, 2013 8:07 pm
Location: Brussels, Belgium

Re: Failing SMP WUs - help wanted

Post by Breach »

Well, I know that the GPUs do cause some CPU load, but nothing like a full core, at least on nVIDIA? Is it better to leave CPU at 8 threads so that 2 of those share with the GPUs or switch to 6 (thus dedicating part of two cpu threads to the GPUs' processing)? Thanks.
Windows 11 x64 / 9800X3D PBO / 32GB DDR5 6400 1:1 / 5090 FE / Sennheiser 650 / PSU Corsair AX1600i
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: Failing SMP WUs - help wanted

Post by P5-133XL »

There is no clear answer but my odds would be that SMP:6 will work better. It does not take a full core of usage to cause a very significant SMP slowdown. My best suggestion is to test both choices and see what works best for you.
Image
Breach
Posts: 212
Joined: Sat Mar 09, 2013 8:07 pm
Location: Brussels, Belgium

Re: Failing SMP WUs - help wanted

Post by Breach »

Thanks. Is there any way to configure FAH so that it uses X CPU cores when there are GPU slots running and Y when they are not?
Windows 11 x64 / 9800X3D PBO / 32GB DDR5 6400 1:1 / 5090 FE / Sennheiser 650 / PSU Corsair AX1600i
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: Failing SMP WUs - help wanted

Post by P5-133XL »

No
Image
Jesse_V
Site Moderator
Posts: 2850
Joined: Mon Jul 18, 2011 4:44 am
Hardware configuration: OS: Windows 10, Kubuntu 19.04
CPU: i7-6700k
GPU: GTX 970, GTX 1080 TI
RAM: 24 GB DDR4
Location: Western Washington

Re: Failing SMP WUs - help wanted

Post by Jesse_V »

Breach wrote:Thanks. Is there any way to configure FAH so that it uses X CPU cores when there are GPU slots running and Y when they are not?
You'd have to make this adjustment manually in Advanced Control.
F@h is now the top computing platform on the planet and nothing unites people like a dedicated fight against a common enemy. This virus affects all of us. Lets end it together.
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Failing SMP WUs - help wanted

Post by PantherX »

Jesse_V wrote:
Breach wrote:Thanks. Is there any way to configure FAH so that it uses X CPU cores when there are GPU slots running and Y when they are not?
You'd have to make this adjustment manually in Advanced Control.
While that is true, do note that some SMP WUs don't like to change the number of threads mid-WU so can error out.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Post Reply