Page 1 of 1

Failing SMP WUs - help wanted

Posted: Fri Mar 29, 2013 6:01 pm
by Breach
Hi,

Seems that as of yesterday FAH started downloading some CPU Core A3 units and since then all I get is failures on these...

SEND error:FAULTY project:10139 run:41 clone:4 gen:0 core:0xa3
SEND error:FAULTY project:10138 run:47 clone:4 gen:0 core:0xa3
SEND error:FAULTY project:10139 run:44 clone:4 gen:0 core:0xa3
SEND error:FAULTY project:10140 run:28 clone:4 gen:0 core:0xa3

So far my 3770K has been receiving Core A4 WUs only which work fine. Any ideas??

Code: Select all

11:31:14:WU02:FS02:0xa4:Completed 5000 out of 500000 steps  (1%)
11:34:19:WU02:FS02:0xa4:Completed 10000 out of 500000 steps  (2%)
11:37:08:WU02:FS02:0xa4:Completed 15000 out of 500000 steps  (3%)
11:39:56:WU02:FS02:0xa4:Completed 20000 out of 500000 steps  (4%)
11:42:51:WU02:FS02:0xa4:Completed 25000 out of 500000 steps  (5%)
11:45:42:WU02:FS02:0xa4:Completed 30000 out of 500000 steps  (6%)
11:48:42:WU02:FS02:0xa4:Completed 35000 out of 500000 steps  (7%)
11:51:37:WU02:FS02:0xa4:Completed 40000 out of 500000 steps  (8%)
11:54:24:WU02:FS02:0xa4:Completed 45000 out of 500000 steps  (9%)
11:57:10:WU02:FS02:0xa4:Completed 50000 out of 500000 steps  (10%)
11:59:59:WU02:FS02:0xa4:Completed 55000 out of 500000 steps  (11%)
12:02:48:WU02:FS02:0xa4:Completed 60000 out of 500000 steps  (12%)
12:05:28:WU02:FS02:0xa4:Completed 65000 out of 500000 steps  (13%)
12:08:10:WU02:FS02:0xa4:Completed 70000 out of 500000 steps  (14%)
12:11:06:WU02:FS02:0xa4:Completed 75000 out of 500000 steps  (15%)
12:13:42:WU02:FS02:0xa4:Completed 80000 out of 500000 steps  (16%)
12:16:24:WU02:FS02:0xa4:Completed 85000 out of 500000 steps  (17%)
12:19:04:WU02:FS02:0xa4:Completed 90000 out of 500000 steps  (18%)
12:21:47:WU02:FS02:0xa4:Completed 95000 out of 500000 steps  (19%)
12:24:21:WU02:FS02:0xa4:Completed 100000 out of 500000 steps  (20%)
12:26:53:WU02:FS02:0xa4:Completed 105000 out of 500000 steps  (21%)
12:29:27:WU02:FS02:0xa4:Completed 110000 out of 500000 steps  (22%)
12:31:54:WU02:FS02:0xa4:Completed 115000 out of 500000 steps  (23%)
12:34:18:WU02:FS02:0xa4:Completed 120000 out of 500000 steps  (24%)
12:36:45:WU02:FS02:0xa4:Completed 125000 out of 500000 steps  (25%)
12:39:21:WU02:FS02:0xa4:Completed 130000 out of 500000 steps  (26%)
12:42:20:WU02:FS02:0xa4:Completed 135000 out of 500000 steps  (27%)
12:46:01:WU02:FS02:0xa4:Completed 140000 out of 500000 steps  (28%)
12:51:25:WU02:FS02:0xa4:Completed 145000 out of 500000 steps  (29%)
12:54:02:WU02:FS02:0xa4:Completed 150000 out of 500000 steps  (30%)
12:56:32:WU02:FS02:0xa4:Completed 155000 out of 500000 steps  (31%)
12:59:01:WU02:FS02:0xa4:Completed 160000 out of 500000 steps  (32%)
13:01:30:WU02:FS02:0xa4:Completed 165000 out of 500000 steps  (33%)
13:03:58:WU02:FS02:0xa4:Completed 170000 out of 500000 steps  (34%)
13:06:30:WU02:FS02:0xa4:Completed 175000 out of 500000 steps  (35%)
13:09:02:WU02:FS02:0xa4:Completed 180000 out of 500000 steps  (36%)
13:11:32:WU02:FS02:0xa4:Completed 185000 out of 500000 steps  (37%)
13:14:01:WU02:FS02:0xa4:Completed 190000 out of 500000 steps  (38%)
13:16:07:FS02:Shutting core down
13:16:17:WU02:FS02:0xa4:Client no longer detected. Shutting down core 
13:16:17:WU02:FS02:0xa4:
13:16:17:WU02:FS02:0xa4:Folding@home Core Shutdown: CLIENT_DIED
13:16:17:WU02:FS02:FahCore returned: INTERRUPTED (102 = 0x66)
13:16:19:WU02:FS02:Starting
13:16:19:WARNING:WU02:FS02:Changed SMP threads from 8 to 7 this can cause some work units to fail
13:16:19:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 02 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
13:16:19:WU02:FS02:Started FahCore on PID 1132
13:16:19:WU02:FS02:Core PID:4992
13:16:19:WU02:FS02:FahCore 0xa4 started
13:16:20:WU02:FS02:0xa4:
13:16:20:WU02:FS02:0xa4:*------------------------------*
13:16:20:WU02:FS02:0xa4:Folding@Home Gromacs GB Core
13:16:20:WU02:FS02:0xa4:Version 2.27 (Dec. 15, 2010)
13:16:20:WU02:FS02:0xa4:
13:16:20:WU02:FS02:0xa4:Preparing to commence simulation
13:16:20:WU02:FS02:0xa4:- Looking at optimizations...
13:16:20:WU02:FS02:0xa4:- Files status OK
13:16:20:WU02:FS02:0xa4:- Expanded 1084426 -> 3054920 (decompressed 281.7 percent)
13:16:20:WU02:FS02:0xa4:Called DecompressByteArray: compressed_data_size=1084426 data_size=3054920, decompressed_data_size=3054920 diff=0
13:16:20:WU02:FS02:0xa4:- Digital signature verified
13:16:20:WU02:FS02:0xa4:
13:16:20:WU02:FS02:0xa4:Project: 8082 (Run 20, Clone 41, Gen 51)
13:16:20:WU02:FS02:0xa4:
13:16:20:WU02:FS02:0xa4:Assembly optimizations on if available.
13:16:20:WU02:FS02:0xa4:Entering M.D.
13:16:26:WU02:FS02:0xa4:Using Gromacs checkpoints
13:16:26:WU02:FS02:0xa4:Mapping NT from 7 to 7 
13:16:26:WU02:FS02:0xa4:Resuming from checkpoint
13:16:26:WU02:FS02:0xa4:Verified 02/wudata_01.log
13:16:26:WU02:FS02:0xa4:Verified 02/wudata_01.trr
13:16:26:WU02:FS02:0xa4:Verified 02/wudata_01.xtc
13:16:26:WU02:FS02:0xa4:Verified 02/wudata_01.edr
13:16:26:WU02:FS02:0xa4:Completed 178520 out of 500000 steps  (35%)
13:17:05:WU02:FS02:0xa4:Completed 180000 out of 500000 steps  (36%)
13:19:16:WU02:FS02:0xa4:Completed 185000 out of 500000 steps  (37%)
13:21:27:WU02:FS02:0xa4:Completed 190000 out of 500000 steps  (38%)
13:23:38:WU02:FS02:0xa4:Completed 195000 out of 500000 steps  (39%)
13:25:50:WU02:FS02:0xa4:Completed 200000 out of 500000 steps  (40%)
13:28:07:WU02:FS02:0xa4:Completed 205000 out of 500000 steps  (41%)
13:30:25:WU02:FS02:0xa4:Completed 210000 out of 500000 steps  (42%)
13:32:43:WU02:FS02:0xa4:Completed 215000 out of 500000 steps  (43%)
13:35:01:WU02:FS02:0xa4:Completed 220000 out of 500000 steps  (44%)
13:37:18:WU02:FS02:0xa4:Completed 225000 out of 500000 steps  (45%)
13:39:36:WU02:FS02:0xa4:Completed 230000 out of 500000 steps  (46%)
13:41:53:WU02:FS02:0xa4:Completed 235000 out of 500000 steps  (47%)
13:44:11:WU02:FS02:0xa4:Completed 240000 out of 500000 steps  (48%)
13:46:29:WU02:FS02:0xa4:Completed 245000 out of 500000 steps  (49%)
13:48:46:WU02:FS02:0xa4:Completed 250000 out of 500000 steps  (50%)
13:51:04:WU02:FS02:0xa4:Completed 255000 out of 500000 steps  (51%)
13:53:22:WU02:FS02:0xa4:Completed 260000 out of 500000 steps  (52%)
13:55:39:WU02:FS02:0xa4:Completed 265000 out of 500000 steps  (53%)
13:57:57:WU02:FS02:0xa4:Completed 270000 out of 500000 steps  (54%)
14:00:16:WU02:FS02:0xa4:Completed 275000 out of 500000 steps  (55%)
14:02:34:WU02:FS02:0xa4:Completed 280000 out of 500000 steps  (56%)
14:04:51:WU02:FS02:0xa4:Completed 285000 out of 500000 steps  (57%)
14:07:09:WU02:FS02:0xa4:Completed 290000 out of 500000 steps  (58%)
14:09:26:WU02:FS02:0xa4:Completed 295000 out of 500000 steps  (59%)
14:11:44:WU02:FS02:0xa4:Completed 300000 out of 500000 steps  (60%)
14:14:02:WU02:FS02:0xa4:Completed 305000 out of 500000 steps  (61%)
14:16:19:WU02:FS02:0xa4:Completed 310000 out of 500000 steps  (62%)
14:18:36:WU02:FS02:0xa4:Completed 315000 out of 500000 steps  (63%)
14:20:54:WU02:FS02:0xa4:Completed 320000 out of 500000 steps  (64%)
14:23:12:WU02:FS02:0xa4:Completed 325000 out of 500000 steps  (65%)
14:25:30:WU02:FS02:0xa4:Completed 330000 out of 500000 steps  (66%)
14:27:48:WU02:FS02:0xa4:Completed 335000 out of 500000 steps  (67%)
14:30:08:WU02:FS02:0xa4:Completed 340000 out of 500000 steps  (68%)
14:32:26:WU02:FS02:0xa4:Completed 345000 out of 500000 steps  (69%)
14:34:44:WU02:FS02:0xa4:Completed 350000 out of 500000 steps  (70%)
14:37:02:WU02:FS02:0xa4:Completed 355000 out of 500000 steps  (71%)
14:39:19:WU02:FS02:0xa4:Completed 360000 out of 500000 steps  (72%)
14:41:37:WU02:FS02:0xa4:Completed 365000 out of 500000 steps  (73%)
14:43:54:WU02:FS02:0xa4:Completed 370000 out of 500000 steps  (74%)
14:46:12:WU02:FS02:0xa4:Completed 375000 out of 500000 steps  (75%)
14:48:30:WU02:FS02:0xa4:Completed 380000 out of 500000 steps  (76%)
14:50:47:WU02:FS02:0xa4:Completed 385000 out of 500000 steps  (77%)
14:53:05:WU02:FS02:0xa4:Completed 390000 out of 500000 steps  (78%)
14:55:22:WU02:FS02:0xa4:Completed 395000 out of 500000 steps  (79%)
14:57:33:WU02:FS02:0xa4:Completed 400000 out of 500000 steps  (80%)
14:59:44:WU02:FS02:0xa4:Completed 405000 out of 500000 steps  (81%)
15:01:55:WU02:FS02:0xa4:Completed 410000 out of 500000 steps  (82%)
15:04:06:WU02:FS02:0xa4:Completed 415000 out of 500000 steps  (83%)
15:06:25:WU02:FS02:0xa4:Completed 420000 out of 500000 steps  (84%)
15:08:44:WU02:FS02:0xa4:Completed 425000 out of 500000 steps  (85%)
15:11:03:WU02:FS02:0xa4:Completed 430000 out of 500000 steps  (86%)
15:13:24:WU02:FS02:0xa4:Completed 435000 out of 500000 steps  (87%)
15:15:41:WU02:FS02:0xa4:Completed 440000 out of 500000 steps  (88%)
15:17:59:WU02:FS02:0xa4:Completed 445000 out of 500000 steps  (89%)
15:20:16:WU02:FS02:0xa4:Completed 450000 out of 500000 steps  (90%)
15:22:33:WU02:FS02:0xa4:Completed 455000 out of 500000 steps  (91%)
15:24:51:WU02:FS02:0xa4:Completed 460000 out of 500000 steps  (92%)
15:27:08:WU02:FS02:0xa4:Completed 465000 out of 500000 steps  (93%)
15:29:26:WU02:FS02:0xa4:Completed 470000 out of 500000 steps  (94%)
15:31:44:WU02:FS02:0xa4:Completed 475000 out of 500000 steps  (95%)
15:34:01:WU02:FS02:0xa4:Completed 480000 out of 500000 steps  (96%)
15:36:19:WU02:FS02:0xa4:Completed 485000 out of 500000 steps  (97%)
15:38:37:WU02:FS02:0xa4:Completed 490000 out of 500000 steps  (98%)
15:38:37:WU00:FS02:Connecting to assign3.stanford.edu:8080
15:38:37:WU00:FS02:News: Welcome to Folding@Home
15:38:37:WU00:FS02:Assigned to work server 171.64.65.75
15:38:37:WU00:FS02:Requesting new work unit for slot 02: RUNNING cpu:7 from 171.64.65.75
15:38:37:WU00:FS02:Connecting to 171.64.65.75:8080
15:38:39:WU00:FS02:Downloading 763.98KiB
15:38:41:WU00:FS02:Download complete
15:38:41:WU00:FS02:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:10140 run:28 clone:4 gen:0 core:0xa3 unit:0x000000010a3b1e6f5149ed740e9414bd
15:40:54:WU02:FS02:0xa4:Completed 495000 out of 500000 steps  (99%)
15:43:12:WU02:FS02:0xa4:Completed 500000 out of 500000 steps  (100%)
15:43:12:WU02:FS02:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
15:43:22:WU02:FS02:0xa4:
15:43:22:WU02:FS02:0xa4:Finished Work Unit:
15:43:22:WU02:FS02:0xa4:- Reading up to 1352328 from "02/wudata_01.trr": Read 1352328
15:43:22:WU02:FS02:0xa4:trr file hash check passed.
15:43:22:WU02:FS02:0xa4:- Reading up to 1506492 from "02/wudata_01.xtc": Read 1506492
15:43:22:WU02:FS02:0xa4:xtc file hash check passed.
15:43:22:WU02:FS02:0xa4:edr file hash check passed.
15:43:22:WU02:FS02:0xa4:logfile size: 28339
15:43:22:WU02:FS02:0xa4:Leaving Run
15:43:22:WU02:FS02:0xa4:- Writing 2895983 bytes of core data to disk...
15:43:22:WU02:FS02:0xa4:Done: 2895471 -> 2806555 (compressed to 96.9 percent)
15:43:22:WU02:FS02:0xa4:  ... Done.
15:43:22:WU02:FS02:0xa4:- Shutting down core
15:43:22:WU02:FS02:0xa4:
15:43:22:WU02:FS02:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
15:43:23:WU02:FS02:FahCore returned: FINISHED_UNIT (100 = 0x64)
15:43:23:WU02:FS02:Sending unit results: id:02 state:SEND error:NO_ERROR project:8082 run:20 clone:41 gen:51 core:0xa4 unit:0x000000376652edb3512a0b586198ab71
15:43:23:WU02:FS02:Uploading 2.68MiB to 171.67.108.35
15:43:23:WU02:FS02:Connecting to 171.67.108.35:8080
15:43:23:WU00:FS02:Starting
15:43:23:WU00:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 00 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:43:23:WU00:FS02:Started FahCore on PID 3512
15:43:23:WU00:FS02:Core PID:6132
15:43:23:WU00:FS02:FahCore 0xa3 started
15:43:23:WU00:FS02:0xa3:
15:43:23:WU00:FS02:0xa3:*------------------------------*
15:43:23:WU00:FS02:0xa3:Folding@Home Gromacs SMP Core
15:43:23:WU00:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:43:23:WU00:FS02:0xa3:
15:43:23:WU00:FS02:0xa3:Preparing to commence simulation
15:43:23:WU00:FS02:0xa3:- Looking at optimizations...
15:43:23:WU00:FS02:0xa3:- Created dyn
15:43:23:WU00:FS02:0xa3:- Files status OK
15:43:23:WU00:FS02:0xa3:- Expanded 781802 -> 2021624 (decompressed 258.5 percent)
15:43:23:WU00:FS02:0xa3:Called DecompressByteArray: compressed_data_size=781802 data_size=2021624, decompressed_data_size=2021624 diff=0
15:43:23:WU00:FS02:0xa3:- Digital signature verified
15:43:23:WU00:FS02:0xa3:
15:43:23:WU00:FS02:0xa3:Project: 10140 (Run 28, Clone 4, Gen 0)
15:43:23:WU00:FS02:0xa3:
15:43:23:WU00:FS02:0xa3:Assembly optimizations on if available.
15:43:23:WU00:FS02:0xa3:Entering M.D.
15:43:29:WU00:FS02:0xa3:Mapping NT from 7 to 7 
15:43:29:WU00:FS02:0xa3:mdrun returned 255
15:43:29:WU00:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:43:29:WU00:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:43:30:WU02:FS02:Upload 98.06%
15:43:31:WU02:FS02:Upload complete
15:43:31:WU02:FS02:Server responded WORK_ACK (400)
15:43:31:WU02:FS02:Final credit estimate, 4221.00 points
15:43:31:WU02:FS02:Cleaning up
15:43:33:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:43:33:WU00:FS02:Sending unit results: id:00 state:SEND error:FAULTY project:10140 run:28 clone:4 gen:0 core:0xa3 unit:0x000000010a3b1e6f5149ed740e9414bd
15:43:33:WU00:FS02:Uploading 656B to 171.64.65.75
15:43:33:WU00:FS02:Connecting to 171.64.65.75:8080
15:43:34:WU02:FS02:Connecting to assign3.stanford.edu:8080
15:43:34:WU00:FS02:Upload complete
15:43:34:WU00:FS02:Server responded WORK_ACK (400)
15:43:34:WU00:FS02:Cleaning up
15:43:34:WU02:FS02:News: Welcome to Folding@Home
15:43:34:WU02:FS02:Assigned to work server 171.64.65.75
15:43:34:WU02:FS02:Requesting new work unit for slot 02: READY cpu:7 from 171.64.65.75
15:43:34:WU02:FS02:Connecting to 171.64.65.75:8080
15:43:35:WU02:FS02:Downloading 769.34KiB
15:43:37:WU02:FS02:Download complete
15:43:37:WU02:FS02:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:10139 run:41 clone:4 gen:0 core:0xa3 unit:0x000000050a3b1e6f51474affbe1daf0d
15:43:37:WU02:FS02:Starting
15:43:37:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 02 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:43:37:WU02:FS02:Started FahCore on PID 6140
15:43:38:WU02:FS02:Core PID:11032
15:43:38:WU02:FS02:FahCore 0xa3 started
15:43:38:WU02:FS02:0xa3:
15:43:38:WU02:FS02:0xa3:*------------------------------*
15:43:38:WU02:FS02:0xa3:Folding@Home Gromacs SMP Core
15:43:38:WU02:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:43:38:WU02:FS02:0xa3:
15:43:38:WU02:FS02:0xa3:Preparing to commence simulation
15:43:38:WU02:FS02:0xa3:- Looking at optimizations...
15:43:38:WU02:FS02:0xa3:- Created dyn
15:43:38:WU02:FS02:0xa3:- Files status OK
15:43:38:WU02:FS02:0xa3:- Expanded 787297 -> 2031392 (decompressed 258.0 percent)
15:43:38:WU02:FS02:0xa3:Called DecompressByteArray: compressed_data_size=787297 data_size=2031392, decompressed_data_size=2031392 diff=0
15:43:38:WU02:FS02:0xa3:- Digital signature verified
15:43:38:WU02:FS02:0xa3:
15:43:38:WU02:FS02:0xa3:Project: 10139 (Run 41, Clone 4, Gen 0)
15:43:38:WU02:FS02:0xa3:
15:43:38:WU02:FS02:0xa3:Assembly optimizations on if available.
15:43:38:WU02:FS02:0xa3:Entering M.D.
15:43:44:WU02:FS02:0xa3:Mapping NT from 7 to 7 
15:43:44:WU02:FS02:0xa3:mdrun returned 255
15:43:44:WU02:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:43:44:WU02:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:43:48:WARNING:WU02:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:43:48:WU02:FS02:Sending unit results: id:02 state:SEND error:FAULTY project:10139 run:41 clone:4 gen:0 core:0xa3 unit:0x000000050a3b1e6f51474affbe1daf0d
15:43:48:WU02:FS02:Uploading 656B to 171.64.65.75
15:43:48:WU02:FS02:Connecting to 171.64.65.75:8080
15:43:48:WU00:FS02:Connecting to assign3.stanford.edu:8080
15:43:49:WU02:FS02:Upload complete
15:43:49:WU02:FS02:Server responded WORK_ACK (400)
15:43:49:WU02:FS02:Cleaning up
15:43:49:WU00:FS02:News: Welcome to Folding@Home
15:43:49:WU00:FS02:Assigned to work server 171.64.65.75
15:43:49:WU00:FS02:Requesting new work unit for slot 02: READY cpu:7 from 171.64.65.75
15:43:49:WU00:FS02:Connecting to 171.64.65.75:8080
15:43:50:WU00:FS02:Downloading 770.64KiB
15:43:53:WU00:FS02:Download complete
15:43:53:WU00:FS02:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:10138 run:46 clone:4 gen:0 core:0xa3 unit:0x000000030a3b1e6f514749797836d701
15:43:53:WU00:FS02:Starting
15:43:53:WU00:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 00 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:43:53:WU00:FS02:Started FahCore on PID 5000
15:43:53:WU00:FS02:Core PID:9280
15:43:53:WU00:FS02:FahCore 0xa3 started
15:43:53:WU00:FS02:0xa3:
15:43:53:WU00:FS02:0xa3:*------------------------------*
15:43:53:WU00:FS02:0xa3:Folding@Home Gromacs SMP Core
15:43:53:WU00:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:43:53:WU00:FS02:0xa3:
15:43:53:WU00:FS02:0xa3:Preparing to commence simulation
15:43:53:WU00:FS02:0xa3:- Looking at optimizations...
15:43:53:WU00:FS02:0xa3:- Created dyn
15:43:53:WU00:FS02:0xa3:- Files status OK
15:43:53:WU00:FS02:0xa3:- Expanded 788628 -> 2035048 (decompressed 258.0 percent)
15:43:53:WU00:FS02:0xa3:Called DecompressByteArray: compressed_data_size=788628 data_size=2035048, decompressed_data_size=2035048 diff=0
15:43:53:WU00:FS02:0xa3:- Digital signature verified
15:43:53:WU00:FS02:0xa3:
15:43:53:WU00:FS02:0xa3:Project: 10138 (Run 46, Clone 4, Gen 0)
15:43:53:WU00:FS02:0xa3:
15:43:53:WU00:FS02:0xa3:Assembly optimizations on if available.
15:43:53:WU00:FS02:0xa3:Entering M.D.
15:43:59:WU00:FS02:0xa3:Mapping NT from 7 to 7 
15:43:59:WU00:FS02:0xa3:mdrun returned 255
15:43:59:WU00:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:43:59:WU00:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:44:03:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:44:03:WU00:FS02:Sending unit results: id:00 state:SEND error:FAULTY project:10138 run:46 clone:4 gen:0 core:0xa3 unit:0x000000030a3b1e6f514749797836d701
15:44:03:WU00:FS02:Uploading 656B to 171.64.65.75
15:44:03:WU00:FS02:Connecting to 171.64.65.75:8080
15:44:04:WU02:FS02:Connecting to assign3.stanford.edu:8080
15:44:04:WU00:FS02:Upload complete
15:44:04:WU00:FS02:Server responded WORK_ACK (400)
15:44:04:WU00:FS02:Cleaning up
15:44:04:WU02:FS02:News: Welcome to Folding@Home
15:44:04:WU02:FS02:Assigned to work server 171.64.65.75
15:44:04:WU02:FS02:Requesting new work unit for slot 02: READY cpu:7 from 171.64.65.75
15:44:04:WU02:FS02:Connecting to 171.64.65.75:8080
15:44:05:WU02:FS02:Downloading 770.93KiB
15:44:07:WU02:FS02:Download complete
15:44:08:WU02:FS02:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:10138 run:47 clone:4 gen:0 core:0xa3 unit:0x000000020a3b1e6f5147497e9ece7f5c
15:44:08:WU02:FS02:Starting
15:44:08:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 02 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:44:08:WU02:FS02:Started FahCore on PID 3496
15:44:08:WU02:FS02:Core PID:4776
15:44:08:WU02:FS02:FahCore 0xa3 started
15:44:08:WU02:FS02:0xa3:
15:44:08:WU02:FS02:0xa3:*------------------------------*
15:44:08:WU02:FS02:0xa3:Folding@Home Gromacs SMP Core
15:44:08:WU02:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:44:08:WU02:FS02:0xa3:
15:44:08:WU02:FS02:0xa3:Preparing to commence simulation
15:44:08:WU02:FS02:0xa3:- Looking at optimizations...
15:44:08:WU02:FS02:0xa3:- Created dyn
15:44:08:WU02:FS02:0xa3:- Files status OK
15:44:08:WU02:FS02:0xa3:- Expanded 788916 -> 2035048 (decompressed 257.9 percent)
15:44:08:WU02:FS02:0xa3:Called DecompressByteArray: compressed_data_size=788916 data_size=2035048, decompressed_data_size=2035048 diff=0
15:44:08:WU02:FS02:0xa3:- Digital signature verified
15:44:08:WU02:FS02:0xa3:
15:44:08:WU02:FS02:0xa3:Project: 10138 (Run 47, Clone 4, Gen 0)
15:44:08:WU02:FS02:0xa3:
15:44:08:WU02:FS02:0xa3:Assembly optimizations on if available.
15:44:08:WU02:FS02:0xa3:Entering M.D.
15:44:14:WU02:FS02:0xa3:Mapping NT from 7 to 7 
15:44:14:WU02:FS02:0xa3:mdrun returned 255
15:44:14:WU02:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:44:14:WU02:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:44:18:WU02:FS02:0xa3:logfile size=0 infoLength=0 edr=0 trr=25
15:44:18:WU02:FS02:0xa3:logfile size: 0 info=0 bed=0 hdr=25
15:44:18:WU02:FS02:0xa3:- Writing 641 bytes of core data to disk...
15:44:18:WU02:FS02:0xa3:Done: 129 -> 144 (compressed to 111.6 percent)
15:44:18:WU02:FS02:0xa3:  ... Done.
15:44:18:WU02:FS02:0xa3:
15:44:18:WU02:FS02:0xa3:Folding@home Core Shutdown: EARLY_UNIT_END
15:44:18:WARNING:WU02:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:44:18:WU02:FS02:Sending unit results: id:02 state:SEND error:FAULTY project:10138 run:47 clone:4 gen:0 core:0xa3 unit:0x000000020a3b1e6f5147497e9ece7f5c
15:44:18:WU02:FS02:Uploading 656B to 171.64.65.75
15:44:18:WU02:FS02:Connecting to 171.64.65.75:8080
15:44:19:WU00:FS02:Connecting to assign3.stanford.edu:8080
15:44:19:WU02:FS02:Upload complete
15:44:19:WU02:FS02:Server responded WORK_ACK (400)
15:44:19:WU02:FS02:Cleaning up
15:44:19:WU00:FS02:News: Welcome to Folding@Home
15:44:19:WU00:FS02:Assigned to work server 171.64.65.75
15:44:19:WU00:FS02:Requesting new work unit for slot 02: READY cpu:7 from 171.64.65.75
15:44:19:WU00:FS02:Connecting to 171.64.65.75:8080
15:44:20:WU00:FS02:Downloading 769.48KiB
15:44:22:WU00:FS02:Download complete
15:44:22:WU00:FS02:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:10139 run:44 clone:4 gen:0 core:0xa3 unit:0x000000020a3b1e6f51474b0b8c6642aa
15:44:22:WU00:FS02:Starting
15:44:22:WU00:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 00 -suffix 01 -version 703 -lifeline 6832 -checkpoint 10 -np 7
15:44:22:WU00:FS02:Started FahCore on PID 4292
15:44:22:WU00:FS02:Core PID:8676
15:44:22:WU00:FS02:FahCore 0xa3 started
15:44:23:WU00:FS02:0xa3:
15:44:23:WU00:FS02:0xa3:*------------------------------*
15:44:23:WU00:FS02:0xa3:Folding@Home Gromacs SMP Core
15:44:23:WU00:FS02:0xa3:Version 2.27 (Dec. 15, 2010)
15:44:23:WU00:FS02:0xa3:
15:44:23:WU00:FS02:0xa3:Preparing to commence simulation
15:44:23:WU00:FS02:0xa3:- Looking at optimizations...
15:44:23:WU00:FS02:0xa3:- Created dyn
15:44:23:WU00:FS02:0xa3:- Files status OK
15:44:23:WU00:FS02:0xa3:- Expanded 787437 -> 2031392 (decompressed 257.9 percent)
15:44:23:WU00:FS02:0xa3:Called DecompressByteArray: compressed_data_size=787437 data_size=2031392, decompressed_data_size=2031392 diff=0
15:44:23:WU00:FS02:0xa3:- Digital signature verified
15:44:23:WU00:FS02:0xa3:
15:44:23:WU00:FS02:0xa3:Project: 10139 (Run 44, Clone 4, Gen 0)
15:44:23:WU00:FS02:0xa3:
15:44:23:WU00:FS02:0xa3:Assembly optimizations on if available.
15:44:23:WU00:FS02:0xa3:Entering M.D.
15:44:29:WU00:FS02:0xa3:Mapping NT from 7 to 7 
15:44:29:WU00:FS02:0xa3:mdrun returned 255
15:44:29:WU00:FS02:0xa3:Going to send back what have done -- stepsTotalG=2000000
15:44:29:WU00:FS02:0xa3:Work fraction=0.0000 steps=2000000.
15:44:33:WU00:FS02:0xa3:logfile size=0 infoLength=0 edr=0 trr=25
15:44:33:WU00:FS02:0xa3:logfile size: 0 info=0 bed=0 hdr=25
15:44:33:WU00:FS02:0xa3:- Writing 641 bytes of core data to disk...
15:44:33:WU00:FS02:0xa3:Done: 129 -> 144 (compressed to 111.6 percent)
15:44:33:WU00:FS02:0xa3:  ... Done.
15:44:33:WU00:FS02:0xa3:
15:44:33:WU00:FS02:0xa3:Folding@home Core Shutdown: EARLY_UNIT_END
15:44:33:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:44:33:WU00:FS02:Sending unit results: id:00 state:SEND error:FAULTY project:10139 run:44 clone:4 gen:0 core:0xa3 unit:0x000000020a3b1e6f51474b0b8c6642aa
15:44:33:WU00:FS02:Uploading 656B to 171.64.65.75
15:44:33:WU00:FS02:Connecting to 171.64.65.75:8080
15:44:33:WU00:FS02:Upload complete
15:44:34:WU00:FS02:Server responded WORK_ACK (400)
15:44:34:WU00:FS02:Cleaning up
******************************* Date: 2013-03-29 *******************************

Re: Failing SMP WUs - help wanted

Posted: Fri Mar 29, 2013 6:14 pm
by bollix47
15:44:29:WU00:FS02:0xa3:Mapping NT from 7 to 7
There are some projects which have difficulty with SMP:7. Either switch to Full Power using the slider or use FAHControl (a.k.a. Advanced Control) and edit your CPU slot to use 8 or 6 CPUs(i.e. change the number of CPUs from -1 to 8 or 6). Using 8 should be fine unless you're running a GPU slot as well. Without that information I can't be more specific.

Re: Failing SMP WUs - help wanted

Posted: Fri Mar 29, 2013 6:25 pm
by Breach
I am running 2x GPU slots as well. I'll fix this to 8 now.

Re: Failing SMP WUs - help wanted

Posted: Fri Mar 29, 2013 6:29 pm
by bollix47
Currently, if the 2 GPU slots are running on your GTX 295 or other Nvidia hardware then 8 might be fine but if they're AMD/ATI GPUs or you see a slowdown of your CPU slot when your GPUs are running then 6 might be better.

Re: Failing SMP WUs - help wanted

Posted: Fri Mar 29, 2013 8:30 pm
by Breach
Well, I know that the GPUs do cause some CPU load, but nothing like a full core, at least on nVIDIA? Is it better to leave CPU at 8 threads so that 2 of those share with the GPUs or switch to 6 (thus dedicating part of two cpu threads to the GPUs' processing)? Thanks.

Re: Failing SMP WUs - help wanted

Posted: Fri Mar 29, 2013 8:43 pm
by P5-133XL
There is no clear answer but my odds would be that SMP:6 will work better. It does not take a full core of usage to cause a very significant SMP slowdown. My best suggestion is to test both choices and see what works best for you.

Re: Failing SMP WUs - help wanted

Posted: Sat Mar 30, 2013 9:38 am
by Breach
Thanks. Is there any way to configure FAH so that it uses X CPU cores when there are GPU slots running and Y when they are not?

Re: Failing SMP WUs - help wanted

Posted: Sat Mar 30, 2013 1:57 pm
by P5-133XL
No

Re: Failing SMP WUs - help wanted

Posted: Sat Mar 30, 2013 2:44 pm
by Jesse_V
Breach wrote:Thanks. Is there any way to configure FAH so that it uses X CPU cores when there are GPU slots running and Y when they are not?
You'd have to make this adjustment manually in Advanced Control.

Re: Failing SMP WUs - help wanted

Posted: Sat Mar 30, 2013 7:49 pm
by PantherX
Jesse_V wrote:
Breach wrote:Thanks. Is there any way to configure FAH so that it uses X CPU cores when there are GPU slots running and Y when they are not?
You'd have to make this adjustment manually in Advanced Control.
While that is true, do note that some SMP WUs don't like to change the number of threads mid-WU so can error out.