Page 1 of 1
Project: 6900 (Run 9, Clone 13, Gen 8)
Posted: Thu Dec 09, 2010 7:10 pm
by campbbri
I received a client communication error on project 6900. I'm running a Core i7 @ 3.8 ghz and 6 GB RAM, and this was my first error after running many bigadv WUs. I haven't made any configuration changes in 6 months or so. Interestingly enough I was assigned the exact same WU and it processed all the way through and was sent without any issues. Does this mean it's my hardware somehow? I figure it can't hurt to report it in either case. Here's part of the log:
Code: Select all
[14:48:49] Project: 6900 (Run 9, Clone 13, Gen 8)
[14:48:49]
[14:48:49] Assembly optimizations on if available.
[14:48:49] Entering M.D.
[14:48:58] Completed 0 out of 250000 steps (0%)
[15:21:54] Completed 2500 out of 250000 steps (1%)
[15:54:27] Completed 5000 out of 250000 steps (2%)
[16:26:21] Completed 7500 out of 250000 steps (3%)
[16:58:11] Completed 10000 out of 250000 steps (4%)
[17:36:36] CoreStatus = C0000005 (-1073741819)
[17:36:36] Client-core communications error: ERROR 0xc0000005
[17:36:36] Deleting current work unit & continuing...
[17:36:52] - Preparing to get new work unit...
[17:36:52] Cleaning up work directory
[17:36:58] + Attempting to get work packet
[17:36:58] Passkey found
[17:36:58] - Connecting to assignment server
[17:36:58] - Successful: assigned to (130.237.232.141).
[17:36:58] + News From Folding@Home: Welcome to Folding@Home
[17:36:58] Loaded queue successfully.
[17:38:16] + Closed connections
[17:38:21]
[17:38:21] + Processing work unit
[17:38:21] Core required: FahCore_a3.exe
[17:38:21] Core found.
[17:38:21] Working on queue slot 05 [December 4 17:38:21 UTC]
[17:38:21] + Working ...
[17:38:21]
[17:38:21] *------------------------------*
[17:38:21] Folding@Home Gromacs SMP Core
[17:38:21] Version 2.22 (Mar 12, 2010)
[17:38:21]
[17:38:21] Preparing to commence simulation
[17:38:21] - Looking at optimizations...
[17:38:21] - Created dyn
[17:38:21] - Files status OK
[17:38:25] - Expanded 24861664 -> 30796293 (decompressed 123.8 percent)
[17:38:25] Called DecompressByteArray: compressed_data_size=24861664 data_size=30796293, decompressed_data_size=30796293 diff=0
[17:38:25] - Digital signature verified
[17:38:25]
[17:38:25] Project: 6900 (Run 9, Clone 13, Gen 8)
[17:38:25]
[17:38:25] Assembly optimizations on if available.
[17:38:25] Entering M.D.
[17:38:34] Completed 0 out of 250000 steps (0%)
[18:11:26] Completed 2500 out of 250000 steps (1%)
[18:44:26] Completed 5000 out of 250000 steps (2%)
[19:17:19] Completed 7500 out of 250000 steps (3%)
[19:50:09] Completed 10000 out of 250000 steps (4%)
[20:23:03] Completed 12500 out of 250000 steps (5%)
[20:56:38] Completed 15000 out of 250000 steps (6%)
[21:30:13] Completed 17500 out of 250000 steps (7%)
[continued to 100% and then sent to server without issues]
Re: Project: 6900 (Run 9, Clone 13, Gen 8)
Posted: Thu Dec 09, 2010 7:52 pm
by sortofageek
You really didn't show the whole picture, so I can't really see what happened. I do know the WU was completed, but I'm not sure it was you since I can't see your folding name and team number in the log portion you posted.
Hi Bxxxx (team 6xxx3),
Your WU (P6900 R9 C13 G8) was added to the stats database on 2010-12-06 21:06:04 for 72195.2 points of credit.
Re: Project: 6900 (Run 9, Clone 13, Gen 8)
Posted: Thu Dec 09, 2010 9:30 pm
by campbbri
Sorry, I didn't want to paste too much junk. You can see in the expanded log I've attached where I stopped and restarted the computer that my team is 61483, username is "Brian" and user ID is 75FC0D3265561F5D. I'm not worried about the points, but I wondered if the error was due to the WU or my hardware and I thought it was interesting that I was assigned the exact same WU.
Code: Select all
[14:47:40] - Preparing to get new work unit...
[14:47:40] Cleaning up work directory
[14:47:40] + Attempting to get work packet
[14:47:40] Passkey found
[14:47:40] - Connecting to assignment server
[14:47:41] - Successful: assigned to (130.237.232.141).
[14:47:41] + News From Folding@Home: Welcome to Folding@Home
[14:47:41] Loaded queue successfully.
[14:48:45] + Closed connections
[14:48:45]
[14:48:45] + Processing work unit
[14:48:45] Core required: FahCore_a3.exe
[14:48:45] Core found.
[14:48:45] Working on queue slot 04 [December 4 14:48:45 UTC]
[14:48:45] + Working ...
[14:48:45]
[14:48:45] *------------------------------*
[14:48:45] Folding@Home Gromacs SMP Core
[14:48:45] Version 2.22 (Mar 12, 2010)
[14:48:45]
[14:48:45] Preparing to commence simulation
[14:48:45] - Looking at optimizations...
[14:48:45] - Created dyn
[14:48:45] - Files status OK
[14:48:49] - Expanded 24861664 -> 30796293 (decompressed 123.8 percent)
[14:48:49] Called DecompressByteArray: compressed_data_size=24861664 data_size=30796293, decompressed_data_size=30796293 diff=0
[14:48:49] - Digital signature verified
[14:48:49]
[14:48:49] Project: 6900 (Run 9, Clone 13, Gen 8)
[14:48:49]
[14:48:49] Assembly optimizations on if available.
[14:48:49] Entering M.D.
[14:48:58] Completed 0 out of 250000 steps (0%)
[15:21:54] Completed 2500 out of 250000 steps (1%)
[15:54:27] Completed 5000 out of 250000 steps (2%)
[16:26:21] Completed 7500 out of 250000 steps (3%)
[16:58:11] Completed 10000 out of 250000 steps (4%)
[17:36:36] CoreStatus = C0000005 (-1073741819)
[17:36:36] Client-core communications error: ERROR 0xc0000005
[17:36:36] Deleting current work unit & continuing...
[17:36:52] - Preparing to get new work unit...
[17:36:52] Cleaning up work directory
[17:36:58] + Attempting to get work packet
[17:36:58] Passkey found
[17:36:58] - Connecting to assignment server
[17:36:58] - Successful: assigned to (130.237.232.141).
[17:36:58] + News From Folding@Home: Welcome to Folding@Home
[17:36:58] Loaded queue successfully.
[17:38:16] + Closed connections
[17:38:21]
[17:38:21] + Processing work unit
[17:38:21] Core required: FahCore_a3.exe
[17:38:21] Core found.
[17:38:21] Working on queue slot 05 [December 4 17:38:21 UTC]
[17:38:21] + Working ...
[17:38:21]
[17:38:21] *------------------------------*
[17:38:21] Folding@Home Gromacs SMP Core
[17:38:21] Version 2.22 (Mar 12, 2010)
[17:38:21]
[17:38:21] Preparing to commence simulation
[17:38:21] - Looking at optimizations...
[17:38:21] - Created dyn
[17:38:21] - Files status OK
[17:38:25] - Expanded 24861664 -> 30796293 (decompressed 123.8 percent)
[17:38:25] Called DecompressByteArray: compressed_data_size=24861664 data_size=30796293, decompressed_data_size=30796293 diff=0
[17:38:25] - Digital signature verified
[17:38:25]
[17:38:25] Project: 6900 (Run 9, Clone 13, Gen 8)
[17:38:25]
[17:38:25] Assembly optimizations on if available.
[17:38:25] Entering M.D.
[17:38:34] Completed 0 out of 250000 steps (0%)
[18:11:26] Completed 2500 out of 250000 steps (1%)
[18:44:26] Completed 5000 out of 250000 steps (2%)
[19:17:19] Completed 7500 out of 250000 steps (3%)
[19:50:09] Completed 10000 out of 250000 steps (4%)
[20:23:03] Completed 12500 out of 250000 steps (5%)
[20:56:38] Completed 15000 out of 250000 steps (6%)
[21:30:13] Completed 17500 out of 250000 steps (7%)
[22:03:25] Completed 20000 out of 250000 steps (8%)
[22:36:18] Completed 22500 out of 250000 steps (9%)
[23:09:12] Completed 25000 out of 250000 steps (10%)
[23:42:10] Completed 27500 out of 250000 steps (11%)
[00:15:14] Completed 30000 out of 250000 steps (12%)
[00:48:07] Completed 32500 out of 250000 steps (13%)
[01:21:18] Completed 35000 out of 250000 steps (14%)
[01:55:09] Completed 37500 out of 250000 steps (15%)
[02:28:16] Completed 40000 out of 250000 steps (16%)
[03:01:28] Completed 42500 out of 250000 steps (17%)
[03:34:32] Completed 45000 out of 250000 steps (18%)
[04:07:32] Completed 47500 out of 250000 steps (19%)
[04:40:39] Completed 50000 out of 250000 steps (20%)
[05:14:21] Completed 52500 out of 250000 steps (21%)
[05:47:58] Completed 55000 out of 250000 steps (22%)
[06:23:18] Completed 57500 out of 250000 steps (23%)
[06:56:19] Completed 60000 out of 250000 steps (24%)
[07:29:16] Completed 62500 out of 250000 steps (25%)
[08:02:13] Completed 65000 out of 250000 steps (26%)
[08:35:07] Completed 67500 out of 250000 steps (27%)
[09:08:02] Completed 70000 out of 250000 steps (28%)
[09:41:28] Completed 72500 out of 250000 steps (29%)
[10:14:27] Completed 75000 out of 250000 steps (30%)
[10:47:19] Completed 77500 out of 250000 steps (31%)
[11:20:10] Completed 80000 out of 250000 steps (32%)
[11:53:12] Completed 82500 out of 250000 steps (33%)
[12:26:13] Completed 85000 out of 250000 steps (34%)
[12:59:32] Completed 87500 out of 250000 steps (35%)
[13:32:49] Completed 90000 out of 250000 steps (36%)
[14:08:00] Completed 92500 out of 250000 steps (37%)
[14:46:28] Completed 95000 out of 250000 steps (38%)
[15:20:23] Completed 97500 out of 250000 steps (39%)
[15:53:25] Completed 100000 out of 250000 steps (40%)
[16:26:35] Completed 102500 out of 250000 steps (41%)
[17:00:48] Completed 105000 out of 250000 steps (42%)
[17:34:51] Completed 107500 out of 250000 steps (43%)
[18:09:15] Completed 110000 out of 250000 steps (44%)
[18:42:51] Completed 112500 out of 250000 steps (45%)
[19:16:20] Completed 115000 out of 250000 steps (46%)
[19:49:17] Completed 117500 out of 250000 steps (47%)
[20:22:01] Completed 120000 out of 250000 steps (48%)
[20:54:49] Completed 122500 out of 250000 steps (49%)
[21:27:28] Completed 125000 out of 250000 steps (50%)
[22:00:18] Completed 127500 out of 250000 steps (51%)
[22:33:03] Completed 130000 out of 250000 steps (52%)
[23:05:46] Completed 132500 out of 250000 steps (53%)
[23:38:32] Completed 135000 out of 250000 steps (54%)
[00:12:30] Completed 137500 out of 250000 steps (55%)
Folding@Home Client Shutdown at user request.
Folding@Home Client Shutdown.
--- Opening Log file [December 6 03:21:26 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.29
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: c:\FAH
Executable: fah
Arguments: -smp -bigadv
[03:21:26] - Ask before connecting: No
[03:21:26] - User name: Brian (Team 61483)
[03:21:26] - User ID: 75FC0D3265561F5D
[03:21:26] - Machine ID: 1
[03:21:26]
[03:21:26] Loaded queue successfully.
[03:21:26]
[03:21:26] + Processing work unit
[03:21:26] Core required: FahCore_a3.exe
[03:21:26] Core found.
[03:21:26] Working on queue slot 05 [December 6 03:21:26 UTC]
[03:21:26] + Working ...
[03:21:26]
[03:21:26] *------------------------------*
[03:21:26] Folding@Home Gromacs SMP Core
[03:21:26] Version 2.22 (Mar 12, 2010)
[03:21:26]
[03:21:26] Preparing to commence simulation
[03:21:26] - Ensuring status. Please wait.
[03:21:35] - Looking at optimizations...
[03:21:35] - Working with standard loops on this execution.
[03:21:35] - Previous termination of core was improper.
[03:21:35] - Files status OK
[03:21:40] - Expanded 24861664 -> 30796293 (decompressed 123.8 percent)
[03:21:40] Called DecompressByteArray: compressed_data_size=24861664 data_size=30796293, decompressed_data_size=30796293 diff=0
[03:21:40] - Digital signature verified
[03:21:40]
[03:21:40] Project: 6900 (Run 9, Clone 13, Gen 8)
[03:21:40]
[03:21:40] Entering M.D.
[03:21:46] Using Gromacs checkpoints
[03:21:52] Resuming from checkpoint
[03:21:52] Verified work/wudata_05.log
[03:21:55] Verified work/wudata_05.trr
[03:21:55] Verified work/wudata_05.xtc
[03:21:55] Verified work/wudata_05.edr
[03:21:56] Completed 137205 out of 250000 steps (54%)
[03:25:38] Completed 137500 out of 250000 steps (55%)
[03:57:21] Completed 140000 out of 250000 steps (56%)
[04:29:07] Completed 142500 out of 250000 steps (57%)
[05:00:48] Completed 145000 out of 250000 steps (58%)
[05:32:34] Completed 147500 out of 250000 steps (59%)
[06:05:46] Completed 150000 out of 250000 steps (60%)
[06:37:41] Completed 152500 out of 250000 steps (61%)
[07:09:22] Completed 155000 out of 250000 steps (62%)
[07:41:05] Completed 157500 out of 250000 steps (63%)
[08:12:42] Completed 160000 out of 250000 steps (64%)
[08:44:45] Completed 162500 out of 250000 steps (65%)
[09:16:27] Completed 165000 out of 250000 steps (66%)
[09:48:14] Completed 167500 out of 250000 steps (67%)
[10:19:58] Completed 170000 out of 250000 steps (68%)
[10:51:40] Completed 172500 out of 250000 steps (69%)
[11:23:26] Completed 175000 out of 250000 steps (70%)
[11:55:10] Completed 177500 out of 250000 steps (71%)
[12:27:03] Completed 180000 out of 250000 steps (72%)
[12:58:49] Completed 182500 out of 250000 steps (73%)
[13:30:30] Completed 185000 out of 250000 steps (74%)
[14:02:25] Completed 187500 out of 250000 steps (75%)
[14:35:07] Completed 190000 out of 250000 steps (76%)
[15:06:46] Completed 192500 out of 250000 steps (77%)
[15:38:32] Completed 195000 out of 250000 steps (78%)
[16:10:17] Completed 197500 out of 250000 steps (79%)
[16:42:08] Completed 200000 out of 250000 steps (80%)
[17:14:07] Completed 202500 out of 250000 steps (81%)
[17:46:02] Completed 205000 out of 250000 steps (82%)
[18:18:55] Completed 207500 out of 250000 steps (83%)
[18:51:26] Completed 210000 out of 250000 steps (84%)
[19:23:48] Completed 212500 out of 250000 steps (85%)
[19:55:46] Completed 215000 out of 250000 steps (86%)
[20:27:41] Completed 217500 out of 250000 steps (87%)
[20:59:35] Completed 220000 out of 250000 steps (88%)
[21:31:36] Completed 222500 out of 250000 steps (89%)
[22:03:37] Completed 225000 out of 250000 steps (90%)
[22:35:42] Completed 227500 out of 250000 steps (91%)
[23:07:44] Completed 230000 out of 250000 steps (92%)
[23:39:39] Completed 232500 out of 250000 steps (93%)
[00:11:38] Completed 235000 out of 250000 steps (94%)
[00:43:58] Completed 237500 out of 250000 steps (95%)
[01:16:14] Completed 240000 out of 250000 steps (96%)
[01:48:30] Completed 242500 out of 250000 steps (97%)
[02:20:27] Completed 245000 out of 250000 steps (98%)
[02:54:29] Completed 247500 out of 250000 steps (99%)
[03:29:18] Completed 250000 out of 250000 steps (100%)
[03:29:30] DynamicWrapper: Finished Work Unit: sleep=10000
[03:29:40]
[03:29:40] Finished Work Unit:
[03:29:40] - Reading up to 52713120 from "work/wudata_05.trr": Read 52713120
[03:29:40] trr file hash check passed.
[03:29:40] - Reading up to 47145736 from "work/wudata_05.xtc": Read 47145736
[03:29:40] xtc file hash check passed.
[03:29:40] edr file hash check passed.
[03:29:40] logfile size: 203554
[03:29:40] Leaving Run
[03:29:40] - Writing 100230350 bytes of core data to disk...
[03:29:41] ... Done.
[03:30:02] - Shutting down core
[03:30:04]
[03:30:04] Folding@home Core Shutdown: FINISHED_UNIT
[03:30:11] CoreStatus = 64 (100)
[03:30:11] Sending work to server
[03:30:11] Project: 6900 (Run 9, Clone 13, Gen 8)
[03:30:11] + Attempting to send results [December 7 03:30:11 UTC]
[04:00:14] - Couldn't send HTTP request to server
[04:00:14] + Could not connect to Work Server (results)
[04:00:14] (130.237.232.141:8080)
[04:00:14] + Retrying using alternative port
[04:06:23] + Results successfully sent
[04:06:23] Thank you for your contribution to Folding@Home.
[04:06:23] + Number of Units Completed: 133
[04:06:29] - Preparing to get new work unit...
[04:06:29] Cleaning up work directory
[04:06:35] + Attempting to get work packet
[04:06:35] Passkey found
[04:06:35] - Connecting to assignment server
[04:06:36] - Successful: assigned to (130.237.232.141).
[04:06:36] + News From Folding@Home: Welcome to Folding@Home
[04:06:36] Loaded queue successfully.
[04:07:33] + Closed connections
[04:07:33]
[04:07:33] + Processing work unit
[04:07:33] Core required: FahCore_a3.exe
[04:07:33] Core found.
[04:07:33] Working on queue slot 06 [December 7 04:07:33 UTC]
[04:07:33] + Working ...
[04:07:33]
[04:07:33] *------------------------------*
[04:07:33] Folding@Home Gromacs SMP Core
[04:07:33] Version 2.22 (Mar 12, 2010)
Re: Project: 6900 (Run 9, Clone 13, Gen 8)
Posted: Fri Dec 10, 2010 6:24 pm
by bruce
Servers commonly reassign the same WU after an error, if only to rule out hardware issues.
Communications errors mean that some essential process died. There are many causes, so they're hard to diagnose. A marginally stable overclock often gives this error, and since it's obviously intermittent (and rare) that might be the cause. FAH is particularly sensitive to memory errors, so if your RAM settings are too aggressive or there's too much heat near the RAM, that's something you can do something about. That's just a guess, though.
Re: Project: 6900 (Run 9, Clone 13, Gen 8)
Posted: Fri Dec 10, 2010 6:28 pm
by sortofageek
campbbri wrote: my team is 61483, username is "Brian"
Thanks, I can now confirm that was your result.