Page 2 of 2

Re: P3906 Gen 1 all failing

Posted: Fri Jan 18, 2008 6:17 pm
by MDCRL
gotcha - didn't realize it would figure them that way..... so when is fahmon revision due out?

Re: P3906 Gen 1 all failing

Posted: Fri Jan 18, 2008 6:38 pm
by 7im
Uncle Fungus is aware if the issue. You can follow along with the progress of FahMon in this thread http://foldingforum.org/viewtopic.php?f=14&t=40.

Re: P3906 Gen 1 all failing

Posted: Fri Jan 18, 2008 7:34 pm
by Ren02
An interesting side note:
Thanks to this error report and this one, it is possible to assess how long it took for the first generation to process: clocks slightly under 2 weeks.
I believe the 3906 had the same 59/86 deadline as the 3907. Yet in 2 weeks it had run out of 0 gen WUs to issue. If some of the 0-gen WUs don't get returned it will take another 1.5 months before they are reissued. No wonder the generation skew can become long...

Re: P3906 Gen 1 all failing

Posted: Sat Jan 19, 2008 12:53 am
by MDCRL
7im wrote:Uncle Fungus is aware if the issue. You can follow along with the progress of FahMon in this thread http://foldingforum.org/viewtopic.php?f=14&t=40.

Thanks... I hopefully will have all those #'s for you in the morning - got tied up w/ other things 2nite....

Re: P3906 Gen 1 all failing

Posted: Sat Jan 19, 2008 6:02 am
by 7im
MDCRL wrote:
7im wrote:Uncle Fungus is aware if the issue. You can follow along with the progress of FahMon in this thread http://foldingforum.org/viewtopic.php?f=14&t=40.

Thanks... I hopefully will have all those #'s for you in the morning - got tied up w/ other things 2nite....
Don't worry about it. We know the problem, and it is getting corrected.

Re: P3906 Gen 1 all failing

Posted: Sun Jan 20, 2008 3:32 am
by MDCRL
ok cool -

- I see 3907 having same issues too - eh? - good luck

Re: P3906 Gen 1 all failing

Posted: Sun Jan 20, 2008 3:46 am
by sortofageek
See this update from kasson ---> viewtopic.php?f=19&t=818&p=6735#p6735

Re: P3906 Gen 1 all failing

Posted: Tue Jan 22, 2008 7:22 am
by bruce
According to this thread. Error C000000D may be caused by ZoneAlarm blocking communication between the FahCore and the client.

Does it go away if you disable ZA (or if you create an appropriate exception)?

Re: P3906 Gen 1 all failing

Posted: Sat Jan 26, 2008 3:52 am
by Cprossu
just thought I should post this, got one that did something very odd

Code: Select all

[18:46:33] 
[18:46:26] Loaded queue successfully.
[18:46:26] Connecting to http://171.64.122.88:8080/
[18:46:27] Posted data.
[18:46:27] Initial: 0000; - Receiving payload (expected size: 347100)
[18:46:28] - Downloaded at ~338 kB/s
[18:46:28] - Averaged speed for that direction ~383 kB/s
[18:46:28] + Received work.
[18:46:28] + Closed connections
[18:46:33] 
[18:46:33] + Processing work unit
[18:46:33] Core required: FahCore_7b.exe
[18:46:33] Core found.
[18:46:33] Working on Unit 00 [January 24 18:46:33]
[18:46:33] + Working ...
[18:46:33] - Calling 'FahCore_7b.exe -dir work/ -suffix 00 -checkpoint 15 -verbose -lifeline 1504 -version 600'

[18:46:33] 
[18:46:33] *------------------------------*
[18:46:33] Folding@Home Double Gromacs Core B
[18:46:33] Version 1.04 (Fri Aug 10 16:46:39 PDT 2007)
[18:46:33] 
[18:46:33] Preparing to commence simulation
[18:46:33] - Files status OK
[18:46:34] - Expanded 346588 -> 1205157 (decompressed 347.7 percent)
[18:46:34] 
[18:46:34] Project: 3906 (Run 28, Clone 1, Gen 1)
[18:46:34] 
[18:46:34] Assembly optimizations on if available.
[18:46:34] Entering M.D.
[18:46:41] Working on Lig in water
[18:46:41] Completed 0 out of 500000 steps  (0)
[18:46:41] Extra SSE2 boost OK
[18:52:53] Completed 280000 out of 500000 steps  (56)
[18:55:40] Writing checkpoint files
[19:10:48] Writing checkpoint files
[19:25:56] Writing checkpoint files
[19:35:34] Writing local files
[19:35:34] Completed 285000 out of 500000 steps  (57)
[19:41:07] Writing checkpoint files
[19:56:16] Writing checkpoint files
[20:11:27] Writing checkpoint files
[20:18:17] Writing local files
[20:18:17] Completed 290000 out of 500000 steps  (58)
[20:26:36] Writing checkpoint files
[20:41:47] Writing checkpoint files
[20:56:57] Writing checkpoint files
[21:01:03] Writing local files
[21:01:03] Completed 295000 out of 500000 steps  (59)
[21:12:09] Writing checkpoint files
[21:26:14] - Autosending finished units...
[21:26:14] Trying to send all finished work units
[21:26:14] + No unsent completed units remaining.
[21:26:14] - Autosend completed
[21:27:18] Writing checkpoint files
[21:42:27] Writing checkpoint files
[21:43:44] Writing local files
[21:43:44] Completed 300000 out of 500000 steps  (60)
[21:43:44] Writing checkpoint files
[21:58:54] Writing checkpoint files
[22:14:02] Writing checkpoint files
[22:26:27] Writing local files
[22:26:27] Completed 305000 out of 500000 steps  (61)
[22:29:12] Writing checkpoint files
[22:44:21] Writing checkpoint files
[22:59:31] Writing checkpoint files
[23:09:09] Writing local files
[23:09:09] Completed 310000 out of 500000 steps  (62)
[23:14:43] Writing checkpoint files
[23:29:51] Writing checkpoint files
[23:45:01] Writing checkpoint files
[23:51:51] Writing local files
[23:51:51] Completed 315000 out of 500000 steps  (63)
[00:00:10] Writing checkpoint files
[00:15:18] Writing checkpoint files
[00:30:27] Writing checkpoint files
[00:34:30] Writing local files
[00:34:30] Completed 320000 out of 500000 steps  (64)
[00:45:35] Writing checkpoint files
[01:00:44] Writing checkpoint files
[01:15:53] Writing checkpoint files
[01:17:09] Writing local files
[01:17:09] Completed 325000 out of 500000 steps  (65)
[01:31:04] Writing checkpoint files
[01:46:13] Writing checkpoint files
[01:59:52] Writing local files
[01:59:52] Completed 330000 out of 500000 steps  (66)
[02:01:22] Writing checkpoint files
[02:16:32] Writing checkpoint files
[02:31:41] Writing checkpoint files
[02:42:33] Writing local files
[02:42:33] Completed 335000 out of 500000 steps  (67)
[02:46:48] Writing checkpoint files
[03:01:58] Writing checkpoint files
[03:17:06] Writing checkpoint files
[03:25:12] Writing local files
[03:25:12] Completed 340000 out of 500000 steps  (68)
[03:26:14] - Autosending finished units...
[03:26:14] Trying to send all finished work units
[03:26:14] + No unsent completed units remaining.
[03:26:14] - Autosend completed
[03:32:17] Writing checkpoint files
[03:47:25] Writing checkpoint files
[04:02:34] Writing checkpoint files
[04:07:56] Writing local files
[04:07:56] Completed 345000 out of 500000 steps  (69)
[04:17:45] Writing checkpoint files
[04:32:54] Writing checkpoint files
[04:48:04] Writing checkpoint files
[04:50:39] Writing local files
[04:50:39] Completed 350000 out of 500000 steps  (70)
[04:50:39] Writing checkpoint files
[05:05:49] Writing checkpoint files
[05:20:57] Writing checkpoint files
[05:33:20] Writing local files
[05:33:20] Completed 355000 out of 500000 steps  (71)
[05:36:07] Writing checkpoint files
[05:51:14] Writing checkpoint files
[06:06:21] Writing checkpoint files
[06:15:56] Writing local files
[06:15:56] Completed 360000 out of 500000 steps  (72)
[06:21:28] Writing checkpoint files
[06:36:36] Writing checkpoint files
[06:51:46] Writing checkpoint files
[06:58:35] Writing local files
[06:58:35] Completed 365000 out of 500000 steps  (73)
[07:06:55] Writing checkpoint files
[07:22:01] Writing checkpoint files
[07:37:11] Writing checkpoint files
[07:41:13] Writing local files
[07:41:13] Completed 370000 out of 500000 steps  (74)
[07:52:19] Writing checkpoint files
[08:07:24] Writing checkpoint files
[08:22:31] Writing checkpoint files
[08:23:47] Writing local files
[08:23:47] Completed 375000 out of 500000 steps  (75)
[08:37:38] Writing checkpoint files
[08:52:47] Writing checkpoint files
[09:06:26] Writing local files
[09:06:26] Completed 380000 out of 500000 steps  (76)
[09:07:55] Writing checkpoint files
[09:23:03] Writing checkpoint files
[09:26:14] - Autosending finished units...
[09:26:14] Trying to send all finished work units
[09:26:14] + No unsent completed units remaining.
[09:26:14] - Autosend completed
[09:38:14] Writing checkpoint files
[09:49:11] Writing local files
[09:49:11] Completed 385000 out of 500000 steps  (77)
[09:53:26] Writing checkpoint files
[10:08:35] Writing checkpoint files
[10:23:41] Writing checkpoint files
[10:31:47] Writing local files
[10:31:47] Completed 390000 out of 500000 steps  (78)
[10:38:49] Writing checkpoint files
[10:54:01] Writing checkpoint files
[11:09:08] Writing checkpoint files
[11:14:28] Writing local files
[11:14:28] Completed 395000 out of 500000 steps  (79)
[11:24:15] Writing checkpoint files
[11:39:25] Writing checkpoint files
[11:54:35] Writing checkpoint files
[11:57:08] Writing local files
[11:57:08] Completed 400000 out of 500000 steps  (80)
[11:57:09] Writing checkpoint files
[12:12:18] Writing checkpoint files
[12:27:27] Writing checkpoint files
[12:39:47] Writing local files
[12:39:47] Completed 405000 out of 500000 steps  (81)
[12:42:32] Writing checkpoint files
[12:57:39] Writing checkpoint files
[13:12:47] Writing checkpoint files
[13:22:24] Writing local files
[13:22:24] Completed 410000 out of 500000 steps  (82)
[13:27:56] Writing checkpoint files
[13:43:01] Writing checkpoint files
[13:58:11] Writing checkpoint files
[14:05:02] Writing local files
[14:05:02] Completed 415000 out of 500000 steps  (83)
[14:13:21] Writing checkpoint files
[14:28:32] Writing checkpoint files
[14:43:42] Writing checkpoint files
[14:47:46] Writing local files
[14:47:46] Completed 420000 out of 500000 steps  (84)
[14:58:54] Writing checkpoint files
[15:14:04] Writing checkpoint files
[15:26:14] - Autosending finished units...
[15:26:14] Trying to send all finished work units
[15:26:14] + No unsent completed units remaining.
[15:26:14] - Autosend completed
[15:29:13] Writing checkpoint files
[15:30:29] Writing local files
[15:30:29] Completed 425000 out of 500000 steps  (85)
[15:44:20] Writing checkpoint files
[15:59:31] Writing checkpoint files
[16:13:11] Writing local files
[16:13:11] Completed 430000 out of 500000 steps  (86)
[16:14:40] Writing checkpoint files
[16:29:51] Writing checkpoint files
[16:45:00] Writing checkpoint files
[16:55:53] Writing local files
[16:55:53] Completed 435000 out of 500000 steps  (87)
[17:00:09] Writing checkpoint files
[17:15:16] Writing checkpoint files
[17:30:23] Writing checkpoint files
[17:38:31] Writing local files
[17:38:31] Completed 440000 out of 500000 steps  (88)
[17:45:31] Writing checkpoint files
[18:00:40] Writing checkpoint files
[18:15:49] Writing checkpoint files
[18:21:11] Writing local files
[18:21:11] Completed 445000 out of 500000 steps  (89)
[18:30:59] Writing checkpoint files
[18:46:08] Writing checkpoint files
[19:01:20] Writing checkpoint files
[19:03:54] Writing local files
[19:03:54] Completed 450000 out of 500000 steps  (90)
[19:03:54] Writing checkpoint files
[19:19:06] Writing checkpoint files
[19:34:15] Writing checkpoint files
[19:46:40] Writing local files
[19:46:40] Completed 455000 out of 500000 steps  (91)
[19:49:28] Writing checkpoint files
[20:04:37] Writing checkpoint files
[20:19:43] Writing checkpoint files
[20:29:17] Writing local files
[20:29:17] Completed 460000 out of 500000 steps  (92)
[20:34:53] Writing checkpoint files
[20:50:05] Writing checkpoint files
[21:05:14] Writing checkpoint files
[21:12:03] Writing local files
[21:12:03] Completed 465000 out of 500000 steps  (93)
[21:20:23] Writing checkpoint files
[21:26:14] - Autosending finished units...
[21:26:14] Trying to send all finished work units
[21:26:14] + No unsent completed units remaining.
[21:26:14] - Autosend completed
[21:35:31] Writing checkpoint files
[21:50:41] Writing checkpoint files
[21:54:44] Writing local files
[21:54:44] Completed 470000 out of 500000 steps  (94)
[22:05:53] Writing checkpoint files
[22:21:01] Writing checkpoint files
[22:36:09] Writing checkpoint files
[22:37:26] Writing local files
[22:37:26] Completed 475000 out of 500000 steps  (95)
[22:51:16] Writing checkpoint files
[23:06:27] Writing checkpoint files
[23:20:08] Writing local files
[23:20:08] Completed 480000 out of 500000 steps  (96)
[23:21:37] Writing checkpoint files
[23:36:48] Writing checkpoint files
[23:51:58] Writing checkpoint files
[00:02:51] Writing local files
[00:02:51] Completed 485000 out of 500000 steps  (97)
[00:07:07] Writing checkpoint files
[00:22:13] Writing checkpoint files
[00:37:22] Writing checkpoint files
[00:45:30] Writing local files
[00:45:30] Completed 490000 out of 500000 steps  (98)
[00:52:32] Writing checkpoint files
[01:07:42] Writing checkpoint files
[01:22:42] Writing checkpoint files
[01:28:16] Writing local files
[01:28:16] Completed 495000 out of 500000 steps  (99)
[01:37:51] Writing checkpoint files
[01:53:00] Writing checkpoint files
[02:08:09] Writing checkpoint files
[02:10:55] Writing local files
[02:10:55] Completed 500000 out of 500000 steps  (100)
[02:10:55] Writing checkpoint files
[02:11:55] 
[02:11:55] Finished Work Unit:
[02:11:55] Leaving Run
[02:12:00] - Writing 1986304 bytes of core data to disk...
[02:12:01] Done: 1985792 -> 644048 (compressed to 32.4 percent)
[02:12:01]   ... Done.
[02:12:01] - Shutting down core
[02:12:01] 
[02:12:01] Folding@home Core Shutdown: FINISHED_UNIT
[03:26:14] - Autosending finished units...
[03:26:14] Trying to send all finished work units
[03:26:14] + No unsent completed units remaining.
[03:26:14] - Autosend completed
and it's been sitting there....
.. should I intervene in any way?

Re: P3906 Gen 1 all failing

Posted: Sat Jan 26, 2008 6:24 am
by bruce
I suspect that if you kill the client and restart it, it'll say that the WU isn't finished and you may lose it. Make a backup before proceeding. Then, if necessary, run qfix (in the text window so you can see the messages) and you should be able to recover the WU.

Re: P3906 Gen 1 all failing

Posted: Sat Jan 26, 2008 12:39 pm
by Cprossu
well for whatever reason it finished and sent later

Code: Select all

[09:26:14] - Autosending finished units...
[09:26:14] Trying to send all finished work units
[09:26:14] + No unsent completed units remaining.
[09:26:14] - Autosend completed
[10:49:55] CoreStatus = 64 (100)
[10:49:55] Unit 0 finished with 98 percent of time to deadline remaining.
[10:49:55] Updated performance fraction: 0.951619
[10:49:55] Sending work to server


[10:49:55] + Attempting to send results
[10:49:55] - Reading file work/wuresults_00.dat from core
[10:49:55]   (Read 9605368 bytes from disk)
[10:49:55] Connecting to http://171.64.122.88:8080/
[10:52:13] Posted data.
[10:52:13] Initial: 0000; - Uploaded at ~67 kB/s
[10:52:14] - Averaged speed for that direction ~65 kB/s
[10:52:14] + Results successfully sent
[10:52:14] Thank you for your contribution to Folding@Home.