Page 1 of 2

128.143.199.96:8080

Posted: Wed Dec 21, 2011 7:38 am
by Wreck3r
This morning I found out that the client wasn't receiving any WU's and also it wasn't able to send the completed WU.

Attempting to get work:

Code: Select all

[05:27:33] + Attempting to get work packet
[05:27:33] Passkey found
[05:27:33] - Will indicate memory of 4890 MB
[05:27:33] - Connecting to assignment server
[05:27:33] Connecting to http://assign.stanford.edu:8080/
[05:27:34] Posted data.
[05:27:34] Initial: 8F80; - Successful: assigned to (128.143.199.96).
[05:27:34] + News From Folding@Home: Welcome to Folding@Home
[05:27:35] Loaded queue successfully.
[05:27:35] Sent data
[05:27:35] Connecting to http://128.143.199.96:8080/
[05:27:46] Posted data.
[05:27:50] Initial: 0000; - Receiving payload (expected size: 1771069)
[05:33:07] + Could not get Work unit data from Work Server
[05:33:07] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[05:33:19] + Attempting to get work packet
[05:33:19] Passkey found
[05:33:19] - Will indicate memory of 4890 MB
[05:33:19] - Connecting to assignment server
[05:33:19] Connecting to http://assign.stanford.edu:8080/
[05:33:20] Posted data.
[05:33:20] Initial: 8F80; - Successful: assigned to (128.143.199.96).
[05:33:20] + News From Folding@Home: Welcome to Folding@Home
[05:33:20] Loaded queue successfully.
[05:33:20] Sent data
[05:33:20] Connecting to http://128.143.199.96:8080/
[05:33:30] Posted data.
[05:33:30] Initial: 0000; - Receiving payload (expected size: 1767463)
[05:38:06] + Could not get Work unit data from Work Server
[05:38:06] - Attempt #2  to get work failed, and no other work to do.
Waiting before retry.
[05:38:24] + Attempting to get work packet
[05:38:24] Passkey found
[05:38:24] - Will indicate memory of 4890 MB
[05:38:24] - Connecting to assignment server
[05:38:24] Connecting to http://assign.stanford.edu:8080/
[05:38:25] Posted data.
[05:38:25] Initial: 8F80; - Successful: assigned to (128.143.199.96).
[05:38:25] + News From Folding@Home: Welcome to Folding@Home
[05:38:25] Loaded queue successfully.
[05:38:25] Sent data
[05:38:25] Connecting to http://128.143.199.96:8080/
[05:38:26] Posted data.
[05:38:26] Initial: 0000; - Receiving payload (expected size: 1772031)
[05:44:24] + Could not get Work unit data from Work Server
[05:44:24] - Attempt #3  to get work failed, and no other work to do.
Waiting before retry.
[05:44:55] + Attempting to get work packet
[05:44:55] Passkey found
[05:44:55] - Will indicate memory of 4890 MB
[05:44:55] - Connecting to assignment server
[05:44:55] Connecting to http://assign.stanford.edu:8080/
[05:44:56] Posted data.
[05:44:56] Initial: 8F80; - Successful: assigned to (128.143.199.96).
[05:44:56] + News From Folding@Home: Welcome to Folding@Home
[05:44:56] Loaded queue successfully.
[05:44:56] Sent data
[05:44:56] Connecting to http://128.143.199.96:8080/
[05:45:01] Posted data.
[05:45:01] Initial: 0000; - Receiving payload (expected size: 1764919)
[06:09:53] ***** Got an Activate signal (2)
[06:09:53] Killing all core threads

Folding@Home Client Shutdown.
Attempting to send:

Code: Select all

[06:10:57] + Attempting to send results [December 21 06:10:57 UTC]
[06:10:57] - Reading file work/wuresults_02.dat from core
[06:10:57]   (Read 3520998 bytes from disk)
[06:10:57] Connecting to http://128.143.199.96:8080/
[06:12:43] - Couldn't send HTTP request to server
[06:12:43] + Could not connect to Work Server (results)
[06:12:43]     (128.143.199.96:8080)
[06:12:43] + Retrying using alternative port
[06:12:43] Connecting to http://128.143.199.96:80/
[06:13:45] - Couldn't send HTTP request to server
[06:13:45] + Could not connect to Work Server (results)
[06:13:45]     (128.143.199.96:80)
[06:13:45] - Error: Could not transmit unit 02 (completed December 21) to work server.
[06:13:45] - 3 failed uploads of this unit.


[06:13:45] + Attempting to send results [December 21 06:13:45 UTC]
[06:13:45] - Reading file work/wuresults_02.dat from core
[06:13:45]   (Read 3520998 bytes from disk)
[06:13:45] Connecting to http://130.237.165.141:8080/
[06:13:49] Posted data.
[06:13:49] Initial: 0000; - Uploaded at ~859 kB/s
[06:13:49] - Averaged speed for that direction ~388 kB/s
[06:13:49] - Server does not have record of this unit. Will try again later.
[06:13:49]   Could not transmit unit 02 to Collection server; keeping in queue.
[06:13:49] - Failed to send unit 02 to server
[06:13:49] ***** Got a SIGTERM signal (15)
[06:13:49] Killing all core threads
Pinging 128.143.199.96 showed about 30% failed pings.

Eventually, after 6:30, things went to normal (upload and new WU), but after checking the stats after 7:00AM it seems that 2 units got sent, but from what I can tell they weren't credited.

Re: 128.143.199.96:8080

Posted: Wed Dec 21, 2011 9:20 am
by bruce
From time to time we see reports of poor connections across the Atlantic. If pinging shows errors, running tracert (traceroute) gives a lot more information about where the problem lies.

If you need to know about certain WUs that might not have been credited, you have to supply the Project/Run/Clone/Gen numbers. You'll find a sequence something like the one shown below either in fahlog.txt or fahlog-prev.txt. (I generally just search for "thank you")

Code: Select all

[hh:mm:ss] Project: xxxx (Run x, Clone xx, Gen x)
[hh:mm:ss] + Attempting to send results [July xx hh:mm:ss UTC]
[hh:mm:ss] + Results successfully sent
[hh:mm:ss] Thank you for your contribution to Folding@Home.
That server sometimes does take longer to credit the points and I'm guessing it's somehow related to the same problems with the internet's transatlantic hop.

Re: 128.143.199.96:8080

Posted: Wed Dec 21, 2011 10:05 am
by Wreck3r
Hi Bruce,

The projects that were last sent are:
Project: 7200 (Run 31, Clone 28, Gen 24)
Project: 6972 (Run 0, Clone 42, Gen 310)
Project: 6985 (Run 0, Clone 79, Gen 174)

Thank you for the assistance.

Re: 128.143.199.96:8080

Posted: Wed Dec 21, 2011 7:05 pm
by bruce
Hi Wreck3r (team 111065),
Your WU (P7200 R31 C28 G24) was added to the stats database on 2011-12-21 00:11:00 for 1662.06 points of credit.

Hi Wreck3r (team 111065),
Your WU (P6972 R0 C42 G310) was added to the stats database on 2011-12-20 18:08:22 for 5021.54 points of credit.

Hi Wreck3r (team 111065),
Your WU (P6985 R0 C79 G174) was added to the stats database on 2011-12-21 00:10:37 for 4183.06 points of credit.

Re: 128.143.199.96:8080

Posted: Wed Dec 21, 2011 11:37 pm
by stanzlad
I also returned two completed units to this server this morning and seem to have only been credited for one.

I fold under the name call2 for team734.

First:

Code: Select all

[09:57:19] Completed 500000 out of 500000 steps  (100%)



 Average load imbalance: 3.2 %
 Part of the total run time spent waiting due to load imbalance: 1.3 %

	Parallel run - timing based on wallclock.

               NODE (s)   Real (s)      (%)
       Time:  30177.487  30177.487    100.0
                       8h22:57
               (Mnbf/s)   (GFlops)   (ns/day)  (hour/ns)
Performance:    200.020     13.199      2.863      8.383

Thanx for Using GROMACS - Have a Nice Day

[09:57:19] DynamicWrapper: Finished Work Unit: sleep=10000
[09:57:29] 
[09:57:29] Finished Work Unit:
[09:57:29] - Reading up to 3701664 from "work/wudata_06.trr": Read 3701664
[09:57:29] trr file hash check passed.
[09:57:29] edr file hash check passed.
[09:57:29] logfile size: 62386
[09:57:29] Leaving Run
[09:57:33] - Writing 3799378 bytes of core data to disk...
[09:57:34] Done: 3798866 -> 3524133 (compressed to 92.7 percent)
[09:57:34]   ... Done.
[09:59:33] - Shutting down core
[09:59:33] 
[09:59:33] Folding@home Core Shutdown: FINISHED_UNIT
[09:59:50] CoreStatus = 64 (100)
[09:59:50] Unit 6 finished with 91 percent of time to deadline remaining.
[09:59:50] Updated performance fraction: 0.912308
[09:59:50] Sending work to server
[09:59:50] Project: 6973 (Run 0, Clone 62, Gen 370)


[09:59:50] + Attempting to send results [December 21 09:59:50 UTC]
[09:59:50] - Reading file work/wuresults_06.dat from core
[09:59:50]   (Read 3524645 bytes from disk)
[09:59:50] Connecting to http://128.143.199.96:8080/
[10:00:29] Posted data.
[10:00:29] Initial: 0000; - Uploaded at ~88 kB/s
[10:00:29] - Averaged speed for that direction ~87 kB/s
[10:00:29] + Results successfully sent
[10:00:29] Thank you for your contribution to Folding@Home.

Second

Code: Select all

[10:38:15] Completed 500000 out of 500000 steps  (100%)

Writing final coordinates.

 Average load imbalance: 7.5 %
 Part of the total run time spent waiting due to load imbalance: 3.1 %


	Parallel run - timing based on wallclock.

               NODE (s)   Real (s)      (%)
       Time:  30230.381  30230.381    100.0
                       8h23:50
               (Mnbf/s)   (GFlops)   (ns/day)  (hour/ns)
Performance:    203.482     13.566      2.858      8.397

Thanx for Using GROMACS - Have a Nice Day

[10:38:15] DynamicWrapper: Finished Work Unit: sleep=10000
[10:38:25] 
[10:38:25] Finished Work Unit:
[10:38:25] - Reading up to 3711648 from "work/wudata_09.trr": Read 3711648
[10:38:25] trr file hash check passed.
[10:38:25] edr file hash check passed.
[10:38:25] logfile size: 62986
[10:38:25] Leaving Run
[10:38:26] - Writing 3810594 bytes of core data to disk...
[10:38:27] Done: 3810082 -> 3528304 (compressed to 92.6 percent)
[10:38:27]   ... Done.
[10:38:27] - Shutting down core
[10:38:27] 
[10:38:27] Folding@home Core Shutdown: FINISHED_UNIT
[10:38:28] CoreStatus = 64 (100)
[10:38:28] Unit 9 finished with 91 percent of time to deadline remaining.
[10:38:28] Updated performance fraction: 0.911720
[10:38:28] Sending work to server
[10:38:28] Project: 7131 (Run 0, Clone 22, Gen 272)


[10:38:28] + Attempting to send results [December 21 10:38:28 UTC]
[10:38:28] - Reading file work/wuresults_09.dat from core
[10:38:28]   (Read 3528816 bytes from disk)
[10:38:28] Connecting to http://128.143.199.96:8080/
[10:39:13] Posted data.
[10:39:13] Initial: 0000; - Uploaded at ~76 kB/s
[10:39:13] - Averaged speed for that direction ~87 kB/s
[10:39:13] + Results successfully sent
[10:39:13] Thank you for your contribution to Folding@Home.


It seems strange that this occurred about the same time as the other poster Wreck3r.

Re: 128.143.199.96:8080

Posted: Thu Dec 22, 2011 12:51 am
by sortofageek
Hi stanzlad, welcome to the site. :) What is your folding ID? I can mark your post for followup, but there wouldn't be much point if we don't know what name to watch for.

There is currently a result for Project: 6973 (Run 0, Clone 62, Gen 370), but not yet for Project: 7131 (Run 0, Clone 22, Gen 272).

Re: 128.143.199.96:8080

Posted: Thu Dec 22, 2011 7:52 am
by stanzlad
I already stated that I fold for team 734 under the name call2. My user number is 22638 - I started folding in 2001. I haven't posted on here for years and I had to re-register as I couldn't remember all my details. :oops: It seems that if you've forgotten the Email address that you used to register your name - in my case call2 - then you cannot login. I can't remember what Email address I used 10 months ago, let alone 10 years. :roll:

Anyway, I see that yesterdays stats only show one credit of 3,382 when two credits of around 6,500 should be there. I just wonder how many times this situation occurs. If I hadn't noticed the post by Wreck3r I might have just ignored the stats and accepted the loss.

Re: 128.143.199.96:8080

Posted: Thu Dec 22, 2011 12:55 pm
by PantherX
Project: 6973 (Run 0, Clone 62, Gen 370):
Hi call2 (team 734),
Your WU (P6973 R0 C62 G370) was added to the stats database on 2011-12-21 02:09:12 for 3382.1 points of credit.

Re: 128.143.199.96:8080

Posted: Thu Dec 22, 2011 4:12 pm
by stanzlad
Yes PantherX, I can see that the first WU was credited. That's not in doubt. What about the second WU, sent at 10:38 Project: 7131 Run 0, Clone 22, Gen 272?


2-22-2012 Mod Note:
The WU (P7131,R0,C22,G272) has been reported as a bad WU. Note that the list of reported WUs are stopped daily at 8am pacific time. ~sorto'

Re: 128.143.199.96:8080

Posted: Thu Dec 22, 2011 6:50 pm
by PantherX
stanzlad wrote:... What about the second WU, sent at 10:38 Project: 7131 Run 0, Clone 22, Gen 272?
Unfortunatly, there's nothing in the WU Database yet :( It will be followed-up later in the future.

Re: 128.143.199.96:8080

Posted: Thu Dec 29, 2011 2:06 pm
by stanzlad
Still not credited for the WU, (call2 team 734). How long does an update to the WU database take?

Re: 128.143.199.96:8080

Posted: Sat Dec 31, 2011 9:51 am
by stanzlad
Come on Stanford. It's the last day of 2011 and it would be nice to get these points added to my overall points score for the year.

call2
team 734
21Dec2011
7131 R0 C22 G272

Still not credited.
[10:38:28] Sending work to server
[10:38:28] Project: 7131 (Run 0, Clone 22, Gen 272)


[10:38:28] + Attempting to send results [December 21 10:38:28 UTC]
[10:38:28] - Reading file work/wuresults_09.dat from core
[10:38:28] (Read 3528816 bytes from disk)
[10:38:28] Connecting to http://128.143.199.96:8080/
[10:39:13] Posted data.
[10:39:13] Initial: 0000; - Uploaded at ~76 kB/s
[10:39:13] - Averaged speed for that direction ~87 kB/s
[10:39:13] + Results successfully sent
[10:39:13] Thank you for your contribution to Folding@Home.

Re: 128.143.199.96:8080

Posted: Tue Jan 03, 2012 11:54 am
by Kunkel
I have about 10 WU sitting in my queue and I can't seem to connect to any of the available work servers on my SMP Ubuntu box for about a week.

Here is a trace route from my mac. My windows box seems to be able to connect just fine with SMP and GPU, I live in Seoul, South Korea.

Code: Select all

traceroute 128.143.199.96
traceroute to 128.143.199.96 (128.143.199.96), 64 hops max, 52 byte packets
 1  192.168.1.1 (192.168.1.1)  1.182 ms  0.695 ms  0.505 ms
 2  222.110.178.126 (222.110.178.126)  1.779 ms  1.127 ms  0.902 ms
 3  * * *
 4  222.110.178.124 (222.110.178.124)  3.398 ms  1.201 ms  0.968 ms
 5  112.188.53.9 (112.188.53.9)  1.133 ms  1.243 ms  1.420 ms
 6  220.73.149.37 (220.73.149.37)  14.566 ms  2.112 ms  1.830 ms
 7  112.174.86.18 (112.174.86.18)  1.602 ms  1.885 ms  1.542 ms
 8  112.174.84.206 (112.174.84.206)  1.805 ms  1.757 ms  1.580 ms
 9  112.174.88.242 (112.174.88.242)  171.728 ms  171.626 ms  171.775 ms
10  paix-px1--kt-ge.cenic.net (198.32.251.49)  219.183 ms  220.381 ms  211.905 ms
11  xe-1-0-0.0.sttl0.tr-cps.internet2.edu (64.57.20.223)  353.051 ms  232.639 ms  506.555 ms
12  137.164.129.3 (137.164.129.3)  259.159 ms  307.017 ms  307.125 ms
13  137.164.129.10 (137.164.129.10)  306.996 ms  307.057 ms  274.131 ms
14  137.164.131.114 (137.164.131.114)  339.626 ms  614.294 ms  614.294 ms
15  192.35.48.25 (192.35.48.25)  614.191 ms  614.405 ms  614.806 ms
16  carruthers-6509a-x.misc.virginia.edu (128.143.222.54)  307.054 ms  306.565 ms  307.517 ms
17  gilmer-6509a-x.misc.virginia.edu (128.143.222.45)  306.665 ms  307.350 ms  307.201 ms
18  fontaine-6506-x.misc.virginia.edu (128.143.222.77)  307.030 ms  282.533 ms  441.867 ms
19  * * *
20  * * *
21  * * *
22  * * *
23  * * *
24  * * *
25  * * *
26  * * *
27  * * *
28  * * *
29  * * *
30  * * *
31  * * *
32  * * *
33  * * *
34  * * *
35  * * *
36  * * *
37  * * *
38  * * *
39  * * *
40  * * *
41  * * *
42  * * *
43  * * *
44  * * *
45  * * *
46  * * *
47  * * *
48  * * *
49  * * *
50  * * *
51  * * *
52  * * *
53  * * *
54  * * *
55  * * *
56  * * *
57  * * *
58  * * *
59  * * *
60  * * *
61  * * *
62  * * *
63  * * *
64  * * *

Re: 128.143.199.96:8080

Posted: Tue Jan 03, 2012 2:11 pm
by kasson
The tail end of the traceroute isn't going to be informative because of our security setup. But you're reaching the University of Virginia just fine with the traceroute (although somewhat long times even for the distance).

Re: 128.143.199.96:8080

Posted: Wed Jan 04, 2012 11:51 am
by Kunkel
Thanks, I figured that was the case. So it appears to be an Ubuntu 11.10 issue? The issue seemed to start around Christmas day, unfortunately the previous logs were overwritten. It looks like I installed some updates on the 24th of December, but with the ExtremeFolding stats site down I didn't catch it until a few days ago (I don't use FHM as I only have 2 systems folding). It's a purely folding/linux hobby box and it just sits in the corner, but I never remember it having network issues a few weeks ago when I put it together and downloaded some updates (I should have just left it alone). Now anything I do it seems to drop 40% of packets to include internal network ping tests and plugging straight into my modem, occasionally unable to connect for Ubuntu updates, and timeout while connecting to websites. I tried -send all a few times and have it running the -oneunit flag tonight, hopefully it will be able to off some work units. Looks like I'm gonna try a clean wipe and stay away from any updates, seems many people have had issues with 11.10; shame as it's my first real try with Ubuntu.