Page 1 of 2
Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Thu Oct 03, 2024 7:28 pm
by PaulTV
Hiya,
It looks like job results for these servers aren't picked up:
Work server: 128.104.69.82
Collection server: 128.174.73.74
In the logs I see:
Code: Select all
14:51:33:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:16770 run:35 clone:0 gen:55 core:0x23 unit:0x37000000000000002300000082410000
14:51:33:WU01:FS01:Uploading 24.21MiB to 128.104.69.82
14:51:33:WU00:FS01:Starting
14:51:33:WU01:FS01:Connecting to 128.104.69.82:8080
...
14:51:45:WU01:FS01:Upload 92.68%
14:51:46:WU01:FS01:Upload complete
14:51:46:WU01:FS01:Server responded WORK_ACK (400)
14:51:46:WU01:FS01:Final credit estimate, 849631.00 points
14:51:46:WU01:FS01:Cleaning up
But when checking the results they can't be found:
https://apps.foldingathome.org/wu#proje ... e=0&gen=55
Another job for the same project suffers from the same, but I can find results for other jobs I handed in later than that.
Cheers,
Paul
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Fri Oct 04, 2024 3:16 am
by Andre_Ti
Similarly for 16780 and 16781
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Fri Oct 04, 2024 8:51 am
by Andre_Ti
The server still doesn't take points into account.
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Fri Oct 04, 2024 9:31 am
by PaulTV
The job results are most likely logged, just not collected by the central server. It'll be a matter of restoring that connection, and then the points will be assigned. It has happened before. Someone just has to nitofy the people involved...
As I understand there is monitoring, but different parties are involved (F@h is a consortium), and sometimes we are part of the monitoring system
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Fri Oct 04, 2024 3:39 pm
by Joe_H
As PaulTV has written, at times people folding WUs and reporting issues like this is part of the monitoring. I have looked up the information about the researchers involved and am contacting them and the manager for the stats database to get the connection for entering the stats logs fixed up.
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Sat Oct 05, 2024 7:03 am
by Andre_Ti
The server was rebooted 7 hours ago, the problem remains unresolved.
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Sat Oct 05, 2024 2:43 pm
by laoyc
Thank you all for reporting this issue. We are working on resolving it.
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Mon Oct 07, 2024 4:23 am
by BobWilliams757
Thanks for the reply.
I've found a 16770 (37,0,178) that doesn't show up as well.
I can wait on or lose a few points... I mean points are points. But I'm wondering if the next work unit assigns in a case such as this?
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Mon Oct 07, 2024 4:44 am
by Joe_H
BobWilliams757 wrote: ↑Mon Oct 07, 2024 4:23 am
... But I'm wondering if the next work unit assigns in a case such as this?
The next Gen WU should be created and assigned, that is a function of the WS and separate from logging the points into the stats database. When a WU is returned it is validated, points calculated and logged to a file, and the next Gen generated from its results. Periodically the points logs are uploaded to the stats database and entered into it over a network connection.
There might be a delay in generating the next Gen for a WU returned to a Collection Server instead of its WS, but so far no examples of that have been seen in connection with the missing points from the projects assigned from WS 128.104.69.82. It may be a certificate that needs updating or some other setting that was changed.
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Mon Oct 07, 2024 6:54 am
by Zac67
I have begun tracking my WUs and have found numerous ones not being credited, not even after days. Involved Projects are 16770, 16771, 16722 - all from the Huang lab.
Is there a way to reject those WUs to start with? Otherwise I'm forced to accept them but will dump them right away afterwards.
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Mon Oct 07, 2024 7:09 am
by Joe_H
They will still be credited, it is just a delay until the stats logs get entered into the database. No valid reason to dump them.
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Mon Oct 07, 2024 7:48 am
by Zac67
Thx @Joe_H - I'm holding my breath then...
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Mon Oct 07, 2024 10:38 pm
by Yue
I believe the connection problem has been resolved because I can find the credit record on the stats syatem now. Could you check the credits and tell me if any of the WUs still couldn't get the credits? Thanks all for your patience and help!
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Tue Oct 08, 2024 12:11 am
by PaulTV
All credits are in now
Many thanks, guys!
Re: Work server: 128.104.69.82, Collection server: 128.174.73.74
Posted: Tue Oct 08, 2024 4:17 am
by DK-tastrophe
Thxs