Page 1 of 1

Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Tue Jul 23, 2013 4:24 pm
by Nathan_P
Not sure what has happened here as both machines pulled the same WU but hours apart...

Machine 1

Code: Select all

17:04:24] Initial: 0000; - Receiving payload (expected size: 30301287)
[17:06:45] - Downloaded at ~209 kB/s
[17:06:45] - Averaged speed for that direction ~211 kB/s
[17:06:45] + Received work.
[17:06:45] + Closed connections
[17:06:45] 
[17:06:45] + Processing work unit
[17:06:45] Core required: FahCore_a5.exe
[17:06:45] Core found.
[17:06:45] Working on queue slot 09 [July 22 17:06:45 UTC]
[17:06:45] + Working ...
[17:06:45] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 09 -np 24 -checkpoint 15 -verbose -lifeline 1901 -version 634'

[17:06:45] 
[17:06:45] *------------------------------*
[17:06:45] Folding@Home Gromacs SMP Core
[17:06:45] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[17:06:45] 
[17:06:45] Preparing to commence simulation
[17:06:45] - Looking at optimizations...
[17:06:45] - Created dyn
[17:06:45] - Files status OK
[17:06:48] - Expanded 30300775 -> 33130012 (decompressed 109.3 percent)
[17:06:48] Called DecompressByteArray: compressed_data_size=30300775 data_size=33130012, decompressed_data_size=33130012 diff=0
[17:06:48] - Digital signature verified
[17:06:48] 
[17:06:48] Project: 8105 (Run 0, Clone 12, Gen 84)
[17:06:48] 
[17:06:48] Assembly optimizations on if available.
[17:06:48] Entering M.D.
[17:06:55] Mapping NT from 24 to 24 
[17:06:59] Completed 0 out of 250000 steps  (0%)
[17:28:34] Completed 2500 out of 250000 steps  (1%)
[17:49:38] Completed 5000 out of 250000 steps  (2%)
[18:10:44] Completed 7500 out of 250000 steps  (3%)
[18:31:48] Completed 10000 out of 250000 steps  (4%)
[18:52:54] Completed 12500 out of 250000 steps  (5%)
[19:13:57] Completed 15000 out of 250000 steps  (6%)
[19:35:02] Completed 17500 out of 250000 steps  (7%)
[19:56:07] Completed 20000 out of 250000 steps  (8%)
[20:17:11] Completed 22500 out of 250000 steps  (9%)
[20:38:15] Completed 25000 out of 250000 steps  (10%)
[20:59:17] Completed 27500 out of 250000 steps  (11%)
[21:20:20] Completed 30000 out of 250000 steps  (12%)
[21:41:25] Completed 32500 out of 250000 steps  (13%)
[22:02:29] Completed 35000 out of 250000 steps  (14%)
[22:23:34] Completed 37500 out of 250000 steps  (15%)
[22:44:37] Completed 40000 out of 250000 steps  (16%)
[23:04:14] - Autosending finished units... [July 22 23:04:14 UTC]
[23:04:14] Trying to send all finished work units
[23:04:14] + No unsent completed units remaining.
[23:04:14] - Autosend completed
[23:05:41] Completed 42500 out of 250000 steps  (17%)
[23:26:47] Completed 45000 out of 250000 steps  (18%)
[23:47:50] Completed 47500 out of 250000 steps  (19%)
[00:08:57] Completed 50000 out of 250000 steps  (20%)
[00:30:02] Completed 52500 out of 250000 steps  (21%)
[00:51:06] Completed 55000 out of 250000 steps  (22%)
[01:12:12] Completed 57500 out of 250000 steps  (23%)
[01:33:16] Completed 60000 out of 250000 steps  (24%)
[01:54:20] Completed 62500 out of 250000 steps  (25%)
[02:15:25] Completed 65000 out of 250000 steps  (26%)
[02:36:29] Completed 67500 out of 250000 steps  (27%)
[02:57:35] Completed 70000 out of 250000 steps  (28%)
[03:18:39] Completed 72500 out of 250000 steps  (29%)
[03:39:45] Completed 75000 out of 250000 steps  (30%)
[04:00:50] Completed 77500 out of 250000 steps  (31%)
[04:21:55] Completed 80000 out of 250000 steps  (32%)
[04:43:03] Completed 82500 out of 250000 steps  (33%)
[05:04:08] Completed 85000 out of 250000 steps  (34%)
[05:04:14] - Autosending finished units... [July 23 05:04:14 UTC]
[05:04:14] Trying to send all finished work units
[05:04:14] + No unsent completed units remaining.
[05:04:14] - Autosend completed
[05:25:15] Completed 87500 out of 250000 steps  (35%)
[05:46:21] Completed 90000 out of 250000 steps  (36%)
[06:07:28] Completed 92500 out of 250000 steps  (37%)
[06:28:33] Completed 95000 out of 250000 steps  (38%)
[06:49:37] Completed 97500 out of 250000 steps  (39%)
[07:10:44] Completed 100000 out of 250000 steps  (40%)
[07:31:49] Completed 102500 out of 250000 steps  (41%)
[07:52:57] Completed 105000 out of 250000 steps  (42%)
[08:14:02] Completed 107500 out of 250000 steps  (43%)
[08:35:07] Completed 110000 out of 250000 steps  (44%)
[08:56:15] Completed 112500 out of 250000 steps  (45%)
[09:17:20] Completed 115000 out of 250000 steps  (46%)
[09:38:27] Completed 117500 out of 250000 steps  (47%)
[09:59:32] Completed 120000 out of 250000 steps  (48%)
[10:20:37] Completed 122500 out of 250000 steps  (49%)
[10:41:45] Completed 125000 out of 250000 steps  (50%)
[11:02:49] Completed 127500 out of 250000 steps  (51%)
[11:04:14] - Autosending finished units... [July 23 11:04:14 UTC]
[11:04:14] Trying to send all finished work units
[11:04:14] + No unsent completed units remaining.
[11:04:14] - Autosend completed
[11:23:57] Completed 130000 out of 250000 steps  (52%)
[11:45:02] Completed 132500 out of 250000 steps  (53%)
[12:06:07] Completed 135000 out of 250000 steps  (54%)
[12:27:15] Completed 137500 out of 250000 steps  (55%)
[12:48:20] Completed 140000 out of 250000 steps  (56%)
[13:09:29] Completed 142500 out of 250000 steps  (57%)
[13:30:34] Completed 145000 out of 250000 steps  (58%)
[13:51:39] Completed 147500 out of 250000 steps  (59%)
[14:12:47] Completed 150000 out of 250000 steps  (60%)
[14:33:53] Completed 152500 out of 250000 steps  (61%)
[14:55:01] Completed 155000 out of 250000 steps  (62%)
[15:16:06] Completed 157500 out of 250000 steps  (63%)
[15:37:15] Completed 160000 out of 250000 steps  (64%)
[15:58:22] Completed 162500 out of 250000 steps  (65%)
[16:19:29] Completed 165000 out of 250000 steps  (66%)

Machine 2

Code: Select all

[07:07:54] Initial: 0000; - Receiving payload (expected size: 30301287)
[07:10:15] - Downloaded at ~209 kB/s
[07:10:15] - Averaged speed for that direction ~211 kB/s
[07:10:15] + Received work.
[07:10:15] Trying to send all finished work units
[07:10:15] + No unsent completed units remaining.
[07:10:15] + Closed connections
[07:10:15] 
[07:10:15] + Processing work unit
[07:10:15] Core required: FahCore_a5.exe
[07:10:15] Core found.
[07:10:15] Working on queue slot 03 [July 23 07:10:15 UTC]
[07:10:15] + Working ...
[07:10:15] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 03 -np 32 -checkpoint 15 -verbose -lifeline 10552 -version 634'

[07:10:15] 
[07:10:15] *------------------------------*
[07:10:15] Folding@Home Gromacs SMP Core
[07:10:15] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[07:10:15] 
[07:10:15] Preparing to commence simulation
[07:10:15] - Looking at optimizations...
[07:10:15] - Created dyn
[07:10:15] - Files status OK
[07:10:18] - Expanded 30300775 -> 33130012 (decompressed 109.3 percent)
[07:10:18] Called DecompressByteArray: compressed_data_size=30300775 data_size=33130012, decompressed_data_size=33130012 diff=0
[07:10:18] - Digital signature verified
[07:10:18] 
[07:10:18] Project: 8105 (Run 0, Clone 12, Gen 84)
[07:10:18] 
[07:10:18] Assembly optimizations on if available.
[07:10:18] Entering M.D.
[07:10:25] Mapping NT from 32 to 32 
[07:10:29] Completed 0 out of 250000 steps  (0%)
[07:27:39] Completed 2500 out of 250000 steps  (1%)
[07:44:15] Completed 5000 out of 250000 steps  (2%)
[08:00:51] Completed 7500 out of 250000 steps  (3%)
[08:17:31] Completed 10000 out of 250000 steps  (4%)
[08:34:07] Completed 12500 out of 250000 steps  (5%)
[08:50:43] Completed 15000 out of 250000 steps  (6%)
[09:07:19] Completed 17500 out of 250000 steps  (7%)
[09:24:00] Completed 20000 out of 250000 steps  (8%)
[09:28:54] - Autosending finished units... [July 23 09:28:54 UTC]
[09:28:54] Trying to send all finished work units
[09:28:54] + No unsent completed units remaining.
[09:28:54] - Autosend completed
[09:40:38] Completed 22500 out of 250000 steps  (9%)
[09:57:16] Completed 25000 out of 250000 steps  (10%)
[10:13:56] Completed 27500 out of 250000 steps  (11%)
[10:30:33] Completed 30000 out of 250000 steps  (12%)
[10:47:10] Completed 32500 out of 250000 steps  (13%)
[11:03:47] Completed 35000 out of 250000 steps  (14%)
[11:20:31] Completed 37500 out of 250000 steps  (15%)
[11:37:08] Completed 40000 out of 250000 steps  (16%)
[11:53:44] Completed 42500 out of 250000 steps  (17%)
[12:10:26] Completed 45000 out of 250000 steps  (18%)
[12:27:06] Completed 47500 out of 250000 steps  (19%)
[12:43:43] Completed 50000 out of 250000 steps  (20%)
[13:00:20] Completed 52500 out of 250000 steps  (21%)
[13:17:01] Completed 55000 out of 250000 steps  (22%)
[13:33:38] Completed 57500 out of 250000 steps  (23%)
[13:50:15] Completed 60000 out of 250000 steps  (24%)
[14:06:51] Completed 62500 out of 250000 steps  (25%)
[14:23:31] Completed 65000 out of 250000 steps  (26%)
[14:40:06] Completed 67500 out of 250000 steps  (27%)
[14:56:45] Completed 70000 out of 250000 steps  (28%)
[15:13:25] Completed 72500 out of 250000 steps  (29%)
[15:28:54] - Autosending finished units... [July 23 15:28:54 UTC]
[15:28:54] Trying to send all finished work units
[15:28:54] + No unsent completed units remaining.
[15:28:54] - Autosend completed
[15:30:01] Completed 75000 out of 250000 steps  (30%)
[15:46:37] Completed 77500 out of 250000 steps  (31%)
[16:03:13] Completed 80000 out of 250000 steps  (32%)
[16:19:53] Completed 82500 out of 250000 steps  (33%)
Any idea's as to why this has happened? I do know that if you download units at the same time you can get a duplicate but the downloads were a good 14 hours apart.

2nd question - will I get credit for both units?

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Tue Jul 23, 2013 4:28 pm
by rickoic
Work units are assigned to several machines across the spectrum. Your two machines just happened to catch the same work unit. You will receive credit for both of them when finished.
Nothing unusual, just the luck of the draw.

Rick

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Tue Jul 23, 2013 4:34 pm
by Nathan_P
Its very unusual, hence the post. WU are usually only resent out if a machine has sent a partial WU in, the WU is corrupted or has missed a deadline - Unless something has changed recently?

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Tue Jul 23, 2013 4:44 pm
by 7im
It has not changed, and is a rare occurrance. WUs are sent only once unless a problem comes up. Then the server generates a copy. If there are multiple failures, like with a bad WU, multiple copies can be sent.

And yes, you will get points for both, if you wanted to let the 2nd copy keep running.

A mod will be along shortly to check on the WU and mark it as bad if needed.

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Tue Jul 23, 2013 5:13 pm
by Joe_H
From the format of the log files, both machines are running the V6 client. As long as you did not clone the installation so both machines have the same ID's assigned, this happens occasionally and is okay. If you did clone the F@H installation, then you need to completely uninstall it from one of the machines, including data files, and reinstall. On connecting for the first time the client will be given an unique ID. Without unique ID's a WU request from the second system a short time after a request from the first can result in getting a duplicate download, and on return the duplicate upload is discarded if you have a successful return from the first. If the returns are from two different machine ID's they are not considered duplicates.

Normally a WU is sent out to only one machine. If it does not come back within the preferred deadline, then the WU will be sent out again. Usually that is to two other machines. This is repeated until there is a successful return or enough failures occur that the WU is suspended from assignment as a possibly bad WU. In this case the WU has no reports in the database, will mark this for rechecking later.

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Tue Jul 23, 2013 5:27 pm
by bruce
Look carefully at the first page of the logs. Do they show the same UserID/MachineID? (Not to be confused with UserName, which should be the same.)

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Tue Jul 23, 2013 5:38 pm
by Nathan_P
No cloning, both machines were done from a fresh install months apart and the user ID's are different. - and having chewed through several kilowatts of leccy i'm not about to dump a WU.

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Wed Jul 24, 2013 3:49 am
by P5-133XL
You will only every get credit for one WU (If they are the same). You might as well get rid of a 2nd copy and start a new WU.

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Wed Jul 24, 2013 4:09 am
by Joe_H
P5-133XL wrote:You will only every get credit for one WU (If they are the same). You might as well get rid of a 2nd copy and start a new WU.
This is not actually true. As long as the machines have unique ID's, two copies of a WU that happened to be assigned to the same user will get credit. Where persons have not gotten credit for working on two of the same WU has been when they cloned a F@H install to another machine after it had connected to Stanford and the folding servers at least once and been assigned an User ID.

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Wed Jul 24, 2013 1:12 pm
by Joe_H
To follow up, both have finished and been turned in:
Hi Nathan_P (team 33),
Your WU (P8105 R0 C12 G84) was added to the stats database on 2013-07-23 22:05:03 for 253933 points of credit.

Hi Nathan_P (team 33),
Your WU (P8105 R0 C12 G84) was added to the stats database on 2013-07-24 05:05:07 for 285356 points of credit.

Re: Project: 8105 (Run 0, Clone 12, Gen 84)

Posted: Thu Jul 25, 2013 7:27 am
by Nathan_P
Thanks, things are now back to normal and both machines have pulled different WU.