Page 2 of 2

Re: Project: 2682 (Run 1, Clone 26, Gen 21)

Posted: Sun Aug 15, 2010 2:35 pm
by noorman
bollix47 wrote:FYI

This WU completed successfully, uploaded and credit, including bonus, has been received. The system immediately download a new WU and is processing it now. :wink:

This doesn't explain why another volunteer had problems with the WU but does suggest that there's nothing wrong with it, at least not on the system here.

Were they using a 64-bit o/s? Do they have more than 4gig of ram?
.

If I 'm informed correctly, they use a 32-bit Windows, a 64-bit able hardware (system) and 6 GB of RAM (dual boot with Unix)
No other problems before the reported problems with P2682 and the supposedly alternative/replacement P2692 !

.

Re: Project: 2682 (Run 1, Clone 26, Gen 21)

Posted: Sun Aug 15, 2010 2:41 pm
by bollix47
If they are using 32-bit Windows then that may have been the problem as it would not be able to use the 6 gig but rather be limited to an amount that was less than the WU was taking on my system. They might try to increase their page file but I'm not sure how much it would help, if at all.

Re: Project: 2682 (Run 1, Clone 26, Gen 21)

Posted: Sun Aug 15, 2010 2:47 pm
by noorman
bollix47 wrote:If they are using 32-bit Windows then that may have been the problem as it would not see the 6 gig but rather be limited to an amount that was less than the WU was taking on my system. They might try to increase their page file but I'm not sure how much it would help, if at all.
.

They employed a trick which encompasses lying to the Client (telling it there is a bit less than 6 GB) and the 64-bit able hardware and BIOS mapping the 6GB so that it can be used by the 32-bit system anyway.
That did the trick ...

Now they just can't get enough -bigadv work for all of their systems, which makes for idle systems, which still eat very costly energy !
Which they hate, as anyone would.
I reported on the bad availability and low WU numbers available before the start of the weekend to Vijay Pande, but I didn't get a reply and it wasn't taken care of in time, it seems ...


.

Re: Project: 2682 (Run 1, Clone 26, Gen 21)

Posted: Sun Aug 15, 2010 5:16 pm
by shdbcamping
noorman wrote:
bollix47 wrote:If they are using 32-bit Windows then that may have been the problem as it would not see the 6 gig but rather be limited to an amount that was less than the WU was taking on my system. They might try to increase their page file but I'm not sure how much it would help, if at all.
.

They employed a trick which encompasses lying to the Client (telling it there is a bit less than 6 GB) and the 64-bit able hardware and BIOS mapping the 6GB so that it can be used by the 32-bit system anyway.
That did the trick ...

Now they just can't get enough -bigadv work for all of their systems, which makes for idle systems, which still eat very costly energy !
Which they hate, as anyone would.
I reported on the bad availability and low WU numbers available before the start of the weekend to Vijay Pande, but I didn't get a reply and it wasn't taken care of in time, it seems ...


.
Sounds much more like an ISP issue than an FAH issue. I've had this problem when My cable ISP was having server issues in the past. Good luck having the ISP sign on to the problem, however. When a backbone drops, the rerouting can cause interrupt issues. Tell them to check with their ISP's and hang in there to see if the problem clears :ewink:

Re: Project: 2682 (Run 1, Clone 26, Gen 21)

Posted: Sun Aug 15, 2010 6:15 pm
by noorman
shdbcamping wrote:
noorman wrote:
bollix47 wrote:If they are using 32-bit Windows then that may have been the problem as it would not see the 6 gig but rather be limited to an amount that was less than the WU was taking on my system. They might try to increase their page file but I'm not sure how much it would help, if at all.
.

They employed a trick which encompasses lying to the Client (telling it there is a bit less than 6 GB) and the 64-bit able hardware and BIOS mapping the 6GB so that it can be used by the 32-bit system anyway.
That did the trick ...

Now they just can't get enough -bigadv work for all of their systems, which makes for idle systems, which still eat very costly energy !
Which they hate, as anyone would.
I reported on the bad availability and low WU numbers available before the start of the weekend to Vijay Pande, but I didn't get a reply and it wasn't taken care of in time, it seems ...


.
Sounds much more like an ISP issue than an FAH issue. I've had this problem when My cable ISP was having server issues in the past. Good luck having the ISP sign on to the problem, however. When a backbone drops, the rerouting can cause interrupt issues. Tell them to check with their ISP's and hang in there to see if the problem clears :ewink:
.


No, can't be that; they are running 7 identical systems and at the moment, only one still hasn't got a new -bigadv WU (which it did get repeatedly before) ...
The others and that one before got Work and the other 6 are still getting Work (with difficulty) and contrary to what has been reported to usn they are P2682 and not the re-done P2692's !??
Changed the machine ID trick too; didn't help (in this case)


.

Re: Project: 2682 (Run 1, Clone 26, Gen 21)

Posted: Sun Aug 15, 2010 7:08 pm
by bruce
noorman wrote:. . . and contrary to what has been reported to us they are P2682 and not the re-done P2692's !??
See my response here: viewtopic.php?p=155520#p155520

Re: Project: 2682 (Run 1, Clone 26, Gen 21)

Posted: Mon Aug 16, 2010 1:10 am
by Amaruk
Regarding 2682 WUs in general:
noorman wrote:. . . and contrary to what has been reported to us they are P2682 and not the re-done P2692's !??
http://foldingforum.org/viewtopic.php?p=155536#p155536
kasson wrote:The work is definitely still useful. As we announced previously (viewtopic.php?f=55&t=15563&p=154823#p154823), we've downweighted 2682 in favor of 2692, but we still have some 2682's around. If they prove problematic enough that people would rather have no bigadv at all than 2682, then we can turn them off.
Hopefully that clears that up. And for the record, I have no problem with 2682s.


Regarding the WU which is the subject of this thread:
bollix47 wrote:FYI

This WU completed successfully, uploaded and credit, including bonus, has been received. The system immediately download a new WU and is processing it now. :wink:
I see your running dual xeons. Unfortunately I don't believe that matchs the OP-by-proxy's system, but it's good to know it ran OK.
noorman wrote:If I 'm informed correctly, they use a 32-bit Windows, a 64-bit able hardware (system) and 6 GB of RAM (dual boot with Unix)

...they are running 7 identical systems and at the moment, only one still hasn't got a new -bigadv WU
This is not much to go on, and none of the other 15 posts or so I've read thus far contain any more info so I'll have to guess. Please correct any details I get wrong.

CPU is i7 930

OS is XP x32 running on 64 bit hardware

Installed ram is 6 GB - would it be possible to verify this? Dual boot with Unix and the following quote:
I may have found a workaround. Manually setting memory to something just under 6 gigs
in the config file is letting 2682's come in and start up just fine. Now its a waiting game
to see if they keep running and don't bomb out. I'd advise anyone wanting to try this method
to resize their page file size accordingly in the system properties and profile stuff.
Make me wonder if there is more than 6 GB installed. This may mean nothing, but you never know...



Now given the above I have a machine very similar to the 'OP' (apart from the OS):

i7 930, 6 GB ram, 7 Pro x64

Several days ago the 'OP' had reported issues getting bigadv work. Because my system is so similar, I decided to interrupt the SMP WU that was running to test the availability of that server. (171.67.108.22) I didn't want to mess with the unfinished SMP, so I did a fresh install in a separate folder, set the oneunit flag, and firing it up. Had no problems getting work, and reported such.

I did not realize it at the time, but by sheer luck I picked up the same WU referred to in this thread.
Here is a screenshot showing memory usage. Quite a bit, actually.

Image

I have since completed it successfully. Although I can not positively prove it was received, recent point/WU strongly suggest it was.

I also saved a copy of this problem WU, and plan to test how it behaves on startup later.

But first I have to finish that SMP WU. Unfortunately, it looks like I will just miss preferred... :(

~edit~

Here is logfile

Code: Select all


--- Opening Log file [August 13 05:50:04 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.30

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\Fuzzy\SMP630
Executable: C:\Users\Fuzzy\SMP630\Folding@home-Win32-x86.exe
Arguments: -configonly -smp 

[05:50:04] Configuring Folding@Home...


[05:53:08] - Ask before connecting: No
[05:53:08] - User name: Amaruk (Team 50625)
[05:53:08] - User ID: 425D5ECEXXXXXXXX
[05:53:08] - Machine ID: 1
[05:53:08] 
[05:53:08] -configonly flag given, so exiting.


--- Opening Log file [August 13 05:53:35 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.30

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Users\Fuzzy\SMP630
Executable: C:\Users\Fuzzy\SMP630\Folding@home-Win32-x86.exe
Arguments: -oneunit -smp -verbosity 9 -bigadv 

[05:53:35] - Ask before connecting: No
[05:53:35] - User name: Amaruk (Team 50625)
[05:53:35] - User ID: 425D5ECEXXXXXXXX
[05:53:35] - Machine ID: 1
[05:53:35] 
[05:53:35] Work directory not found. Creating...
[05:53:35] Could not open work queue, generating new queue...
[05:53:35] - Preparing to get new work unit...
[05:53:35] - Autosending finished units... [August 13 05:53:35 UTC]
[05:53:35] Cleaning up work directory
[05:53:35] Trying to send all finished work units
[05:53:35] + Attempting to get work packet
[05:53:35] + No unsent completed units remaining.
[05:53:35] Passkey found
[05:53:35] - Will indicate memory of 6135 MB
[05:53:35] - Autosend completed
[05:53:35] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 10, Stepping: 5
[05:53:35] - Connecting to assignment server
[05:53:35] Connecting to http://assign.stanford.edu:8080/
[05:53:41] Posted data.
[05:53:41] Initial: 43AB; - Successful: assigned to (171.67.108.22).
[05:53:41] + News From Folding@Home: Welcome to Folding@Home
[05:53:41] Loaded queue successfully.
[05:53:41] Sent data
[05:53:41] Connecting to http://171.67.108.22:8080/
[05:54:17] Posted data.
[05:54:17] Initial: 0000; - Receiving payload (expected size: 29580657)
[05:54:44] - Downloaded at ~1069 kB/s
[05:54:44] - Averaged speed for that direction ~1069 kB/s
[05:54:44] + Received work.
[05:54:44] + Closed connections
[05:54:44] 
[05:54:44] + Processing work unit
[05:54:44] Core required: FahCore_a3.exe
[05:54:44] Core not found.
[05:54:44] - Core is not present or corrupted.
[05:54:44] - Attempting to download new core...
[05:54:44] + Downloading new core: FahCore_a3.exe
[05:54:44] Downloading core (/~pande/Win32/x86/Core_a3.fah from www.stanford.edu)
[05:54:44] Initial: AFDE; + 10240 bytes downloaded

<snip>

[05:54:46] Initial: 3ABE; + 2711113 bytes downloaded
[05:54:46] Verifying core Core_a3.fah...
[05:54:46] Signature is VALID
[05:54:46] 
[05:54:46] Trying to unzip core FahCore_a3.exe
[05:54:47] Decompressed FahCore_a3.exe (9325056 bytes) successfully
[05:54:52] + Core successfully engaged
[05:54:57] 
[05:54:57] + Processing work unit
[05:54:57] Core required: FahCore_a3.exe
[05:54:57] Core found.
[05:54:57] Working on queue slot 01 [August 13 05:54:57 UTC]
[05:54:57] + Working ...
[05:54:57] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 01 -np 8 -checkpoint 15 -verbose -lifeline 8196 -version 630'

[05:54:57] 
[05:54:57] *------------------------------*
[05:54:57] Folding@Home Gromacs SMP Core
[05:54:57] Version 2.22 (Mar 12, 2010)
[05:54:57] 
[05:54:57] Preparing to commence simulation
[05:54:57] - Looking at optimizations...
[05:54:57] - Created dyn
[05:54:57] - Files status OK
[05:56:03] - Expanded 29580145 -> 156321221 (decompressed 92.8 percent)
[05:56:03] Called DecompressByteArray: compressed_data_size=29580145 data_size=156321221, decompressed_data_size=156321221 diff=0
[05:56:04] - Digital signature verified
[05:56:04] 
[05:56:04] Project: 2682 (Run 1, Clone 26, Gen 21)
[05:56:04] 
[05:56:06] Assembly optimizations on if available.
[05:56:06] Entering M.D.
[05:56:26] Completed 0 out of 249999 steps  (0%)
[06:34:28] Completed 2500 out of 249999 steps  (1%)
[07:12:16] Completed 5000 out of 249999 steps  (2%)
[07:50:23] Completed 7500 out of 249999 steps  (3%)
[08:28:26] Completed 10000 out of 249999 steps  (4%)
[09:06:42] Completed 12500 out of 249999 steps  (5%)
[09:44:54] Completed 15000 out of 249999 steps  (6%)
[10:23:01] Completed 17500 out of 249999 steps  (7%)
[11:01:11] Completed 20000 out of 249999 steps  (8%)
[11:39:16] Completed 22500 out of 249999 steps  (9%)
[11:53:35] - Autosending finished units... [August 13 11:53:35 UTC]
[11:53:35] Trying to send all finished work units
[11:53:35] + No unsent completed units remaining.
[11:53:35] - Autosend completed
[12:17:25] Completed 25000 out of 249999 steps  (10%)
[12:55:26] Completed 27500 out of 249999 steps  (11%)
[13:33:40] Completed 30000 out of 249999 steps  (12%)
[14:11:46] Completed 32500 out of 249999 steps  (13%)
[14:49:54] Completed 35000 out of 249999 steps  (14%)
[15:28:16] Completed 37500 out of 249999 steps  (15%)
[16:06:20] Completed 40000 out of 249999 steps  (16%)
[16:44:28] Completed 42500 out of 249999 steps  (17%)
[17:22:34] Completed 45000 out of 249999 steps  (18%)
[17:53:35] - Autosending finished units... [August 13 17:53:35 UTC]
[17:53:35] Trying to send all finished work units
[17:53:35] + No unsent completed units remaining.
[17:53:35] - Autosend completed
[18:00:44] Completed 47500 out of 249999 steps  (19%)
[18:38:55] Completed 50000 out of 249999 steps  (20%)
[19:17:09] Completed 52500 out of 249999 steps  (21%)
[19:55:12] Completed 55000 out of 249999 steps  (22%)
[20:33:23] Completed 57500 out of 249999 steps  (23%)
[21:11:38] Completed 60000 out of 249999 steps  (24%)
[21:48:37] Completed 62500 out of 249999 steps  (25%)
[22:24:23] Completed 65000 out of 249999 steps  (26%)
[23:00:06] Completed 67500 out of 249999 steps  (27%)
[23:35:49] Completed 70000 out of 249999 steps  (28%)
[23:53:35] - Autosending finished units... [August 13 23:53:35 UTC]
[23:53:35] Trying to send all finished work units
[23:53:35] + No unsent completed units remaining.
[23:53:35] - Autosend completed
[00:11:37] Completed 72500 out of 249999 steps  (29%)
[00:47:18] Completed 75000 out of 249999 steps  (30%)
[01:22:59] Completed 77500 out of 249999 steps  (31%)
[01:59:08] Completed 80000 out of 249999 steps  (32%)
[02:34:57] Completed 82500 out of 249999 steps  (33%)
[03:10:48] Completed 85000 out of 249999 steps  (34%)
[03:46:31] Completed 87500 out of 249999 steps  (35%)
[04:22:13] Completed 90000 out of 249999 steps  (36%)
[04:57:58] Completed 92500 out of 249999 steps  (37%)
[05:33:40] Completed 95000 out of 249999 steps  (38%)
[05:53:35] - Autosending finished units... [August 14 05:53:35 UTC]
[05:53:35] Trying to send all finished work units
[05:53:35] + No unsent completed units remaining.
[05:53:35] - Autosend completed
[06:09:26] Completed 97500 out of 249999 steps  (39%)
[06:45:06] Completed 100000 out of 249999 steps  (40%)
[07:20:44] Completed 102500 out of 249999 steps  (41%)
[07:56:20] Completed 105000 out of 249999 steps  (42%)
[08:31:51] Completed 107500 out of 249999 steps  (43%)
[09:07:29] Completed 110000 out of 249999 steps  (44%)
[09:43:14] Completed 112500 out of 249999 steps  (45%)
[10:19:05] Completed 115000 out of 249999 steps  (46%)
[10:54:51] Completed 117500 out of 249999 steps  (47%)
[11:30:39] Completed 120000 out of 249999 steps  (48%)
[11:53:35] - Autosending finished units... [August 14 11:53:35 UTC]
[11:53:35] Trying to send all finished work units
[11:53:35] + No unsent completed units remaining.
[11:53:35] - Autosend completed
[12:06:23] Completed 122500 out of 249999 steps  (49%)
[12:42:03] Completed 125000 out of 249999 steps  (50%)
[13:17:47] Completed 127500 out of 249999 steps  (51%)
[13:53:34] Completed 130000 out of 249999 steps  (52%)
[14:29:19] Completed 132500 out of 249999 steps  (53%)
[15:04:54] Completed 135000 out of 249999 steps  (54%)
[15:40:48] Completed 137500 out of 249999 steps  (55%)
[16:16:18] Completed 140000 out of 249999 steps  (56%)
[16:51:51] Completed 142500 out of 249999 steps  (57%)
[17:27:27] Completed 145000 out of 249999 steps  (58%)
[17:53:35] - Autosending finished units... [August 14 17:53:35 UTC]
[17:53:35] Trying to send all finished work units
[17:53:35] + No unsent completed units remaining.
[17:53:35] - Autosend completed
[18:03:02] Completed 147500 out of 249999 steps  (59%)
[18:38:34] Completed 150000 out of 249999 steps  (60%)
[19:14:16] Completed 152500 out of 249999 steps  (61%)
[19:49:45] Completed 155000 out of 249999 steps  (62%)
[20:25:18] Completed 157500 out of 249999 steps  (63%)
[21:00:53] Completed 160000 out of 249999 steps  (64%)
[21:36:30] Completed 162500 out of 249999 steps  (65%)
[22:12:02] Completed 165000 out of 249999 steps  (66%)
[22:47:43] Completed 167500 out of 249999 steps  (67%)
[23:23:30] Completed 170000 out of 249999 steps  (68%)
[23:53:35] - Autosending finished units... [August 14 23:53:35 UTC]
[23:53:35] Trying to send all finished work units
[23:53:35] + No unsent completed units remaining.
[23:53:35] - Autosend completed
[23:59:22] Completed 172500 out of 249999 steps  (69%)
[00:35:13] Completed 175000 out of 249999 steps  (70%)
[01:11:02] Completed 177500 out of 249999 steps  (71%)
[01:46:49] Completed 180000 out of 249999 steps  (72%)
[02:22:47] Completed 182500 out of 249999 steps  (73%)
[02:58:32] Completed 185000 out of 249999 steps  (74%)
[03:34:15] Completed 187500 out of 249999 steps  (75%)
[04:09:51] Completed 190000 out of 249999 steps  (76%)
[04:46:16] Completed 192500 out of 249999 steps  (77%)
[05:25:52] Completed 195000 out of 249999 steps  (78%)
[05:53:35] - Autosending finished units... [August 15 05:53:35 UTC]
[05:53:35] Trying to send all finished work units
[05:53:35] + No unsent completed units remaining.
[05:53:35] - Autosend completed
[06:01:36] Completed 197500 out of 249999 steps  (79%)
[06:37:12] Completed 200000 out of 249999 steps  (80%)
[07:12:53] Completed 202500 out of 249999 steps  (81%)
[07:48:38] Completed 205000 out of 249999 steps  (82%)
[08:23:38] Completed 207500 out of 249999 steps  (83%)
[09:01:14] Completed 210000 out of 249999 steps  (84%)
[09:39:23] Completed 212500 out of 249999 steps  (85%)
[10:17:43] Completed 215000 out of 249999 steps  (86%)
[10:54:05] Completed 217500 out of 249999 steps  (87%)
[11:29:36] Completed 220000 out of 249999 steps  (88%)
[11:52:54] - Autosending finished units... [August 15 11:52:54 UTC]
[11:52:54] Trying to send all finished work units
[11:52:54] + No unsent completed units remaining.
[11:52:54] - Autosend completed
[12:07:03] Completed 222500 out of 249999 steps  (89%)
[12:45:18] Completed 225000 out of 249999 steps  (90%)
[13:23:26] Completed 227500 out of 249999 steps  (91%)
[14:01:32] Completed 230000 out of 249999 steps  (92%)
[14:39:41] Completed 232500 out of 249999 steps  (93%)
[15:18:12] Completed 235000 out of 249999 steps  (94%)
[15:56:20] Completed 237500 out of 249999 steps  (95%)
[16:34:30] Completed 240000 out of 249999 steps  (96%)
[17:12:46] Completed 242500 out of 249999 steps  (97%)
[17:50:51] Completed 245000 out of 249999 steps  (98%)
[17:52:54] - Autosending finished units... [August 15 17:52:54 UTC]
[17:52:54] Trying to send all finished work units
[17:52:54] + No unsent completed units remaining.
[17:52:54] - Autosend completed
[18:29:00] Completed 247500 out of 249999 steps  (99%)
[19:07:18] Completed 249999 out of 249999 steps  (100%)
[19:07:31] DynamicWrapper: Finished Work Unit: sleep=10000
[19:07:41] 
[19:07:41] Finished Work Unit:
[19:07:41] - Reading up to 26356560 from "work/wudata_01.trr": Read 26356560
[19:07:41] trr file hash check passed.
[19:07:41] - Reading up to 42756488 from "work/wudata_01.xtc": Read 42756488
[19:07:42] xtc file hash check passed.
[19:07:42] edr file hash check passed.
[19:07:42] logfile size: 210131
[19:07:42] Leaving Run
[19:07:44] - Writing 69490859 bytes of core data to disk...
[19:07:46]   ... Done.
[19:08:29] - Shutting down core
[19:08:29] 
[19:08:29] Folding@home Core Shutdown: FINISHED_UNIT
[19:08:39] CoreStatus = 64 (100)
[19:08:39] Unit 1 finished with 57 percent of time to deadline remaining.
[19:08:39] Updated performance fraction: 0.574778
[19:08:39] Sending work to server
[19:08:39] Project: 2682 (Run 1, Clone 26, Gen 21)


[19:08:39] + Attempting to send results [August 15 19:08:39 UTC]
[19:08:39] - Reading file work/wuresults_01.dat from core
[19:08:39]   (Read 69490859 bytes from disk)
[19:08:39] Connecting to http://171.67.108.22:8080/
[19:18:56] Posted data.
[19:18:56] Initial: 0000; - Uploaded at ~109 kB/s
[19:18:56] - Averaged speed for that direction ~109 kB/s
[19:18:56] + Results successfully sent
[19:18:56] Thank you for your contribution to Folding@Home.
[19:18:56] + Starting local stats count at 1
[19:19:01] Trying to send all finished work units
[19:19:01] + No unsent completed units remaining.
[19:19:01] + -oneunit flag given and have now finished a unit. Exiting.***** Got a SIGTERM signal (2)
[19:19:01] Killing all core threads
[19:19:01] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.

Re: Project: 2682 (Run 1, Clone 26, Gen 21)

Posted: Mon Aug 16, 2010 5:40 am
by Amaruk
Made preferred. :)

Started Project: 2682 (Run 1, Clone 26, Gen 21) twice - no spikes in memory usage at startup. So I restarted the client with bigadv and got another 2682

Image

P2682, R11, C14, G0 memory usage is less than half that of P2682, R1, C26, G21. :shock:

I'm not exactly sure, but in Windows isn't a 32 bit process limited to 2GB on a 32 bit OS, and 4GB on 64 bit?

If so that would explain why bollix47 and I (both running 7 Pro, and I suspect he's running 64 bit too) were able to run it, but it would be impossible to run 2682, Run1, C26 ,G21 on 32 bit XP. :(

I still have a copy of this WU if anyone at PG is interested in it.