Project: 7809 (Run 1, Clone 188, Gen 6) Server reports probl

Moderators: Site Moderators, FAHC Science Team

Post Reply
DrBB1
Posts: 136
Joined: Wed Mar 26, 2008 12:30 am
Location: SE PA

Project: 7809 (Run 1, Clone 188, Gen 6) Server reports probl

Post by DrBB1 »

... problem after 10 days of folding 7809 (Run 1, Clone 188, Gen 6).
Key line from log is, "[04:41:27] - Server reports problem with unit." Larger excerpt follows:

Code: Select all

[11:18:23] Completed 1395000 out of 1500000 steps  (93%)
[13:47:12] Completed 1410000 out of 1500000 steps  (94%)
[16:16:00] Completed 1425000 out of 1500000 steps  (95%)
[16:49:32] - Autosending finished units... [November 7 16:49:32 UTC]
[16:49:32] Trying to send all finished work units
[16:49:32] + No unsent completed units remaining.
[16:49:32] - Autosend completed
[16:49:32] + Working...
[18:44:55] Completed 1440000 out of 1500000 steps  (96%)
[21:13:36] Completed 1455000 out of 1500000 steps  (97%)
[22:49:32] - Autosending finished units... [November 7 22:49:32 UTC]
[22:49:32] Trying to send all finished work units
[22:49:32] + No unsent completed units remaining.
[22:49:32] - Autosend completed
[22:49:32] + Working...
[23:42:15] Completed 1470000 out of 1500000 steps  (98%)
[02:10:55] Completed 1485000 out of 1500000 steps  (99%)
[04:40:52] Completed 1500000 out of 1500000 steps  (100%)
[04:40:55] DynamicWrapper: Finished Work Unit: sleep=10000
[04:41:05] 
[04:41:05] Finished Work Unit:
[04:41:05] - Reading up to 2908800 from "work/wudata_00.trr": Read 2908800
[04:41:05] trr file hash check passed.
[04:41:05] - Reading up to 1554392 from "work/wudata_00.xtc": Read 1554392
[04:41:05] xtc file hash check passed.
[04:41:05] edr file hash check passed.
[04:41:05] logfile size: 53174
[04:41:05] Leaving Run
[04:41:08] - Writing 4521378 bytes of core data to disk...
[04:41:11] Done: 4520866 -> 4325658 (compressed to 95.6 percent)
[04:41:11]   ... Done.
[04:41:12] - Shutting down core
[04:41:12] 
[04:41:12] Folding@home Core Shutdown: FINISHED_UNIT
[04:41:16] CoreStatus = 64 (100)
[04:41:16] Unit 0 finished with 78 percent of time to deadline remaining.
[04:41:16] Updated performance fraction: 0.802056
[04:41:16] Sending work to server
[04:41:16] Project: 7809 (Run 1, Clone 188, Gen 6)


[04:41:16] + Attempting to send results [November 8 04:41:16 UTC]
[04:41:16] - Reading file work/wuresults_00.dat from core
[04:41:16]   (Read 4326170 bytes from disk)
[04:41:16] Connecting to http://171.64.65.99:8080/
[04:41:27] Posted data.
[04:41:27] Initial: 0000; - Uploaded at ~384 kB/s
[04:41:27] - Averaged speed for that direction ~321 kB/s
[04:41:27] - Server reports problem with unit.
[04:41:27] Trying to send all finished work units
[04:41:27] + No unsent completed units remaining.
[04:41:27] - Preparing to get new work unit...
[04:41:27] + Attempting to get work packet
[04:41:27] - Will indicate memory of 510 MB
[04:41:27] - Detect CPU. Vendor: GenuineIntel, Family: 15, Model: 4, Stepping: 1
[04:41:27] - Connecting to assignment server
[04:41:27] Connecting to http://assign.stanford.edu:8080/
[04:41:28] Posted data.
[04:41:28] Initial: 598F; - Successful: assigned to (143.89.28.72).
[04:41:28] + News From Folding@Home: Welcome to Folding@Home
[04:41:28] Loaded queue successfully.
[04:41:28] Connecting to http://143.89.28.72:8080/
[04:41:32] Posted data.
[04:41:33] Initial: 0000; - Receiving payload (expected size: 791120)
[04:41:45] - Downloaded at ~64 kB/s
[04:41:45] - Averaged speed for that direction ~139 kB/s
[04:41:45] + Received work.
[04:41:45] Trying to send all finished work units
[04:41:45] + No unsent completed units remaining.
[04:41:45] + Closed connections
[04:41:45] 
[04:41:45] + Processing work unit
[04:41:45] Core required: FahCore_a4.exe
[04:41:45] Core found.
[04:41:45] Working on queue slot 01 [November 8 04:41:45 UTC]
[04:41:45] + Working ...
[04:41:45] - Calling '.\FahCore_a4.exe -dir work/ -suffix 01 -priority 96 -nocpulock -checkpoint 30 -verbose -lifeline 828 -version 620'

[04:41:45] 
[04:41:45] *------------------------------*
[04:41:45] Folding@Home Gromacs GB Core
[04:41:45] Version 2.27 (Dec. 15, 2010)
[04:41:45] 
[04:41:45] Preparing to commence simulation
[04:41:45] - Looking at optimizations...
[04:41:45] - Created dyn
[04:41:45] - Files status OK
[04:41:46] - Expanded 790608 -> 1490352 (decompressed 188.5 percent)
[04:41:46] Called DecompressByteArray: compressed_data_size=790608 data_size=1490352, decompressed_data_size=1490352 diff=0
[04:41:46] - Digital signature verified
[04:41:46] 
[04:41:46] Project: 2975 (Run 249, Clone 1, Gen 6)
[04:41:46] 
[04:41:46] Assembly optimizations on if available.
[04:41:46] Entering M.D.
[04:41:52] Mapping NT from 1 to 1 
[04:41:52] Completed 0 out of 2500000 steps  (0%)
[04:49:32] - Autosending finished units... [November 8 04:49:32 UTC]
[04:49:32] Trying to send all finished work units
[04:49:32] + No unsent completed units remaining.
========
DrBB1
sortofageek
Site Admin
Posts: 3110
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix
Contact:

Re: Project: 7809 (Run 1, Clone 188, Gen 6) Server reports p

Post by sortofageek »

I marked this one for followup along with the others.

I am wondering if the problems we have been seeing might be related to the server upgrade. Please see this post ---> viewtopic.php?f=18&t=19982
7im
Posts: 10179
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Project: 7809 (Run 1, Clone 188, Gen 6) Server reports p

Post by 7im »

How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
DrBB1
Posts: 136
Joined: Wed Mar 26, 2008 12:30 am
Location: SE PA

Re: Project: 7809 (Run 1, Clone 188, Gen 6) Server reports p

Post by DrBB1 »

Since my log stated that I have no "unsent completed units," will those of us affected receive credit for the completed WUs or have ten+ days of folding been a waste for both folders and the researchers alike?
========
DrBB1
7im
Posts: 10179
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Project: 7809 (Run 1, Clone 188, Gen 6) Server reports p

Post by 7im »

When the server says it has a problem, it won't accept the WU. No data, no points. So likely wasted time for both the Researchers and you, and me, and a few 100 others. If you want confirmation from PG, feel free to ask in the thread I linked... ;)

Mechanical things break. That's just the way it is. PG works hard to minimize that with multiple redundancies and multiple servers, etc. Even so, an occasional glitch is unavoidable.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Post Reply