Collection server dumping WUs
Posted: Mon May 07, 2012 12:28 pm
One of my team mates reported this problem. He completed a WU and had the collection server dump it, then successfully uploaded the next, and once more had the server dump the third WU. All are P7903.
Code: Select all
14:26:30:WU01:FS00:0xa4:Completed 2400000 out of 2500000 steps (96%)
14:30:35:WU01:FS00:0xa4:Completed 2425000 out of 2500000 steps (97%)
14:34:40:WU01:FS00:0xa4:Completed 2450000 out of 2500000 steps (98%)
14:38:46:WU01:FS00:0xa4:Completed 2475000 out of 2500000 steps (99%)
14:38:47:WU00:FS00:Connecting to assign3.stanford.edu:8080
14:38:47:WU00:FS00:News: Welcome to Folding@Home
14:38:47:WU00:FS00:Assigned to work server 128.113.12.161
14:38:47:WU00:FS00:Requesting new work unit for slot 00: RUNNING smp:8 from 128.113.12.161
14:38:47:WU00:FS00:Connecting to 128.113.12.161:8080
14:38:48:WU00:FS00:Downloading 646.38KiB
14:38:48:WU00:FS00:Download complete
14:38:48:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:OK project:7903 run:210 clone:9 gen:24 core:0xa4 unit:0x0000001900ac9c214eca68d81525fe45
14:42:51:WU01:FS00:0xa4:Completed 2500000 out of 2500000 steps (100%)
14:42:52:WU01:FS00:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
14:43:02:WU01:FS00:0xa4:
14:43:02:WU01:FS00:0xa4:Finished Work Unit:
14:43:02:WU01:FS00:0xa4:- Reading up to 35910936 from "01/wudata_01.trr": Read 35910936
14:43:02:WU01:FS00:0xa4:trr file hash check passed.
14:43:02:WU01:FS00:0xa4:edr file hash check passed.
14:43:02:WU01:FS00:0xa4:logfile size: 56875
14:43:02:WU01:FS00:0xa4:Leaving Run
14:43:05:WU01:FS00:0xa4:- Writing 35997727 bytes of core data to disk...
14:43:10:WU01:FS00:0xa4:Done: 35997215 -> 30222790 (compressed to 83.9 percent)
14:43:11:WU01:FS00:0xa4: ... Done.
14:43:14:WU01:FS00:0xa4:- Shutting down core
14:43:14:WU01:FS00:0xa4:
14:43:14:WU01:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
14:43:14:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
14:43:14:WU01:FS00:Sending unit results: id:01 state:SEND error:OK project:7903 run:216 clone:8 gen:16 core:0xa4 unit:0x0000001400ac9c214eca68e095e47aac
14:43:14:WU01:FS00:Uploading 28.82MiB to 128.113.12.161
14:43:14:WU01:FS00:Connecting to 128.113.12.161:8080
14:43:14:WU00:FS00:Starting
14:43:14:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/User/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe" -dir 00 -suffix 01 -version 701 -lifeline 1312 -checkpoint 15 -np 8
14:43:14:WU00:FS00:Started FahCore on PID 1340
14:43:14:WU00:FS00:Core PID:3120
14:43:14:WU00:FS00:FahCore 0xa4 started
14:43:15:WU00:FS00:0xa4:
14:43:15:WU00:FS00:0xa4:*------------------------------*
14:43:15:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
14:43:15:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
14:43:15:WU00:FS00:0xa4:
14:43:15:WU00:FS00:0xa4:Preparing to commence simulation
14:43:15:WU00:FS00:0xa4:- Looking at optimizations...
14:43:15:WU00:FS00:0xa4:- Created dyn
14:43:15:WU00:FS00:0xa4:- Files status OK
14:43:15:WU00:FS00:0xa4:- Expanded 661380 -> 1008860 (decompressed 152.5 percent)
14:43:15:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=661380 data_size=1008860, decompressed_data_size=1008860 diff=0
14:43:15:WU00:FS00:0xa4:- Digital signature verified
14:43:15:WU00:FS00:0xa4:
14:43:15:WU00:FS00:0xa4:Project: 7903 (Run 210, Clone 9, Gen 24)
14:43:15:WU00:FS00:0xa4:
14:43:15:WU00:FS00:0xa4:Assembly optimizations on if available.
14:43:15:WU00:FS00:0xa4:Entering M.D.
14:43:21:WU00:FS00:0xa4:Mapping NT from 8 to 8
14:43:21:WU00:FS00:0xa4:Completed 0 out of 2500000 steps (0%)
14:43:57:WU01:FS00:Upload 76.98%
14:43:57:WARNING:WU01:FS00:Exception: Failed to send results to work server: Transfer failed
14:43:57:WU01:FS00:Trying to send results to collection server
14:43:57:WU01:FS00:Uploading 28.82MiB to 129.74.85.16
14:43:57:WU01:FS00:Connecting to 129.74.85.16:8080
14:44:03:WU01:FS00:Upload 30.79%
14:44:09:WU01:FS00:Upload 62.45%
14:44:15:WU01:FS00:Upload 94.54%
14:44:16:WU01:FS00:Upload complete
14:44:16:WU01:FS00:Server responded WORK_QUIT (404)
14:44:16:WARNING:WU01:FS00:Server did not like results, dumping
14:44:16:WU01:FS00:Cleaning up