Page 1 of 1

Project: 11243 (Run 1, Clone 24, Gen 34)

Posted: Sat Oct 30, 2010 8:26 pm
by Stewart1
"Server reports problem with unit" - that's a new one on me.

I've only recently installed a GTS450 and the new client so I'm unfamiliar with what can go wrong, and whether this actually is a problem with the WU.

Grateful for any suggestions...

Code: Select all

# Windows GPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.40r1

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: E:\Non-SSD Programs\GTS450
Executable: fahgpu.exe
Arguments: -gpu 1 -local -verbosity 9 -advmethods -forcegpu nvidia_fermi 

[19:21:52] - Ask before connecting: No
[19:21:52] - User name: Stewart1 (Team 163049)
[19:21:52] - User ID: 7AEA76E22AF32B83
[19:21:52] - Machine ID: 16
[19:21:52] 
[19:21:52] Gpu type=3 species=20.
[19:21:52] Loaded queue successfully.
[19:21:52] 
[19:21:52] + Processing work unit
[19:21:52] Core required: FahCore_15.exe
[19:21:52] Core found.
[19:21:52] - Autosending finished units... [October 30 19:21:52 UTC]
[19:21:52] Trying to send all finished work units
[19:21:52] + No unsent completed units remaining.
[19:21:52] - Autosend completed
[19:21:52] Working on queue slot 02 [October 30 19:21:52 UTC]
[19:21:52] + Working ...
[19:21:52] - Calling '.\FahCore_15.exe -dir work/ -suffix 02 -nice 19 -nocpulock -checkpoint 15 -verbose -lifeline 2416 -version 640'

[19:21:52] 
[19:21:52] *------------------------------*
[19:21:52] Folding@Home GPU Core -- Beta
[19:21:52] Version 2.09 (Thu May 20 11:58:42 PDT 2010)
[19:21:52] 
[19:21:52] Build host: SimbiosNvdWin7
[19:21:52] Board Type: Nvidia
[19:21:52] Core      : 
[19:21:52] Preparing to commence simulation
[19:21:52] - Looking at optimizations...
[19:21:52] - Files status OK
[19:21:53] sizeof(CORE_PACKET_HDR) = 512 file=<>
[19:21:53] - Expanded 18945 -> 76495 (decompressed 403.7 percent)
[19:21:53] Called DecompressByteArray: compressed_data_size=18945 data_size=76495, decompressed_data_size=76495 diff=0
[19:21:53] - Digital signature verified
[19:21:53] 
[19:21:53] Project: 11243 (Run 1, Clone 24, Gen 34)
[19:21:53] 
[19:21:53] Assembly optimizations on if available.
[19:21:53] Entering M.D.
[19:21:59] Will resume from checkpoint file work/wudata_02.ckp
[19:21:59] Tpr hash work/wudata_02.tpr:  3480890183 2147084907 293159923 4220870017 848096843
[19:21:59] Working on 264 Fs_coil
[19:21:59] Client config found, loading data.
[19:21:59] Starting GUI Server
[19:21:59] Resuming from checkpoint
[19:21:59] fcCheckPointResume: retreived and current tpr file hash:
[19:21:59]    0   3480890183   3480890183
[19:21:59]    1   2147084907   2147084907
[19:21:59]    2    293159923    293159923
[19:21:59]    3   4220870017   4220870017
[19:21:59]    4    848096843    848096843
[19:21:59] fcCheckPointResume: file hashes same.
[19:21:59] fcCheckPointResume: state restored.
[19:21:59] fcCheckPointResume: name work/wudata_02.log Verified work/wudata_02.log
[19:21:59] fcCheckPointResume: name work/wudata_02.trr Verified work/wudata_02.trr
[19:21:59] fcCheckPointResume: name work/wudata_02.xtc Verified work/wudata_02.xtc
[19:21:59] fcCheckPointResume: name work/wudata_02.edr Verified work/wudata_02.edr
[19:21:59] fcCheckPointResume: state restored 2
[19:21:59] Resumed from checkpoint
[19:21:59] Completed 53%
[19:23:01] Completed 54%
[19:24:02] Completed 55%
[19:25:03] Completed 56%
[19:26:06] Completed 57%
[19:27:07] Completed 58%
[19:28:07] Completed 59%
[19:29:07] Completed 60%
[19:30:07] Completed 61%
[19:31:08] Completed 62%
[19:32:08] Completed 63%
[19:33:10] Completed 64%
[19:34:10] Completed 65%
[19:35:10] Completed 66%
[19:36:10] Completed 67%
[19:37:10] Completed 68%
[19:38:11] Completed 69%
[19:39:11] Completed 70%
[19:40:11] Completed 71%
[19:41:11] Completed 72%
[19:42:11] Completed 73%
[19:43:10] Completed 74%
[19:44:10] Completed 75%
[19:45:10] Completed 76%
[19:46:11] Completed 77%
[19:47:11] Completed 78%
[19:48:11] Completed 79%
[19:49:12] Completed 80%
[19:50:20] Completed 81%
[19:51:21] Completed 82%
[19:52:21] Completed 83%
[19:53:20] Completed 84%
[19:54:20] Completed 85%
[19:55:19] Completed 86%
[19:56:19] Completed 87%
[19:57:18] Completed 88%
[19:58:18] Completed 89%
[19:59:16] Completed 90%
[20:00:15] Completed 91%
[20:01:14] Completed 92%
[20:02:13] Completed 93%
[20:03:11] Completed 94%
[20:04:10] Completed 95%
[20:05:08] Completed 96%
[20:06:07] Completed 97%
[20:07:05] Completed 98%
[20:08:04] Completed 99%
[20:09:02] Completed 100%
[20:09:02] Finished fah_main
[20:09:02] 
[20:09:02] Successful run
[20:09:02] DynamicWrapper: Finished Work Unit: sleep=10000
[20:09:12] Reserved 1081576 bytes for xtc file; Cosm status=0
[20:09:12] Allocated 1081576 bytes for xtc file
[20:09:12] - Reading up to 1081576 from "work/wudata_02.xtc": Read 1081576
[20:09:12] Read 1081576 bytes from xtc file; available packet space=785348888
[20:09:12] xtc file hash check passed.
[20:09:12] Reserved 32280 32280 785348888 bytes for arc file=<work/wudata_02.trr> Cosm status=0
[20:09:12] Allocated 32280 bytes for arc file
[20:09:12] - Reading up to 32280 from "work/wudata_02.trr": Read 32280
[20:09:12] Read 32280 bytes from arc file; available packet space=785316608
[20:09:12] trr file hash check passed.
[20:09:12] Allocated 544 bytes for edr file
[20:09:12] Read bedfile
[20:09:12] edr file hash check passed.
[20:09:12] Allocated 121820 bytes for logfile
[20:09:12] Read logfile
[20:09:12] GuardedRun: success in DynamicWrapper
[20:09:12] GuardedRun: done
[20:09:12] Run: GuardedRun completed.
[20:09:13] + Opened results file
[20:09:13] - Writing 1236732 bytes of core data to disk...
[20:09:13] Done: 1236220 -> 1070686 (compressed to 86.6 percent)
[20:09:13]   ... Done.
[20:09:13] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[20:09:14] Shutting down core 
[20:09:14] 
[20:09:14] Folding@home Core Shutdown: FINISHED_UNIT
[20:09:17] CoreStatus = 64 (100)
[20:09:17] Unit 2 finished with 100 percent of time to deadline remaining.
[20:09:17] Updated performance fraction: 0.996437
[20:09:17] Sending work to server
[20:09:17] Project: 11243 (Run 1, Clone 24, Gen 34)
[20:09:17] - Read packet limit of 540015616... Set to 524286976.


[20:09:17] + Attempting to send results [October 30 20:09:17 UTC]
[20:09:17] - Reading file work/wuresults_02.dat from core
[20:09:17]   (Read 1071198 bytes from disk)
[20:09:17] Gpu type=3 species=20.
[20:09:17] Connecting to http://171.67.108.32:8080/
[20:09:45] Posted data.
[20:09:45] Initial: 0000; - Uploaded at ~37 kB/s
[20:09:45] - Averaged speed for that direction ~21 kB/s
[20:09:45] - Server reports problem with unit.
[20:09:45] Trying to send all finished work units
[20:09:45] + No unsent completed units remaining.
[20:09:45] - Preparing to get new work unit...

Re: Project: 11243 (Run 1, Clone 24, Gen 34)

Posted: Sat Oct 30, 2010 10:31 pm
by Stewart1
Hmm, it happened again with the next unit. Maybe its an issue with the client. Turning off -advmethods, maybe that might help.

Re: Project: 11243 (Run 1, Clone 24, Gen 34)

Posted: Sat Oct 30, 2010 11:55 pm
by sortofageek
Looking at the error message and other reports, we may be seeing a problem with the server itself.

IP: 171.67.108.32

Re: Project: 11243 (Run 1, Clone 24, Gen 34)

Posted: Sun Oct 31, 2010 12:03 am
by sortofageek
That server is down. I think all three of these reports are probably related to problems being reported here.

I'll flag these reports for followup to see if credits are received when the servers come back up.

Re: Project: 11243 (Run 1, Clone 24, Gen 34)

Posted: Sun Oct 31, 2010 12:38 am
by sortofageek
For anyone interested, please note Professor Pande has responded here.

Re: Project: 11243 (Run 1, Clone 24, Gen 34)

Posted: Sun Oct 31, 2010 3:32 am
by sortofageek
You may also want to read this topic.

Re: Project: 11243 (Run 1, Clone 24, Gen 34)

Posted: Sun Oct 31, 2010 10:42 pm
by sortofageek
The database now says Project: 11243 (Run 1, Clone 24, Gen 34) has been completed and returned for full credit by another folder, so we can't call this a bad WU.