I now have 3 Wu's that continuously attempt but fail to upload. They share work/collection servers 155.247.166.219; 155.247.166.220; and 128.252.203.4. I am able to receive and upload other WU's while these WU's remain uncollected.
Log file is attached. Suggestions would be appreciated.
*********************** Log Started 2019-09-24T15:27:30Z ***********************
15:27:30:************************* Folding@home Client *************************
15:27:30: Website: http://folding.stanford.edu/
15:27:30: Copyright: (c) 2009-2014 Stanford University
15:27:30: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
15:27:30: Args: --open-web-control
15:27:30: Config: C:/Users/Ron/AppData/Roaming/FAHClient/config.xml
15:27:30:******************************** Build ********************************
15:27:30: Version: 7.4.4
15:27:30: Date: Mar 4 2014
15:27:30: Time: 20:26:54
15:27:30: SVN Rev: 4130
15:27:30: Branch: fah/trunk/client
15:27:30: Compiler: Intel(R) C++ MSVC 1500 mode 1200
15:27:30: Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
15:27:30: /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
15:27:30: Platform: win32 XP
15:27:30: Bits: 32
15:27:30: Mode: Release
15:27:30:******************************* System ********************************
15:27:30: CPU: Intel(R) Core(TM) i3-3227U CPU @ 1.90GHz
15:27:30: CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
15:27:30: CPUs: 4
15:27:30: Memory: 3.89GiB
15:27:30: Free Memory: 3.19GiB
15:27:30: Threads: WINDOWS_THREADS
15:27:30: OS Version: 6.1
15:27:30: Has Battery: false
15:27:30: On Battery: false
15:27:30: UTC Offset: -4
15:27:30: PID: 2384
15:27:30: CWD: C:/Users/Ron/AppData/Roaming/FAHClient
15:27:30: OS: Windows 7 Home Premium
15:27:30: OS Arch: AMD64
15:27:30: GPUs: 0
15:27:30: CUDA: Not detected
15:27:30:Win32 Service: false
15:27:30:***********************************************************************
15:27:30:<config>
15:27:30: <!-- Network -->
15:27:30: <proxy v=':8080'/>
15:27:30:
15:27:30: <!-- Slot Control -->
15:27:30: <power v='FULL'/>
15:27:30:
15:27:30: <!-- User Information -->
15:27:30: <passkey v='********************************'/>
15:27:30: <team v='4'/>
15:27:30: <user v='rewron'/>
15:27:30:
15:27:30: <!-- Folding Slots -->
15:27:30: <slot id='0' type='CPU'>
15:27:30: <paused v='true'/>
15:27:30: </slot>
15:27:30:</config>
15:27:30:Trying to access database...
15:27:30:Successfully acquired database lock
15:27:30:Enabled folding slot 00: PAUSED cpu:4 (by user)
15:27:32:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14175 run:0 clone:389 gen:4 core:0xa7 unit:0x000000050002894b5d65700e9d4ea3b4
15:27:35:WU01:FS00:Uploading 378.91MiB to 155.247.166.219
15:27:35:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14189 run:6 clone:134 gen:2 core:0xa7 unit:0x000000040002894b5d543dafa435313a
15:27:35:WU01:FS00:Connecting to 155.247.166.219:8080
15:27:35:WU00:FS00:Uploading 172.91MiB to 155.247.166.219
15:27:35:WU00:FS00:Connecting to 155.247.166.219:8080
15:27:35:WU02:FS00:Sending unit results: id:02 state:SEND error:NO_ERROR project:14189 run:1 clone:275 gen:2 core:0xa7 unit:0x000000040002894b5d77e3f99e5d8b7b
15:27:36:WU02:FS00:Uploading 159.71MiB to 155.247.166.219
15:27:36:WU02:FS00:Connecting to 155.247.166.219:8080
15:27:36:WARNING:WU00:FS00:Exception: Failed to send results to work server: Transfer failed
15:27:36:WU00:FS00:Trying to send results to collection server
15:27:37:WU00:FS00:Uploading 172.91MiB to 155.247.166.220
15:27:37:WARNING:WU01:FS00:Exception: Failed to send results to work server: Transfer failed
15:27:37:WU01:FS00:Trying to send results to collection server
15:27:37:WU00:FS00:Connecting to 155.247.166.220:8080
15:27:37:WU01:FS00:Uploading 378.91MiB to 155.247.166.220
15:27:37:WU01:FS00:Connecting to 155.247.166.220:8080
15:27:38:WARNING:WU02:FS00:Exception: Failed to send results to work server: Transfer failed
15:27:38:WU02:FS00:Trying to send results to collection server
15:27:38:WU02:FS00:Uploading 159.71MiB to 128.252.203.4
15:27:38:WU02:FS00:Connecting to 128.252.203.4:8080
15:27:38:ERROR:WU00:FS00:Exception: Transfer failed
15:27:39:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14189 run:6 clone:134 gen:2 core:0xa7 unit:0x000000040002894b5d543dafa435313a
15:27:39:WU00:FS00:Uploading 172.91MiB to 155.247.166.219
15:27:39:ERROR:WU01:FS00:Exception: Transfer failed
15:27:39:WU00:FS00:Connecting to 155.247.166.219:8080
15:27:39:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14175 run:0 clone:389 gen:4 core:0xa7 unit:0x000000050002894b5d65700e9d4ea3b4
15:27:39:WU01:FS00:Uploading 378.91MiB to 155.247.166.219
15:27:39:WU01:FS00:Connecting to 155.247.166.219:8080
15:27:40:ERROR:WU02:FS00:Exception: Transfer failed
15:27:40:WU02:FS00:Sending unit results: id:02 state:SEND error:NO_ERROR project:14189 run:1 clone:275 gen:2 core:0xa7 unit:0x000000040002894b5d77e3f99e5d8b7b
15:27:40:WU02:FS00:Uploading 159.71MiB to 155.247.166.219
15:27:40:WU02:FS00:Connecting to 155.247.166.219:8080
15:27:40:WARNING:WU00:FS00:Exception: Failed to send results to work server: Transfer failed
15:27:40:WU00:FS00:Trying to send results to collection server
15:27:41:WU00:FS00:Uploading 172.91MiB to 155.247.166.220
15:27:41:WU00:FS00:Connecting to 155.247.166.220:8080
15:27:41:WARNING:WU01:FS00:Exception: Failed to send results to work server: Transfer failed
15:27:41:WU01:FS00:Trying to send results to collection server
15:27:41:WU01:FS00:Uploading 378.91MiB to 155.247.166.220
15:27:41:WU01:FS00:Connecting to 155.247.166.220:8080
15:27:42:WARNING:WU02:FS00:Exception: Failed to send results to work server: Transfer failed
15:27:42:WU02:FS00:Trying to send results to collection server
15:27:42:WU02:FS00:Uploading 159.71MiB to 128.252.203.4
15:27:42:WU02:FS00:Connecting to 128.252.203.4:8080
15:27:42:ERROR:WU00:FS00:Exception: Transfer failed
15:27:43:ERROR:WU01:FS00:Exception: Transfer failed
15:27:43:ERROR:WU02:FS00:Exception: Transfer failed
15:28:39:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14189 run:6 clone:134 gen:2 core:0xa7 unit:0x000000040002894b5d543dafa435313a
15:28:39:WU00:FS00:Uploading 172.91MiB to 155.247.166.219
15:28:39:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14175 run:0 clone:389 gen:4 core:0xa7 unit:0x000000050002894b5d65700e9d4ea3b4
15:28:39:WU00:FS00:Connecting to 155.247.166.219:8080
15:28:40:WU01:FS00:Uploading 378.91MiB to 155.247.166.219
15:28:40:WU02:FS00:Sending unit results: id:02 state:SEND error:NO_ERROR project:14189 run:1 clone:275 gen:2 core:0xa7 unit:0x000000040002894b5d77e3f99e5d8b7b
15:28:40:WU01:FS00:Connecting to 155.247.166.219:8080
15:28:41:WU02:FS00:Uploading 159.71MiB to 155.247.166.219
15:28:41:WARNING:WU00:FS00:Exception: Failed to send results to work server: Transfer failed
15:28:41:WU00:FS00:Trying to send results to collection server
15:28:41:WU02:FS00:Connecting to 155.247.166.219:8080
15:28:41:WU00:FS00:Uploading 172.91MiB to 155.247.166.220
15:28:41:WU00:FS00:Connecting to 155.247.166.220:8080
15:28:42:WARNING:WU01:FS00:Exception: Failed to send results to work server: Transfer failed
15:28:42:WU01:FS00:Trying to send results to collection server
15:28:42:WU01:FS00:Uploading 378.91MiB to 155.247.166.220
15:28:42:WU01:FS00:Connecting to 155.247.166.220:8080
15:28:42:WARNING:WU02:FS00:Exception: Failed to send results to work server: Transfer failed
15:28:42:WU02:FS00:Trying to send results to collection server
15:28:43:WU02:FS00:Uploading 159.71MiB to 128.252.203.4
15:28:43:ERROR:WU00:FS00:Exception: Transfer failed
15:28:43:WU02:FS00:Connecting to 128.252.203.4:8080
15:28:43:ERROR:WU01:FS00:Exception: Transfer failed
15:28:44:ERROR:WU02:FS00:Exception: Transfer failed
15:28:47:FS00:Unpaused
15:28:47:WU03:FS00:Starting
15:28:47:WU03:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Ron/AppData/Roaming/FAHClient/cores/cores.foldingathome.org/Win32/x86/Core_a7.fah/FahCore_a7.exe -dir 03 -suffix 01 -version 704 -lifeline 2384 -checkpoint 15 -np 4
15:28:47:WU03:FS00:Started FahCore on PID 2936
15:28:48:WU03:FS00:Core PID:2948
15:28:48:WU03:FS00:FahCore 0xa7 started
15:28:52:WU03:FS00:0xa7:*********************** Log Started 2019-09-24T15:28:51Z ***********************
15:28:52:WU03:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
15:28:52:WU03:FS00:0xa7: Type: 0xa7
15:28:52:WU03:FS00:0xa7: Core: Gromacs
15:28:52:WU03:FS00:0xa7: Website: https://foldingathome.org/
15:28:52:WU03:FS00:0xa7: Copyright: (c) 2009-2018 foldingathome.org
15:28:52:WU03:FS00:0xa7: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
15:28:52:WU03:FS00:0xa7: Args: -dir 03 -suffix 01 -version 704 -lifeline 2936 -checkpoint 15 -np 4
15:28:52:WU03:FS00:0xa7: Config: <none>
15:28:52:WU03:FS00:0xa7:************************************ Build *************************************
15:28:52:WU03:FS00:0xa7: Version: 0.0.17
15:28:52:WU03:FS00:0xa7: Date: Apr 25 2018
15:28:52:WU03:FS00:0xa7: Time: 11:02:26
15:28:52:WU03:FS00:0xa7: Repository: Git
15:28:52:WU03:FS00:0xa7: Revision: fd11abfb405c921e66db1226933e9dd2d18d2acc
15:28:52:WU03:FS00:0xa7: Branch: master
15:28:52:WU03:FS00:0xa7: Compiler: Visual C++ 2008
15:28:52:WU03:FS00:0xa7: Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
15:28:52:WU03:FS00:0xa7: Platform: win32 10
15:28:52:WU03:FS00:0xa7: Bits: 32
15:28:52:WU03:FS00:0xa7: Mode: Release
15:28:52:WU03:FS00:0xa7: SIMD: sse2
15:28:52:WU03:FS00:0xa7:************************************ System ************************************
15:28:52:WU03:FS00:0xa7: CPU: Unknown
15:28:52:WU03:FS00:0xa7: CPU ID:
15:28:52:WU03:FS00:0xa7: CPUs: 4
15:28:52:WU03:FS00:0xa7: Memory: 3.89GiB
15:28:52:WU03:FS00:0xa7:Free Memory: 3.12GiB
15:28:52:WU03:FS00:0xa7: Threads: WINDOWS_THREADS
15:28:52:WU03:FS00:0xa7: OS Version: 6.1
15:28:52:WU03:FS00:0xa7:Has Battery: false
15:28:52:WU03:FS00:0xa7: On Battery: false
15:28:52:WU03:FS00:0xa7: UTC Offset: -4
15:28:52:WU03:FS00:0xa7: PID: 2948
15:28:52:WU03:FS00:0xa7: CWD: C:\Users\Ron\AppData\Roaming\FAHClient\work
15:28:52:WU03:FS00:0xa7: OS: Windows 7 Home Premium
15:28:52:WU03:FS00:0xa7: OS Arch: AMD64
15:28:52:WU03:FS00:0xa7:********************************************************************************
15:28:52:WU03:FS00:0xa7:Project: 13823 (Run 270, Clone 0, Gen 98)
15:28:52:WU03:FS00:0xa7:Unit: 0x0000008080fccb095c8ff668f76f5418
15:28:52:WU03:FS00:0xa7:Digital signatures verified
15:28:52:WU03:FS00:0xa7:Calling: mdrun -s frame98.tpr -o frame98.trr -x frame98.xtc -cpi state.cpt -cpt 15 -nt 4
15:29:14:WU03:FS00:0xa7:Steps: first=12250000 total=125000
15:29:25:WU03:FS00:0xa7:Completed 53312 out of 125000 steps (42%)
15:29:32:Removing old file 'configs/config-20190705-171545.xml'
15:29:32:Saving configuration to config.xml
15:29:32:<config>
15:29:32: <!-- Network -->
15:29:32: <proxy v=':8080'/>
15:29:32:
15:29:32: <!-- Slot Control -->
15:29:32: <power v='FULL'/>
15:29:32:
15:29:32: <!-- User Information -->
15:29:32: <passkey v='********************************'/>
15:29:32: <team v='4'/>
15:29:32: <user v='rewron'/>
15:29:32:
15:29:32: <!-- Folding Slots -->
15:29:32: <slot id='0' type='CPU'/>
15:29:32:</config>
15:29:39:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14189 run:6 clone:134 gen:2 core:0xa7 unit:0x000000040002894b5d543dafa435313a
15:29:39:WU00:FS00:Uploading 172.91MiB to 155.247.166.219
15:29:39:WU00:FS00:Connecting to 155.247.166.219:8080
15:29:39:WARNING:WU00:FS00:Exception: Failed to send results to work server: Transfer failed
15:29:39:WU00:FS00:Trying to send results to collection server
15:29:40:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14175 run:0 clone:389 gen:4 core:0xa7 unit:0x000000050002894b5d65700e9d4ea3b4
15:29:40:WU00:FS00:Uploading 172.91MiB to 155.247.166.220
15:29:40:WU00:FS00:Connecting to 155.247.166.220:8080
15:29:40:WU01:FS00:Uploading 378.91MiB to 155.247.166.219
15:29:40:WU01:FS00:Connecting to 155.247.166.219:8080
15:29:40:ERROR:WU00:FS00:Exception: Transfer failed
15:29:40:WU02:FS00:Sending unit results: id:02 state:SEND error:NO_ERROR project:14189 run:1 clone:275 gen:2 core:0xa7 unit:0x000000040002894b5d77e3f99e5d8b7b
15:29:40:WU02:FS00:Uploading 159.71MiB to 155.247.166.219
15:29:40:WU02:FS00:Connecting to 155.247.166.219:8080
15:29:41:WARNING:WU01:FS00:Exception: Failed to send results to work server: Transfer failed
15:29:41:WU01:FS00:Trying to send results to collection server
15:29:41:WU01:FS00:Uploading 378.91MiB to 155.247.166.220
15:29:41:WU01:FS00:Connecting to 155.247.166.220:8080
15:29:42:WARNING:WU02:FS00:Exception: Failed to send results to work server: Transfer failed
15:29:42:WU02:FS00:Trying to send results to collection server
15:29:42:WU02:FS00:Uploading 159.71MiB to 128.252.203.4
15:29:42:WU02:FS00:Connecting to 128.252.203.4:8080
15:29:42:ERROR:WU01:FS00:Exception: Transfer failed
15:30:01:WU02:FS00:Upload 0.04%
15:30:01:ERROR:WU02:FS00:Exception: Transfer failed
15:31:16:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14189 run:6 clone:134 gen:2 core:0xa7 unit:0x000000040002894b5d543dafa435313a
15:31:16:WU00:FS00:Uploading 172.91MiB to 155.247.166.219
15:31:16:WU00:FS00:Connecting to 155.247.166.219:8080
15:31:17:WARNING:WU00:FS00:Exception: Failed to send results to work server: Transfer failed
15:31:17:WU00:FS00:Trying to send results to collection server
15:31:17:WU00:FS00:Uploading 172.91MiB to 155.247.166.220
15:31:17:WU00:FS00:Connecting to 155.247.166.220:8080
15:31:17:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14175 run:0 clone:389 gen:4 core:0xa7 unit:0x000000050002894b5d65700e9d4ea3b4
15:31:17:WU01:FS00:Uploading 378.91MiB to 155.247.166.219
15:31:17:WU01:FS00:Connecting to 155.247.166.219:8080
15:31:17:WU02:FS00:Sending unit results: id:02 state:SEND error:NO_ERROR project:14189 run:1 clone:275 gen:2 core:0xa7 unit:0x000000040002894b5d77e3f99e5d8b7b
15:31:18:WU02:FS00:Uploading 159.71MiB to 155.247.166.219
15:31:18:WU02:FS00:Connecting to 155.247.166.219:8080
15:31:18:WARNING:WU01:FS00:Exception: Failed to send results to work server: Transfer failed
15:31:18:WU01:FS00:Trying to send results to collection server
15:31:18:WU01:FS00:Uploading 378.91MiB to 155.247.166.220
15:31:18:WU01:FS00:Connecting to 155.247.166.220:8080
15:31:18:ERROR:WU00:FS00:Exception: Transfer failed
15:31:19:WARNING:WU02:FS00:Exception: Failed to send results to work server: Transfer failed
15:31:19:WU02:FS00:Trying to send results to collection server
15:31:19:WU02:FS00:Uploading 159.71MiB to 128.252.203.4
15:31:19:WU02:FS00:Connecting to 128.252.203.4:8080
15:31:19:ERROR:WU01:FS00:Exception: Transfer failed
15:31:38:WU02:FS00:Upload 0.04%
15:31:38:ERROR:WU02:FS00:Exception: Transfer failed
15:32:52:WU03:FS00:0xa7:Completed 53750 out of 125000 steps (43%)
15:33:53:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14189 run:6 clone:134 gen:2 core:0xa7 unit:0x000000040002894b5d543dafa435313a
15:33:54:WU00:FS00:Uploading 172.91MiB to 155.247.166.219
15:33:54:WU00:FS00:Connecting to 155.247.166.219:8080
15:33:54:WARNING:WU00:FS00:Exception: Failed to send results to work server: Transfer failed
15:33:54:WU00:FS00:Trying to send results to collection server
15:33:54:WU00:FS00:Uploading 172.91MiB to 155.247.166.220
15:33:54:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14175 run:0 clone:389 gen:4 core:0xa7 unit:0x000000050002894b5d65700e9d4ea3b4
15:33:54:WU00:FS00:Connecting to 155.247.166.220:8080
15:33:55:WU01:FS00:Uploading 378.91MiB to 155.247.166.219
15:33:55:WU01:FS00:Connecting to 155.247.166.219:8080
15:33:55:WU02:FS00:Sending unit results: id:02 state:SEND error:NO_ERROR project:14189 run:1 clone:275 gen:2 core:0xa7 unit:0x000000040002894b5d77e3f99e5d8b7b
15:33:55:WU02:FS00:Uploading 159.71MiB to 155.247.166.219
15:33:55:WU02:FS00:Connecting to 155.247.166.219:8080
15:33:55:WARNING:WU01:FS00:Exception: Failed to send results to work server: Transfer failed
15:33:55:WU01:FS00:Trying to send results to collection server
15:33:55:WU01:FS00:Uploading 378.91MiB to 155.247.166.220
15:33:55:ERROR:WU00:FS00:Exception: Transfer failed
15:33:55:WU01:FS00:Connecting to 155.247.166.220:8080
15:33:56:WARNING:WU02:FS00:Exception: Failed to send results to work server: Transfer failed
15:33:56:WU02:FS00:Trying to send results to collection server
15:33:56:WU02:FS00:Uploading 159.71MiB to 128.252.203.4
15:33:56:WU02:FS00:Connecting to 128.252.203.4:8080
15:33:57:ERROR:WU01:FS00:Exception: Transfer failed
15:33:58:ERROR:WU02:FS00:Exception: Transfer failed
I've had two Linux and two Windows 10 clients hang on different days in the last week while downloading a new work unit. Looking at the log (at normal verbosity), the work unit just hangs at some point beyond 50% completion. The download never completes so the client slot sits ready and awaiting work.
That got my attention. All my machines have two or more GPUs, so it's not likely to be an obvious comms problem. Assume all the FAH and OS are up to date. So far, it's a small percentage of the 125-145 units a day my farm completes.
I noticed that recent work units are approaching 70MB in size -- far greater than I recall.
Is it possible that a more robust download failure recovery routine is needed given the many more packets that need to be received and acknowledged?
In Linux I see the same issues.
Either a down or upload fail.
Most of the time, this happens over wifi (with me), and when multiple cards are downloading / uploading WUs at the same time; one of them gets stalled.
Did you try pausing/unpausing, or were you forced to remove and re-add the slot without restart? (I know you can restart the service as well, but if you don't want to pause the other WUs it's not recommended).
When were these WUs assigned to your machine? How long have these WUs been on your system?
All WUs have a deadline and normally a WU which expires is deleted by the client. If, for some reason, the client was unable to delete them at that time, they will never upload. In fact, the first two of those three WUs was completed some time ago.
My current theory is that these are copies that have expired and simply are no longer accepted -- but I'd need more information from your logs to confirm that suspicion.
project:14189 run:6 clone:134 gen:2 was completed 2019-09-11 11:45:21.
project:14175 run:0 clone:389 gen:4 was completed 2019-09-10 02:15:18
project:14189 run:1 clone:275 gen:2 is a bit strange because I can find no record of it. In fact, the preceding WU, project:14189 run:1 clone:275 gen:1, was returned 2019-09-21 07:15:16 so gen:2 would have been issued soon after that and it apparently has not been returned, nor has it expired. I'll have to dig deeper.
Besides what Bruce has mentioned, I would suggest upgrading to the current version of the folding client, 7.5.1. It does have some improvements in the network connection code over the 7.4.4 version you are running.
iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
01:08:38:WU00:FS02:Download 29.44%
01:08:48:WU00:FS02:Download 31.47%
01:09:33:WU00:FS02:Download 33.50%
01:10:57:WU01:FS01:0x21:Completed 7875000 out of 12500000 steps (63%)
01:14:14:WU01:FS01:0x21:Completed 8000000 out of 12500000 steps (64%)
01:17:32:WU01:FS01:0x21:Completed 8125000 out of 12500000 steps (65%)
Okay, I see where the download of a WU for your CPU slot, FS02, stalled and never completed. The usual fix if the client doesn't detect the stall, is a restart of the FAHClient process after pausing the slots. In my experience, 7.5.1 often detects such a stall, but it can take 15-30 minutes before trying again.
iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Yes; rebooting is the usual workaround, but it has not worked for 4x reboots over 12+ hours. My CPU slot is still currently idle. Over the last week there's been slow uploads to this server, as well. A data set that is approx 15 Mb is taking 30+ minutes to upload.
To anyone having problems with uploading and/or downloading and restarting/rebooting has not helped please reboot your router (if you're using one) or your modem. I noticed after a stuck download that my speeds were drastically reduced (75/10 to 13/2) and turning off my router for ~30 seconds brought all my speeds back to 'normal'. I've notified the 'owner' of vav3 & vav4 (155.247.166.219/155.247.166.220) that he should also reboot his equipment. It appears that whenever a download/upload gets 'stuck' the router/modem suffers some temporary 'confusion' and rebooting clears that up.
Overnight, five of seven client machines stalled. Stop/Start Linux clients did not fix the stall; a restart fixed the stall.
The problem occurs on wireless (rebooted) and wired (router rebooted).
If I had a server log to examine at Stanford, it would be at 155.247.166.220, which seems to be awfully slow at downloads (2.5MB/min. to a 200Mbps router).
A Speedtest to San Jose (from Savannah) indicates 14 Mbps down and 10 Mbps up ... that's a problem. To Jacksonville, 16 up, 10 down. Next stop, Comcast.