I have a WU, (i.e. 10501 (182, 0, 1016)), which completed execution over an hour ago. It appears to be stuck in "Send" mode. Here are the details.
Send ID (slot?) = 01 (There is another WU in "Download" status in slot 00, but it has not begun executing. It appears to be waiting for slot 01 to finish before it begins running.)
Progress = 100.00%
ETA = "Unknown"
Credit = "Unknown"
I have a 2.39 day ETA work unit, (i.e. Project 7809 (9, 307, 43), that is currently running in slot 02. Its progress is currently indicating 81.56% completion.)
It appears that WU 10501 (182, 0, 1016) is hung and is not transmitting back to the server. What is the procedure for killing a WU or deleting a WU that appears to be stuck from a particular slot?
Completed WU (Apparently) Hung In "Send" Mode
Moderators: Site Moderators, FAHC Science Team
-
- Posts: 97
- Joined: Thu Dec 20, 2012 3:58 am
-
- Site Admin
- Posts: 7937
- Joined: Tue Apr 21, 2009 4:41 pm
- Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2 - Location: W. MA
Re: Completed WU (Apparently) Hung In "Send" Mode
Please post your log. Include the beginning that shows the system configuration and the section that shows the end of processing on the WU. The WU might be hung, or it could just be slow in wrapping up its work files to send back. There is also a known bug in some versions of the client where it does not recover from a network error and retry sending a WU. But we need more information to do more than guess.
iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
-
- Posts: 887
- Joined: Wed May 26, 2010 2:31 pm
- Hardware configuration: Atom330 (overclocked):
Windows 7 Ultimate 64bit
Intel Atom330 dualcore (4 HyperThreads)
NVidia GT430, core_15 work
2x2GB Kingston KVR1333D3N9K2/4G 1333MHz memory kit
Asus AT3IONT-I Deluxe motherboard - Location: Finland
Re: Completed WU (Apparently) Hung In "Send" Mode
Just wait a while. Some servers are down at the moment, viewtopic.php?f=18&t=23759.
Win7 64bit, FAH v7, OC'd
2C/4T Atom330 3x667MHz - GT430 2x832.5MHz - ION iGPU 3x466.7MHz
NaCl - Core_15 - display
2C/4T Atom330 3x667MHz - GT430 2x832.5MHz - ION iGPU 3x466.7MHz
NaCl - Core_15 - display
-
- Posts: 97
- Joined: Thu Dec 20, 2012 3:58 am
Re: Completed WU (Apparently) Hung In "Send" Mode
Napoleon:
Yep, that appears to be the hangup - a downed server. Thanks for the info.
01:11:37:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:10501 run:182 clone:0 gen:1016 core:0x11 unit:0x000008266652eda54b6ea7d300004719
01:11:37:WU01:FS00:Uploading 128.13KiB to 171.67.108.21
01:11:37:WU01:FS00:Connecting to 171.67.108.21:8080
01:11:58:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
01:11:58:WU01:FS00:Connecting to 171.67.108.21:80
01:12:19:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.21:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
Yep, that appears to be the hangup - a downed server. Thanks for the info.
01:11:37:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:10501 run:182 clone:0 gen:1016 core:0x11 unit:0x000008266652eda54b6ea7d300004719
01:11:37:WU01:FS00:Uploading 128.13KiB to 171.67.108.21
01:11:37:WU01:FS00:Connecting to 171.67.108.21:8080
01:11:58:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
01:11:58:WU01:FS00:Connecting to 171.67.108.21:80
01:12:19:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.21:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
Re: Completed WU (Apparently) Hung In "Send" Mode
I'm getting the same problem on 171.67.108.11
Server status shows both 171.67.108.11 and 171.67.108.21 down (vsp07v and vsp07b)
Server status shows both 171.67.108.11 and 171.67.108.21 down (vsp07v and vsp07b)
Last edited by Ripper36 on Fri Feb 22, 2013 3:49 am, edited 1 time in total.
Re: Completed WU (Apparently) Hung In "Send" Mode
All of the vsp07* servers are down. The Pande Group was notified much earlier.
I'm not sure the nature of the problem, but they'll fix it whenever they can.
I'm not sure the nature of the problem, but they'll fix it whenever they can.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
-
- Posts: 97
- Joined: Thu Dec 20, 2012 3:58 am
Re: Completed WU (Apparently) Hung In "Send" Mode
I usually have two slots simultaneously executing WUs on my machine - one is a GPU slot and the other is an SMP slot. It's now been close to 18 hours with my machine "hung" and getting no production out of one of the slots. Can someone please tell me how to dump a non-executing slot so that I can (hopefully) get back to crunching work units on both slots?
Thanks!
Thanks!
-
- Site Admin
- Posts: 7937
- Joined: Tue Apr 21, 2009 4:41 pm
- Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2 - Location: W. MA
Re: Completed WU (Apparently) Hung In "Send" Mode
You are misunderstanding the situation. The WU waiting to be uploaded is not keeping the GPU slot from processing work. The servers that have suitable WU's for your model of GPU are down, so your client can not download a new WU to process in that slot. PG has been notified, follow the other topic Napoleon linked to to see if there are any changes.
iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3