Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Moderators: Site Moderators, FAHC Science Team

Post Reply
gordonbb
Posts: 511
Joined: Mon May 21, 2018 4:12 pm
Hardware configuration: Ubuntu 22.04.2 LTS; NVidia 525.60.11; 2 x 4070ti; 4070; 4060ti; 3x 3080; 3070ti; 3070
Location: Great White North

Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Post by gordonbb »

Had to work overtime today so I made the effort to come home on my dinner break and restart my slots folding after "finishing" them at the start of the expensive Time-of-use electricity rates this morning.

When I returned home this evening it was to 7/8 slots stuck: Waiting on "WS Assignment" Rebooted the stuck systems but still no joy.

All the stuck slots are all getting sent to WS 54.157.202.86 mskcc1.foldingathome.org

Here is the log from one of the systems:

Code: Select all

*********************** Log Started 2021-07-27T03:40:05Z ***********************
03:40:05:******************************* libFAH ********************************
03:40:05:           Date: Oct 8 2020
03:40:05:           Time: 19:34:47
03:40:05:       Revision: 06b99f7701e0d3f883dd14a78b459ad27da23809
03:40:05:         Branch: master
03:40:05:       Compiler: GNU 8.3.0
03:40:05:        Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05:                 -O3 -funroll-loops -fno-pie
03:40:05:       Platform: linux2 5.8.0-1-amd64
03:40:05:           Bits: 64
03:40:05:           Mode: Release
03:40:05:****************************** FAHClient ******************************
03:40:05:        Version: 7.6.20
03:40:05:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:40:05:      Copyright: 2020 foldingathome.org
03:40:05:       Homepage: https://foldingathome.org/
03:40:05:           Date: Oct 12 2020
03:40:05:           Time: 22:00:41
03:40:05:       Revision: c858fe2a8342bfa3e116e00b394d8dfa322ecd18
03:40:05:         Branch: master
03:40:05:       Compiler: GNU 8.3.0
03:40:05:        Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05:                 -O3 -funroll-loops -fno-pie
03:40:05:       Platform: linux2 5.8.0-1-amd64
03:40:05:           Bits: 64
03:40:05:           Mode: Release
03:40:05:           Args: --child /etc/fahclient/config.xml --run-as fahclient
03:40:05:                 --pid-file=/var/run/fahclient.pid --daemon
03:40:05:         Config: /etc/fahclient/config.xml
03:40:05:******************************** CBang ********************************
03:40:05:           Date: Oct 8 2020
03:40:05:           Time: 19:34:20
03:40:05:       Revision: ab0a6d9e35982b831a74cb2706c569fe46bac2af
03:40:05:         Branch: master
03:40:05:       Compiler: GNU 8.3.0
03:40:05:        Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05:                 -O3 -funroll-loops -fno-pie -fPIC
03:40:05:       Platform: linux2 5.8.0-1-amd64
03:40:05:           Bits: 64
03:40:05:           Mode: Release
03:40:05:******************************* System ********************************
03:40:05:            CPU: AMD Ryzen 9 3950X 16-Core Processor
03:40:05:         CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
03:40:05:           CPUs: 32
03:40:05:         Memory: 31.37GiB
03:40:05:    Free Memory: 27.07GiB
03:40:05:        Threads: POSIX_THREADS
03:40:05:     OS Version: 5.4
03:40:05:    Has Battery: false
03:40:05:     On Battery: false
03:40:05:     UTC Offset: -4
03:40:05:            PID: 2062
03:40:05:            CWD: /var/lib/fahclient
03:40:05:             OS: Linux 5.4.0-74-generic x86_64
03:40:05:        OS Arch: AMD64
03:40:05:           GPUs: 2
03:40:05:          GPU 0: Bus:9 Slot:0 Func:0 NVIDIA:7 TU106 [GeForce RTX 2060 SUPER]
03:40:05:          GPU 1: Bus:10 Slot:0 Func:0 NVIDIA:8 TU104 [GeForce RTX 2070 SUPER]
03:40:05:                 8218
03:40:05:  CUDA Device 0: Platform:0 Device:0 Bus:10 Slot:0 Compute:7.5 Driver:11.2
03:40:05:  CUDA Device 1: Platform:0 Device:1 Bus:9 Slot:0 Compute:7.5 Driver:11.2
03:40:05:OpenCL Device 0: Platform:0 Device:0 Bus:10 Slot:0 Compute:1.2 Driver:460.84
03:40:05:OpenCL Device 1: Platform:0 Device:1 Bus:9 Slot:0 Compute:1.2 Driver:460.84
03:40:05:***********************************************************************
03:40:05:<config>
03:40:05:  <!-- Client Control -->
03:40:05:  <fold-anon v='true'/>
03:40:05:
03:40:05:  <!-- Folding Slot Configuration -->
03:40:05:  <cause v='COVID_19'/>
03:40:05:  <gpu v='false'/>
03:40:05:
03:40:05:  <!-- HTTP Server -->
03:40:05:  <allow v='127.0.0.1 *****************'/>
03:40:05:
03:40:05:  <!-- Network -->
03:40:05:  <proxy v=':8080'/>
03:40:05:
03:40:05:  <!-- Remote Command Server -->
03:40:05:  <command-allow-no-pass v='127.0.0.1 ********************'/>
03:40:05:
03:40:05:  <!-- Slot Control -->
03:40:05:  <pause-on-battery v='false'/>
03:40:05:  <pause-on-start v='true'/>
03:40:05:  <power v='full'/>
03:40:05:
03:40:05:  <!-- User Information -->
03:40:05:  <passkey v='*****'/>
03:40:05:  <team v='*********'/>
03:40:05:  <user v='************'/>
03:40:05:
03:40:05:  <!-- Folding Slots -->
03:40:05:  <slot id='1' type='GPU'>
03:40:05:    <pci-bus v='9'/>
03:40:05:    <pci-slot v='0'/>
03:40:05:  </slot>
03:40:05:  <slot id='0' type='GPU'>
03:40:05:    <pci-bus v='10'/>
03:40:05:    <pci-slot v='0'/>
03:40:05:  </slot>
03:40:05:</config>
03:40:05:Trying to access database...
03:40:05:Successfully acquired database lock
03:40:05:FS01:Initialized folding slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - PAUSED by user
03:40:05:FS00:Initialized folding slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - PAUSED by user
03:40:54:FS01:Unpaused
03:40:54:FS00:Unpaused
03:40:54:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:40:54:WU00:FS01:Assigned to work server 54.157.202.86
03:40:54:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:40:54:WU00:FS01:Connecting to 54.157.202.86:8080
03:40:54:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:40:54:WU01:FS00:Assigned to work server 54.157.202.86
03:40:54:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:40:54:WU01:FS00:Connecting to 54.157.202.86:8080
03:40:54:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:40:55:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:40:55:WU00:FS01:Assigned to work server 54.157.202.86
03:40:55:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:40:55:WU00:FS01:Connecting to 54.157.202.86:8080
03:40:55:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:40:55:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:40:55:WU01:FS00:Assigned to work server 54.157.202.86
03:40:55:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:40:55:WU01:FS00:Connecting to 54.157.202.86:8080
03:40:55:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:40:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:41:55:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:41:55:WU00:FS01:Assigned to work server 54.157.202.86
03:41:55:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:41:55:WU00:FS01:Connecting to 54.157.202.86:8080
03:41:55:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:41:55:WU01:FS00:Assigned to work server 54.157.202.86
03:41:55:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:41:55:WU01:FS00:Connecting to 54.157.202.86:8080
03:41:56:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:41:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:43:32:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:43:32:WU00:FS01:Assigned to work server 54.157.202.86
03:43:32:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:43:32:WU00:FS01:Connecting to 54.157.202.86:8080
03:43:32:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:43:33:WU01:FS00:Assigned to work server 54.157.202.86
03:43:33:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:43:33:WU01:FS00:Connecting to 54.157.202.86:8080
03:43:33:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:43:33:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:46:09:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:46:09:WU00:FS01:Assigned to work server 54.157.202.86
03:46:09:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:46:09:WU00:FS01:Connecting to 54.157.202.86:8080
03:46:10:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:46:10:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:46:10:WU01:FS00:Assigned to work server 54.157.202.86
03:46:10:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:46:10:WU01:FS00:Connecting to 54.157.202.86:8080
03:46:10:ERROR:WU01:FS00:Exception: Server did not assign work unit
Image
gordonbb
Posts: 511
Joined: Mon May 21, 2018 4:12 pm
Hardware configuration: Ubuntu 22.04.2 LTS; NVidia 525.60.11; 2 x 4070ti; 4070; 4060ti; 3x 3080; 3070ti; 3070
Location: Great White North

Re: Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Post by gordonbb »

gordonbb wrote:Had to work overtime today so I made the effort to come home on my dinner break and restart my slots folding after "finishing" them at the start of the expensive Time-of-use electricity rates this morning.

When I returned home this evening it was to 7/8 slots stuck: Waiting on "WS Assignment" Rebooted the stuck systems but still no joy.

All the stuck slots are all getting sent to WS 54.157.202.86 mskcc1.foldingathome.org

Here is the log from one of the systems:

Code: Select all

*********************** Log Started 2021-07-27T03:40:05Z ***********************
03:40:05:******************************* libFAH ********************************
03:40:05:           Date: Oct 8 2020
03:40:05:           Time: 19:34:47
03:40:05:       Revision: 06b99f7701e0d3f883dd14a78b459ad27da23809
03:40:05:         Branch: master
03:40:05:       Compiler: GNU 8.3.0
03:40:05:        Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05:                 -O3 -funroll-loops -fno-pie
03:40:05:       Platform: linux2 5.8.0-1-amd64
03:40:05:           Bits: 64
03:40:05:           Mode: Release
03:40:05:****************************** FAHClient ******************************
03:40:05:        Version: 7.6.20
03:40:05:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:40:05:      Copyright: 2020 foldingathome.org
03:40:05:       Homepage: https://foldingathome.org/
03:40:05:           Date: Oct 12 2020
03:40:05:           Time: 22:00:41
03:40:05:       Revision: c858fe2a8342bfa3e116e00b394d8dfa322ecd18
03:40:05:         Branch: master
03:40:05:       Compiler: GNU 8.3.0
03:40:05:        Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05:                 -O3 -funroll-loops -fno-pie
03:40:05:       Platform: linux2 5.8.0-1-amd64
03:40:05:           Bits: 64
03:40:05:           Mode: Release
03:40:05:           Args: --child /etc/fahclient/config.xml --run-as fahclient
03:40:05:                 --pid-file=/var/run/fahclient.pid --daemon
03:40:05:         Config: /etc/fahclient/config.xml
03:40:05:******************************** CBang ********************************
03:40:05:           Date: Oct 8 2020
03:40:05:           Time: 19:34:20
03:40:05:       Revision: ab0a6d9e35982b831a74cb2706c569fe46bac2af
03:40:05:         Branch: master
03:40:05:       Compiler: GNU 8.3.0
03:40:05:        Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
03:40:05:                 -O3 -funroll-loops -fno-pie -fPIC
03:40:05:       Platform: linux2 5.8.0-1-amd64
03:40:05:           Bits: 64
03:40:05:           Mode: Release
03:40:05:******************************* System ********************************
03:40:05:            CPU: AMD Ryzen 9 3950X 16-Core Processor
03:40:05:         CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
03:40:05:           CPUs: 32
03:40:05:         Memory: 31.37GiB
03:40:05:    Free Memory: 27.07GiB
03:40:05:        Threads: POSIX_THREADS
03:40:05:     OS Version: 5.4
03:40:05:    Has Battery: false
03:40:05:     On Battery: false
03:40:05:     UTC Offset: -4
03:40:05:            PID: 2062
03:40:05:            CWD: /var/lib/fahclient
03:40:05:             OS: Linux 5.4.0-74-generic x86_64
03:40:05:        OS Arch: AMD64
03:40:05:           GPUs: 2
03:40:05:          GPU 0: Bus:9 Slot:0 Func:0 NVIDIA:7 TU106 [GeForce RTX 2060 SUPER]
03:40:05:          GPU 1: Bus:10 Slot:0 Func:0 NVIDIA:8 TU104 [GeForce RTX 2070 SUPER]
03:40:05:                 8218
03:40:05:  CUDA Device 0: Platform:0 Device:0 Bus:10 Slot:0 Compute:7.5 Driver:11.2
03:40:05:  CUDA Device 1: Platform:0 Device:1 Bus:9 Slot:0 Compute:7.5 Driver:11.2
03:40:05:OpenCL Device 0: Platform:0 Device:0 Bus:10 Slot:0 Compute:1.2 Driver:460.84
03:40:05:OpenCL Device 1: Platform:0 Device:1 Bus:9 Slot:0 Compute:1.2 Driver:460.84
03:40:05:***********************************************************************
03:40:05:<config>
03:40:05:  <!-- Client Control -->
03:40:05:  <fold-anon v='true'/>
03:40:05:
03:40:05:  <!-- Folding Slot Configuration -->
03:40:05:  <cause v='COVID_19'/>
03:40:05:  <gpu v='false'/>
03:40:05:
03:40:05:  <!-- HTTP Server -->
03:40:05:  <allow v='127.0.0.1 *****************'/>
03:40:05:
03:40:05:  <!-- Network -->
03:40:05:  <proxy v=':8080'/>
03:40:05:
03:40:05:  <!-- Remote Command Server -->
03:40:05:  <command-allow-no-pass v='127.0.0.1 ********************'/>
03:40:05:
03:40:05:  <!-- Slot Control -->
03:40:05:  <pause-on-battery v='false'/>
03:40:05:  <pause-on-start v='true'/>
03:40:05:  <power v='full'/>
03:40:05:
03:40:05:  <!-- User Information -->
03:40:05:  <passkey v='*****'/>
03:40:05:  <team v='*********'/>
03:40:05:  <user v='************'/>
03:40:05:
03:40:05:  <!-- Folding Slots -->
03:40:05:  <slot id='1' type='GPU'>
03:40:05:    <pci-bus v='9'/>
03:40:05:    <pci-slot v='0'/>
03:40:05:  </slot>
03:40:05:  <slot id='0' type='GPU'>
03:40:05:    <pci-bus v='10'/>
03:40:05:    <pci-slot v='0'/>
03:40:05:  </slot>
03:40:05:</config>
03:40:05:Trying to access database...
03:40:05:Successfully acquired database lock
03:40:05:FS01:Initialized folding slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - PAUSED by user
03:40:05:FS00:Initialized folding slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - PAUSED by user
03:40:54:FS01:Unpaused
03:40:54:FS00:Unpaused
03:40:54:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:40:54:WU00:FS01:Assigned to work server 54.157.202.86
03:40:54:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:40:54:WU00:FS01:Connecting to 54.157.202.86:8080
03:40:54:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:40:54:WU01:FS00:Assigned to work server 54.157.202.86
03:40:54:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:40:54:WU01:FS00:Connecting to 54.157.202.86:8080
03:40:54:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:40:55:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:40:55:WU00:FS01:Assigned to work server 54.157.202.86
03:40:55:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:40:55:WU00:FS01:Connecting to 54.157.202.86:8080
03:40:55:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:40:55:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:40:55:WU01:FS00:Assigned to work server 54.157.202.86
03:40:55:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:40:55:WU01:FS00:Connecting to 54.157.202.86:8080
03:40:55:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:40:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:41:55:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:41:55:WU00:FS01:Assigned to work server 54.157.202.86
03:41:55:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:41:55:WU00:FS01:Connecting to 54.157.202.86:8080
03:41:55:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:41:55:WU01:FS00:Assigned to work server 54.157.202.86
03:41:55:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:41:55:WU01:FS00:Connecting to 54.157.202.86:8080
03:41:56:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:41:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:43:32:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:43:32:WU00:FS01:Assigned to work server 54.157.202.86
03:43:32:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:43:32:WU00:FS01:Connecting to 54.157.202.86:8080
03:43:32:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:43:33:WU01:FS00:Assigned to work server 54.157.202.86
03:43:33:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:43:33:WU01:FS00:Connecting to 54.157.202.86:8080
03:43:33:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:43:33:ERROR:WU01:FS00:Exception: Server did not assign work unit
03:46:09:WU00:FS01:Connecting to assign1.foldingathome.org:80
03:46:09:WU00:FS01:Assigned to work server 54.157.202.86
03:46:09:WU00:FS01:Requesting new work unit for slot 01: gpu:9:0 TU106 [GeForce RTX 2060 SUPER] - READY from 54.157.202.86
03:46:09:WU00:FS01:Connecting to 54.157.202.86:8080
03:46:10:WU01:FS00:Connecting to assign1.foldingathome.org:80
03:46:10:ERROR:WU00:FS01:Exception: Server did not assign work unit
03:46:10:WU01:FS00:Assigned to work server 54.157.202.86
03:46:10:WU01:FS00:Requesting new work unit for slot 00: gpu:10:0 TU104 [GeForce RTX 2070 SUPER] 8218 - READY from 54.157.202.86
03:46:10:WU01:FS00:Connecting to 54.157.202.86:8080
03:46:10:ERROR:WU01:FS00:Exception: Server did not assign work unit
The fix was to change my Client Preference from "COVID" to "Any" and reboot. 7/8 slots now on other Work Servers but one assigned to 54.157.202.86 mskcc1.foldingathome.org and still stuck waiting
Image
Neil-B
Posts: 1996
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Post by Neil-B »

Seeing the same issue ... guess something needs kicking (AS/WS)
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
prcowley
Posts: 28
Joined: Thu Jan 03, 2019 11:03 pm
Hardware configuration: Op Sys: Linux Ubuntu Studio 24.04 LTS
Kernal: 6.8.0-45-lowlatency (64-bit)
Proc: 16x AMD Ryzen 7 7800X3D 8-Core Processor
Mem: 32 GB
GPU: NVIDIA GeForce RTX 4080 SUPER/PCIe/SSE2
Location: Gisborne, New Zealand
Contact:

Re: Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Post by prcowley »

Seeing the same issue.

Checked the foldingathome stats and:

GRO_A8 753,710 0 753,710 46,056.00/hr
GRO_A7 141,425 6,122 147,547 20,964.00/hr
OPENMM_22 445,051 11,137 446,046 9,960.00/hr
Totals 1,340,177 17,259 1,347,294 80,639.99/hr

Although there are 446,046 OpenMM_22 jobs available they dont seem to be being given out.
However CPU jobs are being served from another server.
Pete Cowley, Gisborne, New Zealand. The first city to see the light of the new day. :D
Image
alien88
Posts: 10
Joined: Mon Apr 13, 2020 1:37 am

Re: Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Post by alien88 »

All my machines are failing to get GPU workunits for 12 hours now because of this server :(
firedfly
Posts: 3
Joined: Tue May 31, 2011 7:47 pm

Re: Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Post by firedfly »

I've had a few GPUs stuck waiting for work from this work server as well. I was able to get work by changing the preferred cause for the affected slots. That changed the work server the slot was assigned to and the GPUs are now folding again.
gordonbb
Posts: 511
Joined: Mon May 21, 2018 4:12 pm
Hardware configuration: Ubuntu 22.04.2 LTS; NVidia 525.60.11; 2 x 4070ti; 4070; 4060ti; 3x 3080; 3070ti; 3070
Location: Great White North

Re: Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Post by gordonbb »

My Slots continued on happily overnight after I changed the Preferences to "Any" from "COVID"

One possible hint is that this server in the Stats shows JUST 1.34TB free considerably lower than all the other active servers so perhaps it has some Resource Constraints preventing it from assigning tasks.

I also noted that both the F@H main page and Post Era show a 0% completion rate for the latest Moonshot so perhaps some more tasty COVID WUs are being prepped for our GPUs to work on!
Image
gordonbb
Posts: 511
Joined: Mon May 21, 2018 4:12 pm
Hardware configuration: Ubuntu 22.04.2 LTS; NVidia 525.60.11; 2 x 4070ti; 4070; 4060ti; 3x 3080; 3070ti; 3070
Location: Great White North

Re: Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Post by gordonbb »

P.S. - My systems' production started dropping at almost exactly 22:00 EST yesterday evening if that helps isolate the issue.
Image
alien88
Posts: 10
Joined: Mon Apr 13, 2020 1:37 am

Re: Waiting on WS 54.157.202.86 mskcc1.foldingathome.org

Post by alien88 »

I got WUs from other servers, but now back to no GPU units being handed out in the last 4 hours for my machines by this WS...
Post Reply