Search found 12 matches

by wesgeorge
Sun May 11, 2025 12:33 pm
Forum: v8.4.9 Public Release for Windows / Linux / macOS
Topic: possible memory leak then 'resources not found'
Replies: 12
Views: 17804

Re: possible memory leak then 'resources not found'

Additional data from someone else experiencing this if it's helpful: there doesn't appear to be a way to attach an image, only link one hosted elsewhere, so no screenshot, but in task manager right now, running win11 pro 24H2, client 8.4.9, CPU only, I am seeing: Name PID Status User name CPU Memory...
by wesgeorge
Wed May 07, 2025 12:48 pm
Forum: GPU Projects and FahCores
Topic: client v8.4.9 GPU WUs not completing properly
Replies: 5
Views: 1425

Re: client v8.4.9 GPU WUs not completing properly

Is this happening every GPU WU? Are you behind some firewall, or network infrastructure which would block uploads? Is CPU doing anything during that time? Is there any network activity during the "waiting" period? That looks like some sort of compute instance running some docker, no? Thos...
by wesgeorge
Tue May 06, 2025 1:29 pm
Forum: GPU Projects and FahCores
Topic: client v8.4.9 GPU WUs not completing properly
Replies: 5
Views: 1425

client v8.4.9 GPU WUs not completing properly

I'm not sure if it's specific to one core/project or not, but I've been seeing this issue where the GPU will run the WU to completion, do the shutdown, but then not actually do the steps to finish, unload that WU and get a new one. In other words, it runs at full speed to completion, we get the foll...
by wesgeorge
Mon May 05, 2025 2:10 pm
Forum: v8.4.9 Public Release for Windows / Linux / macOS
Topic: My client cannot detect the GPU after reboot, and I need to manually restart the client
Replies: 7
Views: 1730

Re: My client cannot detect the GPU after reboot, and I need to manually restart the client

/lib/systemd/system/fah-client.service currently lists After=network.target nss-lookup.target WantedBy=multi-user.target I was looking at logs to try to figure out the race condition, saw this: -- Boot f99ecd5cd53542eabfb6e776bad81f18 -- May 05 09:01:17 deathstar systemd[1]: Started fah-client.servi...
by wesgeorge
Tue Mar 18, 2025 10:05 pm
Forum: Discussions of General-FAH topics
Topic: docker containers supported?
Replies: 1
Views: 3464

docker containers supported?

Is https://github.com/FoldingAtHome/containers still considered the right/mostly-official way to fold in docker? Hasn't seen an update in several years and it's starting to have issues with newer cores not working properly, especially for fah-gpu - kinda looking abandoned. I unfortunately don't have...
by wesgeorge
Mon Mar 17, 2025 3:52 pm
Forum: GPU Projects and FahCores
Topic: Core 0x24 WU_STALLED
Replies: 16
Views: 22728

Re: Core 0x24 WU_STALLED

ok, that is something that I can push via the support for fah-gpu since this is very likely a docker issue (I'm running Ubuntu22). I'd already asked about them rolling a rev that includes client v8 but if there's a more pressing issue they may need to update even for v7, I can try to handle that sep...
by wesgeorge
Mon Mar 17, 2025 2:44 pm
Forum: GPU Projects and FahCores
Topic: Core 0x24 WU_STALLED
Replies: 16
Views: 22728

Re: Core 0x24 WU_STALLED

ok, I did the link, file is showing up as expected: 2d0be0932f23:/fah$ cd /fah/cores/cores.foldingathome.org/openmm-core-24/centos-7.9.2009-64bit/release/0x24-8.1.4/Core_24.fah/ 2d0be0932f23:/fah/cores/cores.foldingathome.org/openmm-core-24/centos-7.9.2009-64bit/release/0x24-8.1.4/Core_24.fah$ ls -l...
by wesgeorge
Mon Mar 17, 2025 12:41 pm
Forum: GPU Projects and FahCores
Topic: Core 0x24 WU_STALLED
Replies: 16
Views: 22728

Re: Core 0x24 WU_STALLED

I appear to have multiple copies of it, including one that appears inside of docker. Best to use the one in /usr/lib (first result) ? ./usr/lib/x86_64-linux-gnu/libexpat.so.1 ./snap/core20/2496/usr/lib/x86_64-linux-gnu/libexpat.so.1 ./snap/core20/2434/usr/lib/x86_64-linux-gnu/libexpat.so.1 ./snap/sn...
by wesgeorge
Mon Mar 17, 2025 11:53 am
Forum: GPU Projects and FahCores
Topic: Core 0x24 WU_STALLED
Replies: 16
Views: 22728

Re: Core 0x24 WU_STALLED

libexpat1 seems to already be installed (edit for clarification: on the host OS), and the only link I'm aware of to make this work inside of docker is to link nvidia-container-runtime, which I've done. sudo apt install libexpat1 [sudo] password for wes: Reading package lists... Done Building depende...
by wesgeorge
Sun Mar 16, 2025 3:54 pm
Forum: GPU Projects and FahCores
Topic: Core 0x24 WU_STALLED
Replies: 16
Views: 22728

Re: Core 0x24 WU_STALLED

I dropped inside the docker container to run the requested commands. It looks like it doesn't respond with any output to the second command, just gives me back the prompt. wes@tantive-iv:~$ sudo docker exec -it 2d0be0932f23 /bin/bash 2d0be0932f23:/fah$ cd /fah/cores/cores.foldingathome.org/openmm-co...
by wesgeorge
Fri Mar 14, 2025 4:15 pm
Forum: GPU Projects and FahCores
Topic: Core 0x24 WU_STALLED
Replies: 16
Views: 22728

Re: Core 0x24 WU_STALLED

You see how you provide full info when WU is working. Please do so when WU is stalling, as your snippet of stalled WU log shows nothing informative :) I agree it's not very informative. I'd love to provide more info, but that's all it gives me. Either way, below is the whole log for multiple attemp...
by wesgeorge
Fri Mar 14, 2025 12:30 pm
Forum: GPU Projects and FahCores
Topic: Core 0x24 WU_STALLED
Replies: 16
Views: 22728

Core 0x24 WU_STALLED

Running a fah-gpu docker container (which I believe is built around fairly old NVIDIA driver/cuda libraries (460/11.2)), client 7.6.21 on Ubuntu 22.04.5 LTS (GNU/Linux 5.15.0-134-generic x86_64) with a Quadro K2200, Driver Version: 570.124.06, CUDA Version: 12.8. I've tried older drivers too. Some c...