Search found 73 matches

by Sparkly
Mon Aug 03, 2020 3:30 pm
Forum: Issues with a specific WU
Topic: 13421 gets to 100% then crashes
Replies: 16
Views: 4110

Re: 13421 gets to 100% then crashes

mwroggenbuck wrote:Unfortunately, the one thread limit does not work 100%. I have other posts on this forum about this suggestion. I ran it and still had issues. :(
The one core workaround is for this error only 0xc0000374, so if you have other error codes then it is something else.
by Sparkly
Mon Aug 03, 2020 10:11 am
Forum: Issues with a specific WU
Topic: 13421 gets to 100% then crashes
Replies: 16
Views: 4110

Re: 13421 gets to 100% then crashes

As far as I have seen this 0xc0000374 only happens on rigs (in Windows) where the CPU cores are multithreaded, so typically 4 core 8 thread, 8 core 16 thread etc., which indicate heap issues in the end process of the software and not memory issues. A workaround is to limit each FAHCore_22 process to...
by Sparkly
Fri Jul 31, 2020 6:49 pm
Forum: Issues with a specific WU
Topic: 13421 WUs with abnormally long runtime
Replies: 8
Views: 2271

Re: 13421 WUs with abnormally long runtime

How are you giving the gpu work units more cores? I am using a process manager https://www.bill2-software.com/processmanager/download.shtml to automatically reduce the number cores a FAHcore_22 process gets access to in the first place, so unless you have reduced the amount of cores the process get...
by Sparkly
Wed Jul 29, 2020 9:06 pm
Forum: Issues with a specific WU
Topic: 13421 WUs with abnormally long runtime
Replies: 8
Views: 2271

Re: 13421 WUs with abnormally long runtime

Both WUs ran on its own RX580 GPU for like 8-10 hours, reaching like 3%, when the remaining ETA was recorded, before giving each of the WUs more cores to play with, resulting in a more normal runtime and ETA of like 3h, and this was the only two WUs running at the time, since everything else was tur...
by Sparkly
Wed Jul 29, 2020 9:20 am
Forum: Issues with a specific WU
Topic: 13421 WUs with abnormally long runtime
Replies: 8
Views: 2271

13421 WUs with abnormally long runtime

PRCG numbers for some abnormally long runtime WUs In my case this is based on 1 x CPU core running 1 x FAHCore_22 13421 - 4444, 0, 0 - ETA 13 days 13421 – 3142, 11, 0 – ETA 11 days Tested what makes them move forward at some normal speed, and giving each of them 2 x CPU cores makes them rather happy...
by Sparkly
Sat Jul 25, 2020 1:37 am
Forum: Issues with a specific server
Topic: Ellesmere XT Cards Directed to Work Server 192.0.2.1
Replies: 23
Views: 5862

Re: 192.0.2.1 Work Server

GPUSpecies 4 is supposed to be supported. I don't know what happened. It'll take be a bit to undo a lot of today's work. Well, according to your list in the other thread the Radeon RX 470/480/570/580/590 is technically more Species 5, since all but the RX 470 is above 5 TFLOPS, while the RX 470 is ...
by Sparkly
Sat Jul 25, 2020 12:37 am
Forum: Issues with a specific server
Topic: Ellesmere XT Cards Directed to Work Server 192.0.2.1
Replies: 23
Views: 5862

Re: 192.0.2.1 Work Server

The fact that it just started to happen on several already running systems should probably tell you something. 23:27:43:Trying to access database... 23:27:43:Successfully acquired database lock 23:27:43:Read GPUs.txt 23:27:46:Enabled folding slot 01: READY gpu:0:Ellesmere XT [Radeon RX 470/480/570/5...
by Sparkly
Sat Jul 25, 2020 12:26 am
Forum: Issues with a specific server
Topic: Ellesmere XT Cards Directed to Work Server 192.0.2.1
Replies: 23
Views: 5862

Re: 192.0.2.1 Work Server

The log tells you where it is failing, the slots asks for a new WU assignment after finishing the previous one, then gets a WU assignment for 192.0.2.1, makes the directory in the work folder, but there is nothing to download, since the IP is invalid.
by Sparkly
Sat Jul 25, 2020 12:15 am
Forum: Issues with a specific server
Topic: Ellesmere XT Cards Directed to Work Server 192.0.2.1
Replies: 23
Views: 5862

Re: 192.0.2.1 Work Server

These setups have been running for months, and all the other, exactly the same GPUs, as can be seen in the log, are folding fine on the same machines at the same time in the other slots, so since this 192 thing suddenly happened on several different machines when requesting a new WU in the last few ...
by Sparkly
Fri Jul 24, 2020 11:37 pm
Forum: Issues with a specific server
Topic: Ellesmere XT Cards Directed to Work Server 192.0.2.1
Replies: 23
Views: 5862

192.0.2.1 Work Server

The Assignment Server is sending out Work Server requests for 192.0.2.1, which is for obvious reasons going to be an issue. 23:19:08:WU02:FS01:Connecting to assign1.foldingathome.org:80 23:19:08:WU02:FS01:Assigned to work server 192.0.2.1 23:19:08:WU02:FS01:Requesting new work unit for slot 01: READ...
by Sparkly
Thu Jul 23, 2020 8:43 am
Forum: Issues with a specific WU
Topic: 13416 WUs with abnormally long runtime
Replies: 1
Views: 1065

13416 WUs with abnormally long runtime

Let us just collect the PRCG numbers in one thread, since these abnormally long runtime WUs seem to arrive more and more over the last day. In my case this is 1 x CPU core running 1 x FAHCore_22 13416 – 807, 270, 1 – ETA 14 days 13416 - 1289, 20, 4 - ETA 9 days 13416 - 1226, 216, 1 - ETA 11 days 134...
by Sparkly
Tue Jul 21, 2020 10:04 am
Forum: V7.6.x Public Release Windows/Linux/MacOS X
Topic: Error handling in FAH
Replies: 19
Views: 4568

Re: Error handling in FAH

bruce wrote:We need to be talking about a single issue.
I am only talking about FAHCore_22 issues in Windows, so I don’t know if it is the same with Linux.

The 100% issue is a rather obvious heap thing, since doing the following affinity changes removes the issue completely:

Image
by Sparkly
Sun Jul 19, 2020 10:54 am
Forum: Issues with a specific WU
Topic: WU 13416 low ppd long run time
Replies: 44
Views: 13004

Re: WU 13416 low ppd long run time

FAHCore_22 does sometimes use more than one CPU thread per GPU. Studies have shown that FAHCore_22 can speed up the throughput of it's processing by using more CPU resources. Nobody has yet studied how much a CPU thread that's freed up from CPU assignments will add to the GPU PPD compared to the lo...
by Sparkly
Sun Jul 19, 2020 9:59 am
Forum: V7.6.x Public Release Windows/Linux/MacOS X
Topic: Error handling in FAH
Replies: 19
Views: 4568

Re: Error handling in FAH

It depends on whay you mean by "issues" What I mean by issues is that the software isn’t handling the multithread distribution over several different cores very gracefully, so when releasing heap addresses on 1 core, the software thread running on another core, since there are several sof...
by Sparkly
Sat Jul 18, 2020 12:20 pm
Forum: Issues with a specific WU
Topic: WU 13416 low ppd long run time
Replies: 44
Views: 13004

Re: WU 13416 low ppd long run time

Something is seriously fishy with the 13416 WUs and how they run on GPU systems, since one FULL CORE isn’t enough to keep them happy and running smoothly, they want to grab even more. The following is a combination of 4 WUs from a configuration with Affinity set, to show what is going on for each WU...