Get New GPU Task After Failed GPU Task

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Jesse_V
Site Moderator
Posts: 2850
Joined: Mon Jul 18, 2011 4:44 am
Hardware configuration: OS: Windows 10, Kubuntu 19.04
CPU: i7-6700k
GPU: GTX 970, GTX 1080 TI
RAM: 24 GB DDR4
Location: Western Washington

Re: Get New GPU Task After Failed GPU Task

Post by Jesse_V »

Space45 wrote:I've updated the driver and a GPU task is running. Hopefully, I'll know whether it's succeeded soon.
I recommend that you keep an eye on the log. That way if something goes wrong you can pause the GPU slot to prevent many WUs from failing.
F@h is now the top computing platform on the planet and nothing unites people like a dedicated fight against a common enemy. This virus affects all of us. Lets end it together.
Space45
Posts: 17
Joined: Fri Mar 01, 2013 11:18 pm
Hardware configuration: Intel Core i7 950 3.04GHz (4 Physical Core, 8 Virtual Core)
ASUS Rampage III Extreme Motherboard
EVGA GeForce GTX 470

Re: Get New GPU Task After Failed GPU Task

Post by Space45 »

It successfully completed a GPU task, so it appears to be working now. Updating the driver appeared to fix everything.

Thanks,

Space45
Image
Space45
Posts: 17
Joined: Fri Mar 01, 2013 11:18 pm
Hardware configuration: Intel Core i7 950 3.04GHz (4 Physical Core, 8 Virtual Core)
ASUS Rampage III Extreme Motherboard
EVGA GeForce GTX 470

Re: Get New GPU Task After Failed GPU Task

Post by Space45 »

Unfortunately, after having a few successful rounds of GPU tasks successfully completing, I'm now getting GPU_MEMTEST_ERRORs again. Any suggestions?

Thanks,

Space45
Image
Jesse_V
Site Moderator
Posts: 2850
Joined: Mon Jul 18, 2011 4:44 am
Hardware configuration: OS: Windows 10, Kubuntu 19.04
CPU: i7-6700k
GPU: GTX 970, GTX 1080 TI
RAM: 24 GB DDR4
Location: Western Washington

Re: Get New GPU Task After Failed GPU Task

Post by Jesse_V »

Space45 wrote:Unfortunately, after having a few successful rounds of GPU tasks successfully completing, I'm now getting GPU_MEMTEST_ERRORs again. Any suggestions
Per the documentation:
http://fah-web.stanford.edu/MemtestG80/ ... readme.txt
If you suspect that your graphics card is having issues (for example, it fails running Folding@home work units), we strongly recommend that you test as large a memory region as is practical, and run thousands of test iterations. In our testing, we have found that even "problematic" cards may only fail sporadically (e.g., once every 50,000 test iterations). Like other stress testing tools, to properly verify stability MemtestG80 should be run for an extended period of time.
It wouldn't hurt to try again and let it run for much longer. If MemtestG80 identifies a problem, then there may be something wrong with the card itself. Its detailed testing has helped me out before.
F@h is now the top computing platform on the planet and nothing unites people like a dedicated fight against a common enemy. This virus affects all of us. Lets end it together.
Space45
Posts: 17
Joined: Fri Mar 01, 2013 11:18 pm
Hardware configuration: Intel Core i7 950 3.04GHz (4 Physical Core, 8 Virtual Core)
ASUS Rampage III Extreme Motherboard
EVGA GeForce GTX 470

Re: Get New GPU Task After Failed GPU Task

Post by Space45 »

"Final error count after 50000 iterations over 100 MiB of GPU memory: 0 errors"
Image
Bleeder
Posts: 69
Joined: Sat May 05, 2012 1:21 am

Re: Get New GPU Task After Failed GPU Task

Post by Bleeder »

A few month ago I was also getting the GPU_MEMTEST_ERROR failure. It turned out that it was because I had Remote Desktop enabled and when I actually connected to the machine via RDP the video driver resets and the F@H core dies.

Are you using RDP to connect to the failing machine? If you are the fix is very easy - disable Remote Desktop and reboot the machine then never connect via RDP again.
Image
Image
Space45
Posts: 17
Joined: Fri Mar 01, 2013 11:18 pm
Hardware configuration: Intel Core i7 950 3.04GHz (4 Physical Core, 8 Virtual Core)
ASUS Rampage III Extreme Motherboard
EVGA GeForce GTX 470

Re: Get New GPU Task After Failed GPU Task

Post by Space45 »

Bleeder wrote:A few month ago I was also getting the GPU_MEMTEST_ERROR failure. It turned out that it was because I had Remote Desktop enabled and when I actually connected to the machine via RDP the video driver resets and the F@H core dies.

Are you using RDP to connect to the failing machine? If you are the fix is very easy - disable Remote Desktop and reboot the machine then never connect via RDP again.
No, I'm using the computer itself. However, if those or similar services were enabled, would they affect Folding@Home?
Image
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: Get New GPU Task After Failed GPU Task

Post by P5-133XL »

Microsoft Remote desktop causes folding failure when you connect. Other alternatives like VNC do not have that issue.
Image
Bleeder
Posts: 69
Joined: Sat May 05, 2012 1:21 am

Re: Get New GPU Task After Failed GPU Task

Post by Bleeder »

Space45 wrote:No, I'm using the computer itself. However, if those or similar services were enabled, would they affect Folding@Home?
Enabled is fine just don't connect. But if you are not connecting via RDP then this is not the problem unfortunately.
Image
Image
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Get New GPU Task After Failed GPU Task

Post by bruce »

Some screensavers have also been reported as resetting the GPU just like RDT. It can also happen if the computer sleeps.
Space45
Posts: 17
Joined: Fri Mar 01, 2013 11:18 pm
Hardware configuration: Intel Core i7 950 3.04GHz (4 Physical Core, 8 Virtual Core)
ASUS Rampage III Extreme Motherboard
EVGA GeForce GTX 470

Re: Get New GPU Task After Failed GPU Task

Post by Space45 »

bruce wrote:Some screensavers have also been reported as resetting the GPU just like RDT. It can also happen if the computer sleeps.
So could my computer sleeping due to inactivity cause the GPU tasks to fail?
Image
mmonnin
Posts: 324
Joined: Wed Dec 05, 2007 1:27 am

Re: Get New GPU Task After Failed GPU Task

Post by mmonnin »

Very well could.
Space45
Posts: 17
Joined: Fri Mar 01, 2013 11:18 pm
Hardware configuration: Intel Core i7 950 3.04GHz (4 Physical Core, 8 Virtual Core)
ASUS Rampage III Extreme Motherboard
EVGA GeForce GTX 470

Re: Get New GPU Task After Failed GPU Task

Post by Space45 »

I looked at my power settings and my computer is set to never sleep.
Image
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Get New GPU Task After Failed GPU Task

Post by bruce »

Space45 wrote:
bruce wrote:Some screensavers have also been reported as resetting the GPU just like RDT. It can also happen if the computer sleeps.
So could my computer sleeping due to inactivity cause the GPU tasks to fail?
Yes, but not always.

That's one reason why V7.3.6 disables sleeping. An enhancement is currently being discussed in ticket #988
Jesse_V
Site Moderator
Posts: 2850
Joined: Mon Jul 18, 2011 4:44 am
Hardware configuration: OS: Windows 10, Kubuntu 19.04
CPU: i7-6700k
GPU: GTX 970, GTX 1080 TI
RAM: 24 GB DDR4
Location: Western Washington

Re: Get New GPU Task After Failed GPU Task

Post by Jesse_V »

Hmm. As Bruce mentioned earlier (viewtopic.php?f=61&t=23820#p238166) are you overclocking? What temperatures does the GPU get to?
If you don't have temperature monitoring software, there are a number of programs you could use. Common ones include GPU-Z and Speccy, my personal favorite.
F@h is now the top computing platform on the planet and nothing unites people like a dedicated fight against a common enemy. This virus affects all of us. Lets end it together.
Post Reply