Page 2 of 2
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Fri Feb 27, 2015 9:53 pm
by Gary480six
runpaint,
You say it is Maxwell cards - but you do not say if it's Maxwell 1 (GTX750/750Ti) or Maxwell 2 (GTX970/980) hardware.
I just posted about this yesterday
here in another section of the forum.
And what Bruce is telling you was the exact solution for me - though I only have GTX750 and GTX750Ti cards.
Update your Nvidia video card drivers to the latest version. (I cannot confirm if this will help with GTX970/980 cards - but I assume so)
What I found, when looking back over the last two Months of my log files... was that this had been happening often. That I would get a P13000 or P13001 work unit and it would fail at 0%. The only difference was that I would get one 'bad' p13000, followed by five successful core 18 work units - so I never noticed.
It was only after getting 5+ bad P13000 work units in a row (and the FAILED warning), that the problem was revealed.
I updated my drivers and my PCs are now happily crunching away on P13000 and P13001 work units.
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Fri Feb 27, 2015 9:56 pm
by 7im
runpaint wrote:It's 0.0.52, I thought it updated automatically.
Yes, and no.
The work units that you fold each contain a "minimum required core version" setting. If you haven't folded any work units that require the newer .55 FAHCore version, then no upgrade was done. However, when you do fold your first work unit that requires the newer version, the client will download the newer version automatically. There are ways to induce the update, but that goes even further off the current topic.
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Fri Feb 27, 2015 10:02 pm
by bruce
FAH can automatically update the FahCore to a new version but that's not the issue here. The nVidia Driver version needs to be update and you have to do that.
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Fri Feb 27, 2015 10:23 pm
by Breach
7im wrote:runpaint wrote:It's 0.0.52, I thought it updated automatically.
Yes, and no.
The work units that you fold each contain a "minimum required core version" setting. If you haven't folded any work units that require the newer .55 FAHCore version, then no upgrade was done. However, when you do fold your first work unit that requires the newer version, the client will download the newer version automatically. There are ways to induce the update, but that goes even further off the current topic.
Perhaps it's worthwhile to also mention that the core version is also client-type specific - 0.0.55 is *not* an upgrade to 0.0.52 right now, but the beta version of the core.
With no flags 0.0.52 would be downloaded and installed, e.g. in my case here: C:\ProgramData\FAHClient\cores\web.stanford.edu\~pande\Win32\AMD64\NVIDIA\Fermi\Core_17.fah
Even if you switch to beta, 0.0.55 would be downloaded and installed: C:\ProgramData\FAHClient\cores\web.stanford.edu\~pande\Win32\AMD64\NVIDIA\Fermi\beta\Core_17.fah
However, if you remove the beta flag then the non-beta (0.0.52) version will be used - not the beta one, even if it's newer and still there.
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Fri Feb 27, 2015 10:41 pm
by kyleb
I've reverted projects 13000 and 13001 to beta only for now.
*EDIT* no longer in beta, see next post
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Sat Feb 28, 2015 6:16 pm
by kyleb
OK, after looking further I've restricted these projects to non-maxwell cards.
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Sat Feb 28, 2015 6:35 pm
by Breach
kyleb wrote:OK, after looking further I've restricted these projects to non-maxwell cards.
Aren't they supposed to work with Maxwells with 347.xx drivers?
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Sat Feb 28, 2015 7:22 pm
by kyleb
For now, I want to be conservative and eliminate any issues. We can figure out looser restrictions if needed in the future.
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Sat Feb 28, 2015 7:29 pm
by Gary480six
kyleb wrote:OK, after looking further I've restricted these projects to non-maxwell cards.
Noooooo.........
As others have stated, there is no problem with the P13000 and P13001 work units. It's just that many of us are still using older Nvidia drivers with our GPU Folding.
It was happening to me. My version 7 client had the dreaded FAILED message from crashing too many P13000 work units.
But on advice from Bruce and 7im, I updated my drivers to the latest available from Nvidia - and the failures Stopped.
Rather than pull the P13000s from the Maxwell cards, why not make an announcement about updating the drivers instead?
Re: My Maxwells are getting 13000 & 13001, & failing
Posted: Sun Mar 01, 2015 2:11 am
by HayesK
completed 72x-p13000/13001 on my GTX750Ti since january 25. not aware of any failures.
linux client 7.44, nvidia 346.22 (cuda 5.0, cuda driver 7000), Ubuntu 14.04,
hfm benchmark data below
Code: Select all
Project ID: 13000
Core: ZETA
Credit: 17123
Frames: 100
Name: F63-P67A-i2600K-6C-4.6-1866+2xGTX750Ti-U1404-V744 Slot 01
Number of Frames Observed: 300
Min. Time / Frame : 00:12:20 - 67,970 PPD
Avg. Time / Frame : 00:12:24 - 67,423 PPD
Name: F63-P67A-i2600K-6C-4.6-1866+2xGTX750Ti-U1404-V744 Slot 02
Number of Frames Observed: 166
Min. Time / Frame : 00:12:38 - 65,564 PPD
Avg. Time / Frame : 00:12:42 - 65,048 PPD
Name: F64-P8P67-i2600K-8C-4.3+2x750Ti-1600-U1404-V744 Slot 01
Number of Frames Observed: 300
Min. Time / Frame : 00:12:20 - 67,970 PPD
Avg. Time / Frame : 00:12:26 - 67,152 PPD
Name: F64-P8P67-i2600K-8C-4.3+2x750Ti-1600-U1404-V744 Slot 02
Number of Frames Observed: 300
Min. Time / Frame : 00:12:27 - 67,017 PPD
Avg. Time / Frame : 00:12:34 - 66,086 PPD
Name: F65-P8P67-i2600K-6C-4.5+2x750Ti-1600-U1404-V744 Slot 01
Number of Frames Observed: 300
Min. Time / Frame : 00:12:17 - 68,386 PPD
Avg. Time / Frame : 00:12:23 - 67,559 PPD
Name: F65-P8P67-i2600K-6C-4.5+2x750Ti-1600-U1404-V744 Slot 02
Number of Frames Observed: 300
Min. Time / Frame : 00:12:20 - 67,970 PPD
Avg. Time / Frame : 00:12:25 - 67,287 PPD
Code: Select all
Project ID: 13001
Core: ZETA
Credit: 17123
Frames: 100
Name: F63-P67A-i2600K-6C-4.6-1866+2xGTX750Ti-U1404-V744 Slot 01
Number of Frames Observed: 300
Min. Time / Frame : 00:12:19 - 68,109 PPD
Avg. Time / Frame : 00:12:34 - 66,086 PPD
Name: F63-P67A-i2600K-6C-4.6-1866+2xGTX750Ti-U1404-V744 Slot 02
Number of Frames Observed: 300
Min. Time / Frame : 00:12:37 - 65,694 PPD
Avg. Time / Frame : 00:12:42 - 65,048 PPD
Name: F64-P8P67-i2600K-8C-4.3+2x750Ti-1600-U1404-V744 Slot 01
Number of Frames Observed: 256
Min. Time / Frame : 00:12:21 - 67,833 PPD
Avg. Time / Frame : 00:12:26 - 67,152 PPD
Name: F64-P8P67-i2600K-8C-4.3+2x750Ti-1600-U1404-V744 Slot 02
Number of Frames Observed: 300
Min. Time / Frame : 00:12:28 - 66,883 PPD
Avg. Time / Frame : 00:12:37 - 65,694 PPD
Name: F65-P8P67-i2600K-6C-4.5+2x750Ti-1600-U1404-V744 Slot 01
Number of Frames Observed: 300
Min. Time / Frame : 00:12:17 - 68,386 PPD
Avg. Time / Frame : 00:12:22 - 67,696 PPD
Name: F65-P8P67-i2600K-6C-4.5+2x750Ti-1600-U1404-V744 Slot 02
Number of Frames Observed: 300
Min. Time / Frame : 00:12:19 - 68,109 PPD
Avg. Time / Frame : 00:12:25 - 67,287 PPD