Now using Core x16 vs. Core x17 Killing PPD

If you think it might be a driver problem, see viewforum.php?f=79

Moderators: Site Moderators, FAHC Science Team

Post Reply
LonePalm
Posts: 98
Joined: Thu Feb 26, 2009 7:27 pm
Location: Saint Marys, Georgia

Now using Core x16 vs. Core x17 Killing PPD

Post by LonePalm »

All of a sudden my GPU is downloading core x16 vice core x17 WUs.

This has dropped my expected WU points from either 28K or 65K to about 5K for a WU.

What is going on? Any way to stop this? Will it go back on its own soon?

Code: Select all

*********************** Log Started 2014-09-25T15:12:36Z ***********************
15:12:36:************************* Folding@home Client *************************
15:12:36:      Website: http://folding.stanford.edu/
15:12:36:    Copyright: (c) 2009-2013 Stanford University
15:12:36:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
15:12:36:         Args: 
15:12:36:       Config: C:/Users/Edward Rodman/AppData/Roaming/FAHClient/config.xml
15:12:36:******************************** Build ********************************
15:12:36:      Version: 7.3.6
15:12:36:         Date: Feb 18 2013
15:12:36:         Time: 15:25:17
15:12:36:      SVN Rev: 3923
15:12:36:       Branch: fah/trunk/client
15:12:36:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
15:12:36:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
15:12:36:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
15:12:36:     Platform: win32 XP
15:12:36:         Bits: 32
15:12:36:         Mode: Release
15:12:36:******************************* System ********************************
15:12:36:          CPU: Intel(R) Core(TM) i7-3770 CPU @ 3.40GHz
15:12:36:       CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
15:12:36:         CPUs: 8
15:12:36:       Memory: 7.96GiB
15:12:36:  Free Memory: 5.70GiB
15:12:36:      Threads: WINDOWS_THREADS
15:12:36:  Has Battery: false
15:12:36:   On Battery: false
15:12:36:   UTC offset: -4
15:12:36:          PID: 6684
15:12:36:          CWD: C:/Users/Edward Rodman/AppData/Roaming/FAHClient
15:12:36:           OS: Windows 7 Professional
15:12:36:      OS Arch: AMD64
15:12:36:         GPUs: 1
15:12:36:        GPU 0: ATI:5 Tahiti XT [Radeon HD 7970]
15:12:36:         CUDA: Not detected
15:12:36:Win32 Service: false
15:12:36:***********************************************************************
15:12:36:<config>
15:12:36:  <!-- Folding Core -->
15:12:36:  <checkpoint v='9'/>
15:12:36:  <core-priority v='low'/>
15:12:36:
15:12:36:  <!-- Folding Slot Configuration -->
15:12:36:  <power v='full'/>
15:12:36:
15:12:36:  <!-- Network -->
15:12:36:  <proxy v=':8080'/>
15:12:36:
15:12:36:  <!-- User Information -->
15:12:36:  <passkey v='********************************'/>
15:12:36:  <team v='36120'/>
15:12:36:  <user v='LonePalm'/>
15:12:36:
15:12:36:  <!-- Folding Slots -->
15:12:36:  <slot id='0' type='GPU'>
15:12:36:    <next-unit-percentage v='98'/>
15:12:36:  </slot>
15:12:36:  <slot id='1' type='CPU'>
15:12:36:    <cpus v='8'/>
15:12:36:    <next-unit-percentage v='98'/>
15:12:36:  </slot>
15:12:36:</config>
15:12:36:Trying to access database...
15:12:36:Successfully acquired database lock
15:12:36:Enabled folding slot 00: READY gpu:0:Tahiti XT [Radeon HD 7970]
15:12:36:Enabled folding slot 01: READY cpu:8
15:12:36:WARNING:WU01:Missing data files, dumping
15:12:39:WU02:FS01:Starting
15:12:39:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/Edward Rodman/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe" -dir 02 -suffix 01 -version 703 -lifeline 6684 -checkpoint 9 -np 8
15:12:50:WU02:FS01:Started FahCore on PID 7980
15:13:32:WU02:FS01:Core PID:7288
15:13:32:WU02:FS01:FahCore 0xa4 started
15:13:34:WU01:FS00:Cleaning up
15:13:34:WU02:FS01:0xa4:
15:13:34:WU02:FS01:0xa4:*------------------------------*
15:13:34:WU02:FS01:0xa4:Folding@Home Gromacs GB Core
15:13:34:WU02:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
15:13:34:WU02:FS01:0xa4:
15:13:34:WU02:FS01:0xa4:Preparing to commence simulation
15:13:34:WU02:FS01:0xa4:- Ensuring status. Please wait.
15:13:35:WU00:FS00:Connecting to assign-GPU.stanford.edu:80
15:13:35:WU00:FS00:News: 
15:13:35:WU00:FS00:Assigned to work server 171.67.108.44
15:13:36:WU00:FS00:Requesting new work unit for slot 00: READY gpu:0:Tahiti XT [Radeon HD 7970] from 171.67.108.44
15:13:36:WU00:FS00:Connecting to 171.67.108.44:8080
15:13:38:WU00:FS00:Downloading 44.61KiB
15:13:38:WU00:FS00:Download complete
15:13:38:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:11293 run:19 clone:63 gen:49 core:0x16 unit:0x0000009e6652edbc4d94b8a401a35b8c
15:13:38:WU00:FS00:Starting
15:13:38:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/Edward Rodman/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_16.fah/FahCore_16.exe" -dir 00 -suffix 01 -version 703 -lifeline 6684 -checkpoint 9 -gpu 0 -gpu-vendor ati
15:13:39:WU00:FS00:Started FahCore on PID 6088
15:13:42:WU00:FS00:Core PID:7320
15:13:42:WU00:FS00:FahCore 0x16 started
15:13:43:WU00:FS00:0x16:
15:13:43:WU00:FS00:0x16:*------------------------------*
15:13:43:WU00:FS00:0x16:Folding@Home GPU Core
15:13:43:WU00:FS00:0x16:Version 2.11 (Thu Dec 9 15:00:14 PST 2010)
15:13:43:WU00:FS00:0x16:
15:13:43:WU00:FS00:0x16:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 15.00.30729.01 for 80x86 
15:13:43:WU00:FS00:0x16:Build host: user-f6d030f24f
15:13:43:WU00:FS00:0x16:Board Type: AMD/OpenCL
15:13:43:WU00:FS00:0x16:Core      : x=16
15:13:43:WU00:FS00:0x16: Window's signal control handler registered.
15:13:43:WU00:FS00:0x16:Preparing to commence simulation
15:13:43:WU00:FS00:0x16:- Looking at optimizations...
15:13:43:WU00:FS00:0x16:- Created dyn
15:13:43:WU00:FS00:0x16:- Files status OK
15:13:43:WU00:FS00:0x16:sizeof(CORE_PACKET_HDR) = 512 file=<>
15:13:43:WU00:FS00:0x16:- Expanded 45169 -> 171163 (decompressed 378.9 percent)
15:13:43:WU00:FS00:0x16:Called DecompressByteArray: compressed_data_size=45169 data_size=171163, decompressed_data_size=171163 diff=0
15:13:43:WU00:FS00:0x16:- Digital signature verified
15:13:43:WU00:FS00:0x16:
15:13:43:WU00:FS00:0x16:Project: 11293 (Run 19, Clone 63, Gen 49)
15:13:43:WU00:FS00:0x16:
15:13:43:WU00:FS00:0x16:Assembly optimizations on if available.
15:13:43:WU00:FS00:0x16:Entering M.D.
15:13:43:WU02:FS01:0xa4:- Looking at optimizations...
15:13:43:WU02:FS01:0xa4:- Working with standard loops on this execution.
15:13:43:WU02:FS01:0xa4:- Previous termination of core was improper.
15:13:43:WU02:FS01:0xa4:- Files status OK
15:13:44:WU02:FS01:0xa4:- Expanded 118310 -> 268472 (decompressed 226.9 percent)
15:13:44:WU02:FS01:0xa4:Called DecompressByteArray: compressed_data_size=118310 data_size=268472, decompressed_data_size=268472 diff=0
15:13:44:WU02:FS01:0xa4:- Digital signature verified
15:13:44:WU02:FS01:0xa4:
15:13:44:WU02:FS01:0xa4:Project: 6367 (Run 25, Clone 33, Gen 112)
15:13:44:WU02:FS01:0xa4:
15:13:44:WU02:FS01:0xa4:Entering M.D.
15:13:44:WU00:FS00:0x16:Tpr hash 00/wudata_01.tpr:  1172127439 3714781304 2400098967 3334685826 2232997586
15:13:44:WU00:FS00:0x16:Working on ALZHEIMER DISEASE AMYLOID
15:13:44:WU00:FS00:0x16:Client config unavailable.
15:13:44:WU00:FS00:0x16:Starting GUI Server
15:13:48:WU00:FS00:0x16:Finished fah_main
15:13:48:WU00:FS00:0x16:
15:13:48:WU00:FS00:0x16:Successful run
15:13:48:WU00:FS00:0x16:DynamicWrapper: Finished Work Unit: sleep=10000
15:13:50:WU02:FS01:0xa4:Using Gromacs checkpoints
15:13:50:WU02:FS01:0xa4:Mapping NT from 8 to 8 
15:13:50:WU02:FS01:0xa4:Resuming from checkpoint
15:13:50:WU02:FS01:0xa4:Verified 02/wudata_01.log
15:13:51:WU02:FS01:0xa4:Verified 02/wudata_01.trr
15:13:51:WU02:FS01:0xa4:Verified 02/wudata_01.xtc
15:13:51:WU02:FS01:0xa4:Verified 02/wudata_01.edr
15:13:51:WU02:FS01:0xa4:Completed 3719350 out of 5000000 steps  (74%)
15:13:58:WU00:FS00:0x16:Reserved 0 bytes for xtc file; Cosm status=0
15:13:58:WU00:FS00:0x16:Reserved 0 0 786430464 bytes for arc file=<00/wudata_01.trr> Cosm status=0
15:13:58:WU00:FS00:0x16:Allocated 0 bytes for edr file
15:13:58:WU00:FS00:0x16:Error: could not open bedfile, but going on anyway
15:13:58:WU00:FS00:0x16:- Checksum of file (00/wudata_01.edr) read from disk doesn't match
15:13:58:WU00:FS00:0x16:edrfile file hash check failed.
15:13:58:WU00:FS00:0x16:
15:13:58:WU00:FS00:0x16:Folding@home Core Shutdown: FILE_IO_ERROR
15:13:59:WARNING:WU00:FS00:FahCore returned: FILE_IO_ERROR (117 = 0x75)
15:13:59:WARNING:WU00:FS00:Fatal error, dumping
15:13:59:WU00:FS00:Sending unit results: id:00 state:SEND error:DUMPED project:11293 run:19 clone:63 gen:49 core:0x16 unit:0x0000009e6652edbc4d94b8a401a35b8c
15:13:59:WU00:FS00:Connecting to 171.67.108.44:8080
15:13:59:WU01:FS00:Connecting to assign-GPU.stanford.edu:80
15:13:59:WU00:FS00:Server responded WORK_ACK (400)
15:14:00:WU00:FS00:Cleaning up
15:14:01:WU01:FS00:News: 
15:14:01:WU01:FS00:Assigned to work server 171.67.108.44
15:14:01:WU01:FS00:Requesting new work unit for slot 00: READY gpu:0:Tahiti XT [Radeon HD 7970] from 171.67.108.44
15:14:01:WU01:FS00:Connecting to 171.67.108.44:8080
15:14:02:WU01:FS00:Downloading 44.35KiB
15:14:02:WU01:FS00:Download complete
15:14:02:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:11292 run:0 clone:59 gen:35 core:0x16 unit:0x000000686652edbc4d096d93c22d7713
15:14:02:WU01:FS00:Starting
15:14:02:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/Edward Rodman/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_16.fah/FahCore_16.exe" -dir 01 -suffix 01 -version 703 -lifeline 6684 -checkpoint 9 -gpu 0 -gpu-vendor ati
15:14:04:WU01:FS00:Started FahCore on PID 9140
15:14:05:WU01:FS00:Core PID:8668
15:14:05:WU01:FS00:FahCore 0x16 started
15:14:05:WU01:FS00:0x16:
15:14:05:WU01:FS00:0x16:*------------------------------*
15:14:05:WU01:FS00:0x16:Folding@Home GPU Core
15:14:05:WU01:FS00:0x16:Version 2.11 (Thu Dec 9 15:00:14 PST 2010)
15:14:05:WU01:FS00:0x16:
15:14:05:WU01:FS00:0x16:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 15.00.30729.01 for 80x86 
15:14:05:WU01:FS00:0x16:Build host: user-f6d030f24f
15:14:05:WU01:FS00:0x16:Board Type: AMD/OpenCL
15:14:05:WU01:FS00:0x16:Core      : x=16
15:14:05:WU01:FS00:0x16: Window's signal control handler registered.
15:14:05:WU01:FS00:0x16:Preparing to commence simulation
15:14:05:WU01:FS00:0x16:- Looking at optimizations...
15:14:05:WU01:FS00:0x16:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
15:14:05:WU01:FS00:0x16:- Created dyn
15:14:05:WU01:FS00:0x16:- Files status OK
15:14:05:WU01:FS00:0x16:sizeof(CORE_PACKET_HDR) = 512 file=<>
15:14:05:WU01:FS00:0x16:- Expanded 44906 -> 171163 (decompressed 381.1 percent)
15:14:05:WU01:FS00:0x16:Called DecompressByteArray: compressed_data_size=44906 data_size=171163, decompressed_data_size=171163 diff=0
15:14:05:WU01:FS00:0x16:- Digital signature verified
15:14:05:WU01:FS00:0x16:
15:14:05:WU01:FS00:0x16:Project: 11292 (Run 0, Clone 59, Gen 35)
15:14:05:WU01:FS00:0x16:
15:14:05:WU01:FS00:0x16:Assembly optimizations on if available.
15:14:05:WU01:FS00:0x16:Entering M.D.
15:14:07:WU01:FS00:0x16:Tpr hash 01/wudata_01.tpr:  3480939337 538556079 1257896124 3923774950 2418347131
15:14:07:WU01:FS00:0x16:Working on ALZHEIMER DISEASE AMYLOID
15:14:07:WU01:FS00:0x16:Client config unavailable.
15:14:07:WU01:FS00:0x16:Starting GUI Server
15:14:10:WU01:FS00:0x16:Setting checkpoint frequency: 599998
15:14:10:WU01:FS00:0x16:Completed         0 out of 59999872 steps (0%).
15:18:25:WU02:FS01:0xa4:Completed 3750000 out of 5000000 steps  (75%)
15:20:26:WU01:FS00:0x16:Completed    599999 out of 59999872 steps (1%).
15:24:01:WU02:FS01:0xa4:Completed 3800000 out of 5000000 steps  (76%)
15:26:38:WU01:FS00:0x16:Completed   1199998 out of 59999872 steps (2%).
15:28:19:WU02:FS01:0xa4:Completed 3850000 out of 5000000 steps  (77%)
Mod edit: Changed quote tags to Code tags on log
Image
ChasingTheDream
Posts: 56
Joined: Mon Jun 02, 2014 10:56 pm

Core 16 and R9 290X TRI-X issues

Post by ChasingTheDream »

Hello all,

I've noticed over the last few days all my machines are picking up core 16 projects. I've only been folding for 4-5 months so I had no idea what a core 16 project was but quickly realized that my GPU's (R9 290X TRI-X) don't like them. They crash constantly. So as I was looking around to see what core 16 projects are it appears they were supposed to be phased out last year. So my question is why am I getting them now and is there any way to stop getting them since they are essentially crashing my computers?

Core 17 projects are fine with the lastest AMD beta drivers. Core 16, not so much...
PS3EdOlkkola
Posts: 177
Joined: Tue Aug 26, 2014 9:48 pm
Hardware configuration: 10 SMP folding slots on Intel Phi "Knights Landing" system, configured as 24 CPUs/slot
9 AMD GPU folding slots
31 Nvidia GPU folding slots
50 total folding slots
Average PPD/slot = 459,500
Location: Dallas, TX

Re: Now using Core x16 vice Core x17 Killing PPD

Post by PS3EdOlkkola »

I'm picking up a bunch of Core 16 projects as well, but they aren't crashing on my 7970s and R9-290s using Catalyst 14.4 drivers. You might want to roll-back from the Beta drivers to 14.4, if that's feasible. Core 16 work units also seem to push temps up compared to Core 17 units, so backing off GPU clocks is also a good idea. Same with Nvidia GPUs - the Core 15 work units run those GPUs 3 to 5 degrees C hotter, so I've backed those clocks down under stock by 100 MHz or so.

It seems that every few months the Core 16 projects on whatever server they're hosted on decide to come out of hibernation and humble us. I can't see any rhyme or reason why that happens, it just does. The situation seems relatively democratic (small D), in that it affects everyone relatively equally. Apparently there is still work to be done on Core 16, so the faster we get those work units out of the way, the less there is to be concerned about when a block of new GPU work units is upon us.
Image
Hardware config viewtopic.php?f=66&t=17997&p=277235#p277235
Joe_H
Site Admin
Posts: 7989
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: Now using Core x16 vice Core x17 Killing PPD

Post by Joe_H »

The one server with Core_16 work is set at a very low assignment priority compared to the servers with Core_17 projects. So WU's from this server should only be going out if there is a lack of work on other servers that can be assigned to you. You should receive Core_17 work once that temporary shortage of work ends.

Possibly related to this, PG is looking into some issues with the assignment servers not assigning work to some systems. If that is part of the reason you are being assigned the Core_16 WU's, then again once the AS are taken care of you should get different work assigned.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
7im
Posts: 10179
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Now using Core x16 vice Core x17 Killing PPD

Post by 7im »

Core 17 WUs run low and your client gets routed to other servers. It all needs folding anyway.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
ChasingTheDream
Posts: 56
Joined: Mon Jun 02, 2014 10:56 pm

Re: Now using Core x16 vice Core x17 Killing PPD

Post by ChasingTheDream »

PS3EdOlkkola wrote:I'm picking up a bunch of Core 16 projects as well, but they aren't crashing on my 7970s and R9-290s using Catalyst 14.4 drivers. You might want to roll-back from the Beta drivers to 14.4, if that's feasible. Core 16 work units also seem to push temps up compared to Core 17 units, so backing off GPU clocks is also a good idea. Same with Nvidia GPUs - the Core 15 work units run those GPUs 3 to 5 degrees C hotter, so I've backed those clocks down under stock by 100 MHz or so.

It seems that every few months the Core 16 projects on whatever server they're hosted on decide to come out of hibernation and humble us. I can't see any rhyme or reason why that happens, it just does. The situation seems relatively democratic (small D), in that it affects everyone relatively equally. Apparently there is still work to be done on Core 16, so the faster we get those work units out of the way, the less there is to be concerned about when a block of new GPU work units is upon us.
Unfortunately I can't roll the AMD drivers back because core 17 wouldn't run with all my GPU's until I installed the latest AMD beta drivers which I believe was 14.7RC3. In any event, I'm between a rock and a hard place so-to-speak. The machines will literally not run more than 15 minutes though with Core 16. I've restarted them 5 times this morning alone. No way to continue doing that so I may just have to stay down for a bit which is unfortunate.
Last edited by ChasingTheDream on Fri Sep 26, 2014 12:33 am, edited 1 time in total.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Now using Core x16 vice Core x17 Killing PPD

Post by bruce »

Yes, Core_16 does not get QRB (bonus) points so your PPD will go down. Note that this shouldn't alter your competitive position in the points race because apparently Core_17 assignments are currently not being assigned to anyone -- probably based on issues already mentioned above.

This seems to be true for both AMD and NVidia.
LonePalm
Posts: 98
Joined: Thu Feb 26, 2009 7:27 pm
Location: Saint Marys, Georgia

Re: Now using Core x16 vice Core x17 Killing PPD

Post by LonePalm »

Thank you all for the explanations.

I guess my shot at a best ever month of folding is shot for this month. This was a competition with myself. I am already the #1 folder on my team.
Image
Oletymer
Posts: 2
Joined: Tue Nov 27, 2012 6:20 pm

Re: Now using Core x16 vice Core x17 Killing PPD

Post by Oletymer »

LonePalm wrote:
I guess my shot at a best ever month of folding is shot for this month. This was a competition with myself. I am already the #1 folder on my team.



Well, Not a competition with yourself for all of Sept. I beg to differ. :ewink: I was #1 for part of the Month. :D I fold for Three different Teams (over 150 million points combined)
Have moved on to one of my other teams for now doing just NaCl. Will be back to do battle with you again when the weather turns colder again. :lol:
Keep up the good work,you are really a dedicated Folder. :)
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Now using Core x16 vs. Core x17 Killing PPD

Post by bruce »

Welcome to Foldingforum.org, Oletymer.

Either way, thanks to both of you for folding.

If you have a uniprocessor or if your machine folds only a few hours per week, then NaCl is highly recommended. If you have a GPU and you're suspending that due to heat or whatever, you can bring up FAHControl (aka Advanced Control) and suspend the GPU slot while still folding with as many of your CPU threads as you choose to allocate to FAH. If you need help with those options, please ask.
Oletymer
Posts: 2
Joined: Tue Nov 27, 2012 6:20 pm

Re: Now using Core x16 vs. Core x17 Killing PPD

Post by Oletymer »

Thanks for the welcome bruce. I fold NaCl on a Asus Gaming laptop with a Ivy Bridge i7 Quad with good cooling on the CPU.
It gets a little over 30 K PPD folding 24/7
My GPU folding is done with two Naked budget builds running two GTX680s each ( 4 Total) and Third Gen. Pent. Dual-Cores just to run the GPUs.
I do adjust how much and many GPUs I run by outdoor Temps.
Thanks again for the kind words
Post Reply