Project 16575 Chrushing GPU 18hrs low points

Moderators: Site Moderators, FAHC Science Team

SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

Just picked up project/WU 16575, show about 18 hours to process on my RTX 3070, the GPU is going full till, but when looking at points for the processing it shows just about 379,490 for hammering the GPU for 18 hours! This is after about 22% of processing so I'm assuming estimates are good.

Feels like something is way off here, especially given how much processing this is taking. Is something up with this WU?

Sandy
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
BobWilliams757
Posts: 523
Joined: Fri Apr 03, 2020 2:22 pm
Hardware configuration: ASRock X370M PRO4
Ryzen 2400G APU
16 GB DDR4-3200
MSI GTX 1660 Super Gaming X

Re: Project 16575 Chrushing GPU 18hrs low points

Post by BobWilliams757 »

Right now only one GPU shows up on this project at LAR systems, that being a 3090. Points for that GPU seem much more in line with regular averages for the GPU. If the project in fact brings your 3070 down to the 500K PPD range, there is certainly a chance something is strange with the project.

It might be worth checking your logs for the number of steps. It's my understanding that recently another project went out and some of them were misconfigured for lesser steps than was the target, and it tossed the PPD way up. I'm assuming the inverse could happen as well. Once finished you can also check for trends on the work unit status page and see if the trend seems to impact a lot of generations before and after the one you have. Noting the specific Run, Clone, and Generation here might also help anyone who looks into it.


BTW with your newly added GPU's working, your contribution has become quite impressive to say the least. :eo
Fold them if you get them!
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Re: Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

I took a look at the log, and forgot to mention that I saw one older result from my 3060 card on one machine that looked about 500k points but not 18 hours, can't remember what the time was but something like 8 hours.

In looking at the logs (windows machine) it looks like this as the last entry, still processing this morning

Code: Select all

16:12:55:WU01:FS01:0x22:Completed 987500 out of 1250000 steps (79%)
Possibly the laptop 3070m is causing a huge difference, but I think in all cases the laptop is above or at par with the rtx3060's

And yeah, once moving the GPU's off a mining motherboard with the since lane adapters, things started to move with processing. I think moving them to a server motherboard and linux really helped. I have once more slot with a 3060 to move over from the mining board to a full lane PCI on the other, was waiting for a cable that I just got, and some time. Will swap that out for a 4090 once prices dip at some point. Finally shut down all the smaller intel Linux Brix computers as they were not doing much but making heat ;)

Having fun at it all too ;)

Sandy
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Re: Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

More, the Linux box with the 4090 just completed a WU from project 16575

User Team CPUID Credit Assigned Returned Credited Days Code
SandyGanz 0 D25A1764DC342198 1,396,880 2023-04-08 17:32:05 2023-04-08 18:58:06 2023-04-08 18:57:06 0.06 Ok

So the 4090 cranked this out in 1.44 hours with huge points, doesn't make sense that it's that much faster. Points might be high because of bonus. Maybe it's all good, but seems off on the 3070. Maybe GPU memory has something to do with it since the 4090 has a ton more, not sure.

From the logs on the linux box (rtx4090 got the wu) (not sure if I captured anything helpful

Code: Select all

17:32:52:WU01:FS02:0x22:Project: 16575 (Run 2, Clone 140, Gen 1)
17:32:52:WU01:FS02:0x22:Reading tar file core.xml
17:32:52:WU01:FS02:0x22:Reading tar file integrator.xml
17:32:52:WU01:FS02:0x22:Reading tar file state.xml
17:32:54:WU01:FS02:0x22:Reading tar file system.xml
17:32:55:WU01:FS02:0x22:Digital signatures verified
17:32:55:WU01:FS02:0x22:Folding@home GPU Core22 Folding@home Core
17:32:55:WU01:FS02:0x22:Version 0.0.20
17:32:55:WU01:FS02:0x22:  Checkpoint write interval: 62500 steps (5%) [20 total]
17:32:55:WU01:FS02:0x22:  JSON viewer frame write interval: 12500 steps (1%) [100 total]
17:32:55:WU01:FS02:0x22:  XTC frame write interval: 10000 steps (0.8%) [125 total]
17:32:55:WU01:FS02:0x22:  Global context and integrator variables write interval: disabled
17:32:55:WU01:FS02:0x22:No -opencl-device specified; using deprecated -gpu argument as an alias for -opencl-device.
17:32:55:WU01:FS02:0x22:Please consider upgrading your client version.
17:32:55:WU01:FS02:0x22:There are 4 platforms available.
17:32:55:WU01:FS02:0x22:Platform 0: Reference
17:32:55:WU01:FS02:0x22:Platform 1: CPU
17:32:55:WU01:FS02:0x22:Platform 2: OpenCL
17:32:55:WU01:FS02:0x22:  opencl-device -1 specified
17:32:55:WU01:FS02:0x22:Platform 3: CUDA
17:32:55:WU01:FS02:0x22:  cuda-device 0 specified
17:32:57:WU02:FS01:0x22:Completed 2600000 out of 5000000 steps (52%)
17:32:58:WU04:FS02:Upload 56.27%
17:33:11:WU00:FS03:0x22:Completed 3800000 out of 5000000 steps (76%)
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
BobWilliams757
Posts: 523
Joined: Fri Apr 03, 2020 2:22 pm
Hardware configuration: ASRock X370M PRO4
Ryzen 2400G APU
16 GB DDR4-3200
MSI GTX 1660 Super Gaming X

Re: Project 16575 Chrushing GPU 18hrs low points

Post by BobWilliams757 »

Hard to say what is going on with the 3070M. None are showing up for that project on LARS website, but all other GPU's listed seem to be in line with normal PPD returns.

https://folding.lar.systems/projects/f ... e/16575


The slowest 3070 mobile they list in the GPU rankings has an average of 2.8M PPD. Looking at the specs, it doesn't seem it should have any real bottlenecks through memory speed or such. It might be worth using GPU-Z logging on that GPU to see if it shows any strange behavior out of line with similar hardware.

At least it appears for now to just be the one GPU. Still a mystery though.
Fold them if you get them!
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Re: Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

Completed log from the Windows 3070 machine -

Code: Select all

20:02:51:WU01:FS01:0x22:Completed 1250000 out of 1250000 steps (100%)
20:02:51:WU01:FS01:0x22:Average performance: 3.20095 ns/day
20:02:58:WU01:FS01:0x22:Checkpoint completed at step 1250000
20:03:13:WU01:FS01:0x22:Saving result file ..\logfile_01.txt
20:03:13:WU01:FS01:0x22:Saving result file checkpointIntegrator.xml
20:03:13:WU01:FS01:0x22:Saving result file checkpointState.xml
20:03:19:WU01:FS01:0x22:Saving result file positions.xtc
20:03:19:WU01:FS01:0x22:Saving result file science.log
20:03:19:WU01:FS01:0x22:Saving result file xtcAtoms.csv.bz2
20:03:19:WU01:FS01:0x22:Folding@home Core Shutdown: FINISHED_UNIT
20:03:23:WU01:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
20:03:23:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:16575 run:4 clone:133 gen:0 core:0x22 unit:0x0000008500000000000040bf00000004
20:03:23:WU01:FS01:Uploading 38.26MiB to 66.170.111.50
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
bollix47
Posts: 2963
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project 16575 Chrushing GPU 18hrs low points

Post by bollix47 »

17:32:55:WU01:FS02:0x22: Checkpoint write interval: 62500 steps (5%) [20 total]
17:32:55:WU01:FS02:0x22: JSON viewer frame write interval: 12500 steps (1%) [100 total]
17:32:55:WU01:FS02:0x22: XTC frame write interval: 10000 steps (0.8%) [125 total]
17:32:55:WU01:FS02:0x22: Global context and integrator variables write interval: disabled
17:32:55:WU01:FS02:0x22:No -opencl-device specified; using deprecated -gpu argument as an alias for -opencl-device.
17:32:55:WU01:FS02:0x22:Please consider upgrading your client version.
17:32:55:WU01:FS02:0x22:There are 4 platforms available.
17:32:55:WU01:FS02:0x22:Platform 0: Reference
17:32:55:WU01:FS02:0x22:Platform 1: CPU
17:32:55:WU01:FS02:0x22:Platform 2: OpenCL
17:32:55:WU01:FS02:0x22: opencl-device -1 specified
17:32:55:WU01:FS02:0x22:Platform 3: CUDA
17:32:55:WU01:FS02:0x22: cuda-device 0 specified
Something doesn''t look quite right in the above.
Please post the System & Config sections of your log. (you can x-out your passkey or other personal data but leave the description)
You may have to refresh the log to see the info requested and then scoll up to the top (either pause the slot or uncheck "follow")
More info on posting logs is located at:
viewtopic.php?t=26036
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Re: Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

Here is the start of the log copied out of the FAHClient on Windoze. I can't seem to copy out the 'System Info' tab for some reason, only the log. You may have been looking for the linux configs. That I can get. Next post...

Code: Select all

*********************** Log Started 2023-04-08T02:50:57Z ***********************
02:50:57:******************************* libFAH ********************************
02:50:57:           Date: Oct 20 2020
02:50:57:           Time: 13:36:55
02:50:57:       Revision: 5ca109d295a6245e2a2f590b3d0085ad5e567aeb
02:50:57:         Branch: master
02:50:57:       Compiler: Visual C++ 2015
02:50:57:        Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
02:50:57:       Platform: win32 10
02:50:57:           Bits: 32
02:50:57:           Mode: Release
02:50:57:****************************** FAHClient ******************************
02:50:57:        Version: 7.6.21
02:50:57:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
02:50:57:      Copyright: 2020 foldingathome.org
02:50:57:       Homepage: https://foldingathome.org/
02:50:57:           Date: Oct 20 2020
02:50:57:           Time: 13:41:04
02:50:57:       Revision: 6efbf0e138e22d3963e6a291f78dcb9c6422a278
02:50:57:         Branch: master
02:50:57:       Compiler: Visual C++ 2015
02:50:57:        Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
02:50:57:       Platform: win32 10
02:50:57:           Bits: 32
02:50:57:           Mode: Release
02:50:57:           Args: --open-web-control
02:50:57:         Config: C:\ProgramData\FAHClient\config.xml
02:50:57:******************************** CBang ********************************
02:50:57:           Date: Oct 20 2020
02:50:57:           Time: 11:36:18
02:50:57:       Revision: 7e4ce85225d7eaeb775e87c31740181ca603de60
02:50:57:         Branch: master
02:50:57:       Compiler: Visual C++ 2015
02:50:57:        Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
02:50:57:       Platform: win32 10
02:50:57:           Bits: 32
02:50:57:           Mode: Release
02:50:57:******************************* System ********************************
02:50:57:            CPU: AMD Ryzen 9 5900HX with Radeon Graphics
02:50:57:         CPU ID: AuthenticAMD Family 25 Model 80 Stepping 0
02:50:57:           CPUs: 16
02:50:57:         Memory: 15.35GiB
02:50:57:    Free Memory: 9.74GiB
02:50:57:        Threads: WINDOWS_THREADS
02:50:57:     OS Version: 6.2
02:50:57:    Has Battery: true
02:50:57:     On Battery: false
02:50:57:     UTC Offset: -7
02:50:57:            PID: 11380
02:50:57:            CWD: C:\ProgramData\FAHClient
02:50:57:  Win32 Service: false
02:50:57:             OS: Windows 10 Home
02:50:57:        OS Arch: AMD64
02:50:57:           GPUs: 2
02:50:57:          GPU 0: Bus:1 Slot:0 Func:0 NVIDIA:8 GA104M [GeForce RTX 3070 Mobile /
02:50:57:                 Max-Q]
02:50:57:          GPU 1: Bus:5 Slot:0 Func:0 AMD:3 Cezanne [Vega Mobile 5000 series APU]
02:50:57:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:8.6 Driver:12.0
02:50:57:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:3.0 Driver:526.98
02:50:57:OpenCL Device 1: Platform:1 Device:0 Bus:5 Slot:0 Compute:1.2 Driver:3444.0
02:50:57:***********************************************************************
02:50:57:<config>
02:50:57:  <!-- Folding Core -->
02:50:57:  <checkpoint v='6'/>
02:50:57:
02:50:57:  <!-- HTTP Server -->
02:50:57:  <allow v='0.0.0.0/0'/>
02:50:57:  <deny v=''/>
02:50:57:
02:50:57:  <!-- Network -->
02:50:57:  <proxy v=':8080'/>
02:50:57:
02:50:57:  <!-- Remote Command Server -->
02:50:57:  <password v='*****'/>
02:50:57:
02:50:57:  <!-- Slot Control -->
02:50:57:  <power v='MEDIUM'/>
02:50:57:
02:50:57:  <!-- User Information -->
02:50:57:  <passkey v='*****'/>
02:50:57:  <user v='SandyGanz'/>
02:50:57:
02:50:57:  <!-- Folding Slots -->
02:50:57:  <slot id='0' type='CPU'>
02:50:57:    <cpus v='8'/>
02:50:57:  </slot>
02:50:57:  <slot id='1' type='GPU'>
02:50:57:    <pci-bus v='1'/>
02:50:57:    <pci-slot v='0'/>
02:50:57:  </slot>
02:50:57:</config>
02:50:57:Trying to access database...
02:50:57:Successfully acquired database lock
02:50:57:FS00:Initialized folding slot 00: cpu:8
02:50:57:FS01:Initialized folding slot 01: gpu:1:0 GA104M [GeForce RTX 3070 Mobile / Max-Q]
02:50:57:FS02:Initialized folding slot 02: gpu:5:0 Cezanne [Vega Mobile 5000 series APU]
02:50:57:WU00:FS00:Starting
02:50:57:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\ProgramData\FAHClient\cores/cores.foldingathome.org/win/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8.exe -dir 00 -suffix 01 -version 706 -lifeline
Last edited by SandyG on Sat Apr 08, 2023 9:25 pm, edited 1 time in total.
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Re: Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

Also note from above, I do not process on the AMD GPU, I think that is slot 2. I delete that slot after start up so it does not do any processing, just the CPU and 3070m

Sandy
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Re: Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

This is the Linux config, this has the following GPU's
Slot - CPU
Slot 1 - RTX3090
Slot 2- RTX 4090
Slot 3 - RTX 3090


This is the config from the linux box, seems to be doing just fine, it's the Window's that seems to be showing issues with this WU and possibly others from the 165XX group. But really not sure. Since I could not get the FAHClient to run on the newest version of Mint, I had to hand assemble this from the startup information in the log. Again, I have not seen any issues running on the linux box, only the Window's 11 with this particular patch of WU in the 165XX projects that all seem to come from one researcher. May not even be a problem, but seems like something is off.

Code: Select all

<config>
  <!-- Client Control -->
  <fold-anon v='true'/>

  <!-- Folding Slot Configuration -->
  <gpu v='false'/>

  <!-- User Information -->
  <passkey v='*****'/>
  <user v='SandyGanz'/>

  <!-- Folding Slots -->
  <slot id='0' type='CPU'>
    <cpus v='12'/>
  </slot>
  <slot id='1' type='GPU'>
    <pci-bus v='101'/>
    <pci-slot v='0'/>
  </slot>
  <slot id='2' type='GPU'>
    <pci-bus v='182'/>
    <pci-slot v='0'/>
  </slot>
  <slot id='3' type='GPU'>
    <pci-bus v='23'/>
    <pci-slot v='0'/>
  </slot>
</config>
Start of the log from the Linux box (running Mint)

Code: Select all

22:20:39:******************************** CBang ********************************
22:20:39:         Date: Oct 20 2020
22:20:39:         Time: 18:37:59
22:20:39:     Revision: 7e4ce85225d7eaeb775e87c31740181ca603de60
22:20:39:       Branch: master
22:20:39:     Compiler: GNU 8.3.0
22:20:39:      Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
22:20:39:               -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
22:20:39:     Platform: linux2 5.8.0-1-amd64
22:20:39:         Bits: 64
22:20:39:         Mode: Release
22:20:39:******************************* System ********************************
22:20:39:          CPU: Intel(R) Xeon(R) W-2155 CPU @ 3.30GHz
22:20:39:       CPU ID: GenuineIntel Family 6 Model 85 Stepping 4
22:20:39:         CPUs: 20
22:20:39:       Memory: 125.50GiB
22:20:39:  Free Memory: 124.29GiB
22:20:39:      Threads: POSIX_THREADS
22:20:39:   OS Version: 5.15
22:20:39:  Has Battery: false
22:20:39:   On Battery: false
22:20:39:   UTC Offset: -7
22:20:39:          PID: 1496
22:20:39:          CWD: /var/lib/fahclient
22:20:39:           OS: Linux 5.15.0-67-generic x86_64
22:20:39:      OS Arch: AMD64
22:20:39:         GPUs: 3
22:20:39:        GPU 0: Bus:23 Slot:0 Func:0 NVIDIA:8 GA102 [GeForce RTX 3090]
22:20:39:        GPU 1: Bus:101 Slot:0 Func:0 NVIDIA:8 GA102 [GeForce RTX 3090]
22:20:39:        GPU 2: Bus:182 Slot:0 Func:0 NVIDIA:9 AD102 [GeForce RTX 4090]
22:20:39:CUDA Device 0: Platform:0 Device:0 Bus:182 Slot:0 Compute:8.9 Driver:12.0
22:20:39:CUDA Device 1: Platform:0 Device:1 Bus:23 Slot:0 Compute:8.6 Driver:12.0
22:20:39:CUDA Device 2: Platform:0 Device:2 Bus:101 Slot:0 Compute:8.6 Driver:12.0
22:20:39:       OpenCL: Not detected: Failed to open dynamic library 'libOpenCL.so':
22:20:39:               libOpenCL.so: cannot open shared object file: No such file or
22:20:39:               directory
22:20:39:***********************************************************************
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
bollix47
Posts: 2963
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project 16575 Chrushing GPU 18hrs low points

Post by bollix47 »

22:20:39: OpenCL: Not detected: Failed to open dynamic library 'libOpenCL.so':
22:20:39: libOpenCL.so: cannot open shared object file: No such file or
22:20:39: directory
Okay that's the first error that needs to be fixed.

Open Software & Updates and proceed to the Additional Drivers tab > (wait for the choices to populate) highlight the drivers you want to install. If they are already installed then proceed to the next step .... otherwise Install them (usually says "tested" but I've not used Mint, just ubuntu", & reboot
Open a Terminal and copy/paste the following:

Code: Select all

sudo apt install ocl-icd-opencl-dev
You should reboot afterwards so the kernal recognizes the change.
Now copy/paste the lines from 22:20:39 starting with OpenCL until the config section(ie the lines I quoted above).

There may be more changes to make but let's start by fixing opencl for now .... even though you're using cuda opencl must be installed correctly.

Your windows log does show opencl is installed so the above is all about the linux setup. Also, a 3070m is not going to perform as well as a desktop 3070. It will be slower, maybe 90 % of the desktop assuming it's not overheating.

On my 3070m I'm seeing PPD of ~1.1 Million on a 16574 .... unfortunately I have no history for 16575.

It appears there's a bit of confusion in this thread .... perhaps you could make a post summarizing the setup you're having a problem with & include the System & Config sections for that system.
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Re: Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

Thanks for the help cleaning up some of the Linux issues. here is the new log after the update above -

Code: Select all

00:09:55:WU04:FS02:0x22:    Version: 0.0.20
00:09:55:WU04:FS02:0x22:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
00:09:55:WU04:FS02:0x22:  Copyright: 2020 foldingathome.org
00:09:55:WU04:FS02:0x22:   Homepage: https://foldingathome.org/
00:09:55:WU04:FS02:0x22:       Date: Jan 20 2022
00:09:55:WU04:FS02:0x22:       Time: 00:57:52
00:09:55:WU04:FS02:0x22:   Revision: 3f211b8a4346514edbff34e3cb1c0e0ec951373c
00:09:55:WU04:FS02:0x22:     Branch: HEAD
00:09:55:WU04:FS02:0x22:   Compiler: GNU 9.4.0
00:09:55:WU04:FS02:0x22:    Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
00:09:55:WU04:FS02:0x22:             -fdata-sections -O3 -funroll-loops -fno-pie
00:09:55:WU04:FS02:0x22:             -DOPENMM_VERSION="\"7.7.0\""
00:09:55:WU04:FS02:0x22:   Platform: linux 5.11.0-1025-azure
00:09:55:WU04:FS02:0x22:       Bits: 64
00:09:55:WU04:FS02:0x22:       Mode: Release
00:09:55:WU04:FS02:0x22:Maintainers: John Chodera <john.chodera@choderalab.org> and Peter Eastman
00:09:55:WU04:FS02:0x22:             <peastman@stanford.edu>
00:09:55:WU04:FS02:0x22:       Args: -dir 04 -suffix 01 -version 706 -lifeline 1560 -checkpoint 15
00:09:55:WU04:FS02:0x22:             -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu-vendor
00:09:55:WU04:FS02:0x22:             nvidia -gpu 0 -gpu-usage 100
00:09:55:WU04:FS02:0x22:************************************ libFAH ************************************
00:09:55:WU04:FS02:0x22:       Date: Jan 20 2022
00:09:55:WU04:FS02:0x22:       Time: 00:57:22
00:09:55:WU04:FS02:0x22:   Revision: 9f4ad694e75c2350d4bb6b8b5b769ba27e483a2f
00:09:55:WU04:FS02:0x22:     Branch: HEAD
00:09:55:WU04:FS02:0x22:   Compiler: GNU 9.4.0
00:09:55:WU04:FS02:0x22:    Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
00:09:55:WU04:FS02:0x22:             -fdata-sections -O3 -funroll-loops -fno-pie
00:09:55:WU04:FS02:0x22:   Platform: linux 5.11.0-1025-azure
00:09:55:WU04:FS02:0x22:       Bits: 64
00:09:55:WU04:FS02:0x22:       Mode: Release
00:09:55:WU04:FS02:0x22:************************************ CBang *************************************
00:09:55:WU04:FS02:0x22:       Date: Jan 20 2022
00:09:55:WU04:FS02:0x22:       Time: 00:57:00
00:09:55:WU04:FS02:0x22:   Revision: ab023d155b446906d55b0f6c9a1eedeea04f7a1a
00:09:55:WU04:FS02:0x22:     Branch: HEAD
00:09:55:WU04:FS02:0x22:   Compiler: GNU 9.4.0
00:09:55:WU04:FS02:0x22:    Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
00:09:55:WU04:FS02:0x22:             -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
00:09:55:WU04:FS02:0x22:   Platform: linux 5.11.0-1025-azure
00:09:55:WU04:FS02:0x22:       Bits: 64
00:09:55:WU04:FS02:0x22:       Mode: Release
00:09:55:WU04:FS02:0x22:************************************ System ************************************
00:09:55:WU04:FS02:0x22:        CPU: Intel(R) Xeon(R) W-2155 CPU @ 3.30GHz
00:09:55:WU04:FS02:0x22:     CPU ID: GenuineIntel Family 6 Model 85 Stepping 4
00:09:55:WU04:FS02:0x22:       CPUs: 20
00:09:55:WU04:FS02:0x22:     Memory: 125.50GiB
00:09:55:WU04:FS02:0x22:Free Memory: 124.14GiB
00:09:55:WU04:FS02:0x22:    Threads: POSIX_THREADS
00:09:55:WU04:FS02:0x22: OS Version: 5.15
00:09:55:WU04:FS02:0x22:Has Battery: false
00:09:55:WU04:FS02:0x22: On Battery: false
00:09:55:WU04:FS02:0x22: UTC Offset: -7
00:09:55:WU04:FS02:0x22:        PID: 1564
00:09:55:WU04:FS02:0x22:        CWD: /var/lib/fahclient/work
00:09:55:WU04:FS02:0x22:************************************ OpenMM ************************************
00:09:55:WU04:FS02:0x22:    Version: 7.7.0
00:09:55:WU04:FS02:0x22:********************************************************************************
00:09:55:WU04:FS02:0x22:Project: 18717 (Run 6, Clone 74, Gen 17)
00:09:55:WU04:FS02:0x22:Digital signatures verified
00:09:55:WU04:FS02:0x22:Folding@home GPU Core22 Folding@home Core
00:09:55:WU04:FS02:0x22:Version 0.0.20
00:09:55:WU04:FS02:0x22:  Checkpoint write interval: 250000 steps (5%) [20 total]
00:09:55:WU04:FS02:0x22:  JSON viewer frame write interval: 50000 steps (1%) [100 total]
00:09:55:WU04:FS02:0x22:  XTC frame write interval: 50000 steps (1%) [100 total]
00:09:55:WU04:FS02:0x22:  Global context and integrator variables write interval: disabled
00:09:55:WU04:FS02:0x22:There are 4 platforms available.
00:09:55:WU04:FS02:0x22:Platform 0: Reference
00:09:55:WU04:FS02:0x22:Platform 1: CPU
00:09:55:WU04:FS02:0x22:Platform 2: OpenCL
00:09:55:WU04:FS02:0x22:  opencl-device 0 specified
00:09:55:WU04:FS02:0x22:Platform 3: CUDA
00:09:55:WU04:FS02:0x22:  cuda-device 0 specified
00:09:57:WU03:FS00:0xa8:Completed 6110148 out of 10000000 steps (61%)
00:09:58:WU01:FS01:0x22:Attempting to create CUDA context:
00:09:58:WU01:FS01:0x22:  Configuring platform CUDA
00:09:58:WU04:FS02:0x22:Attempting to create CUDA context:
00:09:58:WU04:FS02:0x22:  Configuring platform CUDA
00:10:05:WU01:FS01:0x22:  Using CUDA and gpu 2
00:10:05:WU01:FS01:0x22:Completed 0 out of 5000000 steps (0%)
00:10:05:WU04:FS02:0x22:  Using CUDA and gpu 0
00:10:05:WU04:FS02:0x22:Completed 4250000 out of 5000000 steps (85%)
00:10:06:WU01:FS01:0x22:Checkpoint completed at step 0
00:10:08:WU02:FS03:0x22:Attempting to create CUDA context:
00:10:08:WU02:FS03:0x22:  Configuring platform CUDA
00:10:27:WU02:FS03:0x22:  Using CUDA and gpu 1
00:10:27:WU02:FS03:0x22:Completed 687500 out of 1250000 steps (55%)
00:10:46:WU04:FS02:0x22:Completed 4300000 out of 5000000 steps (86%)
00:11:11:WU01:FS01:0x22:Completed 50000 out of 5000000 steps (1%)
00:11:25:WU04:FS02:0x22:Completed 4350000 out of 5000000 steps (87%)
00:12:03:WU04:FS02:0x22:Completed 4400000 out of 5000000 steps (88%)
00:12:03:WU02:FS03:0x22:Completed 700000 out of 1250000 steps (56%)
00:12:13:WU01:FS01:0x22:Completed 100000 out of 5000000 steps (2%)
00:12:42:WU04:FS02:0x22:Completed 4450000 out of 5000000 steps (89%)
00:13:16:WU01:FS01:0x22:Completed 150000 out of 5000000 steps (3%)
00:13:20:WU04:FS02:0x22:Completed 4500000 out of 5000000 steps (90%)
00:13:20:WU04:FS02:0x22:Checkpoint completed at step 4500000
00:13:39:WU02:FS03:0x22:Completed 712500 out of 1250000 steps (57%)
00:13:59:WU04:FS02:0x22:Completed 4550000 out of 5000000 steps (91%)
00:14:19:WU01:FS01:0x22:Completed 200000 out of 5000000 steps (4%)
fah@fahlinux:/var/lib/fahclient$ tail log.txt -n 100
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Re: Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

And as to the confusion on the thread, the Linux thing was something not really related to why I made the orig post.

The thread was started because I saw a really long time estimate with a really low points for the 16575 WU on the 3070M on my windows laptop. I saw 18hrs to complete, where a recent completion on another machine with a RTX3060 card did the run in about 8 hours, not 18 like the 3070m. Seemed odd like something was off. The 3070m I'm running is in an Alienware AMD laptop with the CPU only running 8 cores for FAH. 3070m in the laptop always seemed to perform around or better then the RTX3060's.

Otherwise, more of a heads up with odd numbers on these 165XX WU's. I see some crazy numbers posting on the 4090 and 3090 cards but not sure if they work OK with these WU's. Like over a million in a very short period of time as mentioned above.

Sandy
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
bollix47
Posts: 2963
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project 16575 Chrushing GPU 18hrs low points

Post by bollix47 »

Okay, it looks like the opencl issue is looking better now.

When looking at the PPD estimate on FAHControl don't bother looking until the percentage is at least 3% as the software may not have enough info to give an accurate prediction before then.

GL and feel free to post your questions .... remember we are just folders trying to help so we may need a bit of time to get to the root of your problem. :wink:
SandyG
Posts: 108
Joined: Mon Apr 13, 2020 11:15 pm
Hardware configuration: 2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

[img]https://folding.extremeoverclocking.com/sigs/sigimage.php?u=1172112[/img]
Contact:

Re: Project 16575 Chrushing GPU 18hrs low points

Post by SandyG »

Thanks again for taking a look. I always wait until way past 5% before looking for the number, these 165XX ones are just odd.

Great support group here too, helped me get off the mining motherboard onto something that works the cards a lot harder then the way I thought about how I was putting the computer together.

Just need some better prices on the 4090''s, which have not really budged...

Sandy
2 Shuttle i9's with RTX3060, Old server mobo, Mint Linux with 2 RTX3090's and 2 RTX 4090. Dell 7920 RTX3060, RTX4070

Image
Post Reply