Please Add/Whitelist Nvidia Tesla T4G

Post requests to add new GPUs to the official whitelist here.

Moderators: Site Moderators, FAHC Science Team

Post Reply
jad0083
Posts: 1
Joined: Fri Jan 21, 2022 2:29 am

Please Add/Whitelist Nvidia Tesla T4G

Post by jad0083 »

I think it's an awesome card, should be better than the regular Tesla T4 :)

looks like its also a TU104GL: http://pci-ids.ucw.cz/v2.2/pci.ids:
1eb4 TU104GL [T4G]

/usr/bin/FAHClient --lspci seem to fail,
so here's an output of nvidia-smi -q

Code: Select all

==============NVSMI LOG==============

Timestamp                                 : Fri Jan 21 02:33:33 2022
Driver Version                            : 470.82.01
CUDA Version                              : 11.4

Attached GPUs                             : 1
GPU 00000000:00:1F.0
    Product Name                          : NVIDIA T4G
    Product Brand                         : NVIDIA
    Display Mode                          : Enabled
    Display Active                        : Disabled
    Persistence Mode                      : Disabled
    MIG Mode
        Current                           : N/A
        Pending                           : N/A
    Accounting Mode                       : Disabled
    Accounting Mode Buffer Size           : 4000
    Driver Model
        Current                           : N/A
        Pending                           : N/A
    Serial Number                         : 1322821013691
    GPU UUID                              : GPU-de31deba-cd97-52d3-175a-0d19dfad1256
    Minor Number                          : 0
    VBIOS Version                         : 90.04.AF.00.02
    MultiGPU Board                        : No
    Board ID                              : 0x1f
    GPU Part Number                       : 900-2G183-A820-001
    Module ID                             : 0
    Inforom Version
        Image Version                     : G183.0205.00.02
        OEM Object                        : 1.1
        ECC Object                        : 5.0
        Power Management Object           : N/A
    GPU Operation Mode
        Current                           : N/A
        Pending                           : N/A
    GSP Firmware Version                  : N/A
    GPU Virtualization Mode
        Virtualization Mode               : None
        Host VGPU Mode                    : N/A
    IBMNPU
        Relaxed Ordering Mode             : N/A
    PCI
        Bus                               : 0x00
        Device                            : 0x1F
        Domain                            : 0x0000
        Device Id                         : 0x1EB410DE
        Bus Id                            : 00000000:00:1F.0
        Sub System Id                     : 0x157D10DE
        GPU Link Info
            PCIe Generation
                Max                       : 3
                Current                   : 1
            Link Width
                Max                       : 16x
                Current                   : 8x
        Bridge Chip
            Type                          : N/A
            Firmware                      : N/A
        Replays Since Reset               : 0
        Replay Number Rollovers           : 0
        Tx Throughput                     : 0 KB/s
        Rx Throughput                     : 0 KB/s
    Fan Speed                             : N/A
    Performance State                     : P8
Here's the log:

Code: Select all

********************** Log Started 2022-01-21T02:02:25Z ***********************
02:02:25:******************************* libFAH ********************************
02:02:25:           Date: Oct 20 2020
02:02:25:           Time: 20:36:48
02:02:25:       Revision: 5ca109d295a6245e2a2f590b3d0085ad5e567aeb
02:02:25:         Branch: master
02:02:25:       Compiler: GNU 8.3.0
02:02:25:        Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
02:02:25:                 -fdata-sections -O3 -funroll-loops -fno-pie
02:02:25:       Platform: linux2 4.19.0-9-arm64
02:02:25:           Bits: 64
02:02:25:           Mode: Release
02:02:25:****************************** FAHClient ******************************
02:02:25:        Version: 7.6.21
02:02:25:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
02:02:25:      Copyright: 2020 foldingathome.org
02:02:25:       Homepage: https://foldingathome.org/
02:02:25:           Date: Oct 20 2020
02:02:25:           Time: 20:39:10
02:02:25:       Revision: 6efbf0e138e22d3963e6a291f78dcb9c6422a278
02:02:25:         Branch: master
02:02:25:       Compiler: GNU 8.3.0
02:02:25:        Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
02:02:25:                 -fdata-sections -O3 -funroll-loops -fno-pie
02:02:25:       Platform: linux2 4.19.0-9-arm64
02:02:25:           Bits: 64
02:02:25:           Mode: Release
02:02:25:           Args: --child /etc/fahclient/config.xml --run-as fahclient
02:02:25:                 --pid-file=/var/run/fahclient.pid --daemon
02:02:25:         Config: /etc/fahclient/config.xml
02:02:25:******************************** CBang ********************************
02:02:25:           Date: Oct 20 2020
02:02:25:           Time: 18:38:03
02:02:25:       Revision: 7e4ce85225d7eaeb775e87c31740181ca603de60
02:02:25:         Branch: master
02:02:25:       Compiler: GNU 8.3.0
02:02:25:        Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
02:02:25:                 -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
02:02:25:       Platform: linux2 4.19.0-9-arm64
02:02:25:           Bits: 64
02:02:25:           Mode: Release
02:02:25:******************************* System ********************************
02:02:25:            CPU: Unknown
02:02:25:         CPU ID:
02:02:25:           CPUs: 4
02:02:25:         Memory: 7.61GiB
02:02:25:    Free Memory: 7.09GiB
02:02:25:        Threads: POSIX_THREADS
02:02:25:     OS Version: 5.11
02:02:25:    Has Battery: false
02:02:25:     On Battery: false
02:02:25:     UTC Offset: 0
02:02:25:            PID: 721
02:02:25:            CWD: /var/lib/fahclient
02:02:25:             OS: Linux 5.11.0-1027-*** aarch64
02:02:25:        OS Arch: ARM64
02:02:25:           GPUs: 0
02:02:25:  CUDA Device 0: Platform:0 Device:0 Bus:0 Slot:31 Compute:7.5 Driver:11.4
02:02:25:OpenCL Device 0: Platform:0 Device:0 Bus:0 Slot:3 Compute:3.0 Driver:470.82
02:02:25:***********************************************************************
02:02:25:<config>
02:02:25:  <!-- Client Control -->
02:02:25:  <fold-anon v='true'/>
02:02:25:
02:02:25:  <!-- Folding Slot Configuration -->
02:02:25:  <cause v='COVID_19'/>
02:02:25:
02:02:25:  <!-- Slot Control -->
02:02:25:  <power v='full'/>
02:02:25:
02:02:25:  <!-- User Information -->
02:02:25:  <passkey v='*****'/>
02:02:25:  <team v='*****'/>
02:02:25:  <user v='*****'/>
02:02:25:
02:02:25:  <!-- Folding Slots -->
02:02:25:  <slot id='0' type='CPU'/>
02:02:25:  <slot id='1' type='GPU'/>
02:02:25:</config>
02:02:25:Trying to access database...
02:02:25:Successfully acquired database lock
^[[91m02:02:25:ERROR:Exception: No unallocated GPUs found^[[0m
^[[91m02:02:25:ERROR:Deleting slot 1^[[0m
02:02:25:FS00:Initialized folding slot 00: cpu:4
^[[93m02:02:25:WARNING:FS01:No CUDA or OpenCL 1.2+ support detected for GPU slot 01: gpu:-1:-1.  Disabling.^[[0m
02:02:25:WU00:FS00:Starting
02:02:25:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-aarch64/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 00 -suffix 01 -version 706 -lifeline 721 -checkpoint 15 -np 4
02:02:25:WU00:FS00:Started FahCore on PID 731
02:02:25:WU00:FS00:Core PID:735
02:02:25:WU00:FS00:FahCore 0xa8 started
02:02:25:WU00:FS00:0xa8:*********************** Log Started 2022-01-21T02:02:25Z ***********************
02:02:25:WU00:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
02:02:25:WU00:FS00:0xa8:       Core: Gromacs
02:02:25:WU00:FS00:0xa8:       Type: 0xa8
02:02:25:WU00:FS00:0xa8:    Version: 0.0.12
02:02:25:WU00:FS00:0xa8:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
02:02:25:WU00:FS00:0xa8:  Copyright: 2020 foldingathome.org
02:02:25:WU00:FS00:0xa8:   Homepage: https://foldingathome.org/
02:02:25:WU00:FS00:0xa8:       Date: Jan 16 2021
02:02:25:WU00:FS00:0xa8:       Time: 19:29:29
02:02:25:WU00:FS00:0xa8:   Compiler: GNU 8.3.0
02:02:25:WU00:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
02:02:25:WU00:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
02:02:25:WU00:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
02:02:25:WU00:FS00:0xa8:       Bits: 64
02:02:25:WU00:FS00:0xa8:       Mode: Release
02:02:25:WU00:FS00:0xa8:       SIMD: arm_neon_asimd
02:02:25:WU00:FS00:0xa8:     OpenMP: ON
02:02:25:WU00:FS00:0xa8:       CUDA: OFF
02:02:25:WU00:FS00:0xa8:       Args: -dir 00 -suffix 01 -version 706 -lifeline 731 -checkpoint 15 -np 4
02:02:25:WU00:FS00:0xa8:************************************ libFAH ************************************
02:02:25:WU00:FS00:0xa8:       Date: Jan 16 2021
02:02:25:WU00:FS00:0xa8:       Time: 19:29:00
02:02:25:WU00:FS00:0xa8:   Compiler: GNU 8.3.0
02:02:25:WU00:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
02:02:25:WU00:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
02:02:25:WU00:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
02:02:25:WU00:FS00:0xa8:       Bits: 64
02:02:25:WU00:FS00:0xa8:       Mode: Release
02:02:25:WU00:FS00:0xa8:************************************ CBang *************************************
02:02:25:WU00:FS00:0xa8:       Date: Jan 16 2021
02:02:25:WU00:FS00:0xa8:       Time: 19:28:44
02:02:25:WU00:FS00:0xa8:   Compiler: GNU 8.3.0
02:02:25:WU00:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
02:02:25:WU00:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
02:02:25:WU00:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
02:02:25:WU00:FS00:0xa8:       Bits: 64
02:02:25:WU00:FS00:0xa8:       Mode: Release
02:02:25:WU00:FS00:0xa8:************************************ System ************************************
02:02:25:WU00:FS00:0xa8:        CPU: Neoverse N
02:02:25:WU00:FS00:0xa8:     CPU ID: Arm Family 8 Model 1 Stepping 1
02:02:25:WU00:FS00:0xa8:       CPUs: 4
02:02:25:WU00:FS00:0xa8:     Memory: 7.61GiB
02:02:25:WU00:FS00:0xa8:Free Memory: 7.08GiB
02:02:25:WU00:FS00:0xa8:    Threads: POSIX_THREADS
02:02:25:WU00:FS00:0xa8: OS Version: 5.11
02:02:25:WU00:FS00:0xa8:Has Battery: false
02:02:25:WU00:FS00:0xa8: On Battery: false
02:02:25:WU00:FS00:0xa8: UTC Offset: 0
02:02:25:WU00:FS00:0xa8:        PID: 735
2:02:25:WU00:FS00:0xa8:        CWD: /var/lib/fahclient/work
02:02:25:WU00:FS00:0xa8:********************************************************************************
02:02:25:WU00:FS00:0xa8:Project: 16955 (Run 5, Clone 2771, Gen 107)
02:02:25:WU00:FS00:0xa8:Unit: 0x00000000000000000000000000000000
02:02:25:WU00:FS00:0xa8:Digital signatures verified
02:02:25:WU00:FS00:0xa8:Calling: mdrun -c frame107.gro -s frame107.tpr -x frame107.xtc -cpi state.cpt -cpt 15 -nt 4 -ntmpi 1
02:02:25:WU00:FS00:0xa8:Steps: first=53500000 total=54000000
02:03:26:Saving configuration to /etc/fahclient/config.xml
02:03:26:<config>
02:03:26:  <!-- Client Control -->
02:03:26:  <fold-anon v='true'/>
02:03:26:
02:03:26:  <!-- Folding Slot Configuration -->
02:03:26:  <cause v='COVID_19'/>
02:03:26:
02:03:26:  <!-- Slot Control -->
02:03:26:  <power v='full'/>
02:03:26:
02:03:26:  <!-- User Information -->
02:03:26:  <passkey v='*****'/>
02:03:26:  <team v='****'/>
02:03:26:  <user v='******'/>
02:03:26:
02:03:26:  <!-- Folding Slots -->
02:03:26:  <slot id='0' type='CPU'/>
02:03:26:</config>
02:04:11:WU00:FS00:0xa8:Completed 1747 out of 500000 steps (0%)
02:09:00:WU00:FS00:0xa8:Completed 5000 out of 500000 steps (1%)
02:16:23:WU00:FS00:0xa8:Completed 10000 out of 500000 steps (2%)
02:23:48:WU00:FS00:0xa8:Completed 15000 out of 500000 steps (3%)
02:31:11:WU00:FS00:0xa8:Completed 20000 out of 500000 steps (4%)
02:38:36:WU00:FS00:0xa8:Completed 25000 out of 500000 steps (5%)
02:46:00:WU00:FS00:0xa8:Completed 30000 out of 500000 steps (6%)
02:53:24:WU00:FS00:0xa8:Completed 35000 out of 500000 steps (7%)
03:00:47:WU00:FS00:0xa8:Completed 40000 out of 500000 steps (8%)
03:08:12:WU00:FS00:0xa8:Completed 45000 out of 500000 steps (9%)
03:15:35:WU00:FS00:0xa8:Completed 50000 out of 500000 steps (10%)
03:22:59:WU00:FS00:0xa8:Completed 55000 out of 500000 steps (11%)
03:30:23:WU00:FS00:0xa8:Completed 60000 out of 500000 steps (12%)
03:37:47:WU00:FS00:0xa8:Completed 65000 out of 500000 steps (13%)
03:45:11:WU00:FS00:0xa8:Completed 70000 out of 500000 steps (14%)
03:52:35:WU00:FS00:0xa8:Completed 75000 out of 500000 steps (15%)
04:00:00:WU00:FS00:0xa8:Completed 80000 out of 500000 steps (16%)
04:07:24:WU00:FS00:0xa8:Completed 85000 out of 500000 steps (17%)
04:14:48:WU00:FS00:0xa8:Completed 90000 out of 500000 steps (18%)
04:22:13:WU00:FS00:0xa8:Completed 95000 out of 500000 steps (19%)
04:29:37:WU00:FS00:0xa8:Completed 100000 out of 500000 steps (20%)
04:37:01:WU00:FS00:0xa8:Completed 105000 out of 500000 steps (21%)
04:44:26:WU00:FS00:0xa8:Completed 110000 out of 500000 steps (22%)
04:51:51:WU00:FS00:0xa8:Completed 115000 out of 500000 steps (23%)
04:59:15:WU00:FS00:0xa8:Completed 120000 out of 500000 steps (24%)
05:06:40:WU00:FS00:0xa8:Completed 125000 out of 500000 steps (25%)
05:14:05:WU00:FS00:0xa8:Completed 130000 out of 500000 steps (26%)
05:21:30:WU00:FS00:0xa8:Completed 135000 out of 500000 steps (27%)
05:28:54:WU00:FS00:0xa8:Completed 140000 out of 500000 steps (28%)
05:36:19:WU00:FS00:0xa8:Completed 145000 out of 500000 steps (29%)
05:43:43:WU00:FS00:0xa8:Completed 150000 out of 500000 steps (30%)
05:51:07:WU00:FS00:0xa8:Completed 155000 out of 500000 steps (31%)
05:58:31:WU00:FS00:0xa8:Completed 160000 out of 500000 steps (32%)
06:05:57:WU00:FS00:0xa8:Completed 165000 out of 500000 steps (33%)
06:13:20:WU00:FS00:0xa8:Completed 170000 out of 500000 steps (34%)
06:20:48:WU00:FS00:0xa8:Completed 175000 out of 500000 steps (35%)
06:28:12:WU00:FS00:0xa8:Completed 180000 out of 500000 steps (36%)
06:35:36:WU00:FS00:0xa8:Completed 185000 out of 500000 steps (37%)
06:43:00:WU00:FS00:0xa8:Completed 190000 out of 500000 steps (38%)
06:50:24:WU00:FS00:0xa8:Completed 195000 out of 500000 steps (39%)
06:57:47:WU00:FS00:0xa8:Completed 200000 out of 500000 steps (40%)
07:05:11:WU00:FS00:0xa8:Completed 205000 out of 500000 steps (41%)
07:12:34:WU00:FS00:0xa8:Completed 210000 out of 500000 steps (42%)
toTOW
Site Moderator
Posts: 6349
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Please Add/Whitelist Nvidia Tesla T4G

Post by toTOW »

1eb4 / TU104GL [T4G] has been added to the list of supported GPUs.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
XanderF
Posts: 42
Joined: Thu Aug 11, 2011 12:25 am

Re: Please Add/Whitelist Nvidia Tesla T4G

Post by XanderF »

This the Tesla T4G still supported?

I've had a system working okay for a week or so, but just started failing today:

Code: Select all

*********************** Log Started 2022-03-28T03:30:32Z ***********************
03:30:32:Trying to access database...
03:30:32:Successfully acquired database lock
03:30:32:Read GPUs.txt
03:30:35:Enabled folding slot 00: READY gpu:0:TU104GL [Tesla T4]
03:30:36:ERROR:No compute devices matched GPU #0 {
03:30:36:ERROR:  "vendor": 4318,
03:30:36:ERROR:  "device": 7864,
03:30:36:ERROR:  "type": 2,
03:30:36:ERROR:  "species": 7,
03:30:36:ERROR:  "description": "TU104GL [Tesla T4]"
03:30:36:ERROR:}.  You may need to update your graphics drivers.
nvidia-smi -q suggests a slightly different device ID from above:

Code: Select all

GPU UUID                              : GPU-1b2b1b2f-e16c-71c9-3f46-8a9af38a030a
    Minor Number                          : 0
    VBIOS Version                         : 90.04.96.00.01
    MultiGPU Board                        : No
    Board ID                              : 0x4
    GPU Part Number                       : 900-2G183-6300-T00
    Inforom Version
        Image Version                     : G183.0200.00.02
        OEM Object                        : 1.1
        ECC Object                        : 5.0
        Power Management Object           : N/A
    GPU Operation Mode
        Current                           : N/A
        Pending                           : N/A
    GPU Virtualization Mode
        Virtualization Mode               : Pass-Through
        Host VGPU Mode                    : N/A
    IBMNPU
        Relaxed Ordering Mode             : N/A
    PCI
        Bus                               : 0x00
        Device                            : 0x04
        Domain                            : 0x0000
        Device Id                         : 0x1EB810DE
        Bus Id                            : 00000000:00:04.0
        Sub System Id                     : 0x12A210DE
toTOW
Site Moderator
Posts: 6349
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Please Add/Whitelist Nvidia Tesla T4G

Post by toTOW »

Yes it is.

This happens when you get a kernel update ... you have to reinstall NV drivers to rebuild the module for the new kernel ...
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
PaulTV
Posts: 208
Joined: Mon Jan 25, 2021 4:53 pm
Location: Netherlands

Re: Please Add/Whitelist Nvidia Tesla T4G

Post by PaulTV »

With something like dkms, the NV drivers will be rebuild automatically if a new kernel is installed. I've got that running on my Ubuntu folding rig, works like a charm.
Image

Ryzen 5800X / RTX 4090 / Windows 11
Ryzen 5600X / RTX 3070 Ti / Ubuntu 22.04
Ryzen 5600 / RTX 3060 Ti / Windows 11
MARSTG
Posts: 40
Joined: Fri Apr 06, 2012 4:20 pm
Hardware configuration: B450 AORUS M w 3800X & WRAITH PRISM
16 GB DELTA RGB 3600 & WD SN 770 500GB
GTX 1070 GAMING X w LG OLED C1 55"
ECLIPSE P300A TG
Location: Montreal

Re: Please Add/Whitelist Nvidia Tesla T4G

Post by MARSTG »

Tesla T4G proves to be a Quadro RTX 4000 built on the TU104 chip. Techpowerup says that performance wise, an RTX 2070 SUPER is 16% faster at gaming, and my HP 2070S pulls consistently over 3 mil PPD, on Windows, and you can look up my stats on EOC as it is the only card folding rn.
Post Reply