Projects 14158, 14163, 14166: Forces are blowing up! GPU

Moderators: Site Moderators, FAHC Science Team

Post Reply
braiam
Posts: 16
Joined: Mon Mar 23, 2020 2:56 pm

Projects 14158, 14163, 14166: Forces are blowing up! GPU

Post by braiam »

These projects seems to not being able to run at all on my GPU using the rocm libraries, meanwhile projects 16435, 11742-11745, 14549 have been able to. 16435 running right now. I'm not sure what's wrong with these projects, except:

Code: Select all

01:27:17:WU01:FS00:0x21:*********************** Log Started 2020-05-01T01:27:16Z ***********************
01:27:17:WU01:FS00:0x21:Project: 14158 (Run 2, Clone 7672, Gen 0)
01:27:17:WU01:FS00:0x21:Unit: 0x000000000002894c5d3b22824c98d58a
01:27:17:WU01:FS00:0x21:CPU: 0x00000000000000000000000000000000
01:27:17:WU01:FS00:0x21:Machine: 0
01:27:17:WU01:FS00:0x21:Reading tar file core.xml
01:27:17:WU01:FS00:0x21:Reading tar file integrator.xml
01:27:17:WU01:FS00:0x21:Reading tar file state.xml
01:27:17:WU01:FS00:0x21:Reading tar file system.xml
01:27:17:WU01:FS00:0x21:Digital signatures verified
01:27:17:WU01:FS00:0x21:Folding@home GPU Core21 Folding@home Core
01:27:17:WU01:FS00:0x21:Version 0.0.20
01:27:24:WU01:FS00:0x21:ERROR:Discrepancy: Forces are blowing up! 1 0
These seems to be old projects going by their id number. Maybe a rebuild with current cores is necessary. These are the most recent reports:

Code: Select all

05:58:27:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:16435 run:1957 clone:2 gen:1 core:0x22 unit:0x0000000103854c135e9a4ef8f8dd6327
08:42:54:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:11744 run:0 clone:9865 gen:9 core:0x22 unit:0x0000000e8ca304f15e6bc3e3836f6a1a
12:08:04:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:11742 run:0 clone:6070 gen:55 core:0x22 unit:0x0000004b8ca304f15e6bc52082b2bf56
18:56:59:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:16435 run:1127 clone:0 gen:4 core:0x22 unit:0x0000000403854c135e9a4efa897e7ffc
22:14:16:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:11743 run:0 clone:853 gen:64 core:0x22 unit:0x0000005d8ca304f15e67e07a60b3bf3a
05:18:24:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:16435 run:3537 clone:1 gen:1 core:0x22 unit:0x0000000203854c135e9a4efb665c51ba
15:39:54:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14549 run:0 clone:1421 gen:38 core:0x22 unit:0x0000002e0d5262775e863e36216cf608
19:01:04:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:11743 run:0 clone:9910 gen:13 core:0x22 unit:0x000000128ca304f15e6bc454461d1ece
21:56:55:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:11745 run:0 clone:7984 gen:22 core:0x22 unit:0x000000248ca304f15e6bc39d6666c22c
21:57:03:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:3 clone:2643 gen:0 core:0x21 unit:0x000000000002894c5d3b23421311cafc
22:07:07:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:14166 run:19 clone:99 gen:0 core:0x21 unit:0x000000000002894c5eab37719d65887a
00:57:34:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14538 run:0 clone:954 gen:93 core:0x22 unit:0x000000810d5262775e7b9cdd11079134
00:57:43:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:2 clone:6705 gen:0 core:0x21 unit:0x000000010002894c5d3b225b657cc0d4
01:00:42:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:14158 run:2 clone:4657 gen:1 core:0x21 unit:0x000000040002894c5d3b220e01464077
01:01:36:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:14158 run:2 clone:7000 gen:0 core:0x21 unit:0x000000000002894c5d3b22678425c889
01:05:12:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:2 clone:4867 gen:1 core:0x21 unit:0x000000010002894c5d3b2217b1db8903
01:07:30:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:14158 run:3 clone:7140 gen:0 core:0x21 unit:0x000000000002894c5d3b23e9d74a11ed
01:12:45:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:3 clone:7318 gen:0 core:0x21 unit:0x000000000002894c5d3b23ef444f480a
01:13:46:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:14163 run:31 clone:37 gen:2 core:0x21 unit:0x000000030002894c5eab3776e6c04dc4
01:17:40:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:3 clone:7434 gen:0 core:0x21 unit:0x000000000002894c5d3b23f34ee48695
01:26:19:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:14158 run:2 clone:4080 gen:2 core:0x21 unit:0x000000030002894c5d3b21f85dc22e44
01:27:25:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:2 clone:7672 gen:0 core:0x21 unit:0x000000000002894c5d3b22824c98d58a
schapman1978
Posts: 35
Joined: Mon Nov 19, 2012 11:12 pm

Re: Projects 14158, 14163, 14166: Forces are blowing up! GPU

Post by schapman1978 »

I can tell you my computer is not happy with these 14163 work units. They make my coils scream on y my 2080 ti's, run for 90 minutes a piece, and are like 60k credit. It makes me nervous to hear coil whine this loud from multiple $1200 cards. Maybe there's a shortage of more complex units? 2 of my GPUS's are idle this morning.
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Projects 14158, 14163, 14166: Forces are blowing up! GPU

Post by PantherX »

schapman1978 wrote:...Maybe there's a shortage of more complex units? 2 of my GPUS's are idle this morning.
That's correct. Work is being done to meet the high demand for GPU WUs.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
braiam
Posts: 16
Joined: Mon Mar 23, 2020 2:56 pm

Re: Projects 14158, 14163, 14166: Forces are blowing up! GPU

Post by braiam »

PantherX wrote:
schapman1978 wrote:...Maybe there's a shortage of more complex units? 2 of my GPUS's are idle this morning.
That's correct. Work is being done to meet the high demand for GPU WUs.
Any suggestion for my issue?
Neil-B
Posts: 1996
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: Projects 14158, 14163, 14166: Forces are blowing up! GPU

Post by Neil-B »

For your issue it may be that this Project just doesn't work well on your rocm drivers - whilst some projects don't complain it would appear these may have an issue … see viewtopic.php?f=19&t=31711#p331383
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
Post Reply