Page 1 of 1

Projects 14158, 14163, 14166: Forces are blowing up! GPU

Posted: Fri May 01, 2020 2:35 am
by braiam
These projects seems to not being able to run at all on my GPU using the rocm libraries, meanwhile projects 16435, 11742-11745, 14549 have been able to. 16435 running right now. I'm not sure what's wrong with these projects, except:

Code: Select all

01:27:17:WU01:FS00:0x21:*********************** Log Started 2020-05-01T01:27:16Z ***********************
01:27:17:WU01:FS00:0x21:Project: 14158 (Run 2, Clone 7672, Gen 0)
01:27:17:WU01:FS00:0x21:Unit: 0x000000000002894c5d3b22824c98d58a
01:27:17:WU01:FS00:0x21:CPU: 0x00000000000000000000000000000000
01:27:17:WU01:FS00:0x21:Machine: 0
01:27:17:WU01:FS00:0x21:Reading tar file core.xml
01:27:17:WU01:FS00:0x21:Reading tar file integrator.xml
01:27:17:WU01:FS00:0x21:Reading tar file state.xml
01:27:17:WU01:FS00:0x21:Reading tar file system.xml
01:27:17:WU01:FS00:0x21:Digital signatures verified
01:27:17:WU01:FS00:0x21:Folding@home GPU Core21 Folding@home Core
01:27:17:WU01:FS00:0x21:Version 0.0.20
01:27:24:WU01:FS00:0x21:ERROR:Discrepancy: Forces are blowing up! 1 0
These seems to be old projects going by their id number. Maybe a rebuild with current cores is necessary. These are the most recent reports:

Code: Select all

05:58:27:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:16435 run:1957 clone:2 gen:1 core:0x22 unit:0x0000000103854c135e9a4ef8f8dd6327
08:42:54:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:11744 run:0 clone:9865 gen:9 core:0x22 unit:0x0000000e8ca304f15e6bc3e3836f6a1a
12:08:04:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:11742 run:0 clone:6070 gen:55 core:0x22 unit:0x0000004b8ca304f15e6bc52082b2bf56
18:56:59:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:16435 run:1127 clone:0 gen:4 core:0x22 unit:0x0000000403854c135e9a4efa897e7ffc
22:14:16:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:11743 run:0 clone:853 gen:64 core:0x22 unit:0x0000005d8ca304f15e67e07a60b3bf3a
05:18:24:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:16435 run:3537 clone:1 gen:1 core:0x22 unit:0x0000000203854c135e9a4efb665c51ba
15:39:54:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14549 run:0 clone:1421 gen:38 core:0x22 unit:0x0000002e0d5262775e863e36216cf608
19:01:04:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:11743 run:0 clone:9910 gen:13 core:0x22 unit:0x000000128ca304f15e6bc454461d1ece
21:56:55:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:11745 run:0 clone:7984 gen:22 core:0x22 unit:0x000000248ca304f15e6bc39d6666c22c
21:57:03:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:3 clone:2643 gen:0 core:0x21 unit:0x000000000002894c5d3b23421311cafc
22:07:07:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:14166 run:19 clone:99 gen:0 core:0x21 unit:0x000000000002894c5eab37719d65887a
00:57:34:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14538 run:0 clone:954 gen:93 core:0x22 unit:0x000000810d5262775e7b9cdd11079134
00:57:43:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:2 clone:6705 gen:0 core:0x21 unit:0x000000010002894c5d3b225b657cc0d4
01:00:42:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:14158 run:2 clone:4657 gen:1 core:0x21 unit:0x000000040002894c5d3b220e01464077
01:01:36:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:14158 run:2 clone:7000 gen:0 core:0x21 unit:0x000000000002894c5d3b22678425c889
01:05:12:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:2 clone:4867 gen:1 core:0x21 unit:0x000000010002894c5d3b2217b1db8903
01:07:30:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:14158 run:3 clone:7140 gen:0 core:0x21 unit:0x000000000002894c5d3b23e9d74a11ed
01:12:45:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:3 clone:7318 gen:0 core:0x21 unit:0x000000000002894c5d3b23ef444f480a
01:13:46:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:14163 run:31 clone:37 gen:2 core:0x21 unit:0x000000030002894c5eab3776e6c04dc4
01:17:40:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:3 clone:7434 gen:0 core:0x21 unit:0x000000000002894c5d3b23f34ee48695
01:26:19:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:14158 run:2 clone:4080 gen:2 core:0x21 unit:0x000000030002894c5d3b21f85dc22e44
01:27:25:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:14158 run:2 clone:7672 gen:0 core:0x21 unit:0x000000000002894c5d3b22824c98d58a

Re: Projects 14158, 14163, 14166: Forces are blowing up! GPU

Posted: Fri May 01, 2020 8:48 am
by schapman1978
I can tell you my computer is not happy with these 14163 work units. They make my coils scream on y my 2080 ti's, run for 90 minutes a piece, and are like 60k credit. It makes me nervous to hear coil whine this loud from multiple $1200 cards. Maybe there's a shortage of more complex units? 2 of my GPUS's are idle this morning.

Re: Projects 14158, 14163, 14166: Forces are blowing up! GPU

Posted: Fri May 01, 2020 9:05 am
by PantherX
schapman1978 wrote:...Maybe there's a shortage of more complex units? 2 of my GPUS's are idle this morning.
That's correct. Work is being done to meet the high demand for GPU WUs.

Re: Projects 14158, 14163, 14166: Forces are blowing up! GPU

Posted: Fri May 01, 2020 12:35 pm
by braiam
PantherX wrote:
schapman1978 wrote:...Maybe there's a shortage of more complex units? 2 of my GPUS's are idle this morning.
That's correct. Work is being done to meet the high demand for GPU WUs.
Any suggestion for my issue?

Re: Projects 14158, 14163, 14166: Forces are blowing up! GPU

Posted: Fri May 01, 2020 12:43 pm
by Neil-B
For your issue it may be that this Project just doesn't work well on your rocm drivers - whilst some projects don't complain it would appear these may have an issue … see viewtopic.php?f=19&t=31711#p331383