Page 1 of 2

Unknown Enum Error

Posted: Sat Jun 16, 2018 2:36 pm
by Nert
I received a number of unknown enum errors today. The system is Linux Mint 17 with no overclocks. Everything has been very stable for a long time. I started CPU folding along with my GPU folding about a week or so ago on this system. I believe this is the first A7 unit that I have received. The CPU is an I3-8100

Here's the beginning of the log:

Code: Select all

*********************** Log Started 2018-06-15T02:41:51Z ***********************
02:41:51:************************* Folding@home Client *************************
02:41:51:    Website: http://folding.stanford.edu/
02:41:51:  Copyright: (c) 2009-2014 Stanford University
02:41:51:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
02:41:51:       Args: --child --lifeline 1769 /etc/fahclient/config.xml --run-as
02:41:51:             fahclient --pid-file=/var/run/fahclient.pid --daemon
02:41:51:     Config: /etc/fahclient/config.xml
02:41:51:******************************** Build ********************************
02:41:51:    Version: 7.4.4
02:41:51:       Date: Mar 4 2014
02:41:51:       Time: 12:02:38
02:41:51:    SVN Rev: 4130
02:41:51:     Branch: fah/trunk/client
02:41:51:   Compiler: GNU 4.4.7
02:41:51:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
02:41:51:             -fno-unsafe-math-optimizations -msse2
02:41:51:   Platform: linux2 3.2.0-1-amd64
02:41:51:       Bits: 64
02:41:51:       Mode: Release
02:41:51:******************************* System ********************************
02:41:51:        CPU: Intel(R) Core(TM) i5-4590 CPU @ 3.30GHz
02:41:51:     CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
02:41:51:       CPUs: 4
02:41:51:     Memory: 15.61GiB
02:41:51:Free Memory: 14.96GiB
02:41:51:    Threads: POSIX_THREADS
02:41:51: OS Version: 3.19
02:41:51:Has Battery: false
02:41:51: On Battery: false
02:41:51: UTC Offset: -5
02:41:51:        PID: 1771
02:41:51:        CWD: /var/lib/fahclient
02:41:51:         OS: Linux 3.19.0-32-generic x86_64
02:41:51:    OS Arch: AMD64
02:41:51:       GPUs: 3
02:41:51:      GPU 0: NVIDIA:7 GP104 [GeForce GTX 1080] 8873
02:41:51:      GPU 1: UNSUPPORTED: NV3 [PCI]
02:41:51:      GPU 2: NVIDIA:7 GP102 [GeForce GTX 1080 Ti] 11380
02:41:51:       CUDA: 6.1
02:41:51:CUDA Driver: 9000
02:41:51:***********************************************************************
02:41:51:<config>
02:41:51:  <!-- Client Control -->
02:41:51:  <fold-anon v='true'/>
02:41:51:
02:41:51:  <!-- Folding Slot Configuration -->
02:41:51:  <cause v='ALZHEIMERS'/>
02:41:51:  <gpu v='false'/>
02:41:51:
02:41:51:  <!-- HTTP Server -->
02:41:51:  <allow v='192.168.0.0-192.168.0.255'/>
02:41:51:
02:41:51:  <!-- Network -->
02:41:51:  <proxy v=':8080'/>
02:41:51:
02:41:51:  <!-- Remote Command Server -->
02:41:51:  <password v='****'/>
02:41:51:
02:41:51:  <!-- Slot Control -->
02:41:51:  <power v='full'/>
02:41:51:
02:41:51:  <!-- User Information -->
02:41:51:  <passkey v='********************************'/>
02:41:51:  <team v='224497'/>
02:41:51:  <user v='nert_ALL_1KqFJ6gDgARrEvTDsJFE9dXX3B4ttLsv1g'/>
02:41:51:
02:41:51:  <!-- Folding Slots -->
02:41:51:  <slot id='1' type='GPU'>
02:41:51:    <paused v='true'/>
02:41:51:  </slot>
02:41:51:  <slot id='2' type='GPU'>
02:41:51:    <paused v='true'/>
02:41:51:  </slot>
02:41:51:  <slot id='0' type='CPU'>
02:41:51:    <cpus v='2'/>
02:41:51:    <paused v='true'/>
02:41:51:  </slot>
02:41:51:</config>
02:41:51:Switching to user fahclient
02:41:51:Trying to access database...
02:41:51:Successfully acquired database lock
Here are the last few error messages:

Code: Select all

AVX/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 704 -lifeline 1771 -checkpoint 15 -np 2
14:19:39:WU00:FS00:Started FahCore on PID 11084
14:19:39:WU00:FS00:Core PID:11088
14:19:39:WU00:FS00:FahCore 0xa7 started
14:19:39:WARNING:WU00:FS00:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
14:19:40:WU03:FS01:0x21:Completed 5400000 out of 7500000 steps (72%)
14:20:39:WU00:FS00:Starting
14:20:39:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/AVX/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 704 -lifeline 1771 -checkpoint 15 -np 2
14:20:39:WU00:FS00:Started FahCore on PID 11120
14:20:39:WU00:FS00:Core PID:11124
14:20:39:WU00:FS00:FahCore 0xa7 started
14:20:39:WARNING:WU00:FS00:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
14:21:14:WU03:FS01:0x21:Completed 5475000 out of 7500000 steps (73%)
14:21:17:WU02:FS02:0x21:Completed 3450000 out of 7500000 steps (46%)
14:21:39:WU00:FS00:Starting
14:21:39:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/AVX/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 704 -lifeline 1771 -checkpoint 15 -np 2
14:21:39:WU00:FS00:Started FahCore on PID 11156
14:21:39:WU00:FS00:Core PID:11160
14:21:39:WU00:FS00:FahCore 0xa7 started
14:21:39:WARNING:WU00:FS00:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
14:21:39:WARNING:WU00:FS00:Too many errors, failing
14:21:39:WU00:FS00:Sending unit results: id:00 state:SEND error:FAILED project:13812 run:0 clone:586 gen:136 core:0xa7 unit:0x0000009880fccb025a981912d3f48e81
14:21:39:WU00:FS00:Connecting to 128.252.203.2:8080
14:21:39:WU00:FS00:Server responded WORK_ACK (400)
14:21:40:WU00:FS00:Cleaning up
14:21:40:WU00:FS00:Connecting to 65.254.110.245:8080
14:21:40:WU00:FS00:Assigned to work server 128.252.203.2
14:21:40:WU00:FS00:Requesting new work unit for slot 00: READY cpu:2 from 128.252.203.2
14:21:40:WU00:FS00:Connecting to 128.252.203.2:8080
14:21:40:WU00:FS00:Downloading 4.57MiB
14:21:42:WU00:FS00:Download complete
After this a new A4 unit downloaded and is working fine.

Re: Unknown Enum Error

Posted: Sat Jun 16, 2018 4:34 pm
by JimboPalmer
I am going to tell you what I think I know, real experts will be along to point and laugh at me later.

Core_a7 can use SSE2 instructions if your CPU is older than Intel's Haswell or AMD's Excavator designs. If it is newer, it uses AVX instructions for even more speed. (Core_a4 is older and always uses SSE2)

When a process needs to swap out one task to run another, it saves all the registers, and restores them when the task runs again, perhaps on another CPU. Obviously, OSes from before Haswell did not save the AVX registers as they were new with Haswell and the authors had no idea there would be new registers. Once Haswell shipped, new releases of major OSes saved and restored AVX registers as well as all the others.

The Programmers at F@H can see your CPU is a Haswell chip and so sends you the AVX version of a7. I suspect Mint 17 is not saving and restoring AVX registers. You may wish to upgrade Mint.

Re: Unknown Enum Error

Posted: Sat Jun 16, 2018 5:01 pm
by Nert
JimboPalmer,

Thanks for the response. Updating to Mint 18 is something that I'm planning on doing. Since this is my main system, I've been reluctant to do that. I've got a lot of stuff on this system and I don't want to take a chance of losing any of it. Since folding recovered from the error, I'll keep running CPU folding. As nearly as I can tell Alzheimer's research is only being done on CPU. That's one of the diseases that I'm most interested in contributing to. In the meantime, I'll watch for other responses to see if there is some kind of O.S. setting or workaround that I can set to resolve this in the short term.

Re: Unknown Enum Error

Posted: Sat Jun 16, 2018 9:51 pm
by bruce
@JimboPalmer: Good guess. (I don't have anything better to offer.)

@Nert: Are you able to run a utility that tests AVX performance?

Have you searched the Mint support sites for something that might be called AVX support? (It might be considered a component of CPU drivers.)

FAH is supposed to test your system for AVX support and based on that result, decide whether to run a version of Core_a7 that uses AVX or an alternate that uses SSE. If the answers to the previous questions are not fruitful, we may be able to FORCE your system to avoid AVX, but that's a work-around I'd prefer not to recommend, both because it's unsupported and because it would reduce FAH's performanc -- but reduced performance is certainly better than NO performance.

Re: Unknown Enum Error

Posted: Sat Jun 16, 2018 10:08 pm
by Nert
I just realized that the CPU info I posted was incorrect. :oops: The correct type is I5-4990. I forgot which system generated the error. I'll do some searching for AVX information on Linux Mint.

Re: Unknown Enum Error

Posted: Sun Jun 17, 2018 12:40 am
by SteveWillis
just thought I'd mention;
There is an easy upgrade from mint 17 to 17.3 in the update manager. From 17.3 there is an update path to 18 that doesn't require a full reinstall.
google How to Upgrade from Linux Mint 17.3 to Linux Mint 18 for details.

Re: Unknown Enum Error

Posted: Sun Jun 17, 2018 4:48 pm
by toTOW
Can you try the workaround that has been posted here : viewtopic.php?f=72&t=30851 ?

Re: Unknown Enum Error

Posted: Sun Jun 17, 2018 6:03 pm
by Nert
toTOW wrote:Can you try the workaround that has been posted here : viewtopic.php?f=72&t=30851 ?
I paused FAH and entered the following commands as outlined at the end of the thread:

Code: Select all

sudo add-apt-repository ppa:ubuntu-toolchain-r/test 
sudo apt-get update
sudo apt-get upgrade
The process generated a lot more output on my terminal than I expected. I have captured and saved it for future reference. I probably should have asked about this before I entered the commands, but what exactly did this do to my system ? What is the tool chain test repository ?

I unpaused FAH and it seems to be working fine. Currently it is working on an A4 unit. I will monitor results to look for an A7 unit and post when I see results.

Re: Unknown Enum Error

Posted: Sun Jun 17, 2018 8:25 pm
by bollix47
Very basically that ppa contains later versions of some software like gcc and libraries like libstdc++ that FahCore_a7 requires.

You can read about the repository at https://wiki.ubuntu.com/ToolChain .

Later versions of ubuntu have the newer libraries etc already but 14.04 does not (I believe 16.04 is the minimum). By adding the repository you've essentially added the later versions in a safe manner and FahCore_a7 should now work.

Re: Unknown Enum Error

Posted: Sun Jun 17, 2018 9:15 pm
by bruce
toTOW wrote:Can you try the workaround that has been posted here : viewtopic.php?f=72&t=30851 ?
As was mentioned in that topic, what is the output of

Code: Select all

ldd FahCore_a7
? (before you upgrade the library)

Re: Unknown Enum Error

Posted: Sun Jun 17, 2018 9:39 pm
by Nert
Here's Before:

Code: Select all

roger@mintz97 /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/AVX/Core_a7.fah $ ldd FahCore_a7
	linux-vdso.so.1 =>  (0x00007ffc945e8000)
	libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007f39c5501000)
	libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007f39c52fd000)
	libstdc++.so.6 => /usr/lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007f39c4fe7000)
	libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007f39c4ce1000)
	libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007f39c4ac9000)
	libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f39c4700000)
	/lib64/ld-linux-x86-64.so.2 (0x00007f39c6c53000)
and after

Code: Select all

roger@mintz97 /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/AVX/Core_a7.fah $ ldd FahCore_a7
	linux-vdso.so.1 =>  (0x00007fff73dec000)
	libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007f361a006000)
	libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007f3619e02000)
	libstdc++.so.6 => /usr/lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007f3619aec000)
	libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007f36197e6000)
	libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007f36195ce000)
	libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f3619205000)
	/lib64/ld-linux-x86-64.so.2 (0x00007f361b758000)
Still waiting for another A7 unit.

Re: Unknown Enum Error

Posted: Sun Jun 17, 2018 9:42 pm
by Nert
bollix47 wrote:Later versions of ubuntu have the newer libraries etc already but 14.04 does not (I believe 16.04 is the minimum). By adding the repository you've essentially added the later versions in a safe manner and FahCore_a7 should now work.
Thanks for that. I got a little nervous when I saw all that output. I'm hopeful that this will solve the problem. ... I'll post when I get an A7 unit.

Re: Unknown Enum Error

Posted: Sun Jun 17, 2018 10:34 pm
by bollix47
Although your two ldd listings are different there is no error shown in the before listing which I would have expected to see. The problem with your setup not being able to process a7 work units may still exist. We won't know for sure until you get one but we may have to look elsewhere for a solution.

Re: Unknown Enum Error

Posted: Mon Jun 18, 2018 5:05 pm
by Nert
It looks like the changes did not fix the problem.

Code: Select all

16:53:46:WU01:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
16:53:56:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
16:53:56:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14049 run:5 clone:13 gen:67 core:0xa4 unit:0x000000470002894c5aa42ca9219f9817
16:53:56:WU01:FS00:Uploading 4.62MiB to 155.247.166.220
16:53:56:WU01:FS00:Connecting to 155.247.166.220:8080
16:53:56:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
16:53:56:WU01:FS00:Connecting to 155.247.166.220:80
16:53:56:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 155.247.166.220:80: Connection refused
16:53:56:WU01:FS00:Trying to send results to collection server
16:53:56:WU01:FS00:Uploading 4.62MiB to 155.247.166.219
16:53:56:WU01:FS00:Connecting to 155.247.166.219:8080
16:53:56:WU03:FS00:Starting
16:53:56:WU03:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/AVX/Core_a7.fah/FahCore_a7 -dir 03 -suffix 01 -version 704 -lifeline 1775 -checkpoint 15 -np 2
16:53:56:WU03:FS00:Started FahCore on PID 18860
16:53:56:WU03:FS00:Core PID:18864
16:53:56:WU03:FS00:FahCore 0xa7 started
16:53:57:WARNING:WU03:FS00:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
16:53:57:WU03:FS00:Starting
16:53:57:WU03:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/AVX/Core_a7.fah/FahCore_a7 -dir 03 -suffix 01 -version 704 -lifeline 1775 -checkpoint 15 -np 2
16:53:57:WU03:FS00:Started FahCore on PID 18870
16:53:57:WU03:FS00:Core PID:18874
16:53:57:WU03:FS00:FahCore 0xa7 started
16:53:58:WARNING:WU03:FS00:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
16:54:02:WU01:FS00:Upload 37.89%
16:54:08:WU01:FS00:Upload 67.65%
16:54:14:WU01:FS00:Upload 98.77%
16:54:16:WU01:FS00:Upload complete
16:54:16:WU01:FS00:Server responded WORK_ACK (400)
16:54:16:WU01:FS00:Final credit estimate, 7220.00 points
16:54:16:WU01:FS00:Cleaning up
16:54:57:WU03:FS00:Starting
16:54:57:WU03:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/AVX/Core_a7.fah/FahCore_a7 -dir 03 -suffix 01 -version 704 -lifeline 1775 -checkpoint 15 -np 2
16:54:57:WU03:FS00:Started FahCore on PID 18901
16:54:57:WU03:FS00:Core PID:18905
16:54:57:WU03:FS00:FahCore 0xa7 started
16:54:58:WARNING:WU03:FS00:FahCore returned: UNKNOWN_ENUM (127 = 0x7f)
I enabled CPU folding on a second system running Mint 18. That system has been running a7 units without problems over the past couple of days. I can provide any additional information from either system to help if there is any interest, but I will need some guidance on what to do.

In the meantime what would be the recommended steps on the Mint 17 system ?

1) Disable CPU processing
2) Continue CPU processing on Mint 17 to process A4's and let A7' fail and returns.

Re: Unknown Enum Error

Posted: Mon Jun 18, 2018 6:29 pm
by bruce
You can try downloading a new FAHCore_a7. (Assuming you're not running an A7 WU) delete the existing FAHCore from
/usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/AVX/Core_a7.fah/FahCore_a7

The next A7 WU will download a fresh copy.

This probably won't help, but it can't hurt