Page 1 of 1

FahCore 0xa7 fails on precise-Ubuntu

Posted: Tue Feb 21, 2017 3:07 pm
by suprleg
For several months I've had to stop and re-start my Linux FAHClient in order to acquire a new '0xa4' wu as 0xa7 units won't run and simply fail repeatedly. The frequency wasn't too bad until recently, it's become incessant. Other team members with the similar problem have upgraded their Linux versions of various distros and had success in losing the 'a7.bug', however I'm not wanting to fight with all the reporting scripts at this time.
Has anyone had experience with this error and found a simpler solution?
Thanks for any input.

Code: Select all

*********************** Log Started 2017-02-21T14:22:34Z ***********************
14:22:34:************************* Folding@home Client *************************
14:22:34:    Website: http://folding.stanford.edu/
14:22:34:  Copyright: (c) 2009-2013 Stanford University
14:22:34:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
14:22:34:       Args: --child --lifeline 26767 --run-as fahclient
14:22:34:             --pid-file=/var/run/fahclient.pid --daemon
14:22:34:     Config: /var/lib/fahclient/config.xml
14:22:34:******************************** Build ********************************
14:22:34:    Version: 7.3.6
14:22:34:       Date: Feb 18 2013
14:22:34:       Time: 07:24:08
14:22:34:    SVN Rev: 3923
14:22:34:     Branch: fah/trunk/client
14:22:34:   Compiler: GNU 4.4.7
14:22:34:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
14:22:34:             -fno-unsafe-math-optimizations -msse2
14:22:34:   Platform: linux2 3.2.0-1-amd64
14:22:34:       Bits: 64
14:22:34:       Mode: Release
14:22:34:******************************* System ********************************
14:22:34:        CPU: Intel(R) Core(TM) i7-4790K CPU @ 4.00GHz
14:22:34:     CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
14:22:34:       CPUs: 8
14:22:34:     Memory: 7.74GiB
14:22:34:Free Memory: 4.43GiB
14:22:34:    Threads: POSIX_THREADS
14:22:34:Has Battery: false
14:22:34: On Battery: false
14:22:34: UTC offset: -8
14:22:34:        PID: 26774
14:22:34:        CWD: /var/lib/fahclient
14:22:34:         OS: Linux 3.13.0-106-generic x86_64
14:22:34:    OS Arch: AMD64
14:22:34:       GPUs: 1
14:22:34:      GPU 0: NVIDIA:5 GM206 [GeForce GTX 960]
14:22:34:       CUDA: 5.2
14:22:34:CUDA Driver: 7050
14:22:34:***********************************************************************
14:22:34:<config>
14:22:34:  <!-- Folding Slot Configuration -->
14:22:34:  <power v='full'/>
14:22:34:
14:22:34:  <!-- HTTP Server -->
14:22:34:  <allow v='127.0.0.1 192.168.1.1-192.168.1.254'/>
14:22:34:
14:22:34:  <!-- Network -->
14:22:34:  <proxy v=':0'/>
14:22:34:
14:22:34:  <!-- Remote Command Server -->
14:22:34:  <command-allow-no-pass v='127.0.0.1 192.168.1.1-192.168.1.254'/>
14:22:34:  <password v='*********'/>
14:22:34:
14:22:34:  <!-- User Information -->
14:22:34:  <passkey v='********************************'/>
14:22:34:  <team v='4'/>
14:22:34:  <user v='TH_Foldinator'/>
14:22:34:
14:22:34:  <!-- Folding Slots -->
14:22:34:  <slot id='0' type='CPU'>
14:22:34:    <client-type v='advanced'/>
14:22:34:    <cpus v='6'/>
14:22:34:  </slot>
14:22:34:  <slot id='1' type='GPU'>
14:22:34:    <client-type v='advanced'/>
14:22:34:  </slot>
14:22:34:</config>
14:22:34:Switching to user fahclient
14:22:34:Trying to access database...
14:22:34:Successfully acquired database lock
14:22:34:Enabled folding slot 00: READY cpu:6
14:22:34:Enabled folding slot 01: READY gpu:0:GM206 [GeForce GTX 960]
14:22:34:WU00:FS00:Starting
14:22:34:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:22:34:WU00:FS00:Started FahCore on PID 26781
14:22:34:WU00:FS00:Core PID:26785
14:22:34:WU00:FS00:FahCore 0xa7 started
14:22:34:WU01:FS01:Starting
14:22:34:WU01:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21 -dir 01 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
14:22:34:WU01:FS01:Started FahCore on PID 26786
14:22:34:WU01:FS01:Core PID:26790
14:22:34:WU01:FS01:FahCore 0x21 started
14:22:35:WARNING:WU00:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:22:35:WU01:FS01:0x21:*********************** Log Started 2017-02-21T14:22:34Z ***********************
14:22:35:WU01:FS01:0x21:Project: 13112 (Run 36, Clone 0, Gen 622)
14:22:35:WU01:FS01:0x21:Unit: 0x000001bdab436c65577187da8509b93a
14:22:35:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
14:22:35:WU01:FS01:0x21:Machine: 1
14:22:35:WU01:FS01:0x21:Digital signatures verified
14:22:35:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
14:22:35:WU01:FS01:0x21:Version 0.0.18
14:22:35:WU00:FS00:Starting
14:22:35:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:22:35:WU00:FS00:Started FahCore on PID 26793
14:22:35:WU00:FS00:Core PID:26797
14:22:35:WU00:FS00:FahCore 0xa7 started
14:22:35:WARNING:WU00:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:22:48:WU01:FS01:0x21:Completed 0 out of 520000 steps (0%)
14:22:48:WU01:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
14:23:35:WU00:FS00:Starting
14:23:35:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:23:35:WU00:FS00:Started FahCore on PID 26834
14:23:35:WU00:FS00:Core PID:26838
14:23:35:WU00:FS00:FahCore 0xa7 started
14:23:36:WARNING:WU00:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:23:36:WARNING:WU00:FS00:Too many errors, failing
14:23:36:WU00:FS00:Sending unit results: id:00 state:SEND error:FAILED project:8677 run:12 clone:1 gen:24 core:0xa7 unit:0x0000001b0002894b5824db763228f23e
14:23:36:WU00:FS00:Connecting to 155.247.166.219:8080
14:23:36:WU00:FS00:Server responded WORK_ACK (400)
14:23:36:WU00:FS00:Cleaning up
14:23:36:WU02:FS00:Connecting to assign3.stanford.edu:8080
14:23:36:WU02:FS00:News: 
14:23:36:WU02:FS00:Assigned to work server 171.67.108.101
14:23:36:WU02:FS00:Requesting new work unit for slot 00: READY cpu:6 from 171.67.108.101
14:23:36:WU02:FS00:Connecting to 171.67.108.101:8080
14:23:37:WU02:FS00:Downloading 2.94MiB
14:23:38:WU02:FS00:Download complete
14:23:38:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:13124 run:44 clone:2 gen:11 core:0xa7 unit:0x0000000cab436c655898ca8dab07f72d
14:23:38:WU02:FS00:Starting
14:23:38:WU02:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 02 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:23:38:WU02:FS00:Started FahCore on PID 26839
14:23:38:WU02:FS00:Core PID:26843
14:23:38:WU02:FS00:FahCore 0xa7 started
14:23:38:WARNING:WU02:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:23:38:WU02:FS00:Starting
14:23:38:WU02:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 02 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:23:38:WU02:FS00:Started FahCore on PID 26844
14:23:38:WU02:FS00:Core PID:26848
14:23:38:WU02:FS00:FahCore 0xa7 started
14:23:39:WARNING:WU02:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:24:38:WU01:FS01:0x21:Completed 5200 out of 520000 steps (1%)
14:24:38:WU02:FS00:Starting
14:24:38:WU02:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 02 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:24:38:WU02:FS00:Started FahCore on PID 27119
14:24:38:WU02:FS00:Core PID:27123
14:24:38:WU02:FS00:FahCore 0xa7 started
14:24:39:WARNING:WU02:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:25:38:WU02:FS00:Starting
14:25:38:WU02:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 02 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:25:38:WU02:FS00:Started FahCore on PID 27133
14:25:38:WU02:FS00:Core PID:27137
14:25:38:WU02:FS00:FahCore 0xa7 started
14:25:39:WARNING:WU02:FS00:FahCore returned: FAILED_2 (1 = 0x1)

Re: FahCore 0xa7 fails on precise-Ubuntu

Posted: Tue Feb 21, 2017 6:02 pm
by Nathan_P
No, You need at least 14.04 LTS in order for it to work.

Re: FahCore 0xa7 fails on precise-Ubuntu

Posted: Wed Feb 22, 2017 10:19 pm
by jcoffland
What is the output of?:

Code: Select all

ldd /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7

Re: FahCore 0xa7 fails on precise-Ubuntu

Posted: Thu Feb 23, 2017 1:56 pm
by suprleg
jcoffland wrote:What is the output of?:

Code: Select all

ldd /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7

Code: Select all

 ldd /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7
        linux-vdso.so.1 =>  (0x00007ffeae33e000)
        libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007fb1ce869000)
        libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007fb1ce665000)
        libstdc++.so.6 => /usr/lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007fb1ce361000)
        libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007fb1ce05b000)
        libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007fb1cde45000)
        libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007fb1cda80000)
        /lib64/ld-linux-x86-64.so.2 (0x00007fb1cea87000)
I bit the bullet and upgraded to Ubuntu 14.04.5 LTS, a7 wu's are being processed now, thanks for the responses.

Re: FahCore 0xa7 fails on precise-Ubuntu

Posted: Thu Feb 23, 2017 2:52 pm
by Joe_H
There is one recommendation I can make if you are still running the version 7.3.6 client. Upgrade to at least the current release of 7.4.4, or try the public beta. Using the 7.3.6 version your system will not be detected as having an AVX capable CPU, and any Core_A7 WU's will be run using the SSE2 version of the folding core.