FahCore 0xa7 fails on precise-Ubuntu
Posted: Tue Feb 21, 2017 3:07 pm
For several months I've had to stop and re-start my Linux FAHClient in order to acquire a new '0xa4' wu as 0xa7 units won't run and simply fail repeatedly. The frequency wasn't too bad until recently, it's become incessant. Other team members with the similar problem have upgraded their Linux versions of various distros and had success in losing the 'a7.bug', however I'm not wanting to fight with all the reporting scripts at this time.
Has anyone had experience with this error and found a simpler solution?
Thanks for any input.
Has anyone had experience with this error and found a simpler solution?
Thanks for any input.
Code: Select all
*********************** Log Started 2017-02-21T14:22:34Z ***********************
14:22:34:************************* Folding@home Client *************************
14:22:34: Website: http://folding.stanford.edu/
14:22:34: Copyright: (c) 2009-2013 Stanford University
14:22:34: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
14:22:34: Args: --child --lifeline 26767 --run-as fahclient
14:22:34: --pid-file=/var/run/fahclient.pid --daemon
14:22:34: Config: /var/lib/fahclient/config.xml
14:22:34:******************************** Build ********************************
14:22:34: Version: 7.3.6
14:22:34: Date: Feb 18 2013
14:22:34: Time: 07:24:08
14:22:34: SVN Rev: 3923
14:22:34: Branch: fah/trunk/client
14:22:34: Compiler: GNU 4.4.7
14:22:34: Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
14:22:34: -fno-unsafe-math-optimizations -msse2
14:22:34: Platform: linux2 3.2.0-1-amd64
14:22:34: Bits: 64
14:22:34: Mode: Release
14:22:34:******************************* System ********************************
14:22:34: CPU: Intel(R) Core(TM) i7-4790K CPU @ 4.00GHz
14:22:34: CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
14:22:34: CPUs: 8
14:22:34: Memory: 7.74GiB
14:22:34:Free Memory: 4.43GiB
14:22:34: Threads: POSIX_THREADS
14:22:34:Has Battery: false
14:22:34: On Battery: false
14:22:34: UTC offset: -8
14:22:34: PID: 26774
14:22:34: CWD: /var/lib/fahclient
14:22:34: OS: Linux 3.13.0-106-generic x86_64
14:22:34: OS Arch: AMD64
14:22:34: GPUs: 1
14:22:34: GPU 0: NVIDIA:5 GM206 [GeForce GTX 960]
14:22:34: CUDA: 5.2
14:22:34:CUDA Driver: 7050
14:22:34:***********************************************************************
14:22:34:<config>
14:22:34: <!-- Folding Slot Configuration -->
14:22:34: <power v='full'/>
14:22:34:
14:22:34: <!-- HTTP Server -->
14:22:34: <allow v='127.0.0.1 192.168.1.1-192.168.1.254'/>
14:22:34:
14:22:34: <!-- Network -->
14:22:34: <proxy v=':0'/>
14:22:34:
14:22:34: <!-- Remote Command Server -->
14:22:34: <command-allow-no-pass v='127.0.0.1 192.168.1.1-192.168.1.254'/>
14:22:34: <password v='*********'/>
14:22:34:
14:22:34: <!-- User Information -->
14:22:34: <passkey v='********************************'/>
14:22:34: <team v='4'/>
14:22:34: <user v='TH_Foldinator'/>
14:22:34:
14:22:34: <!-- Folding Slots -->
14:22:34: <slot id='0' type='CPU'>
14:22:34: <client-type v='advanced'/>
14:22:34: <cpus v='6'/>
14:22:34: </slot>
14:22:34: <slot id='1' type='GPU'>
14:22:34: <client-type v='advanced'/>
14:22:34: </slot>
14:22:34:</config>
14:22:34:Switching to user fahclient
14:22:34:Trying to access database...
14:22:34:Successfully acquired database lock
14:22:34:Enabled folding slot 00: READY cpu:6
14:22:34:Enabled folding slot 01: READY gpu:0:GM206 [GeForce GTX 960]
14:22:34:WU00:FS00:Starting
14:22:34:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:22:34:WU00:FS00:Started FahCore on PID 26781
14:22:34:WU00:FS00:Core PID:26785
14:22:34:WU00:FS00:FahCore 0xa7 started
14:22:34:WU01:FS01:Starting
14:22:34:WU01:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21 -dir 01 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
14:22:34:WU01:FS01:Started FahCore on PID 26786
14:22:34:WU01:FS01:Core PID:26790
14:22:34:WU01:FS01:FahCore 0x21 started
14:22:35:WARNING:WU00:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:22:35:WU01:FS01:0x21:*********************** Log Started 2017-02-21T14:22:34Z ***********************
14:22:35:WU01:FS01:0x21:Project: 13112 (Run 36, Clone 0, Gen 622)
14:22:35:WU01:FS01:0x21:Unit: 0x000001bdab436c65577187da8509b93a
14:22:35:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
14:22:35:WU01:FS01:0x21:Machine: 1
14:22:35:WU01:FS01:0x21:Digital signatures verified
14:22:35:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
14:22:35:WU01:FS01:0x21:Version 0.0.18
14:22:35:WU00:FS00:Starting
14:22:35:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:22:35:WU00:FS00:Started FahCore on PID 26793
14:22:35:WU00:FS00:Core PID:26797
14:22:35:WU00:FS00:FahCore 0xa7 started
14:22:35:WARNING:WU00:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:22:48:WU01:FS01:0x21:Completed 0 out of 520000 steps (0%)
14:22:48:WU01:FS01:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
14:23:35:WU00:FS00:Starting
14:23:35:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:23:35:WU00:FS00:Started FahCore on PID 26834
14:23:35:WU00:FS00:Core PID:26838
14:23:35:WU00:FS00:FahCore 0xa7 started
14:23:36:WARNING:WU00:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:23:36:WARNING:WU00:FS00:Too many errors, failing
14:23:36:WU00:FS00:Sending unit results: id:00 state:SEND error:FAILED project:8677 run:12 clone:1 gen:24 core:0xa7 unit:0x0000001b0002894b5824db763228f23e
14:23:36:WU00:FS00:Connecting to 155.247.166.219:8080
14:23:36:WU00:FS00:Server responded WORK_ACK (400)
14:23:36:WU00:FS00:Cleaning up
14:23:36:WU02:FS00:Connecting to assign3.stanford.edu:8080
14:23:36:WU02:FS00:News:
14:23:36:WU02:FS00:Assigned to work server 171.67.108.101
14:23:36:WU02:FS00:Requesting new work unit for slot 00: READY cpu:6 from 171.67.108.101
14:23:36:WU02:FS00:Connecting to 171.67.108.101:8080
14:23:37:WU02:FS00:Downloading 2.94MiB
14:23:38:WU02:FS00:Download complete
14:23:38:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:13124 run:44 clone:2 gen:11 core:0xa7 unit:0x0000000cab436c655898ca8dab07f72d
14:23:38:WU02:FS00:Starting
14:23:38:WU02:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 02 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:23:38:WU02:FS00:Started FahCore on PID 26839
14:23:38:WU02:FS00:Core PID:26843
14:23:38:WU02:FS00:FahCore 0xa7 started
14:23:38:WARNING:WU02:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:23:38:WU02:FS00:Starting
14:23:38:WU02:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 02 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:23:38:WU02:FS00:Started FahCore on PID 26844
14:23:38:WU02:FS00:Core PID:26848
14:23:38:WU02:FS00:FahCore 0xa7 started
14:23:39:WARNING:WU02:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:24:38:WU01:FS01:0x21:Completed 5200 out of 520000 steps (1%)
14:24:38:WU02:FS00:Starting
14:24:38:WU02:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 02 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:24:38:WU02:FS00:Started FahCore on PID 27119
14:24:38:WU02:FS00:Core PID:27123
14:24:38:WU02:FS00:FahCore 0xa7 started
14:24:39:WARNING:WU02:FS00:FahCore returned: FAILED_2 (1 = 0x1)
14:25:38:WU02:FS00:Starting
14:25:38:WU02:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a7.fah/FahCore_a7 -dir 02 -suffix 01 -version 703 -lifeline 26774 -checkpoint 15 -np 6
14:25:38:WU02:FS00:Started FahCore on PID 27133
14:25:38:WU02:FS00:Core PID:27137
14:25:38:WU02:FS00:FahCore 0xa7 started
14:25:39:WARNING:WU02:FS00:FahCore returned: FAILED_2 (1 = 0x1)