Unit 17433 (0, 1812, 16)

Moderators: Site Moderators, FAHC Science Team

Post Reply
stratos412
Posts: 5
Joined: Sun Mar 29, 2020 6:36 am

Unit 17433 (0, 1812, 16)

Post by stratos412 »

Hello to the community.

I have an issue with the work unit 17433 (0, 1812, 16). I started yesterday (03/02/2021) and let it run for about 11 hours (~47% complete).
I paused the unit and resumed it today (04/02/2021). For some reason, the work unit have been lost. No progress and GPU started to download a new work unit.

What may have happened?

Best
stratos
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Unit 17433 (0, 1812, 16)

Post by bruce »

According to https://apps.foldingathome.org/wu#proje ... 812&gen=16 your system dumped that WU and awarded you zero points.

Please search back over recent copies of FAH's logs and find out what happened. You should have a significant number of previous logs in the logs subdirectory of FAH's data directory.
stratos412
Posts: 5
Joined: Sun Mar 29, 2020 6:36 am

Re: Unit 17433 (0, 1812, 16)

Post by stratos412 »

OK. I found this

Code: Select all


17:56:39:WU00:FS01:0x22:Project: 17433 (Run 0, Clone 1812, Gen 16)
17:56:39:WU00:FS01:0x22:Unit: 0x00000000000000000000000000000000
17:56:39:WU00:FS01:0x22:Reading tar file core.xml
17:56:39:WU00:FS01:0x22:Reading tar file integrator.xml.bz2
17:56:39:WU00:FS01:0x22:Reading tar file state.xml.bz2
17:56:39:WU00:FS01:0x22:Reading tar file system.xml.bz2
17:56:39:WU00:FS01:0x22:Digital signatures verified
17:56:39:WU00:FS01:0x22:Folding@home GPU Core22 Folding@home Core
17:56:39:WU00:FS01:0x22:Version 0.0.13
17:56:39:WU00:FS01:0x22:  Checkpoint write interval: 25000 steps (2%) [50 total]
17:56:39:WU00:FS01:0x22:  JSON viewer frame write interval: 12500 steps (1%) [100 total]
17:56:39:WU00:FS01:0x22:  XTC frame write interval: 10000 steps (0.8%) [125 total]
17:56:39:WU00:FS01:0x22:  Global context and integrator variables write interval: disabled
17:56:39:WU00:FS01:0x22:There are 3 platforms available.
17:56:39:WU00:FS01:0x22:Platform 0: Reference
17:56:39:WU00:FS01:0x22:Platform 1: CPU
17:56:39:WU00:FS01:0x22:Platform 2: OpenCL
17:56:39:WU00:FS01:0x22:  opencl-device 0 specified
17:56:57:WU00:FS01:0x22:Attempting to create OpenCL context:
17:56:57:WU00:FS01:0x22:  Configuring platform OpenCL
17:57:07:WU00:FS01:0x22:  Using OpenCL on platformId 1 and gpu 0
17:57:08:WU00:FS01:0x22:Completed 0 out of 1250000 steps (0%)
17:57:09:WU00:FS01:0x22:Checkpoint completed at step 0
18:14:36:WU00:FS01:0x22:Completed 12500 out of 1250000 steps (1%)
18:32:01:WU00:FS01:0x22:Completed 25000 out of 1250000 steps (2%)
18:32:04:WU00:FS01:0x22:Checkpoint completed at step 25000
18:49:30:WU00:FS01:0x22:Completed 37500 out of 1250000 steps (3%)
19:06:57:WU00:FS01:0x22:Completed 50000 out of 1250000 steps (4%)
19:06:59:WU00:FS01:0x22:Checkpoint completed at step 50000
19:24:25:WU00:FS01:0x22:Completed 62500 out of 1250000 steps (5%)
19:41:51:WU00:FS01:0x22:Completed 75000 out of 1250000 steps (6%)
19:41:54:WU00:FS01:0x22:Checkpoint completed at step 75000
19:57:33:41:127.0.0.1:New Web connection
19:59:19:WU00:FS01:0x22:Completed 87500 out of 1250000 steps (7%)
20:16:37:WU00:FS01:0x22:Completed 100000 out of 1250000 steps (8%)
20:16:39:WU00:FS01:0x22:Checkpoint completed at step 100000
20:33:54:WU00:FS01:0x22:Completed 112500 out of 1250000 steps (9%)
20:51:10:WU00:FS01:0x22:Completed 125000 out of 1250000 steps (10%)
20:51:13:WU00:FS01:0x22:Checkpoint completed at step 125000
21:03:41:70:127.0.0.1:New Web connection
21:08:41:WU00:FS01:0x22:Completed 137500 out of 1250000 steps (11%)
21:26:10:WU00:FS01:0x22:Completed 150000 out of 1250000 steps (12%)
21:26:13:WU00:FS01:0x22:Checkpoint completed at step 150000
21:43:33:WU00:FS01:0x22:Completed 162500 out of 1250000 steps (13%)
22:00:49:WU00:FS01:0x22:Completed 175000 out of 1250000 steps (14%)
22:00:52:WU00:FS01:0x22:Checkpoint completed at step 175000
22:18:10:WU00:FS01:0x22:Completed 187500 out of 1250000 steps (15%)
22:35:29:WU00:FS01:0x22:Completed 200000 out of 1250000 steps (16%)
22:35:32:WU00:FS01:0x22:Checkpoint completed at step 200000
22:52:50:WU00:FS01:0x22:Completed 212500 out of 1250000 steps (17%)
23:10:07:WU00:FS01:0x22:Completed 225000 out of 1250000 steps (18%)
23:10:09:WU00:FS01:0x22:Checkpoint completed at step 225000
23:27:28:WU00:FS01:0x22:Completed 237500 out of 1250000 steps (19%)
23:44:46:WU00:FS01:0x22:Completed 250000 out of 1250000 steps (20%)
23:44:49:WU00:FS01:0x22:Checkpoint completed at step 250000
******************************* Date: 2021-02-04 *******************************
00:02:05:WU00:FS01:0x22:Completed 262500 out of 1250000 steps (21%)
00:19:23:WU00:FS01:0x22:Completed 275000 out of 1250000 steps (22%)
00:19:26:WU00:FS01:0x22:Checkpoint completed at step 275000
00:36:44:WU00:FS01:0x22:Completed 287500 out of 1250000 steps (23%)
00:54:03:WU00:FS01:0x22:Completed 300000 out of 1250000 steps (24%)
00:54:06:WU00:FS01:0x22:Checkpoint completed at step 300000
01:11:24:WU00:FS01:0x22:Completed 312500 out of 1250000 steps (25%)
01:28:42:WU00:FS01:0x22:Completed 325000 out of 1250000 steps (26%)
01:28:45:WU00:FS01:0x22:Checkpoint completed at step 325000
01:46:03:WU00:FS01:0x22:Completed 337500 out of 1250000 steps (27%)
02:03:21:WU00:FS01:0x22:Completed 350000 out of 1250000 steps (28%)
02:03:24:WU00:FS01:0x22:Checkpoint completed at step 350000
02:20:41:WU00:FS01:0x22:Completed 362500 out of 1250000 steps (29%)
02:37:59:WU00:FS01:0x22:Completed 375000 out of 1250000 steps (30%)
02:38:01:WU00:FS01:0x22:Checkpoint completed at step 375000
02:55:20:WU00:FS01:0x22:Completed 387500 out of 1250000 steps (31%)
03:12:37:WU00:FS01:0x22:Completed 400000 out of 1250000 steps (32%)
03:12:40:WU00:FS01:0x22:Checkpoint completed at step 400000
03:29:59:WU00:FS01:0x22:Completed 412500 out of 1250000 steps (33%)
03:47:17:WU00:FS01:0x22:Completed 425000 out of 1250000 steps (34%)
03:47:19:WU00:FS01:0x22:Checkpoint completed at step 425000
04:04:38:WU00:FS01:0x22:Completed 437500 out of 1250000 steps (35%)
04:21:57:WU00:FS01:0x22:Completed 450000 out of 1250000 steps (36%)
04:21:59:WU00:FS01:0x22:Checkpoint completed at step 450000
04:39:19:WU00:FS01:0x22:Completed 462500 out of 1250000 steps (37%)
04:56:39:WU00:FS01:0x22:Completed 475000 out of 1250000 steps (38%)
04:56:41:WU00:FS01:0x22:Checkpoint completed at step 475000
05:13:59:WU00:FS01:0x22:Completed 487500 out of 1250000 steps (39%)
05:31:16:WU00:FS01:0x22:Completed 500000 out of 1250000 steps (40%)
05:31:19:WU00:FS01:0x22:Checkpoint completed at step 500000
05:34:23:103:127.0.0.1:New Web connection
05:49:29:WU00:FS01:0x22:Completed 512500 out of 1250000 steps (41%)
05:54:18:FS01:Paused
05:54:18:FS01:Shutting core down
05:54:18:WU00:FS01:0x22:WARNING:Console control signal 1 on PID 12004
05:54:18:WU00:FS01:0x22:Exiting, please wait. . .
05:54:18:WU00:FS01:0x22:Folding@home Core Shutdown: INTERRUPTED
05:54:18:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
05:54:29:ERROR:Receive error: 10054: An existing connection was forcibly closed by the remote host.
05:54:40:Removing old file 'configs/config-20201208-212747.xml'
05:54:40:Saving configuration to config.xml
05:54:40:<config>
05:54:40:  <!-- Network -->
05:54:40:  <proxy v=':8080'/>
05:54:40:
05:54:40:  <!-- Slot Control -->
05:54:40:  <pause-on-start v='true'/>
05:54:40:
05:54:40:  <!-- User Information -->
05:54:40:  <passkey v='********************************'/>
05:54:40:  <user v='stratos412'/>
05:54:40:
05:54:40:  <!-- Folding Slots -->
05:54:40:  <slot id='0' type='CPU'/>
05:54:40:  <slot id='1' type='GPU'>
05:54:40:    <paused v='true'/>
05:54:40:  </slot>
05:54:40:</config>

and this

Code: Select all


14:08:16:Trying to access database...
14:08:16:Successfully acquired database lock
14:08:16:Enabled folding slot 00: PAUSED cpu:2 (by user)
14:08:16:ERROR:Exception: No available GPUs
14:08:16:ERROR:Exception: Option 'gpu-index' has no default and is not set.
14:08:16:WARNING:WU00:No longer matches Slot 1's configuration and there are no other matching slots, dumping
14:08:16:WU00:FS01:Sending unit results: id:00 state:SEND error:DUMPED project:17433 run:0 clone:1812 gen:16 core:0x22 unit:0x00000714000000100000441900000000
14:08:16:WU00:FS01:Connecting to 206.223.170.146:8080
14:08:17:WU01:FS01:Connecting to 65.254.110.245:8080
14:08:17:WU00:FS01:Server responded WORK_ACK (400)
14:08:17:WU00:FS01:Cleaning up
14:08:18:WU01:FS01:Assigned to work server 128.252.203.9
14:08:18:WU01:FS01:Requesting new work unit for slot 
14:08:18:ERROR:WU01:FS01:Exception: Option 'gpu-index' has no default and is not set.

Mod Edit: Changed Quote Tags To Code Tags - PantherX
Knish
Posts: 222
Joined: Tue Mar 17, 2020 5:20 am

Re: Unit 17433 (0, 1812, 16)

Post by Knish »

This has happened to me for a few reasons (all of which have happened):

1). I left my pc on for a long time (over 30 days) and after rebooting, did not have an immediate internet connection b/c i like to manually connect to my wifi.
- About once a month the client will try to update the GPUs.txt file. Since the client deemed it was time to check for an update, and I did not immediately have internet at the time, it could not verify the GPU slot so dumped it.

2). something caused a change (usually windows update thinking it has a "better" driver)
- MSFT in their infinite wisdom thinks we don't need OpenCL and their 'update' doesn't have it. FAHClient then fails the gpu prerequisites check and dumps the existing data. In Win10 Pro, gpedit can set a policy to exclude updating drivers during a windows update. One will have to remember turning it off/on if they ever want to update drivers tho. gpedit is only in Pro, but there is also a way to unlock gpedit for Win10 Home so one could do the same. I found the process on youtube that involved typing things into cmd/powershell (something like that).

oh, i guess there were only 2 reasons, heh.
Post Reply