Enormous job: how did I get it, how can I fix it?

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Rwolf01
Posts: 20
Joined: Mon Jun 01, 2020 8:06 pm

Enormous job: how did I get it, how can I fix it?

Post by Rwolf01 »

I just set up 6 PCs with a wide range of CPU and GPU abilities, all under the same user.

I noticed that the size of the work units is highly variable, from as little as 2903 points to 145,432 points!
The largest one is not a problem, it landed on one of my new/fast machines and will be done in 10.5 hours.

The problem is my 2nd largest job: 72,700 points. It landed on a "retired" laptop and has an ETA of 7.04 days!

I could let it run, but I noticed the job has a timeout of just 1 day 9 hours, and an expiration of 4 days 9 hours
from now so I'm afraid my result would be too late to be useful.

I see how to pause a job, but that would only delay it more.

I'd like some way to:

1: Avoid wasting CPUs on a job that won't finish in time.

2: Avoid "failing to deliver" and holding up somebody's research.


I guess I'd like to know:

1: How do I kill a job? (and properly report to the mother-ship that I am punting it, so it can get reassigned ASAP)

2: Set a limit on the job size for each PC or "slot"? (I tried editing them, but didn't see any settings that I was confident/foolish enough to alter)


Thinking big picture, is there a bug here or did I (or maybe the researcher who set the job up) just dork something up?

Any ideas how this happened and how can we prevent it from happening again? (either to me or some other enthusiastic but clueless newbie :-)
anandhanju
Posts: 522
Joined: Mon Dec 03, 2007 4:33 am
Location: Australia

Re: Enormous job: how did I get it, how can I fix it?

Post by anandhanju »

The only project dishing out 72,700 WUs at the moment is p14447, which is GPU project. Under normal circumstances, GPU WUs do not take 7 days to finish.

1) How long has this work unit been running? The estimates can be off in the beginning as the client does not have sufficient timing information to reliably estimate a completion time.
2) What GPU is this project running on?
3) Do you see your GPU being used to a large degree in a GPU monitoring tool?

It would also be helpful if you could post your Folding@home log file here.
JimboPalmer
Posts: 2522
Joined: Mon Feb 16, 2009 4:12 am
Location: Greenwood MS USA

Re: Enormous job: how did I get it, how can I fix it?

Post by JimboPalmer »

Welcome to Folding@Home!

viewtopic.php?f=24&t=26036

tells how to post a log file.

I would prefer to fix the laptop before we remove the current Work Unit, just so we don't download more large WUs.

There are old parameters designed for dial up modems that can influence the size of the downloads, but nothing to influence the amount of work.
Tsar of all the Rushers
I tried to remain childlike, all I achieved was childish.
A friend to those who want no friends
Rwolf01
Posts: 20
Joined: Mon Jun 01, 2020 8:06 pm

Re: Enormous job: how did I get it, how can I fix it?

Post by Rwolf01 »

Yup, 14447. Laptop GPU is GeForce GT 640M. GPU is hitting 100% consistantly. (I have spotted occasional lulls, where it drops to near 0% for a few seconds. Assume that is when it's writing it's every 15 minutes checkpoint files)

I'm familiar with extrapolation errors and looked for that. Plotting % vs time show a line with constant slope and no visible kinks, so that ain't it. Job is at ~19 hours run time/10% now, and ETA is still 6.85 days.

Incidentally, this LT has a sibbling with the same GPU. It's got a chunk of P14253 with an ETA of 3.46 days (between the timeout and expirations times)

complete logfile is:

Code: Select all

*********************** Log Started 2020-06-01T19:35:55Z ***********************
19:35:55:Trying to access database...
19:35:55:Successfully acquired database lock
19:35:55:Downloading GPUs.txt from assign1.foldingathome.org:80
19:35:55:Connecting to assign1.foldingathome.org:80
19:35:55:Read GPUs.txt
19:35:55:Enabled folding slot 00: PAUSED cpu:6 (not configured)
19:35:58:Enabled folding slot 01: PAUSED gpu:0:GK107 [GeForce GT 640M] (not configured)
19:35:58:****************************** FAHClient ******************************
19:35:58:        Version: 7.6.13
19:35:58:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
19:35:58:      Copyright: 2020 foldingathome.org
19:35:58:       Homepage: https://foldingathome.org/
19:35:58:           Date: Apr 27 2020
19:35:58:           Time: 21:21:01
19:35:58:       Revision: 5a652817f46116b6e135503af97f18e094414e3b
19:35:58:         Branch: master
19:35:58:       Compiler: Visual C++ 2008
19:35:58:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
19:35:58:       Platform: win32 10
19:35:58:           Bits: 32
19:35:58:           Mode: Release
19:35:58:           Args: --open-web-control
19:35:58:******************************** CBang ********************************
19:35:58:           Date: Apr 24 2020
19:35:58:           Time: 17:07:55
19:35:58:       Revision: ea081a3b3b0f4a37c4d0440b4f1bc184197c7797
19:35:58:         Branch: master
19:35:58:       Compiler: Visual C++ 2008
19:35:58:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
19:35:58:       Platform: win32 10
19:35:58:           Bits: 32
19:35:58:           Mode: Release
19:35:58:******************************* System ********************************
19:35:58:            CPU: Intel(R) Core(TM) i7-3612QM CPU @ 2.10GHz
19:35:58:         CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
19:35:58:           CPUs: 8
19:35:58:         Memory: 3.90GiB
19:35:58:    Free Memory: 1.98GiB
19:35:58:        Threads: WINDOWS_THREADS
19:35:58:     OS Version: 6.2
19:35:58:    Has Battery: false
19:35:58:     On Battery: false
19:35:58:     UTC Offset: -7
19:35:58:            PID: 8728
19:35:58:            CWD: C:\Users\Sony\AppData\Roaming\FAHClient
19:35:58:  Win32 Service: false
19:35:58:             OS: Windows 10 Enterprise
19:35:58:        OS Arch: AMD64
19:35:58:           GPUs: 1
19:35:58:          GPU 0: Bus:1 Slot:0 Func:0 NVIDIA:3 GK107 [GeForce GT 640M]
19:35:58:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:3.0 Driver:8.0
19:35:58:OpenCL Device 1: Platform:0 Device:1 Bus:NA Slot:NA Compute:1.2 Driver:10.18
19:35:58:OpenCL Device 2: Platform:1 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:382.5
19:35:58:******************************* libFAH ********************************
19:35:58:           Date: Apr 15 2020
19:35:58:           Time: 14:53:14
19:35:58:       Revision: 216968bc7025029c841ed6e36e81a03a316890d3
19:35:58:         Branch: master
19:35:58:       Compiler: Visual C++ 2008
19:35:58:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
19:35:58:       Platform: win32 10
19:35:58:           Bits: 32
19:35:58:           Mode: Release
19:35:58:***********************************************************************
19:35:58:<config>
19:35:58:  <!-- Folding Slots -->
19:35:58:  <slot id='0' type='CPU'/>
19:35:58:  <slot id='1' type='GPU'/>
19:35:58:</config>
19:36:06:17:127.0.0.1:New Web session
19:36:56:Saving configuration to config.xml
19:36:56:<config>
19:36:56:  <!-- Folding Slots -->
19:36:56:  <slot id='0' type='CPU'/>
19:36:56:  <slot id='1' type='GPU'/>
19:36:56:</config>
19:36:56:Set client configured
19:36:56:WU00:FS00:Connecting to assign1.foldingathome.org:80
19:36:56:WU00:FS00:Connecting to assign1.foldingathome.org:80
19:36:56:WU01:FS01:Connecting to assign1.foldingathome.org:80
19:36:56:WU00:FS00:Assigned to work server 155.247.164.214
19:36:56:WU00:FS00:Requesting new work unit for slot 00: READY cpu:6 from 155.247.164.214
19:36:56:WU00:FS00:Connecting to 155.247.164.214:8080
19:36:56:WU01:FS01:Assigned to work server 3.133.76.19
19:36:56:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GK107 [GeForce GT 640M] from 3.133.76.19
19:36:56:WU01:FS01:Connecting to 3.133.76.19:8080
19:37:17:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
19:37:17:WU00:FS00:Connecting to 155.247.164.214:80
19:37:17:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
19:37:17:WU01:FS01:Connecting to 3.133.76.19:80
19:37:39:ERROR:WU00:FS00:Exception: Failed to connect to 155.247.164.214:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
19:37:39:WU00:FS00:Connecting to assign1.foldingathome.org:80
19:37:39:WU00:FS00:Assigned to work server 206.223.170.146
19:37:39:WU00:FS00:Requesting new work unit for slot 00: READY cpu:7 from 206.223.170.146
19:37:39:WU00:FS00:Connecting to 206.223.170.146:8080
19:37:40:WU00:FS00:Downloading 25.12MiB
19:37:46:WU00:FS00:Download 21.65%
19:37:47:WU01:FS01:Downloading 20.51MiB
19:37:52:WU00:FS00:Download 40.06%
19:37:53:WU01:FS01:Download 38.39%
19:37:57:Saving configuration to config.xml
19:37:57:<config>
19:37:57:  <!-- Folding Slot Configuration -->
19:37:57:  <cause v='COVID_19'/>
19:37:57:
19:37:57:  <!-- Slot Control -->
19:37:57:  <power v='FULL'/>
19:37:57:
19:37:57:  <!-- User Information -->
19:37:57:  <passkey v='*****'/>
19:37:57:  <team v='749'/>
19:37:57:  <user v='Rwolf01'/>
19:37:57:
19:37:57:  <!-- Folding Slots -->
19:37:57:  <slot id='0' type='CPU'/>
19:37:57:  <slot id='1' type='GPU'/>
19:37:57:</config>
19:37:58:WU00:FS00:Download 60.47%
19:37:59:WU01:FS01:Download 82.57%
19:38:01:WU01:FS01:Download complete
19:38:01:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:14447 run:0 clone:1479 gen:31 core:0x22 unit:0x0000003203854c135ea7b88f547e2499
19:38:01:WU01:FS01:Downloading core from http://cores.foldingathome.org/v7/win/64bit/Core_22.fah
19:38:01:WU01:FS01:Connecting to cores.foldingathome.org:80
19:38:01:WU01:FS01:FahCore 22: Downloading 4.04MiB
19:38:04:WU00:FS00:Download 78.63%
19:38:05:WU01:FS01:FahCore 22: Download complete
19:38:05:WU01:FS01:Valid core signature
19:38:05:WU01:FS01:Unpacked 13.50MiB to cores/cores.foldingathome.org/v7/win/64bit/Core_22.fah/FahCore_22.exe
19:38:05:WU01:FS01:Starting
19:38:05:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Sony\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/Core_22.fah/FahCore_22.exe -dir 01 -suffix 01 -version 706 -lifeline 8728 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 1 -opencl-device 0 -cuda-device 0 -gpu 0
19:38:06:WU01:FS01:Started FahCore on PID 6068
19:38:06:WU01:FS01:Core PID:6772
19:38:06:WU01:FS01:FahCore 0x22 started
19:38:07:WU01:FS01:0x22:*********************** Log Started 2020-06-01T19:38:06Z ***********************
19:38:07:WU01:FS01:0x22:*************************** Core22 Folding@home Core ***************************
19:38:07:WU01:FS01:0x22:       Type: 0x22
19:38:07:WU01:FS01:0x22:       Core: Core22
19:38:07:WU01:FS01:0x22:    Website: https://foldingathome.org/
19:38:07:WU01:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
19:38:07:WU01:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
19:38:07:WU01:FS01:0x22:             <rafal.wiewiora@choderalab.org>
19:38:07:WU01:FS01:0x22:       Args: -dir 01 -suffix 01 -version 706 -lifeline 6068 -checkpoint 15
19:38:07:WU01:FS01:0x22:             -gpu-vendor nvidia -opencl-platform 1 -opencl-device 0 -cuda-device
19:38:07:WU01:FS01:0x22:             0 -gpu 0
19:38:07:WU01:FS01:0x22:     Config: <none>
19:38:07:WU01:FS01:0x22:************************************ Build *************************************
19:38:07:WU01:FS01:0x22:    Version: 0.0.5
19:38:07:WU01:FS01:0x22:       Date: Apr 22 2020
19:38:07:WU01:FS01:0x22:       Time: 04:42:59
19:38:07:WU01:FS01:0x22: Repository: Git
19:38:07:WU01:FS01:0x22:   Revision: 2d69202c898bd9bb3e093f51cd32bf411c2a0388
19:38:07:WU01:FS01:0x22:     Branch: HEAD
19:38:07:WU01:FS01:0x22:   Compiler: Visual C++ 2008
19:38:07:WU01:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
19:38:07:WU01:FS01:0x22:   Platform: win32 10
19:38:07:WU01:FS01:0x22:       Bits: 64
19:38:07:WU01:FS01:0x22:       Mode: Release
19:38:07:WU01:FS01:0x22:************************************ System ************************************
19:38:07:WU01:FS01:0x22:        CPU: Intel(R) Core(TM) i7-3612QM CPU @ 2.10GHz
19:38:07:WU01:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
19:38:07:WU01:FS01:0x22:       CPUs: 8
19:38:07:WU01:FS01:0x22:     Memory: 3.90GiB
19:38:07:WU01:FS01:0x22:Free Memory: 1.74GiB
19:38:07:WU01:FS01:0x22:    Threads: WINDOWS_THREADS
19:38:07:WU01:FS01:0x22: OS Version: 6.2
19:38:07:WU01:FS01:0x22:Has Battery: false
19:38:07:WU01:FS01:0x22: On Battery: false
19:38:07:WU01:FS01:0x22: UTC Offset: -7
19:38:07:WU01:FS01:0x22:        PID: 6772
19:38:07:WU01:FS01:0x22:        CWD: C:\Users\Sony\AppData\Roaming\FAHClient\work
19:38:07:WU01:FS01:0x22:         OS: Windows 10 Pro
19:38:07:WU01:FS01:0x22:    OS Arch: AMD64
19:38:07:WU01:FS01:0x22:********************************************************************************
19:38:07:WU01:FS01:0x22:Project: 14447 (Run 0, Clone 1479, Gen 31)
19:38:07:WU01:FS01:0x22:Unit: 0x0000003203854c135ea7b88f547e2499
19:38:07:WU01:FS01:0x22:Reading tar file core.xml
19:38:07:WU01:FS01:0x22:Reading tar file integrator.xml
19:38:07:WU01:FS01:0x22:Reading tar file state.xml
19:38:08:WU01:FS01:0x22:Reading tar file system.xml
19:38:08:WU00:FS00:Download complete
19:38:08:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:14236 run:557 clone:0 gen:30 core:0xa7 unit:0x00000024cedfaa925eb8d09be02a4259
19:38:09:WU00:FS00:Downloading core from http://cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah
19:38:09:WU00:FS00:Connecting to cores.foldingathome.org:80
19:38:09:WU00:FS00:FahCore a7: Downloading 6.71MiB
19:38:10:WU01:FS01:0x22:Digital signatures verified
19:38:10:WU01:FS01:0x22:Folding@home GPU Core22 Folding@home Core
19:38:10:WU01:FS01:0x22:Version 0.0.5
19:38:12:WU00:FS00:FahCore a7: Download complete
19:38:12:WU00:FS00:Valid core signature
19:38:12:WU00:FS00:Unpacked 19.85MiB to cores/cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah/FahCore_a7.exe
19:38:12:WU00:FS00:Unpacked 2.64MiB to cores/cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah/libfftw3f-3.dll
19:38:12:WU00:FS00:Starting
19:38:12:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Sony\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah/FahCore_a7.exe -dir 00 -suffix 01 -version 706 -lifeline 8728 -checkpoint 15 -np 7
19:38:12:WU00:FS00:Started FahCore on PID 8548
19:38:13:WU00:FS00:Core PID:7860
19:38:13:WU00:FS00:FahCore 0xa7 started
19:38:13:WU00:FS00:0xa7:*********************** Log Started 2020-06-01T19:38:13Z ***********************
19:38:13:WU00:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
19:38:13:WU00:FS00:0xa7:       Type: 0xa7
19:38:13:WU00:FS00:0xa7:       Core: Gromacs
19:38:13:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 706 -lifeline 8548 -checkpoint 15 -np 7
19:38:13:WU00:FS00:0xa7:************************************ CBang *************************************
19:38:13:WU00:FS00:0xa7:       Date: Oct 26 2019
19:38:13:WU00:FS00:0xa7:       Time: 01:38:25
19:38:13:WU00:FS00:0xa7:   Revision: c46a1a011a24143739ac7218c5a435f66777f62f
19:38:13:WU00:FS00:0xa7:     Branch: master
19:38:13:WU00:FS00:0xa7:   Compiler: Visual C++ 2008
19:38:13:WU00:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
19:38:13:WU00:FS00:0xa7:   Platform: win32 10
19:38:13:WU00:FS00:0xa7:       Bits: 64
19:38:13:WU00:FS00:0xa7:       Mode: Release
19:38:13:WU00:FS00:0xa7:************************************ System ************************************
19:38:13:WU00:FS00:0xa7:        CPU: Intel(R) Core(TM) i7-3612QM CPU @ 2.10GHz
19:38:13:WU00:FS00:0xa7:     CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
19:38:13:WU00:FS00:0xa7:       CPUs: 8
19:38:13:WU00:FS00:0xa7:     Memory: 3.90GiB
19:38:13:WU00:FS00:0xa7:Free Memory: 1.69GiB
19:38:13:WU00:FS00:0xa7:    Threads: WINDOWS_THREADS
19:38:13:WU00:FS00:0xa7: OS Version: 6.2
19:38:13:WU00:FS00:0xa7:Has Battery: false
19:38:13:WU00:FS00:0xa7: On Battery: false
19:38:13:WU00:FS00:0xa7: UTC Offset: -7
19:38:13:WU00:FS00:0xa7:        PID: 7860
19:38:13:WU00:FS00:0xa7:        CWD: C:\Users\Sony\AppData\Roaming\FAHClient\work
19:38:13:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
19:38:13:WU00:FS00:0xa7:    Version: 0.0.18
19:38:13:WU00:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
19:38:13:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
19:38:13:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
19:38:13:WU00:FS00:0xa7:       Date: Oct 26 2019
19:38:13:WU00:FS00:0xa7:       Time: 01:52:30
19:38:13:WU00:FS00:0xa7:   Revision: c1e3513b1bc0c16013668f2173ee969e5995b38e
19:38:13:WU00:FS00:0xa7:     Branch: master
19:38:13:WU00:FS00:0xa7:   Compiler: Visual C++ 2008
19:38:13:WU00:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
19:38:13:WU00:FS00:0xa7:   Platform: win32 10
19:38:13:WU00:FS00:0xa7:       Bits: 64
19:38:13:WU00:FS00:0xa7:       Mode: Release
19:38:13:WU00:FS00:0xa7:************************************ Build *************************************
19:38:13:WU00:FS00:0xa7:       SIMD: avx_256
19:38:13:WU00:FS00:0xa7:********************************************************************************
19:38:13:WU00:FS00:0xa7:Project: 14236 (Run 557, Clone 0, Gen 30)
19:38:13:WU00:FS00:0xa7:Unit: 0x00000024cedfaa925eb8d09be02a4259
19:38:13:WU00:FS00:0xa7:Reading tar file core.xml
19:38:13:WU00:FS00:0xa7:Reading tar file frame30.tpr
19:38:13:WU00:FS00:0xa7:Digital signatures verified
19:38:13:WU00:FS00:0xa7:Reducing thread count from 7 to 6 to avoid domain decomposition by a prime number > 3
19:38:13:WU00:FS00:0xa7:Calling: mdrun -s frame30.tpr -o frame30.trr -x frame30.xtc -cpt 15 -nt 6
19:38:14:WU00:FS00:0xa7:Steps: first=1875000 total=62500
19:38:23:WU00:FS00:0xa7:Completed 1 out of 62500 steps (0%)
19:39:18:WU01:FS01:0x22:Completed 0 out of 2000000 steps (0%)
19:39:18:WU01:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
19:39:33:36:127.0.0.1:New Web session
19:44:35:WU00:FS00:0xa7:Completed 625 out of 62500 steps (1%)
19:45:04:Saving configuration to config.xml
19:45:04:<config>
19:45:04:  <!-- Folding Core -->
19:45:04:  <core-priority v='low'/>
19:45:04:
19:45:04:  <!-- Folding Slot Configuration -->
19:45:04:  <cause v='COVID_19'/>
19:45:04:
19:45:04:  <!-- Network -->
19:45:04:  <proxy v=':8080'/>
19:45:04:
19:45:04:  <!-- Slot Control -->
19:45:04:  <power v='FULL'/>
19:45:04:
19:45:04:  <!-- User Information -->
19:45:04:  <passkey v='*****'/>
19:45:04:  <team v='749'/>
19:45:04:  <user v='Rwolf01'/>
19:45:04:
19:45:04:  <!-- Folding Slots -->
19:45:04:  <slot id='0' type='CPU'/>
19:45:04:  <slot id='1' type='GPU'/>
19:45:04:</config>
19:50:31:57:127.0.0.1:New Web session
19:50:46:WU00:FS00:0xa7:Completed 1250 out of 62500 steps (2%)
19:56:58:WU00:FS00:0xa7:Completed 1875 out of 62500 steps (3%)
20:03:03:WU00:FS00:0xa7:Completed 2500 out of 62500 steps (4%)
20:09:13:WU00:FS00:0xa7:Completed 3125 out of 62500 steps (5%)
20:11:45:66:127.0.0.1:New Web session
20:15:23:WU00:FS00:0xa7:Completed 3750 out of 62500 steps (6%)
20:21:31:WU00:FS00:0xa7:Completed 4375 out of 62500 steps (7%)
20:27:40:WU00:FS00:0xa7:Completed 5000 out of 62500 steps (8%)
20:33:43:WU00:FS00:0xa7:Completed 5625 out of 62500 steps (9%)
20:39:50:WU00:FS00:0xa7:Completed 6250 out of 62500 steps (10%)
20:45:57:WU00:FS00:0xa7:Completed 6875 out of 62500 steps (11%)
20:51:59:WU00:FS00:0xa7:Completed 7500 out of 62500 steps (12%)
20:58:07:WU00:FS00:0xa7:Completed 8125 out of 62500 steps (13%)
21:04:09:WU00:FS00:0xa7:Completed 8750 out of 62500 steps (14%)
21:10:17:WU00:FS00:0xa7:Completed 9375 out of 62500 steps (15%)
21:16:19:WU00:FS00:0xa7:Completed 10000 out of 62500 steps (16%)
21:23:28:WU00:FS00:0xa7:Completed 10625 out of 62500 steps (17%)
21:29:01:WU01:FS01:0x22:Completed 20000 out of 2000000 steps (1%)
21:30:23:WU00:FS00:0xa7:Completed 11250 out of 62500 steps (18%)
21:36:25:WU00:FS00:0xa7:Completed 11875 out of 62500 steps (19%)
21:42:33:WU00:FS00:0xa7:Completed 12500 out of 62500 steps (20%)
21:48:36:WU00:FS00:0xa7:Completed 13125 out of 62500 steps (21%)
21:54:43:WU00:FS00:0xa7:Completed 13750 out of 62500 steps (22%)
22:00:46:WU00:FS00:0xa7:Completed 14375 out of 62500 steps (23%)
22:06:49:WU00:FS00:0xa7:Completed 15000 out of 62500 steps (24%)
22:12:57:WU00:FS00:0xa7:Completed 15625 out of 62500 steps (25%)
22:18:59:WU00:FS00:0xa7:Completed 16250 out of 62500 steps (26%)
22:25:07:WU00:FS00:0xa7:Completed 16875 out of 62500 steps (27%)
22:31:09:WU00:FS00:0xa7:Completed 17500 out of 62500 steps (28%)
22:37:11:WU00:FS00:0xa7:Completed 18125 out of 62500 steps (29%)
22:43:19:WU00:FS00:0xa7:Completed 18750 out of 62500 steps (30%)
22:49:21:WU00:FS00:0xa7:Completed 19375 out of 62500 steps (31%)
22:55:27:WU00:FS00:0xa7:Completed 20000 out of 62500 steps (32%)
23:01:30:WU00:FS00:0xa7:Completed 20625 out of 62500 steps (33%)
23:07:32:WU00:FS00:0xa7:Completed 21250 out of 62500 steps (34%)
23:13:40:WU00:FS00:0xa7:Completed 21875 out of 62500 steps (35%)
23:19:14:WU01:FS01:0x22:Completed 40000 out of 2000000 steps (2%)
23:19:44:WU00:FS00:0xa7:Completed 22500 out of 62500 steps (36%)
23:25:52:WU00:FS00:0xa7:Completed 23125 out of 62500 steps (37%)
23:31:54:WU00:FS00:0xa7:Completed 23750 out of 62500 steps (38%)
23:37:57:WU00:FS00:0xa7:Completed 24375 out of 62500 steps (39%)
23:44:05:WU00:FS00:0xa7:Completed 25000 out of 62500 steps (40%)
23:50:07:WU00:FS00:0xa7:Completed 25625 out of 62500 steps (41%)
23:56:15:WU00:FS00:0xa7:Completed 26250 out of 62500 steps (42%)
00:02:17:WU00:FS00:0xa7:Completed 26875 out of 62500 steps (43%)
00:08:19:WU00:FS00:0xa7:Completed 27500 out of 62500 steps (44%)
00:14:29:WU00:FS00:0xa7:Completed 28125 out of 62500 steps (45%)
00:20:33:WU00:FS00:0xa7:Completed 28750 out of 62500 steps (46%)
00:26:45:WU00:FS00:0xa7:Completed 29375 out of 62500 steps (47%)
00:32:49:WU00:FS00:0xa7:Completed 30000 out of 62500 steps (48%)
00:38:59:WU00:FS00:0xa7:Completed 30625 out of 62500 steps (49%)
00:45:04:WU00:FS00:0xa7:Completed 31250 out of 62500 steps (50%)
00:51:11:WU00:FS00:0xa7:Completed 31875 out of 62500 steps (51%)
00:57:24:WU00:FS00:0xa7:Completed 32500 out of 62500 steps (52%)
01:03:33:WU00:FS00:0xa7:Completed 33125 out of 62500 steps (53%)
01:09:39:WU01:FS01:0x22:Completed 60000 out of 2000000 steps (3%)
01:09:50:WU00:FS00:0xa7:Completed 33750 out of 62500 steps (54%)
01:15:57:WU00:FS00:0xa7:Completed 34375 out of 62500 steps (55%)
01:18:55:74:127.0.0.1:New Web session
01:22:09:WU00:FS00:0xa7:Completed 35000 out of 62500 steps (56%)
01:28:37:WU00:FS00:0xa7:Completed 35625 out of 62500 steps (57%)
01:34:46:WU00:FS00:0xa7:Completed 36250 out of 62500 steps (58%)
******************************* Date: 2020-06-02 *******************************
01:41:01:WU00:FS00:0xa7:Completed 36875 out of 62500 steps (59%)
01:47:07:WU00:FS00:0xa7:Completed 37500 out of 62500 steps (60%)
01:53:11:WU00:FS00:0xa7:Completed 38125 out of 62500 steps (61%)
01:59:22:WU00:FS00:0xa7:Completed 38750 out of 62500 steps (62%)
02:05:30:WU00:FS00:0xa7:Completed 39375 out of 62500 steps (63%)
02:11:39:WU00:FS00:0xa7:Completed 40000 out of 62500 steps (64%)
02:17:43:WU00:FS00:0xa7:Completed 40625 out of 62500 steps (65%)
02:23:51:WU00:FS00:0xa7:Completed 41250 out of 62500 steps (66%)
02:29:56:WU00:FS00:0xa7:Completed 41875 out of 62500 steps (67%)
02:36:00:WU00:FS00:0xa7:Completed 42500 out of 62500 steps (68%)
02:42:11:WU00:FS00:0xa7:Completed 43125 out of 62500 steps (69%)
02:48:14:WU00:FS00:0xa7:Completed 43750 out of 62500 steps (70%)
02:54:23:WU00:FS00:0xa7:Completed 44375 out of 62500 steps (71%)
03:00:26:WU00:FS00:0xa7:Completed 45000 out of 62500 steps (72%)
03:06:30:WU00:FS00:0xa7:Completed 45625 out of 62500 steps (73%)
03:12:40:WU00:FS00:0xa7:Completed 46250 out of 62500 steps (74%)
03:15:49:WU01:FS01:0x22:Completed 80000 out of 2000000 steps (4%)
03:18:45:WU00:FS00:0xa7:Completed 46875 out of 62500 steps (75%)
03:24:52:WU00:FS00:0xa7:Completed 47500 out of 62500 steps (76%)
03:30:54:WU00:FS00:0xa7:Completed 48125 out of 62500 steps (77%)
03:36:56:WU00:FS00:0xa7:Completed 48750 out of 62500 steps (78%)
03:43:04:WU00:FS00:0xa7:Completed 49375 out of 62500 steps (79%)
03:49:06:WU00:FS00:0xa7:Completed 50000 out of 62500 steps (80%)
03:55:13:WU00:FS00:0xa7:Completed 50625 out of 62500 steps (81%)
04:01:15:WU00:FS00:0xa7:Completed 51250 out of 62500 steps (82%)
04:07:18:WU00:FS00:0xa7:Completed 51875 out of 62500 steps (83%)
04:13:25:WU00:FS00:0xa7:Completed 52500 out of 62500 steps (84%)
04:19:27:WU00:FS00:0xa7:Completed 53125 out of 62500 steps (85%)
04:25:34:WU00:FS00:0xa7:Completed 53750 out of 62500 steps (86%)
04:31:36:WU00:FS00:0xa7:Completed 54375 out of 62500 steps (87%)
04:37:38:WU00:FS00:0xa7:Completed 55000 out of 62500 steps (88%)
04:43:46:WU00:FS00:0xa7:Completed 55625 out of 62500 steps (89%)
04:49:48:WU00:FS00:0xa7:Completed 56250 out of 62500 steps (90%)
04:55:56:WU00:FS00:0xa7:Completed 56875 out of 62500 steps (91%)
05:01:57:WU00:FS00:0xa7:Completed 57500 out of 62500 steps (92%)
05:06:22:WU01:FS01:0x22:Completed 100000 out of 2000000 steps (5%)
05:08:01:WU00:FS00:0xa7:Completed 58125 out of 62500 steps (93%)
05:14:09:WU00:FS00:0xa7:Completed 58750 out of 62500 steps (94%)
05:20:10:WU00:FS00:0xa7:Completed 59375 out of 62500 steps (95%)
05:26:17:WU00:FS00:0xa7:Completed 60000 out of 62500 steps (96%)
05:32:19:WU00:FS00:0xa7:Completed 60625 out of 62500 steps (97%)
05:38:20:WU00:FS00:0xa7:Completed 61250 out of 62500 steps (98%)
05:44:29:WU00:FS00:0xa7:Completed 61875 out of 62500 steps (99%)
05:44:29:WU02:FS00:Connecting to assign1.foldingathome.org:80
05:44:29:WU02:FS00:Assigned to work server 129.213.157.105
05:44:29:WU02:FS00:Requesting new work unit for slot 00: RUNNING cpu:7 from 129.213.157.105
05:44:29:WU02:FS00:Connecting to 129.213.157.105:8080
05:44:30:WU02:FS00:Downloading 5.52MiB
05:44:33:WU02:FS00:Download complete
05:44:33:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:16802 run:3 clone:38 gen:96 core:0xa7 unit:0x0000007181d59d695e95d8198b1566d6
05:50:30:WU00:FS00:0xa7:Completed 62500 out of 62500 steps (100%)
05:50:58:WU00:FS00:0xa7:Saving result file ..\logfile_01.txt
05:50:58:WU00:FS00:0xa7:Saving result file frame30.trr
05:50:58:WU00:FS00:0xa7:Saving result file frame30.xtc
05:50:58:WU00:FS00:0xa7:Saving result file md.log
05:50:58:WU00:FS00:0xa7:Saving result file science.log
05:50:59:WU00:FS00:0xa7:Folding@home Core Shutdown: FINISHED_UNIT
05:50:59:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
05:50:59:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:14236 run:557 clone:0 gen:30 core:0xa7 unit:0x00000024cedfaa925eb8d09be02a4259
05:50:59:WU00:FS00:Uploading 13.05MiB to 206.223.170.146
05:50:59:WU00:FS00:Connecting to 206.223.170.146:8080
05:51:00:WU02:FS00:Starting
05:51:00:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\Sony\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah/FahCore_a7.exe -dir 02 -suffix 01 -version 706 -lifeline 8728 -checkpoint 15 -np 7
05:51:00:WU02:FS00:Started FahCore on PID 2228
05:51:00:WU02:FS00:Core PID:1660
05:51:00:WU02:FS00:FahCore 0xa7 started
05:51:02:WU02:FS00:0xa7:*********************** Log Started 2020-06-02T05:51:02Z ***********************
05:51:02:WU02:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
05:51:02:WU02:FS00:0xa7:       Type: 0xa7
05:51:02:WU02:FS00:0xa7:       Core: Gromacs
05:51:02:WU02:FS00:0xa7:       Args: -dir 02 -suffix 01 -version 706 -lifeline 2228 -checkpoint 15 -np 7
05:51:02:WU02:FS00:0xa7:************************************ CBang *************************************
05:51:02:WU02:FS00:0xa7:       Date: Oct 26 2019
05:51:02:WU02:FS00:0xa7:       Time: 01:38:25
05:51:02:WU02:FS00:0xa7:   Revision: c46a1a011a24143739ac7218c5a435f66777f62f
05:51:02:WU02:FS00:0xa7:     Branch: master
05:51:02:WU02:FS00:0xa7:   Compiler: Visual C++ 2008
05:51:02:WU02:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
05:51:02:WU02:FS00:0xa7:   Platform: win32 10
05:51:02:WU02:FS00:0xa7:       Bits: 64
05:51:02:WU02:FS00:0xa7:       Mode: Release
05:51:02:WU02:FS00:0xa7:************************************ System ************************************
05:51:02:WU02:FS00:0xa7:        CPU: Intel(R) Core(TM) i7-3612QM CPU @ 2.10GHz
05:51:02:WU02:FS00:0xa7:     CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
05:51:02:WU02:FS00:0xa7:       CPUs: 8
05:51:02:WU02:FS00:0xa7:     Memory: 3.90GiB
05:51:02:WU02:FS00:0xa7:Free Memory: 1.55GiB
05:51:02:WU02:FS00:0xa7:    Threads: WINDOWS_THREADS
05:51:02:WU02:FS00:0xa7: OS Version: 6.2
05:51:02:WU02:FS00:0xa7:Has Battery: false
05:51:02:WU02:FS00:0xa7: On Battery: false
05:51:02:WU02:FS00:0xa7: UTC Offset: -7
05:51:02:WU02:FS00:0xa7:        PID: 1660
05:51:02:WU02:FS00:0xa7:        CWD: C:\Users\Sony\AppData\Roaming\FAHClient\work
05:51:02:WU02:FS00:0xa7:******************************** Build - libFAH ********************************
05:51:02:WU02:FS00:0xa7:    Version: 0.0.18
05:51:02:WU02:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
05:51:02:WU02:FS00:0xa7:  Copyright: 2019 foldingathome.org
05:51:02:WU02:FS00:0xa7:   Homepage: https://foldingathome.org/
05:51:02:WU02:FS00:0xa7:       Date: Oct 26 2019
05:51:02:WU02:FS00:0xa7:       Time: 01:52:30
05:51:02:WU02:FS00:0xa7:   Revision: c1e3513b1bc0c16013668f2173ee969e5995b38e
05:51:02:WU02:FS00:0xa7:     Branch: master
05:51:02:WU02:FS00:0xa7:   Compiler: Visual C++ 2008
05:51:02:WU02:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
05:51:02:WU02:FS00:0xa7:   Platform: win32 10
05:51:02:WU02:FS00:0xa7:       Bits: 64
05:51:02:WU02:FS00:0xa7:       Mode: Release
05:51:02:WU02:FS00:0xa7:************************************ Build *************************************
05:51:02:WU02:FS00:0xa7:       SIMD: avx_256
05:51:02:WU02:FS00:0xa7:********************************************************************************
05:51:02:WU02:FS00:0xa7:Project: 16802 (Run 3, Clone 38, Gen 96)
05:51:02:WU02:FS00:0xa7:Unit: 0x0000007181d59d695e95d8198b1566d6
05:51:02:WU02:FS00:0xa7:Reading tar file core.xml
05:51:02:WU02:FS00:0xa7:Reading tar file frame96.tpr
05:51:02:WU02:FS00:0xa7:Digital signatures verified
05:51:02:WU02:FS00:0xa7:Reducing thread count from 7 to 6 to avoid domain decomposition by a prime number > 3
05:51:02:WU02:FS00:0xa7:Calling: mdrun -s frame96.tpr -o frame96.trr -cpt 15 -nt 6
05:51:02:WU02:FS00:0xa7:Steps: first=24000000 total=250000
05:51:05:WU00:FS00:Upload 12.46%
05:51:07:WU02:FS00:0xa7:Completed 1 out of 250000 steps (0%)
05:51:11:WU00:FS00:Upload 41.20%
05:51:17:WU00:FS00:Upload 70.91%
05:51:23:WU00:FS00:Upload 100.00%
05:51:23:WU00:FS00:Upload complete
05:51:23:WU00:FS00:Server responded WORK_ACK (400)
05:51:23:WU00:FS00:Final credit estimate, 5264.00 points
05:51:23:WU00:FS00:Cleaning up
06:02:56:WU02:FS00:0xa7:Completed 2500 out of 250000 steps (1%)
06:14:47:WU02:FS00:0xa7:Completed 5000 out of 250000 steps (2%)
06:26:39:WU02:FS00:0xa7:Completed 7500 out of 250000 steps (3%)
06:38:31:WU02:FS00:0xa7:Completed 10000 out of 250000 steps (4%)
06:50:22:WU02:FS00:0xa7:Completed 12500 out of 250000 steps (5%)
06:56:15:WU01:FS01:0x22:Completed 120000 out of 2000000 steps (6%)
07:02:15:WU02:FS00:0xa7:Completed 15000 out of 250000 steps (6%)
07:14:08:WU02:FS00:0xa7:Completed 17500 out of 250000 steps (7%)
07:25:59:WU02:FS00:0xa7:Completed 20000 out of 250000 steps (8%)
07:37:51:WU02:FS00:0xa7:Completed 22500 out of 250000 steps (9%)
******************************* Date: 2020-06-02 *******************************
07:49:42:WU02:FS00:0xa7:Completed 25000 out of 250000 steps (10%)
07:57:15:84:127.0.0.1:New Web session
08:01:40:WU02:FS00:0xa7:Completed 27500 out of 250000 steps (11%)
08:13:48:WU02:FS00:0xa7:Completed 30000 out of 250000 steps (12%)
08:25:51:WU02:FS00:0xa7:Completed 32500 out of 250000 steps (13%)
08:38:03:WU02:FS00:0xa7:Completed 35000 out of 250000 steps (14%)
08:41:12:ERROR:Receive error: 10054: An existing connection was forcibly closed by the remote host.
08:49:56:WU02:FS00:0xa7:Completed 37500 out of 250000 steps (15%)
08:50:20:WU01:FS01:0x22:Completed 140000 out of 2000000 steps (7%)
09:01:53:WU02:FS00:0xa7:Completed 40000 out of 250000 steps (16%)
09:13:42:WU02:FS00:0xa7:Completed 42500 out of 250000 steps (17%)
09:25:31:WU02:FS00:0xa7:Completed 45000 out of 250000 steps (18%)
09:37:23:WU02:FS00:0xa7:Completed 47500 out of 250000 steps (19%)
09:49:09:WU02:FS00:0xa7:Completed 50000 out of 250000 steps (20%)
10:01:02:WU02:FS00:0xa7:Completed 52500 out of 250000 steps (21%)
10:12:51:WU02:FS00:0xa7:Completed 55000 out of 250000 steps (22%)
10:24:40:WU02:FS00:0xa7:Completed 57500 out of 250000 steps (23%)
10:36:27:WU02:FS00:0xa7:Completed 60000 out of 250000 steps (24%)
10:40:08:WU01:FS01:0x22:Completed 160000 out of 2000000 steps (8%)
10:48:17:WU02:FS00:0xa7:Completed 62500 out of 250000 steps (25%)
11:00:10:WU02:FS00:0xa7:Completed 65000 out of 250000 steps (26%)
11:12:25:WU02:FS00:0xa7:Completed 67500 out of 250000 steps (27%)
11:25:18:WU02:FS00:0xa7:Completed 70000 out of 250000 steps (28%)
11:37:47:WU02:FS00:0xa7:Completed 72500 out of 250000 steps (29%)
11:49:35:WU02:FS00:0xa7:Completed 75000 out of 250000 steps (30%)
12:01:25:WU02:FS00:0xa7:Completed 77500 out of 250000 steps (31%)
12:13:14:WU02:FS00:0xa7:Completed 80000 out of 250000 steps (32%)
12:25:03:WU02:FS00:0xa7:Completed 82500 out of 250000 steps (33%)
12:29:51:WU01:FS01:0x22:Completed 180000 out of 2000000 steps (9%)
12:36:54:WU02:FS00:0xa7:Completed 85000 out of 250000 steps (34%)
12:48:42:WU02:FS00:0xa7:Completed 87500 out of 250000 steps (35%)
13:00:31:WU02:FS00:0xa7:Completed 90000 out of 250000 steps (36%)
13:12:19:WU02:FS00:0xa7:Completed 92500 out of 250000 steps (37%)
13:24:10:WU02:FS00:0xa7:Completed 95000 out of 250000 steps (38%)
13:35:57:WU02:FS00:0xa7:Completed 97500 out of 250000 steps (39%)
13:47:52:WU02:FS00:0xa7:Completed 100000 out of 250000 steps (40%)
******************************* Date: 2020-06-02 *******************************
13:59:41:WU02:FS00:0xa7:Completed 102500 out of 250000 steps (41%)
14:11:30:WU02:FS00:0xa7:Completed 105000 out of 250000 steps (42%)
14:19:54:WU01:FS01:0x22:Completed 200000 out of 2000000 steps (10%)
14:23:21:WU02:FS00:0xa7:Completed 107500 out of 250000 steps (43%)
Its a very humble GPU (passmark rating of 853, when state of the art is ~20,000).

I don't think there is anything to fix. We have just inadvertantly entered a hamster into a horse race.

The CPUs on these laptops are equally humble. They are rated about 5.6% of the state of the art, but they are getting approriately sized WUs that are finishing in under a day.

So they are not rocketships, but they are available to run 24/7 and "quantity has a quality all of it's own."

Is it possible to just put a min or max limit on the base credit (job size) a slot will accept? My other laptop is ~10x faster, and would be happy to pass on the small jobs. So all the work would still get done.
HugoNotte
Posts: 66
Joined: Tue Apr 07, 2020 7:09 pm

Re: Enormous job: how did I get it, how can I fix it?

Post by HugoNotte »

In the beginning I let FAH run on my GT 630M, even less capable than your GPU. I didn't receive a single WU for that GPU that would finish before time out, crunching 24/7. After realizing that, I cancelled the GPU slot on my laptop, some hardware is just too old to keep up and be of real value to science, in my opinion.
JimboPalmer
Posts: 2522
Joined: Mon Feb 16, 2009 4:12 am
Location: Greenwood MS USA

Re: Enormous job: how did I get it, how can I fix it?

Post by JimboPalmer »

First, your CPU is actually pretty good compared to mine. It is not using all 8 threads as it is saving one for a GPU that won't finish, and ignoring one that makes a Prime number. If we remove the GPU slot, then next CPU work unit should be faster yet as you have more threads.

In the taskbar to the lower right of the screen, you should see a F@H molecule icon, click it (you may need to click an Up Arrow to see it ^)

The second item in this menu is Advanced Control, click it

On this screen to the left is a Configure button, click it

Now you get a screen with a Slots tab, click it

On this white field should be a gpu item, click it and then click remove. now save.

This will let F@H know you have abandoned the Work Unit for the GPU, and soon your CPU will get more work.
Tsar of all the Rushers
I tried to remain childlike, all I achieved was childish.
A friend to those who want no friends
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Enormous job: how did I get it, how can I fix it?

Post by bruce »

The GT 6xx series GPus are, in fact, some of the slowest GPUs that we support. I'll speak to owner of project 14447 and ask him to exclude assignments to Kepler GPUs from the project's assignment permission set so nobody else run into the same problem. (Gpu species 3 shouldn't be tasked with projects that have 72,700 atoms.)
Rwolf01
Posts: 20
Joined: Mon Jun 01, 2020 8:06 pm

Re: Enormous job: how did I get it, how can I fix it?

Post by Rwolf01 »

Okay thanks. I killed the job that was running too slow to be useful.

But if "base credit" points are a good measure of actual utility, between the CPU and GPU, I'd say it's worth keeping those GPUs running. They were earning about 9400 points/day when the CPU was only making about 4000. SO it's a bad deal to turn off 9400 points/day of GPU power, just to gain one CPU thread that is worth 500 points/day.

So I'd like to try turning that GPU slot back on, if you think you've got things dialed in that it will get reasonable sized jobs.

Also, would it be disruptive if I used the remove/restore slot action as a way to reject jobs that are too large/slow? If it's not a problem on your end, I could use that as a workaround.

My other 640M GPU is doing a job for P14253 that will finish ~ 2 days before Expiration time, but after the "timeout". I'll leave that running, unless you ask me to send it back also.

Cheers,

- Ralph
HugoNotte
Posts: 66
Joined: Tue Apr 07, 2020 7:09 pm

Re: Enormous job: how did I get it, how can I fix it?

Post by HugoNotte »

That's weird that your i7 mobile with 8 threads only makes 4000 ppd. My i5-3210M is good for around 11,000 ppd on average.
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Enormous job: how did I get it, how can I fix it?

Post by PantherX »

If the CPU has a lot of interruptions while it is folding, it can cause a measurable impact on the PPD since the TPF (Time Per Frame) would be increased. A good balance would be to see what daily applications you use and see how much of the CPU percentage it uses and then configure the CPU Slot to fill in the "unused" CPU percentage. That way, you can potentially decrease the TPF and increase the PPD. Experiment with your system to find the optimal configuration.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Rwolf01
Posts: 20
Joined: Mon Jun 01, 2020 8:06 pm

Re: Enormous job: how did I get it, how can I fix it?

Post by Rwolf01 »

I was only looking at the "base credits". Total PPD is currently 12685, w/o the GPU.

These are retired machines that were only turned back on to run FAH. (My new latop, even with distractions, is smoking at 392,000 PPD :-)

Is there an newbie's guide to tuning the slots? I am particularly interested in how I might give the GPU permission to use more system memory.

(I geet seeing "Experts Only" and I'm like a moth to a flame... :-)
peterjammo
Posts: 90
Joined: Wed Mar 25, 2020 1:19 pm

Re: Enormous job: how did I get it, how can I fix it?

Post by peterjammo »

I think I can answer most of those questions:

Process: If your WU passes Timeout, the Work Server reissues the WU to the next folder. If you complete between Timeout and Expiry, your WU is still uploaded, you still get points, and if your WU is first back, it will still trigger next in series to be issued. The second folder will however still fold a now pointless WU to completion. If your WU reaches Expiry before completing, the Client deletes it.

Transferring WU. Short answer is you can't transfer a WU from a slow machine to a faster one. Long answer is that if you run Linux,you may be able to swap hard drives and do it that way,but that won't work with Windows.

Dumping Large WU. This is discouraged. If you ask on any specific WU, you'll only be advised to dump if you're definitely going to exceed Expiry. I'm not 100% sure of the reasons for this.

Is it worth folding with a very slow GPU: I'd say no. I have a slightly faster GPU than yours sidelined. There still appears to be excess GPU capacity, so folding with a low end GPU just slows down the science, and contributes a little to global warming. The situation seems to be different for CPUs, with there being more work available than CPUs, so any CPU which can meet Timeout is still worthwhile.

Tuning slots: The only useful tuning I've found is the ability to set how many CPU cores should fold. There's plenty on the forum on this if you search, but the default setting which alows the client to decide may work just as well on your machines.

Hope this helps. I'm sure if I've missed anything or got anything wrong,someone will be along soon to sort that.
JimboPalmer
Posts: 2522
Joined: Mon Feb 16, 2009 4:12 am
Location: Greenwood MS USA

Re: Enormous job: how did I get it, how can I fix it?

Post by JimboPalmer »

HugoNotte wrote:That's weird that your i7 mobile with 8 threads only makes 4000 ppd. My i5-3210M is good for around 11,000 ppd on average.
I do not think we have ever convinced him to run all 8 threads, he is using 6. To use all 8 he has to remove the (useless) GPU slot.
Tsar of all the Rushers
I tried to remain childlike, all I achieved was childish.
A friend to those who want no friends
Joe_H
Site Admin
Posts: 7943
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: Enormous job: how did I get it, how can I fix it?

Post by Joe_H »

My experience with HT enabled Intel CPUs and processing WUs on a laptop is that setting the client to only use the the main cores often will either produce as many points or sometimes do better. With HT in use you may run into thermal throttling, So it may take a bit of experimenting to see what works best.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Rwolf01
Posts: 20
Joined: Mon Jun 01, 2020 8:06 pm

Re: Enormous job: how did I get it, how can I fix it?

Post by Rwolf01 »

>> Is it worth folding with a very slow GPU: I'd say no.
That's a straw man argument. I am not convinced the GPU is that slow. (certainly there are faster ones, but I'm comparing the CPU and GPU in their ability to earn points. Assuming the points are a fair measure of the computational value of the work, it's a good metric)

The CPU has a job worth 525 base points (2449 estimated) that is 26% done after 1:29:18. Converting that to points/day I see the CPU (which is keeping all 8 threads busy, btw) as being worth 2201 base points/day and 10,268 estiamted points/day.

The GPU on the same machine has a job worth 2500 base points (5368 estiamted) that is 86% done after 6:41:39. That works out to 7708 base points/day or 16,551 estimated.

If it only costs me 1 thread out of 8 to support the GPU, I am paying 275 points/day to get 7700. That seems like a no brainer. Doing the same math on extended points says I'm paying 1280 points to get 16,550. Also a clear win.

I am new here and trying to understand. I respond well to reason. (Less so to bullying)

If there is an error in my math or assumptions please point it out.
Post Reply