Project: 2665 (Run 3, Clone 689, Gen 46) immediate NaN error
Moderators: Site Moderators, FAHC Science Team
Project: 2665 (Run 3, Clone 689, Gen 46) immediate NaN error
Hello all:
I have un-installed and re-installed this program for over a week and can not seem to make it work. here is the latest log after a new re-install
Ed
--- Opening Log file [October 15 04:24:46 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.22 SMP Beta2
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\FAHsmp
Executable: C:\FAHsmp\Folding@home-Win32-x86 -smp.exe
Arguments: -smp
[04:24:46] - Ask before connecting: No
[04:24:46] - User name: Ed_B (Team 40051)
[04:24:46] - User ID: 236A633A1479A2B5
[04:24:46] - Machine ID: 1
[04:24:46]
[04:24:46] Work directory not found. Creating...
[04:24:46] Could not open work queue, generating new queue...
[04:24:46] - Preparing to get new work unit...
[04:24:46] + Attempting to get work packet
[04:24:46] - Connecting to assignment server
[04:24:46] - Successful: assigned to (171.64.65.64).
[04:24:46] + News From Folding@Home: Welcome to Folding@Home
[04:24:46] Loaded queue successfully.
[04:25:26] + Closed connections
[04:25:26]
[04:25:26] + Processing work unit
[04:25:26] Work type a1 not eligible for variable processors
[04:25:26] Core required: FahCore_a1.exe
[04:25:26] Core not found.
[04:25:26] - Core is not present or corrupted.
[04:25:26] - Attempting to download new core...
[04:25:26] + Downloading new core: FahCore_a1.exe
[04:25:26] + 10240 bytes downloaded
[04:25:26] + 20480 bytes downloaded
[04:25:26] + 30720 bytes downloaded
[04:25:27] + 40960 bytes downloaded
[04:25:27] + 51200 bytes downloaded
[04:25:27] + 61440 bytes downloaded
[04:25:27] + 71680 bytes downloaded
[04:25:27] + 81920 bytes downloaded
[04:25:27] + 92160 bytes downloaded
[04:25:27] + 102400 bytes downloaded
[04:25:27] + 112640 bytes downloaded
[04:25:27] + 122880 bytes downloaded
[04:25:27] + 133120 bytes downloaded
[04:25:27] + 143360 bytes downloaded
[04:25:27] + 153600 bytes downloaded
[04:25:27] + 163840 bytes downloaded
[04:25:27] + 174080 bytes downloaded
[04:25:28] + 184320 bytes downloaded
[04:25:28] + 194560 bytes downloaded
[04:25:28] + 204800 bytes downloaded
[04:25:28] + 215040 bytes downloaded
[04:25:28] + 225280 bytes downloaded
[04:25:28] + 235520 bytes downloaded
[04:25:29] + 245760 bytes downloaded
[04:25:31] + 256000 bytes downloaded
[04:25:32] + 266240 bytes downloaded
[04:25:33] + 276480 bytes downloaded
[04:25:34] + 286720 bytes downloaded
[04:25:35] + 296960 bytes downloaded
[04:25:37] + 307200 bytes downloaded
[04:25:38] + 317440 bytes downloaded
[04:25:39] + 327680 bytes downloaded
[04:25:41] + 337920 bytes downloaded
[04:25:41] + 348160 bytes downloaded
[04:25:41] + 358400 bytes downloaded
[04:25:41] + 368640 bytes downloaded
[04:25:41] + 378880 bytes downloaded
[04:25:42] + 389120 bytes downloaded
[04:25:42] + 399360 bytes downloaded
[04:25:42] + 409600 bytes downloaded
[04:25:42] + 419840 bytes downloaded
[04:25:42] + 430080 bytes downloaded
[04:25:42] + 440320 bytes downloaded
[04:25:42] + 450560 bytes downloaded
[04:25:42] + 460800 bytes downloaded
[04:25:42] + 471040 bytes downloaded
[04:25:42] + 481280 bytes downloaded
[04:25:42] + 491520 bytes downloaded
[04:25:42] + 501760 bytes downloaded
[04:25:42] + 512000 bytes downloaded
[04:25:42] + 522240 bytes downloaded
[04:25:43] + 532480 bytes downloaded
[04:25:43] + 542720 bytes downloaded
[04:25:43] + 552960 bytes downloaded
[04:25:43] + 563200 bytes downloaded
[04:25:43] + 573440 bytes downloaded
[04:25:43] + 583680 bytes downloaded
[04:25:43] + 593920 bytes downloaded
[04:25:43] + 604160 bytes downloaded
[04:25:43] + 614400 bytes downloaded
[04:25:43] + 624640 bytes downloaded
[04:25:43] + 634880 bytes downloaded
[04:25:43] + 645120 bytes downloaded
[04:25:43] + 655360 bytes downloaded
[04:25:43] + 665600 bytes downloaded
[04:25:44] + 675840 bytes downloaded
[04:25:44] + 686080 bytes downloaded
[04:25:44] + 696320 bytes downloaded
[04:25:44] + 706560 bytes downloaded
[04:25:44] + 716800 bytes downloaded
[04:25:44] + 727040 bytes downloaded
[04:25:44] + 737280 bytes downloaded
[04:25:44] + 747520 bytes downloaded
[04:25:44] + 757760 bytes downloaded
[04:25:44] + 768000 bytes downloaded
[04:25:44] + 778240 bytes downloaded
[04:25:44] + 788480 bytes downloaded
[04:25:44] + 789667 bytes downloaded
[04:25:44] Verifying core Core_a1.fah...
[04:25:44] Signature is VALID
[04:25:44]
[04:25:44] Trying to unzip core FahCore_a1.exe
[04:25:44] Decompressed FahCore_a1.exe (2035712 bytes) successfully
[04:25:49] + Core successfully engaged
[04:25:54]
[04:25:54] + Processing work unit
[04:25:54] Work type a1 not eligible for variable processors
[04:25:54] Core required: FahCore_a1.exe
[04:25:54] Core found.
[04:25:54] Using generic mpiexec calls
[04:25:54] Working on queue slot 01 [October 15 04:25:54 UTC]
[04:25:54] + Working ...
[04:25:55]
[04:25:55] *------------------------------*
[04:25:55] Folding@Home Gromacs SMP Core
[04:25:55] Version 1.74 (March 10, 2007)
[04:25:55]
[04:25:55] Preparing to commence simulation
[04:25:55] - Ensuring status. Please wait.
[04:26:00] - Starting from initial work packet
[04:26:00]
[04:26:00] Project: 2665 (Run 3, Clone 689, Gen 46)
[04:26:00]
[04:26:00] acket
[04:26:00]
[04:26:00] Project: 2665 (Run 3, Clone 689, Gen 46)
[04:26:00]
[04:26:01] M.D.
[04:26:01] ing M.D.
[04:26:17] a- Starting from initial work packet
[04:26:17]
[04:26:17] Project: 2665 (Run 3, Clone 689, Gen 46)
[04:26:17]
[04:26:18] Entering M.D.
[04:26:24] Rejecting checkpoint
[04:26:25] NaN detected: x[23838][2]=5.39614 v[23838][2]=NaN
[04:26:25] ng output
Folding@Home Client Shutdown at user request.
Folding@Home Client Shutdown.
I have un-installed and re-installed this program for over a week and can not seem to make it work. here is the latest log after a new re-install
Ed
--- Opening Log file [October 15 04:24:46 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.22 SMP Beta2
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\FAHsmp
Executable: C:\FAHsmp\Folding@home-Win32-x86 -smp.exe
Arguments: -smp
[04:24:46] - Ask before connecting: No
[04:24:46] - User name: Ed_B (Team 40051)
[04:24:46] - User ID: 236A633A1479A2B5
[04:24:46] - Machine ID: 1
[04:24:46]
[04:24:46] Work directory not found. Creating...
[04:24:46] Could not open work queue, generating new queue...
[04:24:46] - Preparing to get new work unit...
[04:24:46] + Attempting to get work packet
[04:24:46] - Connecting to assignment server
[04:24:46] - Successful: assigned to (171.64.65.64).
[04:24:46] + News From Folding@Home: Welcome to Folding@Home
[04:24:46] Loaded queue successfully.
[04:25:26] + Closed connections
[04:25:26]
[04:25:26] + Processing work unit
[04:25:26] Work type a1 not eligible for variable processors
[04:25:26] Core required: FahCore_a1.exe
[04:25:26] Core not found.
[04:25:26] - Core is not present or corrupted.
[04:25:26] - Attempting to download new core...
[04:25:26] + Downloading new core: FahCore_a1.exe
[04:25:26] + 10240 bytes downloaded
[04:25:26] + 20480 bytes downloaded
[04:25:26] + 30720 bytes downloaded
[04:25:27] + 40960 bytes downloaded
[04:25:27] + 51200 bytes downloaded
[04:25:27] + 61440 bytes downloaded
[04:25:27] + 71680 bytes downloaded
[04:25:27] + 81920 bytes downloaded
[04:25:27] + 92160 bytes downloaded
[04:25:27] + 102400 bytes downloaded
[04:25:27] + 112640 bytes downloaded
[04:25:27] + 122880 bytes downloaded
[04:25:27] + 133120 bytes downloaded
[04:25:27] + 143360 bytes downloaded
[04:25:27] + 153600 bytes downloaded
[04:25:27] + 163840 bytes downloaded
[04:25:27] + 174080 bytes downloaded
[04:25:28] + 184320 bytes downloaded
[04:25:28] + 194560 bytes downloaded
[04:25:28] + 204800 bytes downloaded
[04:25:28] + 215040 bytes downloaded
[04:25:28] + 225280 bytes downloaded
[04:25:28] + 235520 bytes downloaded
[04:25:29] + 245760 bytes downloaded
[04:25:31] + 256000 bytes downloaded
[04:25:32] + 266240 bytes downloaded
[04:25:33] + 276480 bytes downloaded
[04:25:34] + 286720 bytes downloaded
[04:25:35] + 296960 bytes downloaded
[04:25:37] + 307200 bytes downloaded
[04:25:38] + 317440 bytes downloaded
[04:25:39] + 327680 bytes downloaded
[04:25:41] + 337920 bytes downloaded
[04:25:41] + 348160 bytes downloaded
[04:25:41] + 358400 bytes downloaded
[04:25:41] + 368640 bytes downloaded
[04:25:41] + 378880 bytes downloaded
[04:25:42] + 389120 bytes downloaded
[04:25:42] + 399360 bytes downloaded
[04:25:42] + 409600 bytes downloaded
[04:25:42] + 419840 bytes downloaded
[04:25:42] + 430080 bytes downloaded
[04:25:42] + 440320 bytes downloaded
[04:25:42] + 450560 bytes downloaded
[04:25:42] + 460800 bytes downloaded
[04:25:42] + 471040 bytes downloaded
[04:25:42] + 481280 bytes downloaded
[04:25:42] + 491520 bytes downloaded
[04:25:42] + 501760 bytes downloaded
[04:25:42] + 512000 bytes downloaded
[04:25:42] + 522240 bytes downloaded
[04:25:43] + 532480 bytes downloaded
[04:25:43] + 542720 bytes downloaded
[04:25:43] + 552960 bytes downloaded
[04:25:43] + 563200 bytes downloaded
[04:25:43] + 573440 bytes downloaded
[04:25:43] + 583680 bytes downloaded
[04:25:43] + 593920 bytes downloaded
[04:25:43] + 604160 bytes downloaded
[04:25:43] + 614400 bytes downloaded
[04:25:43] + 624640 bytes downloaded
[04:25:43] + 634880 bytes downloaded
[04:25:43] + 645120 bytes downloaded
[04:25:43] + 655360 bytes downloaded
[04:25:43] + 665600 bytes downloaded
[04:25:44] + 675840 bytes downloaded
[04:25:44] + 686080 bytes downloaded
[04:25:44] + 696320 bytes downloaded
[04:25:44] + 706560 bytes downloaded
[04:25:44] + 716800 bytes downloaded
[04:25:44] + 727040 bytes downloaded
[04:25:44] + 737280 bytes downloaded
[04:25:44] + 747520 bytes downloaded
[04:25:44] + 757760 bytes downloaded
[04:25:44] + 768000 bytes downloaded
[04:25:44] + 778240 bytes downloaded
[04:25:44] + 788480 bytes downloaded
[04:25:44] + 789667 bytes downloaded
[04:25:44] Verifying core Core_a1.fah...
[04:25:44] Signature is VALID
[04:25:44]
[04:25:44] Trying to unzip core FahCore_a1.exe
[04:25:44] Decompressed FahCore_a1.exe (2035712 bytes) successfully
[04:25:49] + Core successfully engaged
[04:25:54]
[04:25:54] + Processing work unit
[04:25:54] Work type a1 not eligible for variable processors
[04:25:54] Core required: FahCore_a1.exe
[04:25:54] Core found.
[04:25:54] Using generic mpiexec calls
[04:25:54] Working on queue slot 01 [October 15 04:25:54 UTC]
[04:25:54] + Working ...
[04:25:55]
[04:25:55] *------------------------------*
[04:25:55] Folding@Home Gromacs SMP Core
[04:25:55] Version 1.74 (March 10, 2007)
[04:25:55]
[04:25:55] Preparing to commence simulation
[04:25:55] - Ensuring status. Please wait.
[04:26:00] - Starting from initial work packet
[04:26:00]
[04:26:00] Project: 2665 (Run 3, Clone 689, Gen 46)
[04:26:00]
[04:26:00] acket
[04:26:00]
[04:26:00] Project: 2665 (Run 3, Clone 689, Gen 46)
[04:26:00]
[04:26:01] M.D.
[04:26:01] ing M.D.
[04:26:17] a- Starting from initial work packet
[04:26:17]
[04:26:17] Project: 2665 (Run 3, Clone 689, Gen 46)
[04:26:17]
[04:26:18] Entering M.D.
[04:26:24] Rejecting checkpoint
[04:26:25] NaN detected: x[23838][2]=5.39614 v[23838][2]=NaN
[04:26:25] ng output
Folding@Home Client Shutdown at user request.
Folding@Home Client Shutdown.
-
- Posts: 2948
- Joined: Sun Dec 02, 2007 4:36 am
- Hardware configuration: Machine #1:
Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).
Machine #2:
Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.
Machine 3:
Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32
I am currently folding just on the 5x GTX 460's for aprox. 70K PPD - Location: Salem. OR USA
Re: smp not working for me!!
Lots of questions -- Is the client sending the data back to the server? Are you getting the same WU with the same Run, clone, and gen over and over? Are you overclocking? Have you checked your RAM (a common problem with machines that are repeatedly getting EUE's and NaN's)? Do you have a internet connection that likes going up and down on a regular basis (Use the Deino version to help with this problem)?
Re: smp not working for me!!
I'll check the ram, everything else seems to be working alright. my internet connection is solid and it has no problem connecting to the server and downloading work Units. Different WU's same result.
Ed
Ed
-
- Posts: 2948
- Joined: Sun Dec 02, 2007 4:36 am
- Hardware configuration: Machine #1:
Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).
Machine #2:
Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.
Machine 3:
Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32
I am currently folding just on the 5x GTX 460's for aprox. 70K PPD - Location: Salem. OR USA
Re: smp not working for me!!
It isn't a question of connecting but rather staying up. The standard SMP uses the loopback connection of the network to communicate and if the network goes down even for a fraction of a second, for any reason, it will kill the WU. The Denio version works much better in situations where your networking is intermittant for example if your network card powers down when not being used...
Re: smp not working for me!!
I suspect that this particular WU is defective. It has been assigned to three people, plus some have received it more than once. Everyone has gotten zero points. I'll notify the project researcher so he can pull that WU out of circulation. In the meantime, you need to discard it until the server assigns you some other WU. I suspect that the next assignment will work for you, but if it doesn't we'll need a fresh copy of FAHlog so we can figure out what to do next. (You did say you got a different WU already but didn't report what it was.)
I changed the thread title to reflect this new way of looking at the problem, but you do need to switch to the client beta version 6.23 and see if the new error handling feature reports the error correctly. viewtopic.php?f=46&t=6143
I changed the thread title to reflect this new way of looking at the problem, but you do need to switch to the client beta version 6.23 and see if the new error handling feature reports the error correctly. viewtopic.php?f=46&t=6143
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
Re: Project: 2665 (Run 3, Clone 689, Gen 46) immediate NaN error
Bruce,
I can confirm this issue still exists, here is my log from today.
I can confirm this issue still exists, here is my log from today.
Code: Select all
--- Opening Log file [December 20 21:31:22 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.22 SMP Beta2
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Program Files (x86)\Folding@home\Folding@Home-SMP
Executable: C:\Program Files (x86)\Folding@home\Folding@Home-SMP\FAH-SMP.exe
Arguments: -smp -verbosity 9
[21:31:22] - Ask before connecting: No
[21:31:22] - User name: ParrLeyne (Team 0)
[21:31:22] - User ID: 1C537EE405B02D03
[21:31:22] - Machine ID: 2
[21:31:22]
[21:31:22] Work directory not found. Creating...
[21:31:22] Could not open work queue, generating new queue...
[21:31:22] - Preparing to get new work unit...
[21:31:22] - Autosending finished units... [December 20 21:31:22 UTC]
[21:31:22] + Attempting to get work packet
[21:31:22] Trying to send all finished work units
[21:31:22] - Will indicate memory of 2048 MB
[21:31:22] + No unsent completed units remaining.
[21:31:22] - Detect CPU.[21:31:22] - Autosend completed
Vendor: GenuineIntel, Family: 6, Model: 15, Stepping: 11
[21:31:22] - Connecting to assignment server
[21:31:22] Connecting to http://assign.stanford.edu:8080/
[21:31:23] Posted data.
[21:31:23] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[21:31:23] + News From Folding@Home: Welcome to Folding@Home
[21:31:23] Loaded queue successfully.
[21:31:23] Connecting to http://171.64.65.64:8080/
[21:31:29] Posted data.
[21:31:29] Initial: 0000; - Receiving payload (expected size: 4813215)
[21:31:42] - Downloaded at ~361 kB/s
[21:31:42] - Averaged speed for that direction ~361 kB/s
[21:31:42] + Received work.
[21:31:42] + Closed connections
[21:31:42]
[21:31:42] + Processing work unit
[21:31:42] Work type a1 not eligible for variable processors
[21:31:42] Core required: FahCore_a1.exe
[21:31:42] Core found.
[21:31:42] Using generic mpiexec calls
[21:31:42] Working on queue slot 01 [December 20 21:31:42 UTC]
[21:31:42] + Working ...
[21:31:42] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 2388 -version 622'
[21:31:42]
[21:31:42] *------------------------------*
[21:31:42] Folding@Home Gromacs SMP Core
[21:31:42] Version 1.74 (March 10, 2007)
[21:31:42]
[21:31:42] Preparing to commence simulation
[21:31:42] - Ensuring status. Please wait.
[21:31:47] - Starting from initial work packet
[21:31:47]
[21:31:47] Project: 2665 (Run 2, Clone 597, Gen 68)
[21:31:47]
[21:31:47] Assembly optimizations on if available.
[21:31:47] Entering M.D.
[21:32:08] percent)
[21:32:08] - Starting from initial work packet
[21:32:08]
[21:32:08] Project: 2665 (Run 2, Clone 597, Gen 68)
[21:32:08]
[21:32:09] Entering M.D.
[21:32:16] Rejecting checkpoint
[21:32:18] NaN detected: x[28][1]=5.28945 v[28][1]=NaN
[21:32:18] utdown: BAD_CORE_FILES
[21:32:18] Finalizing output
[21:34:18] ES
[21:34:18]
[21:34:18] Folding@home Core Shutdown: BAD_CORE_FILES
[21:34:18] olding@home Core Shutdown: BAD_CORE_FILES
[21:34:18] Finalizing output
[21:36:21] CoreStatus = 1 (1)
[21:36:21] Client-core communications error: ERROR 0x1
[21:36:21] This is a sign of more serious problems, shutting down.
[03:31:23] - Autosending finished units... [December 21 03:31:23 UTC]
[03:31:23] Trying to send all finished work units
[03:31:23] + No unsent completed units remaining.
[03:31:23] - Autosend completed
--- Opening Log file [December 21 05:05:20 UTC]
# Windows SMP Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.22 SMP Beta2
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: C:\Program Files (x86)\Folding@home\Folding@Home-SMP
Executable: C:\Program Files (x86)\Folding@home\Folding@Home-SMP\FAH-SMP.exe
Arguments: -smp -verbosity 9
[05:05:20] - Ask before connecting: No
[05:05:20] - User name: ParrLeyne (Team 0)
[05:05:20] - User ID: 1C537EE405B02D03
[05:05:20] - Machine ID: 2
[05:05:20]
[05:05:20] Loaded queue successfully.
[05:05:20]
[05:05:20] - Autosending finished units... [December 21 05:05:20 UTC]
[05:05:20] + Processing work unit
[05:05:20] Trying to send all finished work units
[05:05:20] Work type a1 not eligible for variable processors
[05:05:20] + No unsent completed units remaining.
[05:05:20] Core required: FahCore_a1.exe
[05:05:20] - Autosend completed
[05:05:20] Core found.
[05:05:20] Using generic mpiexec calls
[05:05:20] Working on queue slot 01 [December 21 05:05:20 UTC]
[05:05:20] + Working ...
[05:05:20] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 2532 -version 622'
[05:05:20]
[05:05:20] *------------------------------*
[05:05:20] Folding@Home Gromacs SMP Core
[05:05:20] Version 1.74 (March 10, 2007)
[05:05:20]
[05:05:20] Preparing to commence simulation
[05:05:20] - Ensuring status. Please wait.
[05:05:24] - Starting from initial work packet
[05:05:24]
[05:05:24] Project: 2665 (Run 2, Clone 597, Gen 68)
[05:05:24]
[05:05:25] Assembly optimizations on if available.
[05:05:25] Entering M.D.
[05:05:45] al work packet
[05:05:46]
[05:05:46] Project: 2665 (Run 2, Clone 597, Gen 68)
[05:05:46]
[05:05:47] 65 (Run 2, Clone 597, Gen 68)
[05:05:47]
[05:05:47] Entering M.D.
[05:05:54] Rejecting checkpoint
[05:05:56] NaN detected: x[28][1]=5.28945 v[28][1]=NaN
[05:05:56] utdown: BAD_CORE_FILES
[05:05:56] Finalizing output
[05:07:30] Killing all core threads
[05:07:30] Killing 2 cores
[05:07:30] Killing core 0
[05:07:30] Killing core 1
Folding@Home Client Shutdown at user request.
[05:07:30] ***** Got a SIGTERM signal (2)
[05:07:30] Killing all core threads
[05:07:30] Killing 2 cores
[05:07:30] Killing core 0
[05:07:30] Killing core 1
Folding@Home Client Shutdown.
-
- Posts: 438
- Joined: Mon Dec 03, 2007 1:31 am
- Hardware configuration: Old Faithful CPU: Windows Graphical 5.03; Intel Pentium 4 Processor 540
(3.2GHz) HT;Windows XP
Big Red: Windows SMP Console 6.29; Windows GPU console 6.20r1; Intel Q9450 2.66G; ASUS P5Q 775 P45; [BFG 9800GTX+ old graphics card] NVidia GeForce 8800 GTX [as of 5/9/09]; Windows XP Pro SP3
Lenovo Think Pad: Windows 6.29 w/ SMP; Windows GPU Console 6.20r1 systray; Intel QX9300; NVIDIA Quadro FX-3700M; Windows XP Professional - Location: SF Peninsula
Project: 2665 (Run 3, Clone 689, Gen 46)
It's baaaccckkkkk..... Immediate bad core report. Got it three times (per normal) and then downloaded another 2665, which completed successfully without downloading another core.
Code: Select all
[12:12:48] + Closed connections
[12:12:48]
[12:12:48] + Processing work unit
[12:12:48] Work type a1 not eligible for variable processors
[12:12:48] Core required: FahCore_a1.exe
[12:12:48] Core found.
[12:12:48] Using generic mpiexec calls
[12:12:48] Working on queue slot 00 [April 26 12:12:48 UTC]
[12:12:48] + Working ...
[12:12:48] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 00 -checkpoint 15 -verbose -lifeline 4332 -version 623'
[12:12:50]
[12:12:50] *------------------------------*
[12:12:50] Folding@Home Gromacs SMP Core
[12:12:50] Version 1.74 (March 10, 2007)
[12:12:50]
[12:12:50] Preparing to commence simulation
[12:12:50] - Ensuring status. Please wait- Created dyn
[12:12:50] - Files status OK
[12:12:59] - Expanded 4835541 -> 24426905 (decompressed 505.1 percent)
[12:12:59] - Starting from initial work packet
[12:12:59]
[12:12:59] Project: 2665 (Run 3, Clone 689, Gen 46)
[12:12:59]
[12:13:00] Assembly optimizations on if available.
[12:13:00] Entering M.D.
[12:13:16] al work packet
[12:13:16]
[12:13:16] Project: 2665 (Run 3, Clone 689, Gen 46)
[12:13:16]
[12:13:17] Entering M.D.
[12:13:27] NaN detected: x[23838][2]=5.39614 v[23838][2]=NaN
[12:15:27] g@home Core Shutdown: BAD_CORE_FILES
[12:15:27] _FILES
[12:15:27] Finalizing output
[12:15:27] tdown: BAD_CORE_FILES
[12:15:27] Finalizing output
[12:15:27] 1][1]=NaN
[12:17:27]
[12:17:27] Folding@home Core Shutdown: BAD_CORE_FILES
[12:17:27] Finalizing output
[12:17:31] CoreStatus = 1 (1)
[12:17:31] Sending work to server
[12:17:31] Project: 2665 (Run 3, Clone 689, Gen 46)
[12:17:31] - Error: Could not get length of results file work/wuresults_00.dat
[12:17:31] - Error: Could not read unit 00 file. Removing from queue.
[12:17:31] Trying to send all finished work units
[12:17:31] + No unsent completed units remaining.
[12:17:31] - Preparing to get new work unit...
[12:17:32] + Attempting to get work packet
[12:17:32] - Will indicate memory of 2553 MB
[12:17:32] - Connecting to assignment server
[12:17:32] Connecting to http://assign.stanford.edu:8080/
[12:17:32] Posted data.
[12:17:32] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[12:17:32] + News From Folding@Home: Welcome to Folding@Home
[12:17:32] Loaded queue successfully.
[12:17:32] Connecting to http://171.64.65.64:8080/
[12:17:38] Posted data.
[12:17:38] Initial: 0000; - Receiving payload (expected size: 4836053)
[12:17:48] - Downloaded at ~472 kB/s
[12:17:48] - Averaged speed for that direction ~381 kB/s
[12:17:48] + Received work.
[12:17:48] Trying to send all finished work units
[12:17:48] + No unsent completed units remaining.
[12:17:49] + Closed connections
[12:17:54]
[12:17:54] + Processing work unit
[12:17:54] Work type a1 not eligible for variable processors
[12:17:54] Core required: FahCore_a1.exe
[12:17:54] Core found.
[12:17:54] Using generic mpiexec calls
[12:17:54] Working on queue slot 01 [April 26 12:17:54 UTC]
[12:17:54] + Working ...
[12:17:54] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 4332 -version 623'
[12:17:56]
[12:17:56] *------------------------------*
[12:17:56] Folding@Home Gromacs SMP Core
[12:17:56] Version 1.74 (March 10, 2007)
[12:17:56]
[12:17:56] Preparing to commence simulation
[12:17:56] - Ensuring status. Please wait.
[12:18:01] - Starting from initial work packet
[12:18:01]
[12:18:01] Project: 2665 (Run 3, Clone 689, Gen 46)
[12:18:01]
[12:18:02] Assembly optimizations on if available.
[12:18:02] Entering M.D.
[12:18:25] al work packet
[12:18:25]
[12:18:25] Project: 2665 (Run 3, Clone 689, Gen 46)
[12:18:25]
[12:18:25] 65 (Run 3, Clone 689, Gen 46)
[12:18:25]
[12:18:27] Entering M.D.
[12:18:34] Rejecting checkpoint
[12:18:35] NaN detected: x[23838][2]=5.39614 v[23838][2]=NaN
[12:18:35] utdown: BAD_CORE_FILES
[12:18:36] Finalizing output
[12:20:36] ES
[12:20:36]
[12:20:36] Folding@home Core Shutdown: BAD_CORE_FILES
[12:20:36] aN
[12:20:36]
[12:20:36] Folding@home Core Shutdown: BAD_CORE_FILES
[12:20:36] Finalizing output
[12:22:40] CoreStatus = 1 (1)
[12:22:40] Sending work to server
[12:22:40] Project: 2665 (Run 3, Clone 689, Gen 46)
[12:22:40] - Error: Could not get length of results file work/wuresults_01.dat
[12:22:40] - Error: Could not read unit 01 file. Removing from queue.
[12:22:40] Trying to send all finished work units
[12:22:41] + No unsent completed units remaining.
[12:22:41] - Preparing to get new work unit...
[12:22:41] + Attempting to get work packet
[12:22:41] - Will indicate memory of 2553 MB
[12:22:41] - Connecting to assignment server
[12:22:41] Connecting to http://assign.stanford.edu:8080/
[12:22:41] Posted data.
[12:22:41] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[12:22:41] + News From Folding@Home: Welcome to Folding@Home
[12:22:41] Loaded queue successfully.
[12:22:41] Connecting to http://171.64.65.64:8080/
[12:22:46] Posted data.
[12:22:46] Initial: 0000; - Receiving payload (expected size: 4836053)
[12:22:57] - Downloaded at ~472 kB/s
[12:22:57] - Averaged speed for that direction ~399 kB/s
[12:22:57] + Received work.
[12:22:58] Trying to send all finished work units
[12:22:58] + No unsent completed units remaining.
[12:22:58] + Closed connections
[12:23:03]
[12:23:03] + Processing work unit
[12:23:03] Work type a1 not eligible for variable processors
[12:23:03] Core required: FahCore_a1.exe
[12:23:03] Core found.
[12:23:03] Using generic mpiexec calls
[12:23:03] Working on queue slot 02 [April 26 12:23:03 UTC]
[12:23:03] + Working ...
[12:23:03] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 02 -checkpoint 15 -verbose -lifeline 4332 -version 623'
[12:23:06]
[12:23:06] *------------------------------*
[12:23:06] Folding@Home Gromacs SMP Core
[12:23:06] Version 1.74 (March 10, 2007)
[12:23:06]
[12:23:06] Preparing to commence simulation
[12:23:06] - Ensuring status. Please wait- Created dyn
[12:23:06] - Files status OK
[12:23:12] - Expanded 4835541 -> 24426905 (decompressed 505.1 percent)
[12:23:12] - Starting from initial work packet
[12:23:12]
[12:23:12] Project: 2665 (Run 3, Clone 689, Gen 46)
[12:23:12]
[12:23:12] Assembly optimizations on if available.
[12:23:12] Entering M.D.
[12:23:35] percent)
[12:23:35] cket
[12:23:35]
[12:23:35] Project: 2665 (Run 3, Clone 689, Gen 46)
[12:23:35]
[12:23:35] 65 (Run 3, Clone 689, Gen 46)
[12:23:35]
[12:23:37] Entering M.D.
[12:23:45] ed: x[23838][2]=5.39614 v[23838][2]=NaN
[12:23:45] 8][2]=NaN
[12:23:45]
[12:23:45] Folding@home Core Shutdown: BAD_CORE_FILES
[12:23:45] Finalizing output
[12:25:45] BAD_CORE_FILES
[12:25:46] 34971][1]=6.08422 v[34971][1]=NaN
[12:25:46]
[12:25:46] Folding@home Core Shutdown: BAD_CORE_FILES
[12:25:46] Finalizing output
[12:27:50] CoreStatus = 1 (1)
[12:27:50] Sending work to server
[12:27:51] Project: 2665 (Run 3, Clone 689, Gen 46)
[12:27:51] - Error: Could not get length of results file work/wuresults_02.dat
[12:27:51] - Error: Could not read unit 02 file. Removing from queue.
[12:27:51] Trying to send all finished work units
[12:27:51] + No unsent completed units remaining.
[12:27:51] - Preparing to get new work unit...
[12:27:51] + Attempting to get work packet
[12:27:51] - Will indicate memory of 2553 MB
[12:27:51] - Connecting to assignment server
[12:27:51] Connecting to http://assign.stanford.edu:8080/
[12:27:51] Posted data.
[12:27:51] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[12:27:51] + News From Folding@Home: Welcome to Folding@Home
[12:27:51] Loaded queue successfully.
[12:27:51] Connecting to http://171.64.65.64:8080/
[12:27:52] Posted data.
[12:27:52] Initial: 0000; - Error: Bad packet type from server, expected work assignment
[12:27:52] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[12:28:00] + Attempting to get work packet
[12:28:01] - Will indicate memory of 2553 MB
[12:28:01] - Connecting to assignment server
[12:28:01] Connecting to http://assign.stanford.edu:8080/
[12:28:01] Posted data.
[12:28:01] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[12:28:01] + News From Folding@Home: Welcome to Folding@Home
[12:28:01] Loaded queue successfully.
[12:28:01] Connecting to http://171.64.65.64:8080/
[12:28:07] Posted data.
[12:28:07] Initial: 0000; - Receiving payload (expected size: 4752564)
[12:28:17] - Downloaded at ~464 kB/s
[12:28:17] - Averaged speed for that direction ~412 kB/s
[12:28:17] + Received work.
[12:28:17] Trying to send all finished work units
[12:28:17] + No unsent completed units remaining.
[12:28:17] + Closed connections
[12:28:22]
[12:28:22] + Processing work unit
[12:28:22] Work type a1 not eligible for variable processors
[12:28:22] Core required: FahCore_a1.exe
[12:28:22] Core found.
[12:28:22] Using generic mpiexec calls
[12:28:22] Working on queue slot 03 [April 26 12:28:22 UTC]
[12:28:22] + Working ...
[12:28:22] - Calling 'mpiexec -np 4 -channel auto -host 127.0.0.1 FahCore_a1.exe -dir work/ -suffix 03 -checkpoint 15 -verbose -lifeline 4332 -version 623'
[12:28:24]
[12:28:24] *------------------------------*
[12:28:24] Folding@Home Gromacs SMP Core
[12:28:24] Version 1.74 (March 10, 2007)
[12:28:24]
[12:28:24] Preparing to commence simulation
[12:28:24] - Ensuring status. Please wait- Created dyn
[12:28:24] - Files status OK
[12:28:30] - Expanded 4752052 -> 24426905 (decompressed 514.0 percent)
[12:28:30] - Starting from initial work packet
[12:28:30]
[12:28:30] Project: 2665 (Run 3, Clone 399, Gen 107)
-
- Site Moderator
- Posts: 6359
- Joined: Sun Dec 02, 2007 10:38 am
- Location: Bordeaux, France
- Contact:
Re: Project: 2665 (Run 3, Clone 689, Gen 46) immediate NaN error
I've marked it as bad again ...
-
- Posts: 438
- Joined: Mon Dec 03, 2007 1:31 am
- Hardware configuration: Old Faithful CPU: Windows Graphical 5.03; Intel Pentium 4 Processor 540
(3.2GHz) HT;Windows XP
Big Red: Windows SMP Console 6.29; Windows GPU console 6.20r1; Intel Q9450 2.66G; ASUS P5Q 775 P45; [BFG 9800GTX+ old graphics card] NVidia GeForce 8800 GTX [as of 5/9/09]; Windows XP Pro SP3
Lenovo Think Pad: Windows 6.29 w/ SMP; Windows GPU Console 6.20r1 systray; Intel QX9300; NVIDIA Quadro FX-3700M; Windows XP Professional - Location: SF Peninsula
Re: Project: 2665 (Run 3, Clone 689, Gen 46) immediate NaN error
Thanks, toTOW. I take it I should go ahead and drag an old thread back up if the WU is the same? I debated which was better, to resurrect [so to speak, this is more like a zombie] from Oct/Dec or to start again.