Project: 5766 (Run 3, Clone 238, Gen 695)

Moderators: Site Moderators, FAHC Science Team

Post Reply
CrustyCat
Posts: 14
Joined: Sun Jun 28, 2009 10:10 pm

Project: 5766 (Run 3, Clone 238, Gen 695)

Post by CrustyCat »

I got this with this wu.

Code: Select all

23:23:22] Trying to send all finished work units
[23:23:22] + No unsent completed units remaining.
[23:23:22] - Preparing to get new work unit...
[23:23:22] + Attempting to get work packet
[23:23:22] - Will indicate memory of 8188 MB
[23:23:22] - Connecting to assignment server
[23:23:22] Connecting to http://assign-GPU.stanford.edu:8080/
[23:23:22] Posted data.
[23:23:22] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[23:23:22] + News From Folding@Home: Welcome to Folding@Home
[23:23:23] Loaded queue successfully.
[23:23:23] Connecting to http://171.67.108.11:8080/
[23:23:23] Posted data.
[23:23:23] Initial: 0000; - Receiving payload (expected size: 47231)
[23:23:23] Conversation time very short, giving reduced weight in bandwidth avg
[23:23:23] - Downloaded at ~92 kB/s
[23:23:23] - Averaged speed for that direction ~129 kB/s
[23:23:23] + Received work.
[23:23:23] Trying to send all finished work units
[23:23:23] + No unsent completed units remaining.
[23:23:23] + Closed connections
[23:23:23] 
[23:23:23] + Processing work unit
[23:23:23] Core required: FahCore_11.exe
[23:23:23] Core found.
[23:23:23] Working on queue slot 01 [August 7 23:23:23 UTC]
[23:23:23] + Working ...
[23:23:23] - Calling '.\FahCore_11.exe -dir work/ -suffix 01 -priority 96 -nocpulock -checkpoint 15 -verbose -lifeline 2576 -version 623'

[23:23:23] 
[23:23:23] *------------------------------*
[23:23:23] Folding@Home GPU Core
[23:23:23] Version 1.27 (Thu Jun 18 14:02:10 PDT 2009)
[23:23:23] 
[23:23:23] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[23:23:23] Build host: amoeba
[23:23:23] Board Type: Nvidia
[23:23:23] Core      : 
[23:23:23] Preparing to commence simulation
[23:23:23] - Looking at optimizations...
[23:23:23] DeleteFrameFiles: successfully deleted file=work/wudata_01.ckp
[23:23:23] - Created dyn
[23:23:23] - Files status OK
[23:23:23] - Expanded 46719 -> 252912 (decompressed 541.3 percent)
[23:23:23] Called DecompressByteArray: compressed_data_size=46719 data_size=252912, decompressed_data_size=252912 diff=0
[23:23:23] - Digital signature verified
[23:23:23] 
[23:23:23] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:23:23] 
[23:23:23] Assembly optimizations on if available.
[23:23:23] Entering M.D.
[23:23:29] Tpr hash work/wudata_01.tpr:  4039675324 1869098365 1272378941 1320969106 2898386213
[23:23:29] 
[23:23:29] Calling fah_main args: 14 usage=85
[23:23:29] 
[23:23:30] Working on Protein
[23:23:30] Client config found, loading data.
[23:23:30] Starting GUI Server
[23:23:30] mdrun_gpu returned 
[23:23:30] NANs detected on GPU
[23:23:30] 
[23:23:30] Folding@home Core Shutdown: UNSTABLE_MACHINE
[23:23:33] CoreStatus = 7A (122)
[23:23:33] Sending work to server
[23:23:33] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:23:33] - Read packet limit of 540015616... Set to 524286976.
[23:23:33] - Error: Could not get length of results file work/wuresults_01.dat
[23:23:33] - Error: Could not read unit 01 file. Removing from queue.
[23:23:33] Trying to send all finished work units
[23:23:33] + No unsent completed units remaining.
[23:23:33] - Preparing to get new work unit...
[23:23:33] + Attempting to get work packet
[23:23:33] - Will indicate memory of 8188 MB
[23:23:33] - Connecting to assignment server
[23:23:33] Connecting to http://assign-GPU.stanford.edu:8080/
[23:23:34] Posted data.
[23:23:34] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[23:23:34] + News From Folding@Home: Welcome to Folding@Home
[23:23:34] Loaded queue successfully.
[23:23:34] Connecting to http://171.67.108.11:8080/
[23:23:34] Posted data.
[23:23:34] Initial: 0000; - Receiving payload (expected size: 47231)
[23:23:34] Conversation time very short, giving reduced weight in bandwidth avg
[23:23:34] - Downloaded at ~92 kB/s
[23:23:34] - Averaged speed for that direction ~125 kB/s
[23:23:34] + Received work.
[23:23:34] Trying to send all finished work units
[23:23:34] + No unsent completed units remaining.
[23:23:34] + Closed connections
[23:23:39] 
[23:23:39] + Processing work unit
[23:23:39] Core required: FahCore_11.exe
[23:23:39] Core found.
[23:23:39] Working on queue slot 02 [August 7 23:23:39 UTC]
[23:23:39] + Working ...
[23:23:39] - Calling '.\FahCore_11.exe -dir work/ -suffix 02 -priority 96 -nocpulock -checkpoint 15 -verbose -lifeline 2576 -version 623'

[23:23:39] 
[23:23:39] *------------------------------*
[23:23:39] Folding@Home GPU Core
[23:23:39] Version 1.27 (Thu Jun 18 14:02:10 PDT 2009)
[23:23:39] 
[23:23:39] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[23:23:39] Build host: amoeba
[23:23:39] Board Type: Nvidia
[23:23:39] Core      : 
[23:23:39] Preparing to commence simulation
[23:23:39] - Looking at optimizations...
[23:23:39] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[23:23:39] - Created dyn
[23:23:39] - Files status OK
[23:23:39] - Expanded 46719 -> 252912 (decompressed 541.3 percent)
[23:23:39] Called DecompressByteArray: compressed_data_size=46719 data_size=252912, decompressed_data_size=252912 diff=0
[23:23:39] - Digital signature verified
[23:23:39] 
[23:23:39] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:23:39] 
[23:23:39] Assembly optimizations on if available.
[23:23:39] Entering M.D.
[23:23:46] Tpr hash work/wudata_02.tpr:  4039675324 1869098365 1272378941 1320969106 2898386213
[23:23:46] 
[23:23:46] Calling fah_main args: 14 usage=85
[23:23:46] 
[23:23:46] Working on Protein
[23:23:47] Client config found, loading data.
[23:23:47] Starting GUI Server
[23:23:47] mdrun_gpu returned 
[23:23:47] NANs detected on GPU
[23:23:47] 
[23:23:47] Folding@home Core Shutdown: UNSTABLE_MACHINE
[23:23:50] CoreStatus = 7A (122)
[23:23:50] Sending work to server
[23:23:50] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:23:50] - Read packet limit of 540015616... Set to 524286976.
[23:23:50] - Error: Could not get length of results file work/wuresults_02.dat
[23:23:50] - Error: Could not read unit 02 file. Removing from queue.
[23:23:50] Trying to send all finished work units
[23:23:50] + No unsent completed units remaining.
[23:23:50] - Preparing to get new work unit...
[23:23:50] + Attempting to get work packet
[23:23:50] - Will indicate memory of 8188 MB
[23:23:50] - Connecting to assignment server
[23:23:50] Connecting to http://assign-GPU.stanford.edu:8080/
[23:23:50] Posted data.
[23:23:50] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[23:23:50] + News From Folding@Home: Welcome to Folding@Home
[23:23:50] Loaded queue successfully.
[23:23:50] Connecting to http://171.67.108.11:8080/
[23:23:50] Posted data.
[23:23:50] Initial: 0000; - Receiving payload (expected size: 47231)
[23:23:51] - Downloaded at ~46 kB/s
[23:23:51] - Averaged speed for that direction ~109 kB/s
[23:23:51] + Received work.
[23:23:51] Trying to send all finished work units
[23:23:51] + No unsent completed units remaining.
[23:23:51] + Closed connections
[23:23:56] 
[23:23:56] + Processing work unit
[23:23:56] Core required: FahCore_11.exe
[23:23:56] Core found.
[23:23:56] Working on queue slot 03 [August 7 23:23:56 UTC]
[23:23:56] + Working ...
[23:23:56] - Calling '.\FahCore_11.exe -dir work/ -suffix 03 -priority 96 -nocpulock -checkpoint 15 -verbose -lifeline 2576 -version 623'

[23:23:56] 
[23:23:56] *------------------------------*
[23:23:56] Folding@Home GPU Core
[23:23:56] Version 1.27 (Thu Jun 18 14:02:10 PDT 2009)
[23:23:56] 
[23:23:56] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[23:23:56] Build host: amoeba
[23:23:56] Board Type: Nvidia
[23:23:56] Core      : 
[23:23:56] Preparing to commence simulation
[23:23:56] - Looking at optimizations...
[23:23:56] DeleteFrameFiles: successfully deleted file=work/wudata_03.ckp
[23:23:56] - Created dyn
[23:23:56] - Files status OK
[23:23:56] - Expanded 46719 -> 252912 (decompressed 541.3 percent)
[23:23:56] Called DecompressByteArray: compressed_data_size=46719 data_size=252912, decompressed_data_size=252912 diff=0
[23:23:56] - Digital signature verified
[23:23:56] 
[23:23:56] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:23:56] 
[23:23:56] Assembly optimizations on if available.
[23:23:56] Entering M.D.
[23:24:02] Tpr hash work/wudata_03.tpr:  4039675324 1869098365 1272378941 1320969106 2898386213
[23:24:02] 
[23:24:02] Calling fah_main args: 14 usage=85
[23:24:02] 
[23:24:02] Working on Protein
[23:24:03] Client config found, loading data.
[23:24:03] mdrun_gpu returned 
[23:24:03] NANs detected on GPU
[23:24:03] 
[23:24:03] Folding@home Core Shutdown: UNSTABLE_MACHINE
[23:24:06] CoreStatus = 7A (122)
[23:24:06] Sending work to server
[23:24:06] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:24:06] - Read packet limit of 540015616... Set to 524286976.
[23:24:06] - Error: Could not get length of results file work/wuresults_03.dat
[23:24:06] - Error: Could not read unit 03 file. Removing from queue.
[23:24:06] Trying to send all finished work units
[23:24:06] + No unsent completed units remaining.
[23:24:06] - Preparing to get new work unit...
[23:24:06] + Attempting to get work packet
[23:24:06] - Will indicate memory of 8188 MB
[23:24:06] - Connecting to assignment server
[23:24:06] Connecting to http://assign-GPU.stanford.edu:8080/
[23:24:06] Posted data.
[23:24:06] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[23:24:06] + News From Folding@Home: Welcome to Folding@Home
[23:24:07] Loaded queue successfully.
[23:24:07] Connecting to http://171.67.108.11:8080/
[23:24:07] Posted data.
[23:24:07] Initial: 0000; - Receiving payload (expected size: 47231)
[23:24:07] Conversation time very short, giving reduced weight in bandwidth avg
[23:24:07] - Downloaded at ~92 kB/s
[23:24:07] - Averaged speed for that direction ~107 kB/s
[23:24:07] + Received work.
[23:24:07] Trying to send all finished work units
[23:24:07] + No unsent completed units remaining.
[23:24:07] + Closed connections
[23:24:12] 
[23:24:12] + Processing work unit
[23:24:12] Core required: FahCore_11.exe
[23:24:12] Core found.
[23:24:12] Working on queue slot 04 [August 7 23:24:12 UTC]
[23:24:12] + Working ...
[23:24:12] - Calling '.\FahCore_11.exe -dir work/ -suffix 04 -priority 96 -nocpulock -checkpoint 15 -verbose -lifeline 2576 -version 623'

[23:24:12] 
[23:24:12] *------------------------------*
[23:24:12] Folding@Home GPU Core
[23:24:12] Version 1.27 (Thu Jun 18 14:02:10 PDT 2009)
[23:24:12] 
[23:24:12] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[23:24:12] Build host: amoeba
[23:24:12] Board Type: Nvidia
[23:24:12] Core      : 
[23:24:12] Preparing to commence simulation
[23:24:12] - Looking at optimizations...
[23:24:12] DeleteFrameFiles: successfully deleted file=work/wudata_04.ckp
[23:24:12] - Created dyn
[23:24:12] - Files status OK
[23:24:12] - Expanded 46719 -> 252912 (decompressed 541.3 percent)
[23:24:12] Called DecompressByteArray: compressed_data_size=46719 data_size=252912, decompressed_data_size=252912 diff=0
[23:24:12] - Digital signature verified
[23:24:12] 
[23:24:12] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:24:12] 
[23:24:12] Assembly optimizations on if available.
[23:24:12] Entering M.D.
[23:24:18] Tpr hash work/wudata_04.tpr:  4039675324 1869098365 1272378941 1320969106 2898386213
[23:24:18] 
[23:24:18] Calling fah_main args: 14 usage=85
[23:24:18] 
[23:24:18] Working on Protein
[23:24:19] Client config found, loading data.
[23:24:19] Starting GUI Server
[23:24:19] mdrun_gpu returned 
[23:24:19] NANs detected on GPU
[23:24:19] 
[23:24:19] Folding@home Core Shutdown: UNSTABLE_MACHINE
[23:24:22] CoreStatus = 7A (122)
[23:24:22] Sending work to server
[23:24:22] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:24:22] - Read packet limit of 540015616... Set to 524286976.
[23:24:22] - Error: Could not get length of results file work/wuresults_04.dat
[23:24:22] - Error: Could not read unit 04 file. Removing from queue.
[23:24:22] Trying to send all finished work units
[23:24:22] + No unsent completed units remaining.
[23:24:22] - Preparing to get new work unit...
[23:24:22] + Attempting to get work packet
[23:24:22] - Will indicate memory of 8188 MB
[23:24:22] - Connecting to assignment server
[23:24:22] Connecting to http://assign-GPU.stanford.edu:8080/
[23:24:23] Posted data.
[23:24:23] Initial: 43AB; - Successful: assigned to (171.67.108.11).
[23:24:23] + News From Folding@Home: Welcome to Folding@Home
[23:24:23] Loaded queue successfully.
[23:24:23] Connecting to http://171.67.108.11:8080/
[23:24:23] Posted data.
[23:24:23] Initial: 0000; - Receiving payload (expected size: 47231)
[23:24:23] Conversation time very short, giving reduced weight in bandwidth avg
[23:24:23] - Downloaded at ~92 kB/s
[23:24:23] - Averaged speed for that direction ~105 kB/s
[23:24:23] + Received work.
[23:24:23] Trying to send all finished work units
[23:24:23] + No unsent completed units remaining.
[23:24:23] + Closed connections
[23:24:28] 
[23:24:28] + Processing work unit
[23:24:28] Core required: FahCore_11.exe
[23:24:28] Core found.
[23:24:28] Working on queue slot 05 [August 7 23:24:28 UTC]
[23:24:28] + Working ...
[23:24:28] - Calling '.\FahCore_11.exe -dir work/ -suffix 05 -priority 96 -nocpulock -checkpoint 15 -verbose -lifeline 2576 -version 623'

[23:24:28] 
[23:24:28] *------------------------------*
[23:24:28] Folding@Home GPU Core
[23:24:28] Version 1.27 (Thu Jun 18 14:02:10 PDT 2009)
[23:24:28] 
[23:24:28] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[23:24:28] Build host: amoeba
[23:24:28] Board Type: Nvidia
[23:24:28] Core      : 
[23:24:28] Preparing to commence simulation
[23:24:28] - Looking at optimizations...
[23:24:28] DeleteFrameFiles: successfully deleted file=work/wudata_05.ckp
[23:24:28] - Created dyn
[23:24:28] - Files status OK
[23:24:28] - Expanded 46719 -> 252912 (decompressed 541.3 percent)
[23:24:28] Called DecompressByteArray: compressed_data_size=46719 data_size=252912, decompressed_data_size=252912 diff=0
[23:24:28] - Digital signature verified
[23:24:28] 
[23:24:28] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:24:28] 
[23:24:28] Assembly optimizations on if available.
[23:24:28] Entering M.D.
[23:24:34] Tpr hash work/wudata_05.tpr:  4039675324 1869098365 1272378941 1320969106 2898386213
[23:24:34] 
[23:24:34] Calling fah_main args: 14 usage=85
[23:24:34] 
[23:24:34] Working on Protein
[23:24:35] Client config found, loading data.
[23:24:35] mdrun_gpu returned 
[23:24:35] NANs detected on GPU
[23:24:35] 
[23:24:35] Folding@home Core Shutdown: UNSTABLE_MACHINE
[23:24:39] CoreStatus = 7A (122)
[23:24:39] Sending work to server
[23:24:39] Project: 5766 (Run 3, Clone 238, Gen 695)
[23:24:39] - Read packet limit of 540015616... Set to 524286976.
[23:24:39] - Error: Could not get length of results file work/wuresults_05.dat
[23:24:39] - Error: Could not read unit 05 file. Removing from queue.
[23:24:39] EUE limit exceeded. Pausing 24 hours
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 5766 (Run 3, Clone 238, Gen 695)

Post by bruce »

CrustyCat wrote:I got this with this wu.
Are you asking for help? I see it was reassigned 5 times and failed at least 4.
CrustyCat
Posts: 14
Joined: Sun Jun 28, 2009 10:10 pm

Re: Project: 5766 (Run 3, Clone 238, Gen 695)

Post by CrustyCat »

Just lately I've been getting these NANS with the 353 point wu's. Everything else is running fine, but these just don't seem to want to run.
Post Reply