Page 1 of 1

Project: 5791 (Run 0, Clone 318, Gen 9)

Posted: Tue Oct 06, 2009 4:42 pm
by Hyperlife
Had this one fail repeatedly with SHAKE violation errors at the start. System is otherwise stable (no OC, no history of EUEs).

Code: Select all

[15:56:44] - Preparing to get new work unit...
[15:56:44] + Attempting to get work packet
[15:56:44] - Will indicate memory of 2013 MB
[15:56:44] - Connecting to assignment server
[15:56:44] Connecting to http://assign-GPU.stanford.edu:8080/
[15:56:44] Posted data.
[15:56:44] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[15:56:44] + News From Folding@Home: Welcome to Folding@Home
[15:56:44] Loaded queue successfully.
[15:56:44] Connecting to http://171.64.65.106:8080/
[15:56:44] Posted data.
[15:56:44] Initial: 0000; - Receiving payload (expected size: 67911)
[15:56:44] Conversation time very short, giving reduced weight in bandwidth avg
[15:56:44] - Downloaded at ~132 kB/s
[15:56:44] - Averaged speed for that direction ~122 kB/s
[15:56:44] + Received work.
[15:56:44] Trying to send all finished work units
[15:56:44] + No unsent completed units remaining.
[15:56:44] + Closed connections
[15:56:44] 
[15:56:44] + Processing work unit
[15:56:44] Core required: FahCore_11.exe
[15:56:44] Core found.
[15:56:44] Working on queue slot 03 [October 6 15:56:44 UTC]
[15:56:44] + Working ...
[15:56:44] - Calling '.\FahCore_11.exe -dir work/ -suffix 03 -checkpoint 30 -verbose -lifeline 8 -version 623'

[15:56:45] 
[15:56:45] *------------------------------*
[15:56:45] Folding@Home GPU Core
[15:56:45] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[15:56:45] 
[15:56:45] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[15:56:45] Build host: amoeba
[15:56:45] Board Type: Nvidia
[15:56:45] Core      : 
[15:56:45] Preparing to commence simulation
[15:56:45] - Looking at optimizations...
[15:56:45] DeleteFrameFiles: successfully deleted file=work/wudata_03.ckp
[15:56:45] - Created dyn
[15:56:45] - Files status OK
[15:56:45] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[15:56:45] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[15:56:45] - Digital signature verified
[15:56:45] 
[15:56:45] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:56:45] 
[15:56:45] Assembly optimizations on if available.
[15:56:45] Entering M.D.
[15:56:51] Tpr hash work/wudata_03.tpr:  1084752786 1432926355 3632744897 2249080962 3255808895
[15:56:51] 
[15:56:51] Calling fah_main args: 14 usage=100
[15:56:51] 
[15:56:52] Working on Protein
[15:56:53] Client config found, loading data.
[15:56:53] mdrun_gpu returned 
[15:56:53] SHAKE violations on GPU
[15:56:53] 
[15:56:53] Folding@home Core Shutdown: UNSTABLE_MACHINE
[15:56:57] CoreStatus = 7A (122)
[15:56:57] Sending work to server
[15:56:57] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:56:57] - Read packet limit of 540015616... Set to 524286976.
[15:56:57] - Error: Could not get length of results file work/wuresults_03.dat
[15:56:57] - Error: Could not read unit 03 file. Removing from queue.
[15:56:57] Trying to send all finished work units
[15:56:57] + No unsent completed units remaining.
[15:56:57] - Preparing to get new work unit...
[15:56:57] + Attempting to get work packet
[15:56:57] - Will indicate memory of 2013 MB
[15:56:57] - Connecting to assignment server
[15:56:57] Connecting to http://assign-GPU.stanford.edu:8080/
[15:56:57] Posted data.
[15:56:57] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[15:56:57] + News From Folding@Home: Welcome to Folding@Home
[15:56:57] Loaded queue successfully.
[15:56:57] Connecting to http://171.64.65.106:8080/
[15:56:57] Posted data.
[15:56:57] Initial: 0000; - Receiving payload (expected size: 67911)
[15:56:58] - Downloaded at ~66 kB/s
[15:56:58] - Averaged speed for that direction ~111 kB/s
[15:56:58] + Received work.
[15:56:58] Trying to send all finished work units
[15:56:58] + No unsent completed units remaining.
[15:56:58] + Closed connections
[15:57:03] 
[15:57:03] + Processing work unit
[15:57:03] Core required: FahCore_11.exe
[15:57:03] Core found.
[15:57:03] Working on queue slot 04 [October 6 15:57:03 UTC]
[15:57:03] + Working ...
[15:57:03] - Calling '.\FahCore_11.exe -dir work/ -suffix 04 -checkpoint 30 -verbose -lifeline 8 -version 623'

[15:57:03] 
[15:57:03] *------------------------------*
[15:57:03] Folding@Home GPU Core
[15:57:03] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[15:57:03] 
[15:57:03] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[15:57:03] Build host: amoeba
[15:57:03] Board Type: Nvidia
[15:57:03] Core      : 
[15:57:03] Preparing to commence simulation
[15:57:03] - Looking at optimizations...
[15:57:03] DeleteFrameFiles: successfully deleted file=work/wudata_04.ckp
[15:57:03] - Created dyn
[15:57:03] - Files status OK
[15:57:03] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[15:57:03] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[15:57:03] - Digital signature verified
[15:57:03] 
[15:57:03] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:57:03] 
[15:57:03] Assembly optimizations on if available.
[15:57:03] Entering M.D.
[15:57:10] Tpr hash work/wudata_04.tpr:  1084752786 1432926355 3632744897 2249080962 3255808895
[15:57:10] 
[15:57:10] Calling fah_main args: 14 usage=100
[15:57:10] 
[15:57:10] Working on Protein
[15:57:11] Client config found, loading data.
[15:57:11] mdrun_gpu returned 
[15:57:11] SHAKE violations on GPU
[15:57:11] 
[15:57:11] Folding@home Core Shutdown: UNSTABLE_MACHINE
[15:57:15] CoreStatus = 7A (122)
[15:57:15] Sending work to server
[15:57:15] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:57:15] - Read packet limit of 540015616... Set to 524286976.
[15:57:15] - Error: Could not get length of results file work/wuresults_04.dat
[15:57:15] - Error: Could not read unit 04 file. Removing from queue.
[15:57:15] Trying to send all finished work units
[15:57:15] + No unsent completed units remaining.
[15:57:15] - Preparing to get new work unit...
[15:57:15] + Attempting to get work packet
[15:57:15] - Will indicate memory of 2013 MB
[15:57:15] - Connecting to assignment server
[15:57:15] Connecting to http://assign-GPU.stanford.edu:8080/
[15:57:15] Posted data.
[15:57:15] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[15:57:15] + News From Folding@Home: Welcome to Folding@Home
[15:57:15] Loaded queue successfully.
[15:57:15] Connecting to http://171.64.65.106:8080/
[15:57:16] Posted data.
[15:57:16] Initial: 0000; - Receiving payload (expected size: 67911)
[15:57:16] Conversation time very short, giving reduced weight in bandwidth avg
[15:57:16] - Downloaded at ~132 kB/s
[15:57:16] - Averaged speed for that direction ~113 kB/s
[15:57:16] + Received work.
[15:57:16] Trying to send all finished work units
[15:57:16] + No unsent completed units remaining.
[15:57:16] + Closed connections
[15:57:21] 
[15:57:21] + Processing work unit
[15:57:21] Core required: FahCore_11.exe
[15:57:21] Core found.
[15:57:21] Working on queue slot 05 [October 6 15:57:21 UTC]
[15:57:21] + Working ...
[15:57:21] - Calling '.\FahCore_11.exe -dir work/ -suffix 05 -checkpoint 30 -verbose -lifeline 8 -version 623'

[15:57:21] 
[15:57:21] *------------------------------*
[15:57:21] Folding@Home GPU Core
[15:57:21] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[15:57:21] 
[15:57:21] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[15:57:21] Build host: amoeba
[15:57:21] Board Type: Nvidia
[15:57:21] Core      : 
[15:57:21] Preparing to commence simulation
[15:57:21] - Looking at optimizations...
[15:57:21] DeleteFrameFiles: successfully deleted file=work/wudata_05.ckp
[15:57:21] - Created dyn
[15:57:21] - Files status OK
[15:57:21] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[15:57:21] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[15:57:21] - Digital signature verified
[15:57:21] 
[15:57:21] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:57:21] 
[15:57:21] Assembly optimizations on if available.
[15:57:21] Entering M.D.
[15:57:28] Tpr hash work/wudata_05.tpr:  1084752786 1432926355 3632744897 2249080962 3255808895
[15:57:28] 
[15:57:28] Calling fah_main args: 14 usage=100
[15:57:28] 
[15:57:28] Working on Protein
[15:57:29] Client config found, loading data.
[15:57:29] mdrun_gpu returned 
[15:57:29] SHAKE violations on GPU
[15:57:29] 
[15:57:29] Folding@home Core Shutdown: UNSTABLE_MACHINE
[15:57:33] CoreStatus = 7A (122)
[15:57:33] Sending work to server
[15:57:33] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:57:33] - Read packet limit of 540015616... Set to 524286976.
[15:57:33] - Error: Could not get length of results file work/wuresults_05.dat
[15:57:33] - Error: Could not read unit 05 file. Removing from queue.
[15:57:33] Trying to send all finished work units
[15:57:33] + No unsent completed units remaining.
[15:57:33] - Preparing to get new work unit...
[15:57:33] + Attempting to get work packet
[15:57:33] - Will indicate memory of 2013 MB
[15:57:33] - Connecting to assignment server
[15:57:33] Connecting to http://assign-GPU.stanford.edu:8080/
[15:57:33] Posted data.
[15:57:33] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[15:57:33] + News From Folding@Home: Welcome to Folding@Home
[15:57:33] Loaded queue successfully.
[15:57:33] Connecting to http://171.64.65.106:8080/
[15:57:34] Posted data.
[15:57:34] Initial: 0000; - Receiving payload (expected size: 67911)
[15:57:34] Conversation time very short, giving reduced weight in bandwidth avg
[15:57:34] - Downloaded at ~132 kB/s
[15:57:34] - Averaged speed for that direction ~115 kB/s
[15:57:34] + Received work.
[15:57:34] Trying to send all finished work units
[15:57:34] + No unsent completed units remaining.
[15:57:34] + Closed connections
[15:57:39] 
[15:57:39] + Processing work unit
[15:57:39] Core required: FahCore_11.exe
[15:57:39] Core found.
[15:57:39] Working on queue slot 06 [October 6 15:57:39 UTC]
[15:57:39] + Working ...
[15:57:39] - Calling '.\FahCore_11.exe -dir work/ -suffix 06 -checkpoint 30 -verbose -lifeline 8 -version 623'

[15:57:39] 
[15:57:39] *------------------------------*
[15:57:39] Folding@Home GPU Core
[15:57:39] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[15:57:39] 
[15:57:39] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[15:57:39] Build host: amoeba
[15:57:39] Board Type: Nvidia
[15:57:39] Core      : 
[15:57:39] Preparing to commence simulation
[15:57:39] - Looking at optimizations...
[15:57:39] DeleteFrameFiles: successfully deleted file=work/wudata_06.ckp
[15:57:39] - Created dyn
[15:57:39] - Files status OK
[15:57:39] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[15:57:39] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[15:57:39] - Digital signature verified
[15:57:39] 
[15:57:39] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:57:39] 
[15:57:39] Assembly optimizations on if available.
[15:57:39] Entering M.D.
[15:57:46] Tpr hash work/wudata_06.tpr:  1084752786 1432926355 3632744897 2249080962 3255808895
[15:57:46] 
[15:57:46] Calling fah_main args: 14 usage=100
[15:57:46] 
[15:57:46] Working on Protein
[15:57:47] Client config found, loading data.
[15:57:47] Starting GUI Server
[15:57:47] mdrun_gpu returned 
[15:57:47] SHAKE violations on GPU
[15:57:47] 
[15:57:47] Folding@home Core Shutdown: UNSTABLE_MACHINE
[15:57:51] CoreStatus = 7A (122)
[15:57:51] Sending work to server
[15:57:51] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:57:51] - Read packet limit of 540015616... Set to 524286976.
[15:57:51] - Error: Could not get length of results file work/wuresults_06.dat
[15:57:51] - Error: Could not read unit 06 file. Removing from queue.
[15:57:51] Trying to send all finished work units
[15:57:51] + No unsent completed units remaining.
[15:57:51] - Preparing to get new work unit...
[15:57:51] + Attempting to get work packet
[15:57:51] - Will indicate memory of 2013 MB
[15:57:51] - Connecting to assignment server
[15:57:51] Connecting to http://assign-GPU.stanford.edu:8080/
[15:57:51] Posted data.
[15:57:51] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[15:57:51] + News From Folding@Home: Welcome to Folding@Home
[15:57:51] Loaded queue successfully.
[15:57:51] Connecting to http://171.64.65.106:8080/
[15:57:52] Posted data.
[15:57:52] Initial: 0000; - Receiving payload (expected size: 67911)
[15:57:52] Conversation time very short, giving reduced weight in bandwidth avg
[15:57:52] - Downloaded at ~132 kB/s
[15:57:52] - Averaged speed for that direction ~117 kB/s
[15:57:52] + Received work.
[15:57:52] Trying to send all finished work units
[15:57:52] + No unsent completed units remaining.
[15:57:52] + Closed connections
[15:57:57] 
[15:57:57] + Processing work unit
[15:57:57] Core required: FahCore_11.exe
[15:57:57] Core found.
[15:57:57] Working on queue slot 07 [October 6 15:57:57 UTC]
[15:57:57] + Working ...
[15:57:57] - Calling '.\FahCore_11.exe -dir work/ -suffix 07 -checkpoint 30 -verbose -lifeline 8 -version 623'

[15:57:57] 
[15:57:57] *------------------------------*
[15:57:57] Folding@Home GPU Core
[15:57:57] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[15:57:57] 
[15:57:57] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[15:57:57] Build host: amoeba
[15:57:57] Board Type: Nvidia
[15:57:57] Core      : 
[15:57:57] Preparing to commence simulation
[15:57:57] - Looking at optimizations...
[15:57:58] DeleteFrameFiles: successfully deleted file=work/wudata_07.ckp
[15:57:58] - Created dyn
[15:57:58] - Files status OK
[15:57:58] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[15:57:58] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[15:57:58] - Digital signature verified
[15:57:58] 
[15:57:58] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:57:58] 
[15:57:58] Assembly optimizations on if available.
[15:57:58] Entering M.D.
[15:58:04] Tpr hash work/wudata_07.tpr:  1084752786 1432926355 3632744897 2249080962 3255808895
[15:58:04] 
[15:58:04] Calling fah_main args: 14 usage=100
[15:58:04] 
[15:58:04] Working on Protein
[15:58:06] Client config found, loading data.
[15:58:06] mdrun_gpu returned 
[15:58:06] SHAKE violations on GPU
[15:58:06] 
[15:58:06] Folding@home Core Shutdown: UNSTABLE_MACHINE
[15:58:09] CoreStatus = 7A (122)
[15:58:09] Sending work to server
[15:58:09] Project: 5791 (Run 0, Clone 318, Gen 9)
[15:58:09] - Read packet limit of 540015616... Set to 524286976.
[15:58:09] - Error: Could not get length of results file work/wuresults_07.dat
[15:58:09] - Error: Could not read unit 07 file. Removing from queue.
[15:58:09] EUE limit exceeded. Pausing 24 hours.

Re: Project: 5791 (Run 0, Clone 318, Gen 9)

Posted: Wed Oct 07, 2009 6:24 pm
by G-Byte
Same with me for this wu/rcg

For Monday;

Code: Select all

[04:26:55] *------------------------------*
[04:26:55] Folding@Home GPU Core - Beta
[04:26:55] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[04:26:55] 
[04:26:55] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[04:26:55] Build host: amoeba
[04:26:55] Board Type: Nvidia
[04:26:55] Core      : 
[04:26:55] Preparing to commence simulation
[04:26:55] - Looking at optimizations...
[04:26:55] - Created dyn
[04:26:55] - Files status OK
[04:26:55] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[04:26:55] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[04:26:55] - Digital signature verified
[04:26:55] 
[04:26:55] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:26:55] 
[04:26:55] Assembly optimizations on if available.
[04:26:55] Entering M.D.
[04:27:03] Working on Protein
[04:27:03] Client config found, loading data.
[04:27:03] Starting GUI Server
[04:27:03] mdrun_gpu returned 
[04:27:03] SHAKE violations on GPU
[04:27:03] 
[04:27:03] Folding@home Core Shutdown: UNSTABLE_MACHINE
[04:27:08] CoreStatus = 7A (122)
[04:27:08] Sending work to server
[04:27:08] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:27:08] - Read packet limit of 540015616... Set to 524286976.
[04:27:08] - Error: Could not get length of results file work/wuresults_08.dat
[04:27:08] - Error: Could not read unit 08 file. Removing from queue.


[04:27:16] *------------------------------*
[04:27:16] Folding@Home GPU Core - Beta
[04:27:16] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[04:27:16] 
[04:27:16] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[04:27:16] Build host: amoeba
[04:27:16] Board Type: Nvidia
[04:27:16] Core      : 
[04:27:16] Preparing to commence simulation
[04:27:16] - Looking at optimizations...
[04:27:16] - Created dyn
[04:27:16] - Files status OK
[04:27:16] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[04:27:16] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[04:27:16] - Digital signature verified
[04:27:16] 
[04:27:16] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:27:16] 
[04:27:16] Assembly optimizations on if available.
[04:27:16] Entering M.D.
[04:27:22] Working on Protein
[04:27:24] Client config found, loading data.
[04:27:24] Starting GUI Server
[04:27:24] mdrun_gpu returned 
[04:27:24] SHAKE violations on GPU
[04:27:24] 
[04:27:24] Folding@home Core Shutdown: UNSTABLE_MACHINE
[04:27:27] CoreStatus = 7A (122)
[04:27:27] Sending work to server
[04:27:27] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:27:27] - Read packet limit of 540015616... Set to 524286976.
[04:27:27] - Error: Could not get length of results file work/wuresults_09.dat
[04:27:27] - Error: Could not read unit 09 file. Removing from queue.
[04:27:35] Folding@Home GPU Core - Beta
[04:27:35] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[04:27:35] 
[04:27:35] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[04:27:35] Build host: amoeba
[04:27:35] Board Type: Nvidia
[04:27:35] Core      : 
[04:27:35] Preparing to commence simulation
[04:27:35] - Looking at optimizations...
[04:27:36] - Created dyn
[04:27:36] - Files status OK
[04:27:36] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[04:27:36] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[04:27:36] - Digital signature verified
[04:27:36] 
[04:27:36] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:27:36] 
[04:27:37] Assembly optimizations on if available.
[04:27:37] Entering M.D.
[04:27:43] Working on Protein
[04:27:44] Client config found, loading data.
[04:27:44] mdrun_gpu returned 
[04:27:44] SHAKE violations on GPU
[04:27:44] 
[04:27:44] Folding@home Core Shutdown: UNSTABLE_MACHINE
[04:27:48] CoreStatus = 7A (122)
[04:27:48] Sending work to server
[04:27:48] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:27:48] - Read packet limit of 540015616... Set to 524286976.
[04:27:48] - Error: Could not get length of results file work/wuresults_00.dat
[04:27:48] - Error: Could not read unit 00 file. Removing from queue.

[04:27:56] *------------------------------*
[04:27:56] Folding@Home GPU Core - Beta
[04:27:56] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[04:27:56] 
[04:27:56] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[04:27:56] Build host: amoeba
[04:27:56] Board Type: Nvidia
[04:27:56] Core      : 
[04:27:56] Preparing to commence simulation
[04:27:56] - Looking at optimizations...
[04:27:56] - Created dyn
[04:27:56] - Files status OK
[04:27:56] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[04:27:56] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[04:27:56] - Digital signature verified
[04:27:56] 
[04:27:56] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:27:56] 
[04:27:58] Assembly optimizations on if available.
[04:27:58] Entering M.D.
[04:28:04] Working on Protein
[04:28:05] Client config found, loading data.
[04:28:05] mdrun_gpu returned 
[04:28:05] SHAKE violations on GPU
[04:28:05] 
[04:28:05] Folding@home Core Shutdown: UNSTABLE_MACHINE
[04:28:07] CoreStatus = 7A (122)
[04:28:07] Sending work to server
[04:28:07] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:28:07] - Read packet limit of 540015616... Set to 524286976.
[04:28:07] - Error: Could not get length of results file work/wuresults_01.dat
[04:28:07] - Error: Could not read unit 01 file. Removing from queue.

[04:28:15] *------------------------------*
[04:28:15] Folding@Home GPU Core - Beta
[04:28:15] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[04:28:15] 
[04:28:15] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[04:28:15] Build host: amoeba
[04:28:15] Board Type: Nvidia
[04:28:15] Core      : 
[04:28:15] Preparing to commence simulation
[04:28:15] - Looking at optimizations...
[04:28:15] - Created dyn
[04:28:15] - Files status OK
[04:28:15] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[04:28:15] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[04:28:15] - Digital signature verified
[04:28:15] 
[04:28:15] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:28:15] 
[04:28:15] Assembly optimizations on if available.
[04:28:15] Entering M.D.
[04:28:23] Working on Protein
[04:28:23] Client config found, loading data.
[04:28:23] Starting GUI Server
[04:28:23] mdrun_gpu returned 
[04:28:23] SHAKE violations on GPU
[04:28:23] 
[04:28:23] Folding@home Core Shutdown: UNSTABLE_MACHINE
[04:28:26] CoreStatus = 7A (122)
[04:28:26] Sending work to server
[04:28:26] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:28:26] - Read packet limit of 540015616... Set to 524286976.
[04:28:26] - Error: Could not get length of results file work/wuresults_02.dat
[04:28:26] - Error: Could not read unit 02 file. Removing from queue.
[04:28:26] EUE limit exceeded. Pausing 24 hours.
[05:53:31] ***** Got a SIGTERM signal (2)
[05:53:31] Killing all core threads
And these for yesterday:

Code: Select all

[04:26:55] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:26:55] 
[04:26:55] Assembly optimizations on if available.
[04:26:55] Entering M.D.
[04:27:03] Working on Protein
[04:27:03] Client config found, loading data.
[04:27:03] Starting GUI Server
[04:27:03] mdrun_gpu returned 
[04:27:03] SHAKE violations on GPU
[04:27:03] 
[04:27:03] Folding@home Core Shutdown: UNSTABLE_MACHINE
[04:27:08] CoreStatus = 7A (122)

[04:27:16] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:27:16] 
[04:27:16] Assembly optimizations on if available.
[04:27:16] Entering M.D.
[04:27:22] Working on Protein
[04:27:24] Client config found, loading data.
[04:27:24] Starting GUI Server
[04:27:24] mdrun_gpu returned 
[04:27:24] SHAKE violations on GPU

[04:27:36] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:27:36] 
[04:27:37] Assembly optimizations on if available.
[04:27:37] Entering M.D.
[04:27:43] Working on Protein
[04:27:44] Client config found, loading data.
[04:27:44] mdrun_gpu returned 
[04:27:44] SHAKE violations on GPU
[04:27:44] 
[04:27:44] Folding@home Core Shutdown: UNSTABLE_MACHINE
[04:27:48] CoreStatus = 7A (122)
[04:27:48] Sending work to server
[04:27:48] Project: 5791 (Run 0, Clone 318, Gen 9)
[04:27:48] - Read packet limit of 540015616... Set to 524286976.
[04:27:48] - Error: Could not get length of results file work/wuresults_00.dat
[04:27:48] - Error: Could not read unit 00 file. Removing from queue.

Re: Project: 5791 (Run 0, Clone 318, Gen 9)

Posted: Wed Nov 04, 2009 12:13 pm
by bollix47
+1 for this WU

Code: Select all

[11:58:46] Project: 5791 (Run 0, Clone 318, Gen 9)
[11:58:46] 
[11:58:46] Assembly optimizations on if available.
[11:58:46] Entering M.D.
[11:58:53] Working on Protein
[11:58:54] Client config found, loading data.
[11:58:54] Starting GUI Server
[11:58:55] mdrun_gpu returned 
[11:58:55] SHAKE violations on GPU
[11:58:55] 
[11:58:55] Folding@home Core Shutdown: UNSTABLE_MACHINE
[11:58:59] CoreStatus = 7A (122)

Re: Project: 5791 (Run 0, Clone 318, Gen 9)

Posted: Tue Nov 10, 2009 4:25 am
by vfolder
And again, SHAKE violation....

Code: Select all

[01:56:27] Trying to send all finished work units
[01:56:27] + No unsent completed units remaining.
[01:56:27] - Preparing to get new work unit...
[01:56:27] + Attempting to get work packet
[01:56:27] - Will indicate memory of 2014 MB
[01:56:27] - Connecting to assignment server
[01:56:27] Connecting to http://assign-GPU.stanford.edu:8080/
[01:56:28] Posted data.
[01:56:28] Initial: 40AB; - Successful: assigned to (171.64.65.106).
[01:56:28] + News From Folding@Home: Welcome to Folding@Home
[01:56:28] Loaded queue successfully.
[01:56:28] Connecting to http://171.64.65.106:8080/
[01:56:28] Posted data.
[01:56:28] Initial: 0000; - Receiving payload (expected size: 67911)
[01:56:28] Conversation time very short, giving reduced weight in bandwidth avg
[01:56:28] - Downloaded at ~132 kB/s
[01:56:28] - Averaged speed for that direction ~105 kB/s
[01:56:28] + Received work.
[01:56:28] Trying to send all finished work units
[01:56:28] + No unsent completed units remaining.
[01:56:28] + Closed connections
[01:56:28] 
[01:56:28] + Processing work unit
[01:56:28] Core required: FahCore_11.exe
[01:56:28] Core found.
[01:56:28] Working on queue slot 08 [November 10 01:56:28 UTC]
[01:56:28] + Working ...
[01:56:28] - Calling '.\FahCore_11.exe -dir work/ -suffix 08 -checkpoint 15 -ver
bose -lifeline 45 -version 623'

[01:56:28] 
[01:56:28] *------------------------------*
[01:56:28] Folding@Home GPU Core - Beta
[01:56:28] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[01:56:28] 
[01:56:28] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14
.00.50727.762 for 80x86 
[01:56:28] Build host: amoeba
[01:56:28] Board Type: Nvidia
[01:56:28] Core      : 
[01:56:28] Preparing to commence simulation
[01:56:28] - Looking at optimizations...
[01:56:28] - Created dyn
[01:56:28] - Files status OK
[01:56:28] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[01:56:28] Called DecompressByteArray: compressed_data_size=67399 data_size=3507
44, decompressed_data_size=350744 diff=0
[01:56:28] - Digital signature verified
[01:56:28] 
[01:56:28] Project: 5791 (Run 0, Clone 318, Gen 9)
[01:56:28] 
[01:56:28] Assembly optimizations on if available.
[01:56:28] Entering M.D.
[01:56:35] Working on Protein
[01:56:37] Client config found, loading data.
[01:56:37] Starting GUI Server
[01:56:37] mdrun_gpu returned 
[01:56:37] SHAKE violations on GPU
[01:56:37] 
[01:56:37] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:56:40] CoreStatus = 7A (122)
[01:56:40] Sending work to server
[01:56:40] Project: 5791 (Run 0, Clone 318, Gen 9)
[01:56:40] - Error: Could not get length of results file work/wuresults_08.dat
[01:56:40] - Error: Could not read unit 08 file. Removing from queue.
[01:56:40] Trying to send all finished work units
[01:56:40] + No unsent completed units remaining.
[01:56:40] - Preparing to get new work unit...
[01:56:40] + Attempting to get work packet
[01:56:40] - Will indicate memory of 2014 MB
[01:56:40] - Connecting to assignment server
etc. etc.

Wash, rinse, repeat....24hr jail.

This loaded into a 9600GSO (96 shader) at stock clocks.

Re: Project: 5791 (Run 0, Clone 318, Gen 9)

Posted: Tue Nov 10, 2009 5:42 am
by vvoelz
Yet another instability in this WU. I've killed this clone, so it won't ever be assigned again. Thanks for the good reporting --Vince

Re: Project: 5791 (Run 0, Clone 318, Gen 9)

Posted: Thu Nov 12, 2009 6:20 am
by Torin3
It is still getting out:
[10:06:52] + Processing work unit
[10:06:52] Core required: FahCore_11.exe
[10:06:52] Core found.
[10:06:52] Working on queue slot 00 [November 11 10:06:52 UTC]
[10:06:52] + Working ...
[10:06:52] - Calling '.\FahCore_11.exe -dir work/ -suffix 00 -priority 96 -checkpoint 15 -verbose -lifeline 2652 -version 623'

[10:06:52]
[10:06:52] *------------------------------*
[10:06:52] Folding@Home GPU Core - Beta
[10:06:52] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[10:06:52]
[10:06:52] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[10:06:52] Build host: amoeba
[10:06:52] Board Type: Nvidia
[10:06:52] Core :
[10:06:52] Preparing to commence simulation
[10:06:52] - Looking at optimizations...
[10:06:52] - Created dyn
[10:06:52] - Files status OK
[10:06:52] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[10:06:52] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[10:06:52] - Digital signature verified
[10:06:52]
[10:06:52] Project: 5791 (Run 0, Clone 318, Gen 9)
[10:06:52]
[10:06:52] Assembly optimizations on if available.
[10:06:52] Entering M.D.
[10:06:59] Working on Protein
[10:07:00] Client config found, loading data.
[10:07:00] mdrun_gpu returned
[10:07:00] SHAKE violations on GPU
[10:07:00]
[10:07:00] Folding@home Core Shutdown: UNSTABLE_MACHINE
[10:07:02] CoreStatus = 7A (122)
[10:07:02] Sending work to server
[10:07:02] Project: 5791 (Run 0, Clone 318, Gen 9)
[10:07:02] - Read packet limit of 540015616... Set to 524286976.
[10:07:02] - Error: Could not get length of results file work/wuresults_00.dat
[10:07:02] - Error: Could not read unit 00 file. Removing from queue.
[10:07:02] Trying to send all finished work units

Re: Project: 5791 (Run 0, Clone 318, Gen 9)

Posted: Mon Nov 16, 2009 3:56 am
by Torin3
I hate to say, but it is still getting out. Got it again today:
[02:45:07] + Processing work unit
[02:45:07] Core required: FahCore_11.exe
[02:45:07] Core found.
[02:45:07] Working on queue slot 03 [November 16 02:45:07 UTC]
[02:45:07] + Working ...
[02:45:07] - Calling '.\FahCore_11.exe -dir work/ -suffix 03 -priority 96 -checkpoint 15 -verbose -lifeline 2356 -version 623'

[02:45:07]
[02:45:07] *------------------------------*
[02:45:07] Folding@Home GPU Core - Beta
[02:45:07] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[02:45:07]
[02:45:07] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[02:45:07] Build host: amoeba
[02:45:07] Board Type: Nvidia
[02:45:07] Core :
[02:45:07] Preparing to commence simulation
[02:45:07] - Looking at optimizations...
[02:45:07] - Created dyn
[02:45:07] - Files status OK
[02:45:07] - Expanded 67399 -> 350744 (decompressed 520.3 percent)
[02:45:07] Called DecompressByteArray: compressed_data_size=67399 data_size=350744, decompressed_data_size=350744 diff=0
[02:45:07] - Digital signature verified
[02:45:07]
[02:45:07] Project: 5791 (Run 0, Clone 318, Gen 9)
[02:45:07]
[02:45:07] Assembly optimizations on if available.
[02:45:07] Entering M.D.
[02:45:13] Working on Protein
[02:45:14] Client config found, loading data.
[02:45:15] mdrun_gpu returned
[02:45:15] SHAKE violations on GPU
[02:45:15]
[02:45:15] Folding@home Core Shutdown: UNSTABLE_MACHINE
[02:45:17] CoreStatus = 7A (122)

Re: Project: 5791 (Run 0, Clone 318, Gen 9)

Posted: Mon Nov 16, 2009 9:08 pm
by bruce
Torin3 wrote:I hate to say, but it is still getting out. Got it again today:
This is my theory about that. There are apparently two separate assignment methods.

1) If you finish a WU and request a new one, you will be assigned a new WU from whichever sever has work for your client. When the Pande Group stops a WU, this type of assignment is prevented.

2) If you fail to finish a WU and request an assignment, the server often re-send the exact same WU to you. Apparently this type of assignment is not stopped by whatever vvoelz did to this WU.

I suggest that you change your MachineID. The server will no longer receive a request from the SAME client, so it will be a type-1 assignment. The type-2 assignment will probably still hang around the server until that WU expires. That's 15 days after it was assigned to you, but since the same machine will not be requesting assignments, it won't be reassigned.