Project: 5744 (Run 4, Clone 88, Gen 641)

Moderators: Site Moderators, FAHC Science Team

Post Reply
bapriebe
Posts: 44
Joined: Sun Apr 20, 2008 8:33 am
Hardware configuration: HP xw4600 workstation (4GB)+Q9650+Sapphire Vapor-X HD4890,
HP Z600 workstation (4GB)+2xXEON E5540+Sapphire HD5770,
HP ML350 server (4GB)+2xXEON E5520+Diamond HD3850
Location: Ottawa, Ontario

Project: 5744 (Run 4, Clone 88, Gen 641)

Post by bapriebe »

5 times in a row, this WU failed immediately on SHAKE violations...

Code: Select all

[15:04:51] Project: 5744 (Run 4, Clone 88, Gen 641)
[15:04:51] 
[15:04:51] Assembly optimizations on if available.
[15:04:51] Entering M.D.
[15:04:57] Tpr hash work/wudata_01.tpr:  2652713085 1063011248 2982323259 4137909011 1987027690
[15:04:57] Working on Protein
[15:04:58] Client config found, loading data.
[15:04:58] Starting GUI Server
[15:05:01] mdrun_gpu returned 
[15:05:01] SHAKE violations on GPU
[15:05:01] 
[15:05:01] Folding@home Core Shutdown: UNSTABLE_MACHINE
[15:05:04] CoreStatus = 7A (122)
[15:05:04] Sending work to server
[15:05:04] Project: 5744 (Run 4, Clone 88, Gen 641)
[15:05:04] - Read packet limit of 540015616... Set to 524286976.
[15:05:04] - Error: Could not get length of results file work/wuresults_01.dat
[15:05:04] - Error: Could not read unit 01 file. Removing from queue.
rhavern
Posts: 425
Joined: Mon Dec 03, 2007 8:45 am
Location: UK

Re: Project: 5744 (Run 4, Clone 88, Gen 641)

Post by rhavern »

Me too, same problem, SHAKE violation, on a stable machine, WinXP 32-bit, AMD 3200+, HD3850 AGP, experimental DLLs

Code: Select all

[18:56:54] Trying to send all finished work units
[18:56:54] + No unsent completed units remaining.
[18:56:54] - Preparing to get new work unit...
[18:56:54] + Attempting to get work packet
[18:56:54] - Will indicate memory of 1023 MB
[18:56:54] - Detect CPU. Vendor: AuthenticAMD, Family: 6, Model: 10, Stepping: 0
[18:56:54] - Connecting to assignment server
[18:56:54] Connecting to http://assign-GPU.stanford.edu:8080/
[18:56:55] - Successful: assigned to (171.64.65.102).
[18:56:55] + News From Folding@Home: Welcome to Folding@Home
[18:56:55] Loaded queue successfully.
[18:56:55] Connecting to http://171.64.65.102:8080/
[18:56:55] - Receiving payload (expected size: 69133)
[18:56:57] - Downloaded at ~33 kB/s
[18:56:57] - Averaged speed for that direction ~66 kB/s
[18:56:57] + Received work.
[18:56:57] Trying to send all finished work units
[18:56:57] + No unsent completed units remaining.
[18:56:57] + Closed connections
[18:56:57] 
[18:56:57] + Processing work unit
[18:56:57] Core required: FahCore_11.exe
[18:56:57] Core found.
[18:56:57] Working on queue slot 04 [November 22 18:56:57 UTC]
[18:56:57] + Working ...
[18:56:57] - Calling '.\FahCore_11.exe -dir work/ -suffix 04 -checkpoint 30 -service -verbose -lifeline 2732 -version 623'

[18:56:57] 
[18:56:57] *------------------------------*
[18:56:57] Folding@Home GPU Core - Beta
[18:56:57] Version 1.24 (Mon Feb 9 11:00:12 PST 2009)
[18:56:57] 
[18:56:57] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[18:56:57] Build host: amoeba
[18:56:57] Board Type: AMD
[18:56:57] Core      : 
[18:56:57] Preparing to commence simulation
[18:56:57] - Looking at optimizations...
[18:56:57] - Created dyn
[18:56:57] - Files status OK
[18:56:57] - Expanded 68621 -> 357580 (decompressed 521.0 percent)
[18:56:57] Called DecompressByteArray: compressed_data_size=68621 data_size=357580, decompressed_data_size=357580 diff=0
[18:56:57] - Digital signature verified
[18:56:57] 
[18:56:57] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:56:57] 
[18:56:57] Assembly optimizations on if available.
[18:56:57] Entering M.D.
[18:57:03] Tpr hash work/wudata_04.tpr:  2652713085 1063011248 2982323259 4137909011 1987027690
[18:57:03] Working on Protein
[18:57:13] Client config found, loading data.
[18:57:14] Starting GUI Server
[18:57:22] mdrun_gpu returned 
[18:57:22] SHAKE violations on GPU
[18:57:22] 
[18:57:22] Folding@home Core Shutdown: UNSTABLE_MACHINE
[18:57:25] CoreStatus = 7A (122)
[18:57:25] Sending work to server
[18:57:25] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:57:25] - Read packet limit of 540015616... Set to 524286976.
[18:57:25] - Error: Could not get length of results file work/wuresults_04.dat
[18:57:25] - Error: Could not read unit 04 file. Removing from queue.
[18:57:25] Trying to send all finished work units
[18:57:25] + No unsent completed units remaining.
[18:57:25] - Preparing to get new work unit...
[18:57:25] + Attempting to get work packet
[18:57:25] - Will indicate memory of 1023 MB
[18:57:25] - Connecting to assignment server
[18:57:25] Connecting to http://assign-GPU.stanford.edu:8080/
[18:57:25] - Successful: assigned to (171.64.65.102).
[18:57:25] + News From Folding@Home: Welcome to Folding@Home
[18:57:25] Loaded queue successfully.
[18:57:25] Connecting to http://171.64.65.102:8080/
[18:57:26] - Receiving payload (expected size: 69133)
[18:57:27] - Downloaded at ~67 kB/s
[18:57:27] - Averaged speed for that direction ~66 kB/s
[18:57:27] + Received work.
[18:57:27] Trying to send all finished work units
[18:57:27] + No unsent completed units remaining.
[18:57:27] + Closed connections
[18:57:32] 
[18:57:32] + Processing work unit
[18:57:32] Core required: FahCore_11.exe
[18:57:32] Core found.
[18:57:32] Working on queue slot 05 [November 22 18:57:32 UTC]
[18:57:32] + Working ...
[18:57:32] - Calling '.\FahCore_11.exe -dir work/ -suffix 05 -checkpoint 30 -service -verbose -lifeline 2732 -version 623'

[18:57:32] 
[18:57:32] *------------------------------*
[18:57:32] Folding@Home GPU Core - Beta
[18:57:32] Version 1.24 (Mon Feb 9 11:00:12 PST 2009)
[18:57:32] 
[18:57:32] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[18:57:32] Build host: amoeba
[18:57:32] Board Type: AMD
[18:57:32] Core      : 
[18:57:32] Preparing to commence simulation
[18:57:32] - Looking at optimizations...
[18:57:32] - Created dyn
[18:57:32] - Files status OK
[18:57:32] - Expanded 68621 -> 357580 (decompressed 521.0 percent)
[18:57:32] Called DecompressByteArray: compressed_data_size=68621 data_size=357580, decompressed_data_size=357580 diff=0
[18:57:32] - Digital signature verified
[18:57:32] 
[18:57:32] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:57:32] 
[18:57:33] Assembly optimizations on if available.
[18:57:33] Entering M.D.
[18:57:39] Tpr hash work/wudata_05.tpr:  2652713085 1063011248 2982323259 4137909011 1987027690
[18:57:39] Working on Protein
[18:57:40] Client config found, loading data.
[18:57:40] Starting GUI Server
[18:57:48] mdrun_gpu returned 
[18:57:48] SHAKE violations on GPU
[18:57:48] 
[18:57:48] Folding@home Core Shutdown: UNSTABLE_MACHINE
[18:57:50] CoreStatus = 7A (122)
[18:57:50] Sending work to server
[18:57:50] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:57:50] - Read packet limit of 540015616... Set to 524286976.
[18:57:50] - Error: Could not get length of results file work/wuresults_05.dat
[18:57:50] - Error: Could not read unit 05 file. Removing from queue.
[18:57:50] Trying to send all finished work units
[18:57:50] + No unsent completed units remaining.
[18:57:50] - Preparing to get new work unit...
[18:57:50] + Attempting to get work packet
[18:57:50] - Will indicate memory of 1023 MB
[18:57:50] - Connecting to assignment server
[18:57:50] Connecting to http://assign-GPU.stanford.edu:8080/
[18:57:51] - Successful: assigned to (171.64.65.102).
[18:57:51] + News From Folding@Home: Welcome to Folding@Home
[18:57:51] Loaded queue successfully.
[18:57:51] Connecting to http://171.64.65.102:8080/
[18:57:52] - Receiving payload (expected size: 69133)
[18:57:53] - Downloaded at ~67 kB/s
[18:57:53] - Averaged speed for that direction ~66 kB/s
[18:57:53] + Received work.
[18:57:53] Trying to send all finished work units
[18:57:53] + No unsent completed units remaining.
[18:57:53] + Closed connections
[18:57:58] 
[18:57:58] + Processing work unit
[18:57:58] Core required: FahCore_11.exe
[18:57:58] Core found.
[18:57:58] Working on queue slot 06 [November 22 18:57:58 UTC]
[18:57:58] + Working ...
[18:57:58] - Calling '.\FahCore_11.exe -dir work/ -suffix 06 -checkpoint 30 -service -verbose -lifeline 2732 -version 623'

[18:57:58] 
[18:57:58] *------------------------------*
[18:57:58] Folding@Home GPU Core - Beta
[18:57:58] Version 1.24 (Mon Feb 9 11:00:12 PST 2009)
[18:57:58] 
[18:57:58] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[18:57:58] Build host: amoeba
[18:57:58] Board Type: AMD
[18:57:58] Core      : 
[18:57:58] Preparing to commence simulation
[18:57:58] - Looking at optimizations...
[18:57:58] - Created dyn
[18:57:58] - Files status OK
[18:57:58] - Expanded 68621 -> 357580 (decompressed 521.0 percent)
[18:57:58] Called DecompressByteArray: compressed_data_size=68621 data_size=357580, decompressed_data_size=357580 diff=0
[18:57:58] - Digital signature verified
[18:57:58] 
[18:57:58] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:57:58] 
[18:57:59] Assembly optimizations on if available.
[18:57:59] Entering M.D.
[18:58:05] Tpr hash work/wudata_06.tpr:  2652713085 1063011248 2982323259 4137909011 1987027690
[18:58:05] Working on Protein
[18:58:06] Client config found, loading data.
[18:58:06] Starting GUI Server
[18:58:14] mdrun_gpu returned 
[18:58:14] SHAKE violations on GPU
[18:58:14] 
[18:58:14] Folding@home Core Shutdown: UNSTABLE_MACHINE
[18:58:16] CoreStatus = 7A (122)
[18:58:16] Sending work to server
[18:58:16] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:58:16] - Read packet limit of 540015616... Set to 524286976.
[18:58:16] - Error: Could not get length of results file work/wuresults_06.dat
[18:58:16] - Error: Could not read unit 06 file. Removing from queue.
[18:58:16] Trying to send all finished work units
[18:58:16] + No unsent completed units remaining.
[18:58:16] - Preparing to get new work unit...
[18:58:16] + Attempting to get work packet
[18:58:16] - Will indicate memory of 1023 MB
[18:58:16] - Connecting to assignment server
[18:58:16] Connecting to http://assign-GPU.stanford.edu:8080/
[18:58:17] - Successful: assigned to (171.64.65.102).
[18:58:17] + News From Folding@Home: Welcome to Folding@Home
[18:58:17] Loaded queue successfully.
[18:58:17] Connecting to http://171.64.65.102:8080/
[18:58:18] - Receiving payload (expected size: 69133)
[18:58:19] - Downloaded at ~67 kB/s
[18:58:19] - Averaged speed for that direction ~67 kB/s
[18:58:19] + Received work.
[18:58:19] Trying to send all finished work units
[18:58:19] + No unsent completed units remaining.
[18:58:19] + Closed connections
[18:58:24] 
[18:58:24] + Processing work unit
[18:58:24] Core required: FahCore_11.exe
[18:58:24] Core found.
[18:58:24] Working on queue slot 07 [November 22 18:58:24 UTC]
[18:58:24] + Working ...
[18:58:24] - Calling '.\FahCore_11.exe -dir work/ -suffix 07 -checkpoint 30 -service -verbose -lifeline 2732 -version 623'

[18:58:24] 
[18:58:24] *------------------------------*
[18:58:24] Folding@Home GPU Core - Beta
[18:58:24] Version 1.24 (Mon Feb 9 11:00:12 PST 2009)
[18:58:24] 
[18:58:24] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[18:58:24] Build host: amoeba
[18:58:24] Board Type: AMD
[18:58:24] Core      : 
[18:58:24] Preparing to commence simulation
[18:58:24] - Looking at optimizations...
[18:58:24] - Created dyn
[18:58:24] - Files status OK
[18:58:24] - Expanded 68621 -> 357580 (decompressed 521.0 percent)
[18:58:24] Called DecompressByteArray: compressed_data_size=68621 data_size=357580, decompressed_data_size=357580 diff=0
[18:58:24] - Digital signature verified
[18:58:24] 
[18:58:24] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:58:24] 
[18:58:24] Assembly optimizations on if available.
[18:58:24] Entering M.D.
[18:58:30] Tpr hash work/wudata_07.tpr:  2652713085 1063011248 2982323259 4137909011 1987027690
[18:58:30] Working on Protein
[18:58:31] Client config found, loading data.
[18:58:31] Starting GUI Server
[18:58:40] mdrun_gpu returned 
[18:58:40] SHAKE violations on GPU
[18:58:40] 
[18:58:40] Folding@home Core Shutdown: UNSTABLE_MACHINE
[18:58:42] CoreStatus = 7A (122)
[18:58:42] Sending work to server
[18:58:42] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:58:42] - Read packet limit of 540015616... Set to 524286976.
[18:58:42] - Error: Could not get length of results file work/wuresults_07.dat
[18:58:42] - Error: Could not read unit 07 file. Removing from queue.
[18:58:42] Trying to send all finished work units
[18:58:42] + No unsent completed units remaining.
[18:58:42] - Preparing to get new work unit...
[18:58:42] + Attempting to get work packet
[18:58:42] - Will indicate memory of 1023 MB
[18:58:42] - Connecting to assignment server
[18:58:42] Connecting to http://assign-GPU.stanford.edu:8080/
[18:58:43] - Successful: assigned to (171.64.65.102).
[18:58:43] + News From Folding@Home: Welcome to Folding@Home
[18:58:43] Loaded queue successfully.
[18:58:43] Connecting to http://171.64.65.102:8080/
[18:58:43] - Receiving payload (expected size: 69133)
[18:58:45] - Downloaded at ~33 kB/s
[18:58:45] - Averaged speed for that direction ~60 kB/s
[18:58:45] + Received work.
[18:58:45] Trying to send all finished work units
[18:58:45] + No unsent completed units remaining.
[18:58:45] + Closed connections
[18:58:50] 
[18:58:50] + Processing work unit
[18:58:50] Core required: FahCore_11.exe
[18:58:50] Core found.
[18:58:50] Working on queue slot 08 [November 22 18:58:50 UTC]
[18:58:50] + Working ...
[18:58:50] - Calling '.\FahCore_11.exe -dir work/ -suffix 08 -checkpoint 30 -service -verbose -lifeline 2732 -version 623'

[18:58:50] 
[18:58:50] *------------------------------*
[18:58:50] Folding@Home GPU Core - Beta
[18:58:50] Version 1.24 (Mon Feb 9 11:00:12 PST 2009)
[18:58:50] 
[18:58:50] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
[18:58:50] Build host: amoeba
[18:58:50] Board Type: AMD
[18:58:50] Core      : 
[18:58:50] Preparing to commence simulation
[18:58:50] - Looking at optimizations...
[18:58:50] - Created dyn
[18:58:50] - Files status OK
[18:58:50] - Expanded 68621 -> 357580 (decompressed 521.0 percent)
[18:58:50] Called DecompressByteArray: compressed_data_size=68621 data_size=357580, decompressed_data_size=357580 diff=0
[18:58:50] - Digital signature verified
[18:58:50] 
[18:58:50] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:58:50] 
[18:58:50] Assembly optimizations on if available.
[18:58:50] Entering M.D.
[18:58:56] Tpr hash work/wudata_08.tpr:  2652713085 1063011248 2982323259 4137909011 1987027690
[18:58:56] Working on Protein
[18:58:57] Client config found, loading data.
[18:58:57] Starting GUI Server
[18:59:06] mdrun_gpu returned 
[18:59:06] SHAKE violations on GPU
[18:59:06] 
[18:59:06] Folding@home Core Shutdown: UNSTABLE_MACHINE
[18:59:08] CoreStatus = 7A (122)
[18:59:08] Sending work to server
[18:59:08] Project: 5744 (Run 4, Clone 88, Gen 641)
[18:59:08] - Read packet limit of 540015616... Set to 524286976.
[18:59:08] - Error: Could not get length of results file work/wuresults_08.dat
[18:59:08] - Error: Could not read unit 08 file. Removing from queue.
[18:59:08] EUE limit exceeded. Pausing 24 hours.
[19:03:21] - Autosending finished units... [November 22 19:03:21 UTC]
[19:03:21] Trying to send all finished work units
[19:03:21] + No unsent completed units remaining.
[19:03:21] - Autosend completed
Folding since 1 WU=1 point
ImageImage
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 5744 (Run 4, Clone 88, Gen 641)

Post by bruce »

I've asked the project owner to stop this WU.
Post Reply