Page 2 of 3

Re: 128.143.231.201 (BA) acting up?

Posted: Mon Mar 31, 2014 1:24 am
by PinHead
Finished my SMP WU from a few hours ago, but when I switch back to BA this is all that I get ( over and over ):

Code: Select all

[23:49:18] Trying to send all finished work units
[23:49:18] + No unsent completed units remaining.
[23:49:18] - Preparing to get new work unit...
[23:49:18] Cleaning up work directory
[23:49:18] + Attempting to get work packet
[23:49:18] Passkey found
[23:49:18] - Will indicate memory of 16078 MB
[23:49:18] - Connecting to assignment server
[23:49:18] Connecting to http://assign.stanford.edu:8080/
[23:49:19] Posted data.
[23:49:19] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[23:49:19] + News From Folding@Home: Welcome to Folding@Home
[23:49:19] Loaded queue successfully.
[23:49:19] Sent data
[23:49:19] Connecting to http://128.143.231.201:8080/
[23:49:19] Posted data.
[23:49:19] Initial: 0000; - Receiving payload (expected size: 512)
[23:49:19] Conversation time very short, giving reduced weight in bandwidth avg
[23:49:19] - Downloaded at ~1 kB/s
[23:49:19] - Averaged speed for that direction ~17 kB/s
[23:49:19] + Received work.
[23:49:19] + Closed connections
[23:49:24]
[23:49:24] + Processing work unit
[23:49:24] Core required: FahCore_a5.exe
[23:49:24] Core found.
[23:49:24] Working on queue slot 00 [March 30 23:49:24 UTC]
[23:49:24] + Working ...
[23:49:24] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 00 -np 24 -checkpoint 15 -verbose -lifeline 1984 -version 634'

[23:49:24]
[23:49:24] *------------------------------*
[23:49:24] Folding@Home Gromacs SMP Core
[23:49:24] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[23:49:24]
[23:49:24] Preparing to commence simulation
[23:49:24] - Looking at optimizations...
[23:49:24] - Created dyn
[23:49:24] - Files status OK
[23:49:24] Couldn't Decompress
[23:49:24] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[23:49:24] -Error: Couldn't update checksum variables
[23:49:24] Error: Could not open work file
[23:49:24]
[23:49:24] Folding@home Core Shutdown: FILE_IO_ERROR
[23:49:24] CoreStatus = 75 (117)
[23:49:24] Error opening or reading from a file.
[23:49:24] Deleting current work unit & continuing...
[23:49:24] Trying to send all finished work units


Re: 128.143.231.201 (BA) acting up?

Posted: Mon Mar 31, 2014 3:20 pm
by Nathan_P
still playing up, I've switched to smp until further notice for this rig - lets see what happens with my other rig in about 5 hours

Re: 128.143.231.201 (BA) acting up?

Posted: Mon Mar 31, 2014 3:58 pm
by P5-133XL
Just as a side question, has anyone experimented/explored running the NaCl client on one of these BA machines. Is there a CPU limit to the NaCl? Does the network speed limit PPD more than the CPU? I'm curious as to the PPD when compared to SMP/BA?

Re: 128.143.231.201 (BA) acting up?

Posted: Mon Mar 31, 2014 8:30 pm
by Nathan_P
I tried to get the NaCl client to work on my daily rig but it wouldn't start folding, followed the FAQ and still couldn't get it to fold. I might have another go this weekend. The rig in question is at the lower end of the BA spectrum (dual x5670) but its the only one that has windows and chrome on it.

As for my earlier post, my 2nd rig is just starting a 8104.

Re: 128.143.231.201 (BA) acting up?

Posted: Tue Apr 01, 2014 2:01 am
by PinHead
I think the problem still exists.

Earlier today, my 64 core boxes picked up SMP's but are now working on BA WU. The 24 core box can't seem to get a BA WU. Still trying to download 512 byte 8105, over and over.

Re: 128.143.231.201 (BA) acting up?

Posted: Tue Apr 01, 2014 10:30 pm
by PinHead
So is there anything else I can try?

There haven't been any software changes on this box and now it can't seem to pull a BA WU. This has been going on for a couple of days. Server still says 16 cores but can't seem to give my 24 core box a correct assignment and delivery. My 64 core boxes seem to work fine, only one short glitch 1 day ago and back to work on BA units.

Here is the start up:

Code: Select all

Note: Please read the license agreement (fah6 -license). Further 
use of this software requires that you have read and accepted this agreement.

24 cores detected


--- Opening Log file [April 1 22:13:56 UTC] 


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /media/Fold/FAH
Executable: ./fah6
Arguments: -smp -bigadv -verbosity 9 

[22:13:56] - Ask before connecting: No
[22:13:56] - User name: PinHead (Team 4)
[22:13:56] - User ID: XXXXXXXXXXXXXX
[22:13:56] - Machine ID: 1
[22:13:56] 
[22:13:56] Loaded queue successfully.
[A2:13:56] 
[22:13:56] - Autosending finished units... [A2:13:1 22:13:56 UTC]
[22:13:56] + Processing work unit
[22:13:56] Trying to send all finished work units
[22:13:56] Core required: FahCore_a5.exe
[22:13:56] + No unsent completed units remaining.
[22:13:56] Core found.
[22:13:56] - Autosend completed
[22:13:56] Working on queue slot 03 [April 1 22:13:56 UTC]
[22:13:56] + Working ...
[22:13:56] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 03 -np 24 -checkpoint 15 -verbose -lifeline 11393 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11398
thekraken: Logging to thekraken.log
[22:13:56] 
[22:13:56] *------------------------------*
[22:13:56] Folding@Home Gromacs SMP Core
[22:13:56] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:13:56] 
[22:13:56] Preparing to commence simulation
[22:13:56] - Looking at optimizations...
[22:13:56] - Created dyn
[22:13:56] - Files status OK
[22:13:56] Couldn't Decompress
[22:13:56] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:13:56] -Error: Couldn't update checksum variables
[22:13:56] Error: Could not open work file
[22:13:56] 
[22:13:56] Folding@home Core Shutdown: FILE_IO_ERROR
[22:13:56] CoreStatus = 75 (117)
[22:13:56] Error opening or reading from a file.
[22:13:56] Deleting current work unit & continuing...
thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11401
thekraken: Logging to thekraken.log
[22:13:56] Trying to send all finished work units
[22:13:56] + No unsent completed units remaining.
[22:13:56] - Preparing to get new work unit...
[22:13:56] Cleaning up work directory
[22:13:56] + Attempting to get work packet
[22:13:56] Passkey found
[22:13:56] - Will indicate memory of 16078 MB
[22:13:56] - Connecting to assignment server
[22:13:56] Connecting to http://assign.stanford.edu:8080/
[22:13:57] Posted data.
[22:13:57] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[22:13:57] + News From Folding@Home: Welcome to Folding@Home
[22:13:57] Loaded queue successfully.
[22:13:57] Sent data
[22:13:57] Connecting to http://128.143.231.201:8080/
[22:13:57] Posted data.
[22:13:57] Initial: 0000; - Receiving payload (expected size: 512)
[22:13:57] Conversation time very short, giving reduced weight in bandwidth avg
[22:13:57] - Downloaded at ~1 kB/s
[22:13:57] - Averaged speed for that direction ~29 kB/s
[22:13:57] + Received work.
[22:13:57] + Closed connections
[22:14:02] 
[22:14:02] + Processing work unit
[22:14:02] Core required: FahCore_a5.exe
[22:14:02] Core found.
[22:14:02] Working on queue slot 04 [April 1 22:14:02 UTC]
[22:14:02] + Working ...
[22:14:02] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 04 -np 24 -checkpoint 15 -verbose -lifeline 11393 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11410
thekraken: Logging to thekraken.log
[22:14:02] 
[22:14:02] *------------------------------*
[22:14:02] Folding@Home Gromacs SMP Core
[22:14:02] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:14:02] 
[22:14:02] Preparing to commence simulation
[22:14:02] - Looking at optimizations...
[22:14:02] - Created dyn
[22:14:02] - Files status OK
[22:14:02] Couldn't Decompress
[22:14:02] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:14:02] -Error: Couldn't update checksum variables
[22:14:02] Error: Could not open work file
[22:14:02] 
[22:14:02] Folding@home Core Shutdown: FILE_IO_ERROR
[22:14:03] CoreStatus = 75 (117)
[22:14:03] Error opening or reading from a file.
[22:14:03] Deleting current work unit & continuing...
thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11413
thekraken: Logging to thekraken.log
[22:14:03] Trying to send all finished work units
[22:14:03] + No unsent completed units remaining.
[22:14:03] - Preparing to get new work unit...
[22:14:03] Cleaning up work directory
[22:14:03] + Attempting to get work packet
[22:14:03] Passkey found
[22:14:03] - Will indicate memory of 16078 MB
[22:14:03] - Connecting to assignment server
[22:14:03] Connecting to http://assign.stanford.edu:8080/
[22:14:03] Posted data.
[22:14:03] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[22:14:03] + News From Folding@Home: Welcome to Folding@Home
[22:14:03] Loaded queue successfully.
[22:14:03] Sent data
[22:14:03] Connecting to http://128.143.231.201:8080/
[22:14:03] Posted data.
[22:14:03] Initial: 0000; - Receiving payload (expected size: 512)
[22:14:03] Conversation time very short, giving reduced weight in bandwidth avg
[22:14:03] - Downloaded at ~1 kB/s
[22:14:03] - Averaged speed for that direction ~26 kB/s
[22:14:03] + Received work.
[22:14:03] + Closed connections
[22:14:08] 
[22:14:08] + Processing work unit
[22:14:08] Core required: FahCore_a5.exe
[22:14:08] Core found.
[22:14:08] Working on queue slot 05 [April 1 22:14:08 UTC]
[22:14:08] + Working ...
[22:14:08] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 05 -np 24 -checkpoint 15 -verbose -lifeline 11393 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11424
thekraken: Logging to thekraken.log
[22:14:08] 
[22:14:08] *------------------------------*
[22:14:08] Folding@Home Gromacs SMP Core
[22:14:08] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:14:08] 
[22:14:08] Preparing to commence simulation
[22:14:08] - Looking at optimizations...
[22:14:08] - Created dyn
[22:14:08] - Files status OK
[22:14:08] Couldn't Decompress
[22:14:08] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:14:08] -Error: Couldn't update checksum variables
[22:14:08] Error: Could not open work file
[22:14:08] 
[22:14:08] Folding@home Core Shutdown: FILE_IO_ERROR
[22:14:09] CoreStatus = 75 (117)
[22:14:09] Error opening or reading from a file.
[22:14:09] Deleting current work unit & continuing...
thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11427
thekraken: Logging to thekraken.log
[22:14:09] Trying to send all finished work units
[22:14:09] + No unsent completed units remaining.
[22:14:09] - Preparing to get new work unit...
[22:14:09] Cleaning up work directory
[22:14:09] + Attempting to get work packet
[22:14:09] Passkey found
[22:14:09] - Will indicate memory of 16078 MB
[22:14:09] - Connecting to assignment server
[22:14:09] Connecting to http://assign.stanford.edu:8080/
[22:14:09] Posted data.
[22:14:09] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[22:14:09] + News From Folding@Home: Welcome to Folding@Home
[22:14:09] Loaded queue successfully.
[22:14:09] Sent data
[22:14:09] Connecting to http://128.143.231.201:8080/
[22:14:09] Posted data.
[22:14:09] Initial: 0000; - Receiving payload (expected size: 512)
[22:14:09] Conversation time very short, giving reduced weight in bandwidth avg
[22:14:09] - Downloaded at ~1 kB/s
[22:14:09] - Averaged speed for that direction ~23 kB/s
[22:14:09] + Received work.
[22:14:09] + Closed connections
[22:14:14] 
[22:14:14] + Processing work unit
[22:14:14] Core required: FahCore_a5.exe
[22:14:14] Core found.
[22:14:14] Working on queue slot 06 [April 1 22:14:14 UTC]
[22:14:14] + Working ...
[22:14:14] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 06 -np 24 -checkpoint 15 -verbose -lifeline 11393 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11436
thekraken: Logging to thekraken.log
[22:14:15] 
[22:14:15] *------------------------------*
[22:14:15] Folding@Home Gromacs SMP Core
[22:14:15] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:14:15] 
[22:14:15] Preparing to commence simulation
[22:14:15] - Looking at optimizations...
[22:14:15] - Created dyn
[22:14:15] - Files status OK
[22:14:15] Couldn't Decompress
[22:14:15] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[22:14:15] -Error: Couldn't update checksum variables
[22:14:15] Error: Could not open work file
[22:14:15] 
[22:14:15] Folding@home Core Shutdown: FILE_IO_ERROR
[22:14:15] CoreStatus = 75 (117)
[22:14:15] Error opening or reading from a file.
[22:14:15] Deleting current work unit & continuing...
thekraken: The Kraken 0.7-pre15 (compiled Sat Mar 16 09:47:09 EDT 2013 by wupig@wupig-System-Product-Name)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 11439
thekraken: Logging to thekraken.log
[22:14:15] Trying to send all finished work units
[22:14:15] + No unsent completed units remaining.
[22:14:15] - Preparing to get new work unit...
[22:14:15] Cleaning up work directory
[22:14:15] + Attempting to get work packet
[22:14:15] Passkey found
[22:14:15] - Will indicate memory of 16078 MB
[22:14:15] - Connecting to assignment server
[22:14:15] Connecting to http://assign.stanford.edu:8080/
[22:14:15] Posted data.
[22:14:15] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[22:14:15] + News From Folding@Home: Welcome to Folding@Home
[22:14:15] Loaded queue successfully.
[22:14:15] Sent data
[22:14:15] Connecting to http://128.143.231.201:8080/
[22:14:16] Posted data.
[22:14:16] Initial: 0000; - Receiving payload (expected size: 512)
[22:14:16] Conversation time very short, giving reduced weight in bandwidth avg
[22:14:16] - Downloaded at ~1 kB/s
[22:14:16] - Averaged speed for that direction ~20 kB/s
[22:14:16] + Received work.
[22:14:16] + Closed connections
^C[22:14:16] ***** Got an Activate signal (2)
[22:14:16] Killing all core threads

The queueinfo indicates that it is getting assigned a BA unit, it just seems to get bad data to retrieve it:

Code: Select all

[22:23:14] Loaded queue successfully.
[22:23:14] Printing Queue Information
Current Queue: 
Slot 08  Empty/Deleted
Project: 8571 (Run 0, Clone 5, Gen 499), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 1 01:56:55
Finished date: April 1 08:46:10

Slot 09  Empty/Deleted
Project: 8568 (Run 1, Clone 3, Gen 328), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 1 08:50:21
Finished date: April 1 15:24:53

Slot 00  Empty/Deleted
Project: 8577 (Run 1, Clone 7, Gen 325), Core: a3
Work server: 128.143.231.202:8080
Collection server: 128.143.199.97
Download date: April 1 15:35:24
Finished date: April 1 22:07:36

Slot 01  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:13:34
Finished date: January 1 00:00:00

Slot 02  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:13:35
Finished date: January 1 00:00:00

Slot 03  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:13:41
Finished date: January 1 00:00:00

Slot 04  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:13:57
Finished date: January 1 00:00:00

Slot 05  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:14:03
Finished date: January 1 00:00:00

Slot 06  Empty/Deleted
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:14:09
Finished date: January 1 00:00:00

Slot 07 *Ready    
Project: 8105 (Run 0, Clone 0, Gen 337), Core: a5
Work server: 128.143.231.201:8080
Collection server: 128.143.199.97
Download date: April 1 22:14:16
Deadline date: April 5 22:14:16

PF: 0.979206 based on last 4 slot(s)

Re: 128.143.231.201 (BA) acting up?

Posted: Tue Apr 01, 2014 11:08 pm
by bruce
I was told that a problem with bad WUs was fixed a few hours ago. Have you rebooted/reset everything and let the client start fresh? After that, if the problem persists, let us know.

The server 128.143.231.201 has very few WUs. Shouldn't you be redirected to another BA server?

Re: 128.143.231.201 (BA) acting up?

Posted: Tue Apr 01, 2014 11:32 pm
by PinHead
Ok, I powered off, switched the power supply off and let all residual energy drain. After restart I removed the work folder, deleted the queue.dat, machinedependent.dat and unitinfo.txt file.

This time, the steps seemed to have worked and I am getting an expected download size of more than 512.

Thanks bruce!

Re: 128.143.231.201 (BA) acting up?

Posted: Wed Apr 02, 2014 4:23 pm
by Macaholic
bruce wrote:I was told that a problem with bad WUs was fixed a few hours ago. Have you rebooted/reset everything and let the client start fresh? After that, if the problem persists, let us know.

The server 128.143.231.201 has very few WUs. Shouldn't you be redirected to another BA server?
Not fixed. Units are still there.

Code: Select all

[15:19:51] + Attempting to send results [April 2 15:19:51 UTC]
[15:19:51] - Reading file work/wuresults_05.dat from core
[15:19:51]   (Read 91401212 bytes from disk)
[15:19:51] Connecting to http://128.143.231.201:8080/
[15:36:56] Posted data.
[15:36:56] Initial: 0000; - Uploaded at ~87 kB/s
[15:36:56] - Averaged speed for that direction ~86 kB/s
[15:36:56] + Results successfully sent
[15:36:56] Thank you for your contribution to Folding@Home.
[15:36:56] + Number of Units Completed: 652

[15:53:59] Trying to send all finished work units
[15:53:59] + No unsent completed units remaining.
[15:53:59] - Preparing to get new work unit...
[15:53:59] Cleaning up work directory
[15:59:59] + Attempting to get work packet
[15:59:59] Passkey found
[15:59:59] - Will indicate memory of 32233 MB
[15:59:59] - Connecting to assignment server
[15:59:59] Connecting to http://assign.stanford.edu:8080/
[16:00:22] Posted data.
[16:00:22] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[16:00:22] + News From Folding@Home: Welcome to Folding@Home
[16:00:22] Loaded queue successfully.
[16:00:22] Sent data
[16:00:22] Connecting to http://128.143.231.201:8080/
[16:00:31] Posted data.
[16:00:31] Initial: 0000; - Receiving payload (expected size: 512)
[16:00:31] Conversation time very short, giving reduced weight in bandwidth avg
[16:00:31] - Downloaded at ~1 kB/s
[16:00:31] - Averaged speed for that direction ~360 kB/s
[16:00:31] + Received work.
[16:00:31] Trying to send all finished work units
[16:00:31] + No unsent completed units remaining.
[16:00:31] + Closed connections
[16:00:31] 
[16:00:31] + Processing work unit
[16:00:31] Core required: FahCore_a5.exe
[16:00:31] Core found.
[16:00:31] Working on queue slot 06 [April 2 16:00:31 UTC]
[16:00:31] + Working ...
[16:00:31] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 06 -np 48 -checkpoint 15 -forceasm -verbose -lifeline 2762 -version 634'

[16:00:32] 
[16:00:32] *------------------------------*
[16:00:32] Folding@Home Gromacs SMP Core
[16:00:32] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[16:00:32] 
[16:00:32] Preparing to commence simulation
[16:00:32] - Assembly optimizations manually forced on.
[16:00:32] - Not checking prior termination.
[16:00:32] Couldn't Decompress
[16:00:32] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[16:00:32] -Error: Couldn't update checksum variables
[16:00:32] Error: Could not open work file
[16:00:32] 
[16:00:32] Folding@home Core Shutdown: FILE_IO_ERROR
[16:00:32] CoreStatus = 75 (117)
[16:00:32] Error opening or reading from a file.
[16:00:32] Deleting current work unit & continuing...
[16:00:32] Trying to send all finished work units
[16:00:32] + No unsent completed units remaining.
[16:00:32] - Preparing to get new work unit...
[16:00:32] Cleaning up work directory
[16:06:30] + Attempting to get work packet
[16:06:30] Passkey found
[16:06:30] - Will indicate memory of 32233 MB
[16:06:30] - Connecting to assignment server
[16:06:30] Connecting to http://assign.stanford.edu:8080/
[16:06:31] Posted data.
[16:06:31] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[16:06:31] + News From Folding@Home: Welcome to Folding@Home
[16:06:31] Loaded queue successfully.
[16:06:31] Sent data
[16:06:31] Connecting to http://128.143.231.201:8080/
[16:06:31] Posted data.
[16:06:31] Initial: 0000; - Receiving payload (expected size: 512)
[16:06:31] Conversation time very short, giving reduced weight in bandwidth avg
[16:06:31] - Downloaded at ~1 kB/s
[16:06:31] - Averaged speed for that direction ~320 kB/s
[16:06:31] + Received work.
[16:06:31] + Closed connections
[16:06:36] 
[16:06:36] + Processing work unit
[16:06:36] Core required: FahCore_a5.exe
[16:06:36] Core found.
[16:06:36] Working on queue slot 07 [April 2 16:06:36 UTC]
[16:06:36] + Working ...
[16:06:36] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 07 -np 48 -checkpoint 15 -forceasm -verbose -lifeline 2762 -version 634'

[16:06:36] 
[16:06:36] *------------------------------*
[16:06:36] Folding@Home Gromacs SMP Core
[16:06:36] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[16:06:36] 
[16:06:36] Preparing to commence simulation
[16:06:36] - Assembly optimizations manually forced on.
[16:06:36] - Not checking prior termination.
[16:06:36] Couldn't Decompress
[16:06:36] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[16:06:36] -Error: Couldn't update checksum variables
[16:06:36] Error: Could not open work file
[16:06:36] 
[16:06:36] Folding@home Core Shutdown: FILE_IO_ERROR
[16:06:36] CoreStatus = 75 (117)
[16:06:36] Error opening or reading from a file.
[16:06:36] Deleting current work unit & continuing...
[16:06:36] Trying to send all finished work units
[16:06:36] + No unsent completed units remaining.
[16:06:36] - Preparing to get new work unit...
[16:06:36] Cleaning up work directory
[16:07:46] ***** Got an Activate signal (2)
[16:07:46] Killing all core threads

Folding@Home Client Shutdown.

Re: 128.143.231.201 (BA) acting up?

Posted: Thu Apr 03, 2014 9:39 am
by -alias-
I have the same problem randomly with all my 6 servers, and it have been going on for over a week now. This is what happend. When this this incident occurs I delete everything but not the config, and starting a new fresh fah, and this can run ok for several WUs before it happend again. Clip from the latest log:

Code: Select all

[06:42:53] Trying to send all finished work units
[06:42:53] + No unsent completed units remaining.
[06:42:53] - Preparing to get new work unit...
[06:42:53] Cleaning up work directory
[06:42:53] + Attempting to get work packet
[06:42:53] Passkey found
[06:42:53] - Will indicate memory of 32217 MB
[06:42:53] - Connecting to assignment server
[06:42:53] Connecting to http://assign.stanford.edu:8080/
[06:42:54] Posted data.
[06:42:54] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[06:42:54] + News From Folding@Home: Welcome to Folding@Home
[06:42:54] Loaded queue successfully.
[06:42:54] Sent data
[06:42:54] Connecting to http://128.143.231.201:8080/
[06:42:55] Posted data.
[06:42:55] Initial: 0000; - Receiving payload (expected size: 512)
[06:42:55] Conversation time very short, giving reduced weight in bandwidth avg
[06:42:55] - Downloaded at ~1 kB/s
[06:42:55] - Averaged speed for that direction ~1 kB/s
[06:42:55] + Received work.
[06:42:55] + Closed connections
[06:43:00] 
[06:43:00] + Processing work unit
[06:43:00] Core required: FahCore_a5.exe
[06:43:00] Core found.
[06:43:00] Working on queue slot 07 [April 3 06:43:00 UTC]
[06:43:00] + Working ...
[06:43:00] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 07 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[06:43:00] 
[06:43:00] *------------------------------*
[06:43:00] Folding@Home Gromacs SMP Core
[06:43:00] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[06:43:00] 
[06:43:00] Preparing to commence simulation
[06:43:00] - Looking at optimizations...
[06:43:00] - Created dyn
[06:43:00] - Files status OK
[06:43:00] Couldn't Decompress
[06:43:00] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[06:43:00] -Error: Couldn't update checksum variables
[06:43:00] Error: Could not open work file
[06:43:00] 
[06:43:00] Folding@home Core Shutdown: FILE_IO_ERROR
[06:43:00] CoreStatus = 75 (117)
[06:43:00] Error opening or reading from a file.
[06:43:00] Deleting current work unit & continuing...
[06:43:00] Trying to send all finished work units
[06:43:00] + No unsent completed units remaining.
[06:43:00] - Preparing to get new work unit...
If I am not there to stop it, this could go on for several hours, until I detects it, deletes it and start a new fresch fah again!

To me, this look like bad WUs over and over again from server http://128.143.231.201

To show my point, I print a clip from the same log that shows that Project: 8583 (Run 0, Clone 1, Gen 477) is downloaded from server http://128.143.231.202 and folding normal before the bad WU occors again, as before P8583 came down.

Code: Select all

[01:28:17] + News From Folding@Home: Welcome to Folding@Home
[01:28:17] Loaded queue successfully.
[01:28:17] Sent data
[01:28:17] Connecting to http://128.143.231.201:8080/
[01:28:18] Posted data.
[01:28:18] Initial: 0000; - Receiving payload (expected size: 512)
[01:28:18] Conversation time very short, giving reduced weight in bandwidth avg
[01:28:18] - Downloaded at ~1 kB/s
[01:28:18] - Averaged speed for that direction ~1 kB/s
[01:28:18] + Received work.
[01:28:18] + Closed connections
[01:28:23] 
[01:28:23] + Processing work unit
[01:28:23] Core required: FahCore_a5.exe
[01:28:23] Core found.
[01:28:23] Working on queue slot 01 [April 3 01:28:23 UTC]
[01:28:23] + Working ...
[01:28:23] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 01 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:28:23] 
[01:28:23] *------------------------------*
[01:28:23] Folding@Home Gromacs SMP Core
[01:28:23] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[01:28:23] 
[01:28:23] Preparing to commence simulation
[01:28:23] - Looking at optimizations...
[01:28:23] - Created dyn
[01:28:23] - Files status OK
[01:28:23] Couldn't Decompress
[01:28:23] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[01:28:23] -Error: Couldn't update checksum variables
[01:28:23] Error: Could not open work file
[01:28:23] 
[01:28:23] Folding@home Core Shutdown: FILE_IO_ERROR
[01:28:23] CoreStatus = 75 (117)
[01:28:23] Error opening or reading from a file.
[01:28:23] Deleting current work unit & continuing...
[01:28:23] Trying to send all finished work units
[01:28:23] + No unsent completed units remaining.
[01:28:23] - Preparing to get new work unit...
[01:28:23] Cleaning up work directory
[01:28:23] + Attempting to get work packet
[01:28:23] Passkey found
[01:28:23] - Will indicate memory of 32217 MB
[01:28:23] - Connecting to assignment server
[01:28:23] Connecting to http://assign.stanford.edu:8080/
[01:28:24] Posted data.
[01:28:24] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[01:28:24] + News From Folding@Home: Welcome to Folding@Home
[01:28:24] Loaded queue successfully.
[01:28:24] Sent data
[01:28:24] Connecting to http://128.143.231.201:8080/
[01:28:25] Posted data.
[01:28:25] Initial: 0000; - Receiving payload (expected size: 512)
[01:28:25] Conversation time very short, giving reduced weight in bandwidth avg
[01:28:25] - Downloaded at ~1 kB/s
[01:28:25] - Averaged speed for that direction ~1 kB/s
[01:28:25] + Received work.
[01:28:25] + Closed connections
[01:28:30] 
[01:28:30] + Processing work unit
[01:28:30] Core required: FahCore_a5.exe
[01:28:30] Core found.
[01:28:30] Working on queue slot 02 [April 3 01:28:30 UTC]
[01:28:30] + Working ...
[01:28:30] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 02 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:28:30] 
[01:28:30] *------------------------------*
[01:28:30] Folding@Home Gromacs SMP Core
[01:28:30] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[01:28:30] 
[01:28:30] Preparing to commence simulation
[01:28:30] - Looking at optimizations...
[01:28:30] - Created dyn
[01:28:30] - Files status OK
[01:28:30] Couldn't Decompress
[01:28:30] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[01:28:30] -Error: Couldn't update checksum variables
[01:28:30] Error: Could not open work file
[01:28:30] 
[01:28:30] Folding@home Core Shutdown: FILE_IO_ERROR
[01:28:30] CoreStatus = 75 (117)
[01:28:30] Error opening or reading from a file.
[01:28:30] Deleting current work unit & continuing...
[01:28:30] Trying to send all finished work units
[01:28:30] + No unsent completed units remaining.
[01:28:30] - Preparing to get new work unit...
[01:28:30] Cleaning up work directory
[01:28:30] + Attempting to get work packet
[01:28:30] Passkey found
[01:28:30] - Will indicate memory of 32217 MB
[01:28:30] - Connecting to assignment server
[01:28:30] Connecting to http://assign.stanford.edu:8080/
[01:28:31] Posted data.
[01:28:31] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[01:28:31] + News From Folding@Home: Welcome to Folding@Home
[01:28:31] Loaded queue successfully.
[01:28:31] Sent data
[01:28:31] Connecting to http://128.143.231.201:8080/
[01:28:32] Posted data.
[01:28:32] Initial: 0000; - Receiving payload (expected size: 512)
[01:28:32] Conversation time very short, giving reduced weight in bandwidth avg
[01:28:32] - Downloaded at ~1 kB/s
[01:28:32] - Averaged speed for that direction ~1 kB/s
[01:28:32] + Received work.
[01:28:32] + Closed connections
[01:28:37] 
[01:28:37] + Processing work unit
[01:28:37] Core required: FahCore_a5.exe
[01:28:37] Core found.
[01:28:37] Working on queue slot 03 [April 3 01:28:37 UTC]
[01:28:37] + Working ...
[01:28:37] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 03 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:28:37] 
[01:28:37] *------------------------------*
[01:28:37] Folding@Home Gromacs SMP Core
[01:28:37] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[01:28:37] 
[01:28:37] Preparing to commence simulation
[01:28:37] - Looking at optimizations...
[01:28:37] - Created dyn
[01:28:37] - Files status OK
[01:28:37] Couldn't Decompress
[01:28:37] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[01:28:37] -Error: Couldn't update checksum variables
[01:28:37] Error: Could not open work file
[01:28:37] 
[01:28:37] Folding@home Core Shutdown: FILE_IO_ERROR
[01:28:37] CoreStatus = 75 (117)
[01:28:37] Error opening or reading from a file.
[01:28:37] Deleting current work unit & continuing...
[01:28:37] Trying to send all finished work units
[01:28:37] + No unsent completed units remaining.
[01:28:37] - Preparing to get new work unit...
[01:28:37] Cleaning up work directory
[01:28:37] + Attempting to get work packet
[01:28:37] Passkey found
[01:28:37] - Will indicate memory of 32217 MB
[01:28:37] - Connecting to assignment server
[01:28:37] Connecting to http://assign.stanford.edu:8080/
[01:28:38] Posted data.
[01:28:38] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[01:28:38] + News From Folding@Home: Welcome to Folding@Home
[01:28:38] Loaded queue successfully.
[01:28:38] Sent data
[01:28:38] Connecting to http://128.143.231.201:8080/
[01:28:39] Posted data.
[01:28:39] Initial: 0000; - Receiving payload (expected size: 512)
[01:28:39] Conversation time very short, giving reduced weight in bandwidth avg
[01:28:39] - Downloaded at ~1 kB/s
[01:28:39] - Averaged speed for that direction ~1 kB/s
[01:28:39] + Received work.
[01:28:39] + Closed connections
[01:28:44] 
[01:28:44] + Processing work unit
[01:28:44] Core required: FahCore_a5.exe
[01:28:44] Core found.
[01:28:44] Working on queue slot 04 [April 3 01:28:44 UTC]
[01:28:44] + Working ...
[01:28:44] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 04 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:28:44] 
[01:28:44] *------------------------------*
[01:28:44] Folding@Home Gromacs SMP Core
[01:28:44] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[01:28:44] 
[01:28:44] Preparing to commence simulation
[01:28:44] - Looking at optimizations...
[01:28:44] - Created dyn
[01:28:44] - Files status OK
[01:28:44] Couldn't Decompress
[01:28:44] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[01:28:44] -Error: Couldn't update checksum variables
[01:28:44] Error: Could not open work file
[01:28:44] 
[01:28:44] Folding@home Core Shutdown: FILE_IO_ERROR
[01:28:44] CoreStatus = 75 (117)
[01:28:44] Error opening or reading from a file.
[01:28:44] Deleting current work unit & continuing...
[01:28:44] Trying to send all finished work units
[01:28:44] + No unsent completed units remaining.
[01:28:44] - Preparing to get new work unit...
[01:28:44] Cleaning up work directory
[01:28:44] + Attempting to get work packet
[01:28:44] Passkey found
[01:28:44] - Will indicate memory of 32217 MB
[01:28:44] - Connecting to assignment server
[01:28:44] Connecting to http://assign.stanford.edu:8080/
[01:40:00] - Couldn't send HTTP request to server
[01:40:00] + Could not connect to Assignment Server
[01:40:00] Connecting to http://assign2.stanford.edu:80/
[01:40:02] Posted data.
[01:40:02] Initial: 8F80; - Successful: assigned to (128.143.231.202).
[01:40:02] + News From Folding@Home: Welcome to Folding@Home
[01:40:02] Loaded queue successfully.
[01:40:02] Sent data
[01:40:02] Connecting to http://128.143.231.202:80/
[01:40:03] Posted data.
[01:40:03] Initial: 0000; - Receiving payload (expected size: 3848746)
[01:40:12] - Downloaded at ~417 kB/s
[01:40:12] - Averaged speed for that direction ~84 kB/s
[01:40:12] + Received work.
[01:40:12] + Closed connections
[01:40:17] 
[01:40:17] + Processing work unit
[01:40:17] Core required: FahCore_a3.exe
[01:40:17] Core found.
[01:40:17] Working on queue slot 05 [April 3 01:40:17 UTC]
[01:40:17] + Working ...
[01:40:17] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 05 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[01:40:17] 
[01:40:17] *------------------------------*
[01:40:17] Folding@Home Gromacs SMP Core
[01:40:17] Version 2.27 (Dec. 15, 2010)
[01:40:17] 
[01:40:17] Preparing to commence simulation
[01:40:17] - Looking at optimizations...
[01:40:17] - Created dyn
[01:40:17] - Files status OK
[01:40:18] - Expanded 3848234 -> 4382484 (decompressed 113.8 percent)
[01:40:18] Called DecompressByteArray: compressed_data_size=3848234 data_size=4382484, decompressed_data_size=4382484 diff=0
[01:40:18] - Digital signature verified
[01:40:18] 
[01:40:18] Project: 8583 (Run 0, Clone 1, Gen 477)
[01:40:18] 
[01:40:18] Assembly optimizations on if available.
[01:40:18] Entering M.D.
[01:40:24] Mapping NT from 64 to 64 
[01:40:25] Completed 0 out of 500000 steps  (0%)
[01:42:30] Completed 5000 out of 500000 steps  (1%)
[01:44:44] Completed 10000 out of 500000 steps  (2%)
[01:46:52] Completed 15000 out of 500000 steps  (3%)
[01:48:59] Completed 20000 out of 500000 steps  (4%)
[01:51:00] Completed 25000 out of 500000 steps  (5%)
[01:52:59] Completed 30000 out of 500000 steps  (6%)
[01:54:56] Completed 35000 out of 500000 steps  (7%)
[01:56:57] Completed 40000 out of 500000 steps  (8%)
[01:58:57] Completed 45000 out of 500000 steps  (9%)
[02:00:56] Completed 50000 out of 500000 steps  (10%)
[02:02:57] Completed 55000 out of 500000 steps  (11%)
[02:04:56] Completed 60000 out of 500000 steps  (12%)
[02:06:58] Completed 65000 out of 500000 steps  (13%)
[02:08:59] Completed 70000 out of 500000 steps  (14%)
[02:11:01] Completed 75000 out of 500000 steps  (15%)
[02:13:00] Completed 80000 out of 500000 steps  (16%)
[02:14:58] Completed 85000 out of 500000 steps  (17%)
[02:16:56] Completed 90000 out of 500000 steps  (18%)
[02:18:55] Completed 95000 out of 500000 steps  (19%)
[02:20:53] Completed 100000 out of 500000 steps  (20%)
[02:22:57] Completed 105000 out of 500000 steps  (21%)
[02:25:01] Completed 110000 out of 500000 steps  (22%)
[02:27:02] Completed 115000 out of 500000 steps  (23%)
[02:29:03] Completed 120000 out of 500000 steps  (24%)
[02:31:03] Completed 125000 out of 500000 steps  (25%)
[02:33:02] Completed 130000 out of 500000 steps  (26%)
[02:35:02] Completed 135000 out of 500000 steps  (27%)
[02:37:01] Completed 140000 out of 500000 steps  (28%)
[02:39:02] Completed 145000 out of 500000 steps  (29%)
[02:41:01] Completed 150000 out of 500000 steps  (30%)
[02:43:00] Completed 155000 out of 500000 steps  (31%)
[02:44:57] Completed 160000 out of 500000 steps  (32%)
[02:46:55] Completed 165000 out of 500000 steps  (33%)
[02:48:54] Completed 170000 out of 500000 steps  (34%)
[02:51:21] Completed 175000 out of 500000 steps  (35%)
[02:53:20] Completed 180000 out of 500000 steps  (36%)
[02:55:20] Completed 185000 out of 500000 steps  (37%)
[02:57:29] Completed 190000 out of 500000 steps  (38%)
[02:59:28] Completed 195000 out of 500000 steps  (39%)
[03:01:29] Completed 200000 out of 500000 steps  (40%)
[03:03:30] Completed 205000 out of 500000 steps  (41%)
[03:05:35] Completed 210000 out of 500000 steps  (42%)
[03:07:37] Completed 215000 out of 500000 steps  (43%)
[03:09:38] Completed 220000 out of 500000 steps  (44%)
[03:11:36] Completed 225000 out of 500000 steps  (45%)
[03:13:35] Completed 230000 out of 500000 steps  (46%)
[03:15:36] Completed 235000 out of 500000 steps  (47%)
[03:17:36] Completed 240000 out of 500000 steps  (48%)
[03:19:36] Completed 245000 out of 500000 steps  (49%)
[03:21:36] Completed 250000 out of 500000 steps  (50%)
[03:23:42] Completed 255000 out of 500000 steps  (51%)
[03:25:41] Completed 260000 out of 500000 steps  (52%)
[03:27:42] Completed 265000 out of 500000 steps  (53%)
[03:29:43] Completed 270000 out of 500000 steps  (54%)
[03:31:41] Completed 275000 out of 500000 steps  (55%)
[03:33:42] Completed 280000 out of 500000 steps  (56%)
[03:35:41] Completed 285000 out of 500000 steps  (57%)
[03:37:40] Completed 290000 out of 500000 steps  (58%)
[03:39:41] Completed 295000 out of 500000 steps  (59%)
[03:41:40] Completed 300000 out of 500000 steps  (60%)
[03:43:44] Completed 305000 out of 500000 steps  (61%)
[03:45:46] Completed 310000 out of 500000 steps  (62%)
[03:47:51] Completed 315000 out of 500000 steps  (63%)
[03:49:52] Completed 320000 out of 500000 steps  (64%)
[03:51:54] Completed 325000 out of 500000 steps  (65%)
[03:53:53] Completed 330000 out of 500000 steps  (66%)
[03:55:52] Completed 335000 out of 500000 steps  (67%)
[03:57:58] Completed 340000 out of 500000 steps  (68%)
[04:00:35] Completed 345000 out of 500000 steps  (69%)
[04:02:35] Completed 350000 out of 500000 steps  (70%)
[04:04:36] Completed 355000 out of 500000 steps  (71%)
[04:06:35] Completed 360000 out of 500000 steps  (72%)
[04:08:39] Completed 365000 out of 500000 steps  (73%)
[04:10:37] Completed 370000 out of 500000 steps  (74%)
[04:12:37] Completed 375000 out of 500000 steps  (75%)
[04:14:39] Completed 380000 out of 500000 steps  (76%)
[04:17:02] Completed 385000 out of 500000 steps  (77%)
[04:19:01] Completed 390000 out of 500000 steps  (78%)
[04:21:03] Completed 395000 out of 500000 steps  (79%)
[04:23:05] Completed 400000 out of 500000 steps  (80%)
[04:25:07] Completed 405000 out of 500000 steps  (81%)
[04:27:09] Completed 410000 out of 500000 steps  (82%)
[04:29:09] Completed 415000 out of 500000 steps  (83%)
[04:31:09] Completed 420000 out of 500000 steps  (84%)
[04:33:10] Completed 425000 out of 500000 steps  (85%)
[04:35:16] Completed 430000 out of 500000 steps  (86%)
[04:37:17] Completed 435000 out of 500000 steps  (87%)
[04:39:19] Completed 440000 out of 500000 steps  (88%)
[04:41:31] Completed 445000 out of 500000 steps  (89%)
[04:43:35] Completed 450000 out of 500000 steps  (90%)
[04:46:00] Completed 455000 out of 500000 steps  (91%)
[04:48:01] Completed 460000 out of 500000 steps  (92%)
[04:50:00] Completed 465000 out of 500000 steps  (93%)
[04:52:00] Completed 470000 out of 500000 steps  (94%)
[04:53:59] Completed 475000 out of 500000 steps  (95%)
[04:55:59] Completed 480000 out of 500000 steps  (96%)
[04:58:00] Completed 485000 out of 500000 steps  (97%)
[05:00:00] Completed 490000 out of 500000 steps  (98%)
[05:02:01] Completed 495000 out of 500000 steps  (99%)
[05:04:04] Completed 500000 out of 500000 steps  (100%)
[05:04:06] DynamicWrapper: Finished Work Unit: sleep=10000
[05:04:16] 
[05:04:16] Finished Work Unit:
[05:04:16] - Reading up to 8055024 from "work/wudata_05.trr": Read 8055024
[05:04:16] trr file hash check passed.
[05:04:16] edr file hash check passed.
[05:04:16] logfile size: 61112
[05:04:16] Leaving Run
[05:04:18] - Writing 8152968 bytes of core data to disk...
[05:04:19] Done: 8152456 -> 7529428 (compressed to 92.3 percent)
[05:04:19]   ... Done.
[05:04:20] - Shutting down core
[05:04:20] 
[05:04:20] Folding@home Core Shutdown: FINISHED_UNIT
[05:04:20] CoreStatus = 64 (100)
[05:04:20] Unit 5 finished with 99 percent of time to deadline remaining.
[05:04:20] Updated performance fraction: 0.848119
[05:04:20] Sending work to server
[05:04:20] Project: 8583 (Run 0, Clone 1, Gen 477)


[05:04:20] + Attempting to send results [April 3 05:04:20 UTC]
[05:04:20] - Reading file work/wuresults_05.dat from core
[05:04:20]   (Read 7529940 bytes from disk)
[05:04:20] Connecting to http://128.143.231.202:8080/
[05:04:34] Posted data.
[05:04:34] Initial: 0000; - Uploaded at ~525 kB/s
[05:04:34] - Averaged speed for that direction ~556 kB/s
[05:04:34] + Results successfully sent
[05:04:34] Thank you for your contribution to Folding@Home.
[05:04:34] + Number of Units Completed: 857

[05:04:34] Trying to send all finished work units
[05:04:34] + No unsent completed units remaining.
[05:04:34] - Preparing to get new work unit...
[05:04:34] Cleaning up work directory
[05:04:34] + Attempting to get work packet
[05:04:34] Passkey found
[05:04:34] - Will indicate memory of 32217 MB
[05:04:34] - Connecting to assignment server
[05:04:34] Connecting to http://assign.stanford.edu:8080/
[05:04:36] Posted data.
[05:04:36] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[05:04:36] + News From Folding@Home: Welcome to Folding@Home
[05:04:36] Loaded queue successfully.
[05:04:36] Sent data
[05:04:36] Connecting to http://128.143.231.201:8080/
[05:04:36] Posted data.
[05:04:36] Initial: 0000; - Receiving payload (expected size: 512)
[05:04:36] Conversation time very short, giving reduced weight in bandwidth avg
[05:04:36] - Downloaded at ~1 kB/s
[05:04:36] - Averaged speed for that direction ~75 kB/s
[05:04:36] + Received work.
[05:04:36] Trying to send all finished work units
[05:04:36] + No unsent completed units remaining.
[05:04:36] + Closed connections
[05:04:36] 
[05:04:36] + Processing work unit
[05:04:36] Core required: FahCore_a5.exe
[05:04:36] Core found.
[05:04:36] Working on queue slot 06 [April 3 05:04:36 UTC]
[05:04:36] + Working ...
[05:04:36] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 06 -np 64 -checkpoint 3 -verbose -lifeline 9401 -version 634'

[05:04:36] 
[05:04:36] *------------------------------*
[05:04:36] Folding@Home Gromacs SMP Core
[05:04:36] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[05:04:36] 
[05:04:36] Preparing to commence simulation
[05:04:36] - Looking at optimizations...
[05:04:36] - Created dyn
[05:04:36] - Files status OK
[05:04:36] Couldn't Decompress
[05:04:36] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[05:04:36] -Error: Couldn't update checksum variables
[05:04:36] Error: Could not open work file
[05:04:36] 
[05:04:36] Folding@home Core Shutdown: FILE_IO_ERROR
[05:04:37] CoreStatus = 75 (117)
[05:04:37] Error opening or reading from a file.
[05:04:37] Deleting current work unit & continuing...
[05:04:37] Trying to send all finished work units
[05:04:37] + No unsent completed units remaining.
[05:04:37] - Preparing to get new work unit...
[05:04:37] Cleaning up work directory
[05:04:37] + Attempting to get work packet

Re: 128.143.231.201 (BA) acting up?

Posted: Thu Apr 03, 2014 10:55 am
by EXT64
I am also getting this randomly as well. Also seemingly random, sometimes after some failures it will pickup an SMP, then start failing again. Not a big deal as the SMP run pretty fast, so I just wait until they are completed and delete the previously mentioned files.

Re: 128.143.231.201 (BA) acting up?

Posted: Thu Apr 03, 2014 4:33 pm
by Nathan_P
I've given up, until its fixed both my machines are on SMP, which is still nice at 300k PPD and a lot less strain on the net connection

Re: 128.143.231.201 (BA) acting up?

Posted: Thu Apr 03, 2014 6:42 pm
by -alias-
It does not seem to be a priority to fix this problem at PG, when no one takes the time to comment on the issue properly? I think I give up also, but I choose to shut them all down if this does not stop very soon, or maybe this is a way to get rid of us BA-folders sooner.

Re: 128.143.231.201 (BA) acting up?

Posted: Thu Apr 03, 2014 7:17 pm
by bollix47
I've had no problems with my bigadv setup but I am using v7. No bad WUs and no switching to SMP. Is anyone that is using v7 having a problem? It might help if we can narrow the focus for PG.

Re: 128.143.231.201 (BA) acting up?

Posted: Thu Apr 03, 2014 7:36 pm
by Nicolas_orleans
(until now) no issues with v6 + Langouste