Project 5801 issues. [Should be Offline]
Moderators: Site Moderators, FAHC Science Team
- 
				VijayPande
- Pande Group Member
- Posts: 2058
- Joined: Fri Nov 30, 2007 6:25 am
- Location: Stanford
Re: Project 5801 issues.
We've taken these off line until we can see what's up.
			
			
									
						
										
						- 
				theo343
- Posts: 74
- Joined: Thu Jul 03, 2008 12:43 pm
- Hardware configuration: Home Network:
 ADSL 12Mbps - 807 / 14.439
 USRobotics 8port 10/100/1000
 All computers connected with TP CAT5
 HC1:
 E8400@4GHz(8*500)
 4GB PC2-8000
 9800GX2@stock
 removed - (8800GTS-512MB@724/1810/972)
 Mist 600W rev2
 WinXP Pro 32bit SP3
 FW180.43
 1xGPU2 v6.20 R1 Core 1.18/1.19
 1xSMP v6.23 Beta R1
 HC2:
 AM2+ x4 Phenom 9950@3GHz
 2GB Crucial Ballistix PC2-5300
 3x8800GS-384MB
 Corsair TX 750W
 WinXP Pro 32bit SP3
 FW178.24
 3xGPU2 v6.20 R1 Core 1.18
 1xSMP v6.23 Beta R1
 HC3: (not folding atm and outdated)
 X2 6000+@Stock
 HD3850OC-512MB@783MHz
 2GB PC6400
 PSU 420W
 WinXP Pro 32bit SP3
 CCC8.6
 1*GPU2 v6.12 Beta8 Core 1.04
 1*CPU 5.04
 Office Network:
 SDSL 20Mbps
 100/1000
 All computers connected with TP CAT5
 OC1:
 E6750@stock
 8800GT-256MB@702/1755/900
 4GB PC6400
 Tagan 480W
 Vista Ultimate 32bit SP1
 FW 178.24
 1xGPU2 v6.20 R1 Core 1.15
 1xSMP v6.23 Beta R1
 OC2:
 E8400@stock
 2x8800GT-512MB@stock
 8GB PC3-8500
 Corsair HX520
 Vista Business 64bit
 FW178.24
 2xGPU2 v6.20 R1 Core 1.15
 1xSMP v6.23 Beta R1
- Location: Norway
Re: Project 5801 issues.
Here is my log:
			
			
									
						
										
						Code: Select all
[22:43:32] Project: 5801 (Run 1, Clone 43, Gen 0)
[22:43:32] 
[22:43:32] Assembly optimizations on if available.
[22:43:32] Entering M.D.
[22:43:38] mdrun_gpu returned 
[22:43:38] Going to send back what have done -- stepsTotalG=0
[22:43:38] Work fraction=0.0000 steps=0.
[22:43:42] logfile size=0 infoLength=0 edr=0 trr=25
[22:43:42] - Writing 637 bytes of core data to disk...
[22:43:42] Done: 125 -> 124 (compressed to 99.2 percent)
[22:43:42]   ... Done.
[22:43:42] 
[22:43:42] Folding@home Core Shutdown: UNSTABLE_MACHINE
[22:43:46] CoreStatus = 7A (122)
[22:43:46] Sending work to server
[22:43:46] Project: 5801 (Run 1, Clone 43, Gen 0)
[22:43:46] - Read packet limit of 540015616... Set to 524286976.Re: Project 5801 issues.
why does it force me to connect to 171.67.108.11 even thought i URL blocked that server very annoying it should goto another server after so many fails 
scrach that the URL BLock on 171.67.108.11 /.11:8080 /.11:80 worked after failing 4 connections to it redirected me to the 5016 server
will remove the block from that server when project is removed 100% from it going sleep now, Server code could do with been tweeked an little to detect when Every work unit is failing and stop handing them out or at lest the server that hands out the project server Should Not keep handing out the same server on every fail
			
			
													scrach that the URL BLock on 171.67.108.11 /.11:8080 /.11:80 worked after failing 4 connections to it redirected me to the 5016 server
will remove the block from that server when project is removed 100% from it going sleep now, Server code could do with been tweeked an little to detect when Every work unit is failing and stop handing them out or at lest the server that hands out the project server Should Not keep handing out the same server on every fail
					Last edited by leexgx on Wed Oct 29, 2008 12:14 am, edited 1 time in total.
									
			
						
							- 
				toTOW
- Site Moderator
- Posts: 6501
- Joined: Sun Dec 02, 2007 10:38 am
- Location: Bordeaux, France
- Contact:
Re: Project 5801 issues.
Thank you.VijayPande wrote:We've taken these off line until we can see what's up.
I can now got to bed in peace of mind

Re: Project 5801 issues.
This is getting ridicules as far as Im concerned. Get it fixed already.
			
			
									
						
										
						Re: Project 5801 issues. [Should be Offline]
Glad these have been taken away - have had to restart 3 clients due to them
			
			
									
						
										
						Re: Project 5801 issues. [Should be Offline]
same problem here 2 days now
on 9800GTX
			
			
									
						
										
						on 9800GTX

- 
				theo343
- Posts: 74
- Joined: Thu Jul 03, 2008 12:43 pm
- Hardware configuration: Home Network:
 ADSL 12Mbps - 807 / 14.439
 USRobotics 8port 10/100/1000
 All computers connected with TP CAT5
 HC1:
 E8400@4GHz(8*500)
 4GB PC2-8000
 9800GX2@stock
 removed - (8800GTS-512MB@724/1810/972)
 Mist 600W rev2
 WinXP Pro 32bit SP3
 FW180.43
 1xGPU2 v6.20 R1 Core 1.18/1.19
 1xSMP v6.23 Beta R1
 HC2:
 AM2+ x4 Phenom 9950@3GHz
 2GB Crucial Ballistix PC2-5300
 3x8800GS-384MB
 Corsair TX 750W
 WinXP Pro 32bit SP3
 FW178.24
 3xGPU2 v6.20 R1 Core 1.18
 1xSMP v6.23 Beta R1
 HC3: (not folding atm and outdated)
 X2 6000+@Stock
 HD3850OC-512MB@783MHz
 2GB PC6400
 PSU 420W
 WinXP Pro 32bit SP3
 CCC8.6
 1*GPU2 v6.12 Beta8 Core 1.04
 1*CPU 5.04
 Office Network:
 SDSL 20Mbps
 100/1000
 All computers connected with TP CAT5
 OC1:
 E6750@stock
 8800GT-256MB@702/1755/900
 4GB PC6400
 Tagan 480W
 Vista Ultimate 32bit SP1
 FW 178.24
 1xGPU2 v6.20 R1 Core 1.15
 1xSMP v6.23 Beta R1
 OC2:
 E8400@stock
 2x8800GT-512MB@stock
 8GB PC3-8500
 Corsair HX520
 Vista Business 64bit
 FW178.24
 2xGPU2 v6.20 R1 Core 1.15
 1xSMP v6.23 Beta R1
- Location: Norway
Re: Project 5801 issues. [Should be Offline]
pitty i cant reach half of my GPU clients, but I will get to them in the morning.
			
			
									
						
										
						Re: Project 5801 issues. [Should be Offline]
server was handing out them work units an little i remove that block in 24hrs (or when ever i come back home)
			
			
									
						
							- 
				VijayPande
- Pande Group Member
- Posts: 2058
- Joined: Fri Nov 30, 2007 6:25 am
- Location: Stanford
Re: Project 5801 issues. [Should be Offline]
Sorry about the really nasty problem on this one.  It was definitely strange since these WU's were QA'd before.  I think this may be an issue where they were QA'd on an earlier core and 1.15 is causing issues.
			
			
									
						
										
						- 
				MoneyGuyBK
- Posts: 179
- Joined: Sun Dec 02, 2007 6:40 am
- Location: Team_XPS ..... OC, S. Calif
Re: Project 5801 issues.
Welcome to the party toTOW ... I mean what feels like a funeral !!!
God only knows how much PpD I lost and how much benefit Humanity missed today.

I am surprised that:
1) F@H released this WU in such a bad state
However, more stumped that:
2) F@H has not chimed in here officially after 7 Pages of comments
EDIT/Added..... I see VP chimed in on the cause while I was writing.... Thanx VP
Peace
			
			
									
						
							I have finally stopped getting the 5801s, did not do anything else except a restart, I got all 5506s....toTOW wrote:I feel alone, depressed and helpless
God only knows how much PpD I lost and how much benefit Humanity missed today.

I am surprised that:
1) F@H released this WU in such a bad state

However, more stumped that:
2) F@H has not chimed in here officially after 7 Pages of comments

EDIT/Added..... I see VP chimed in on the cause while I was writing.... Thanx VP
Peace
T.E.A.M. “Together Everyone Accomplishes Miracles!”

OC, S. California ... God Bless All
			
						OC, S. California ... God Bless All
- 
				Insidious
Re: Project 5801 issues. [Should be Offline]
Thanks Dr. Pande,VijayPande wrote:Sorry about the really nasty problem on this one. It was definitely strange since these WU's were QA'd before. I think this may be an issue where they were QA'd on an earlier core and 1.15 is causing issues.
They are dying on 1.18 too.
-Sid
Re: Project 5801 issues. [Should be Offline]
Well, there lies part of the problem... poor QA.  Can you honestly say that not even one p5801 WU was not run on the most recent core before deploying them?
I'm a software developer... I won't go into details about the software I develop but suffice it to say an engineering design engine is the meat of the software. What was done here is akin to us developing an updated engine, then not running a single piece of data through it before releasing it out into the wild. Then when it fails we'll just shrug our shoulders and say... "Well, it worked on the previous version."
I understand resources are limited... failures happen... and the software is beta. As long as lessons are learned and processes are improved, then that's all we can ask for.
This recent string of debacles with the GPU2 core and WUs have really cast a shadow on what was, IMO, the best rollout in FAH history.
			
			
									
						
										
						I'm a software developer... I won't go into details about the software I develop but suffice it to say an engineering design engine is the meat of the software. What was done here is akin to us developing an updated engine, then not running a single piece of data through it before releasing it out into the wild. Then when it fails we'll just shrug our shoulders and say... "Well, it worked on the previous version."
I understand resources are limited... failures happen... and the software is beta. As long as lessons are learned and processes are improved, then that's all we can ask for.
This recent string of debacles with the GPU2 core and WUs have really cast a shadow on what was, IMO, the best rollout in FAH history.
- 
				VijayPande
- Pande Group Member
- Posts: 2058
- Joined: Fri Nov 30, 2007 6:25 am
- Location: Stanford
Re: Project 5801 issues.
PS In case you're curious:
			
			
									
						
										
						This was beta tested before (this was a project # change due to a move onto a new server -- which was done to try to keep work around while the CS servers were down).MoneyGuyBK wrote: I am surprised that:
1) F@H released this WU in such a bad state
We keep an eye on the forum, but the first post was just a few hours ago. Due to staff having other responsibilities, our response will typically be on the hours time scale not minutes time scales for issues like this. I wish it could be faster, but that's what we're staffed to do at the moment.However, more stumped that:
2) F@H has not chimed in here officially after 7 Pages of comments

