Project 5801 issues. [Should be Offline]
Moderators: Site Moderators, FAHC Science Team
-
- Pande Group Member
- Posts: 2058
- Joined: Fri Nov 30, 2007 6:25 am
- Location: Stanford
Re: Project 5801 issues.
We've taken these off line until we can see what's up.
-
- Posts: 74
- Joined: Thu Jul 03, 2008 12:43 pm
- Hardware configuration: Home Network:
ADSL 12Mbps - 807 / 14.439
USRobotics 8port 10/100/1000
All computers connected with TP CAT5
HC1:
E8400@4GHz(8*500)
4GB PC2-8000
9800GX2@stock
removed - (8800GTS-512MB@724/1810/972)
Mist 600W rev2
WinXP Pro 32bit SP3
FW180.43
1xGPU2 v6.20 R1 Core 1.18/1.19
1xSMP v6.23 Beta R1
HC2:
AM2+ x4 Phenom 9950@3GHz
2GB Crucial Ballistix PC2-5300
3x8800GS-384MB
Corsair TX 750W
WinXP Pro 32bit SP3
FW178.24
3xGPU2 v6.20 R1 Core 1.18
1xSMP v6.23 Beta R1
HC3: (not folding atm and outdated)
X2 6000+@Stock
HD3850OC-512MB@783MHz
2GB PC6400
PSU 420W
WinXP Pro 32bit SP3
CCC8.6
1*GPU2 v6.12 Beta8 Core 1.04
1*CPU 5.04
Office Network:
SDSL 20Mbps
100/1000
All computers connected with TP CAT5
OC1:
E6750@stock
8800GT-256MB@702/1755/900
4GB PC6400
Tagan 480W
Vista Ultimate 32bit SP1
FW 178.24
1xGPU2 v6.20 R1 Core 1.15
1xSMP v6.23 Beta R1
OC2:
E8400@stock
2x8800GT-512MB@stock
8GB PC3-8500
Corsair HX520
Vista Business 64bit
FW178.24
2xGPU2 v6.20 R1 Core 1.15
1xSMP v6.23 Beta R1 - Location: Norway
Re: Project 5801 issues.
Here is my log:
Code: Select all
[22:43:32] Project: 5801 (Run 1, Clone 43, Gen 0)
[22:43:32]
[22:43:32] Assembly optimizations on if available.
[22:43:32] Entering M.D.
[22:43:38] mdrun_gpu returned
[22:43:38] Going to send back what have done -- stepsTotalG=0
[22:43:38] Work fraction=0.0000 steps=0.
[22:43:42] logfile size=0 infoLength=0 edr=0 trr=25
[22:43:42] - Writing 637 bytes of core data to disk...
[22:43:42] Done: 125 -> 124 (compressed to 99.2 percent)
[22:43:42] ... Done.
[22:43:42]
[22:43:42] Folding@home Core Shutdown: UNSTABLE_MACHINE
[22:43:46] CoreStatus = 7A (122)
[22:43:46] Sending work to server
[22:43:46] Project: 5801 (Run 1, Clone 43, Gen 0)
[22:43:46] - Read packet limit of 540015616... Set to 524286976.
Re: Project 5801 issues.
why does it force me to connect to 171.67.108.11 even thought i URL blocked that server very annoying it should goto another server after so many fails
scrach that the URL BLock on 171.67.108.11 /.11:8080 /.11:80 worked after failing 4 connections to it redirected me to the 5016 server
will remove the block from that server when project is removed 100% from it going sleep now, Server code could do with been tweeked an little to detect when Every work unit is failing and stop handing them out or at lest the server that hands out the project server Should Not keep handing out the same server on every fail
scrach that the URL BLock on 171.67.108.11 /.11:8080 /.11:80 worked after failing 4 connections to it redirected me to the 5016 server
will remove the block from that server when project is removed 100% from it going sleep now, Server code could do with been tweeked an little to detect when Every work unit is failing and stop handing them out or at lest the server that hands out the project server Should Not keep handing out the same server on every fail
Last edited by leexgx on Wed Oct 29, 2008 12:14 am, edited 1 time in total.
-
- Site Moderator
- Posts: 6359
- Joined: Sun Dec 02, 2007 10:38 am
- Location: Bordeaux, France
- Contact:
Re: Project 5801 issues.
Thank you.VijayPande wrote:We've taken these off line until we can see what's up.
I can now got to bed in peace of mind
Re: Project 5801 issues.
This is getting ridicules as far as Im concerned. Get it fixed already.
Re: Project 5801 issues. [Should be Offline]
Glad these have been taken away - have had to restart 3 clients due to them
Re: Project 5801 issues. [Should be Offline]
same problem here 2 days now
on 9800GTX
on 9800GTX
-
- Posts: 74
- Joined: Thu Jul 03, 2008 12:43 pm
- Hardware configuration: Home Network:
ADSL 12Mbps - 807 / 14.439
USRobotics 8port 10/100/1000
All computers connected with TP CAT5
HC1:
E8400@4GHz(8*500)
4GB PC2-8000
9800GX2@stock
removed - (8800GTS-512MB@724/1810/972)
Mist 600W rev2
WinXP Pro 32bit SP3
FW180.43
1xGPU2 v6.20 R1 Core 1.18/1.19
1xSMP v6.23 Beta R1
HC2:
AM2+ x4 Phenom 9950@3GHz
2GB Crucial Ballistix PC2-5300
3x8800GS-384MB
Corsair TX 750W
WinXP Pro 32bit SP3
FW178.24
3xGPU2 v6.20 R1 Core 1.18
1xSMP v6.23 Beta R1
HC3: (not folding atm and outdated)
X2 6000+@Stock
HD3850OC-512MB@783MHz
2GB PC6400
PSU 420W
WinXP Pro 32bit SP3
CCC8.6
1*GPU2 v6.12 Beta8 Core 1.04
1*CPU 5.04
Office Network:
SDSL 20Mbps
100/1000
All computers connected with TP CAT5
OC1:
E6750@stock
8800GT-256MB@702/1755/900
4GB PC6400
Tagan 480W
Vista Ultimate 32bit SP1
FW 178.24
1xGPU2 v6.20 R1 Core 1.15
1xSMP v6.23 Beta R1
OC2:
E8400@stock
2x8800GT-512MB@stock
8GB PC3-8500
Corsair HX520
Vista Business 64bit
FW178.24
2xGPU2 v6.20 R1 Core 1.15
1xSMP v6.23 Beta R1 - Location: Norway
Re: Project 5801 issues. [Should be Offline]
pitty i cant reach half of my GPU clients, but I will get to them in the morning.
Re: Project 5801 issues. [Should be Offline]
server was handing out them work units an little i remove that block in 24hrs (or when ever i come back home)
-
- Pande Group Member
- Posts: 2058
- Joined: Fri Nov 30, 2007 6:25 am
- Location: Stanford
Re: Project 5801 issues. [Should be Offline]
Sorry about the really nasty problem on this one. It was definitely strange since these WU's were QA'd before. I think this may be an issue where they were QA'd on an earlier core and 1.15 is causing issues.
-
- Posts: 179
- Joined: Sun Dec 02, 2007 6:40 am
- Location: Team_XPS ..... OC, S. Calif
Re: Project 5801 issues.
Welcome to the party toTOW ... I mean what feels like a funeral !!!
God only knows how much PpD I lost and how much benefit Humanity missed today.
I am surprised that:
1) F@H released this WU in such a bad state
However, more stumped that:
2) F@H has not chimed in here officially after 7 Pages of comments
EDIT/Added..... I see VP chimed in on the cause while I was writing.... Thanx VP
Peace
I have finally stopped getting the 5801s, did not do anything else except a restart, I got all 5506s....toTOW wrote:I feel alone, depressed and helpless
God only knows how much PpD I lost and how much benefit Humanity missed today.
I am surprised that:
1) F@H released this WU in such a bad state
However, more stumped that:
2) F@H has not chimed in here officially after 7 Pages of comments
EDIT/Added..... I see VP chimed in on the cause while I was writing.... Thanx VP
Peace
T.E.A.M. “Together Everyone Accomplishes Miracles!”
OC, S. California ... God Bless All
OC, S. California ... God Bless All
Re: Project 5801 issues. [Should be Offline]
Thanks Dr. Pande,VijayPande wrote:Sorry about the really nasty problem on this one. It was definitely strange since these WU's were QA'd before. I think this may be an issue where they were QA'd on an earlier core and 1.15 is causing issues.
They are dying on 1.18 too.
-Sid
Re: Project 5801 issues. [Should be Offline]
Well, there lies part of the problem... poor QA. Can you honestly say that not even one p5801 WU was not run on the most recent core before deploying them?
I'm a software developer... I won't go into details about the software I develop but suffice it to say an engineering design engine is the meat of the software. What was done here is akin to us developing an updated engine, then not running a single piece of data through it before releasing it out into the wild. Then when it fails we'll just shrug our shoulders and say... "Well, it worked on the previous version."
I understand resources are limited... failures happen... and the software is beta. As long as lessons are learned and processes are improved, then that's all we can ask for.
This recent string of debacles with the GPU2 core and WUs have really cast a shadow on what was, IMO, the best rollout in FAH history.
I'm a software developer... I won't go into details about the software I develop but suffice it to say an engineering design engine is the meat of the software. What was done here is akin to us developing an updated engine, then not running a single piece of data through it before releasing it out into the wild. Then when it fails we'll just shrug our shoulders and say... "Well, it worked on the previous version."
I understand resources are limited... failures happen... and the software is beta. As long as lessons are learned and processes are improved, then that's all we can ask for.
This recent string of debacles with the GPU2 core and WUs have really cast a shadow on what was, IMO, the best rollout in FAH history.
-
- Pande Group Member
- Posts: 2058
- Joined: Fri Nov 30, 2007 6:25 am
- Location: Stanford
Re: Project 5801 issues.
PS In case you're curious:
This was beta tested before (this was a project # change due to a move onto a new server -- which was done to try to keep work around while the CS servers were down).MoneyGuyBK wrote: I am surprised that:
1) F@H released this WU in such a bad state
We keep an eye on the forum, but the first post was just a few hours ago. Due to staff having other responsibilities, our response will typically be on the hours time scale not minutes time scales for issues like this. I wish it could be faster, but that's what we're staffed to do at the moment.However, more stumped that:
2) F@H has not chimed in here officially after 7 Pages of comments