bad wu - Project: 6882, run 808 clone 0 gen 13

Moderators: Site Moderators, FAHC Science Team

Post Reply
betchern0t
Posts: 5
Joined: Thu Jul 07, 2011 8:24 am

bad wu - Project: 6882, run 808 clone 0 gen 13

Post by betchern0t »

Hi,
I have a bad wu:

project 6882, run 808 clone 0 gen 13

It completes 145000 steps of 250000 runs the next chunk for a while then the terminal it is running in then disappears. Log states: corestatus 8B (139) Client core communications error 0x8b this is a sign of more serious problems shutting down. I caught a glimpse as the terminal window was disappearing of a segmentation fault error.

Box is brand new intel i5-2500 running ubuntu 11.04 in a KVM vm. The folding client is running in the VM. Please advise or point me at docco on what to do next.

Cheers Paul
bruce
Posts: 20822
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: bad wu -

Post by bruce »

Welcome to foldingforum.org, betchern0t.

It's unlikely that you have a bad WU, but we can't be certain yet. Error 0x8b is not a FAH error but rather an unstable system detected by the OS. It's generally due to overclocking/overheating or a memory failure. FAH only shuts down due to "more serious problems" when the errors come from the OS detecting hardware that is malfunctioning.

If you're overclocking, what benchmark did you use to insure stability? Have you thoroughly tested RAM? Is the VM you're running somehow interfering with the processing of the WU?

The stats server has no reports associated with this WU yet, but that's not unexpected. Your client is not uploading an error report and project 6882 has a Preferred Deadline of 14 days. Once your assignment times out, it will be assigned to someone else for completion. I'll mark this topic for followup.
betchern0t
Posts: 5
Joined: Thu Jul 07, 2011 8:24 am

Re: bad wu -

Post by betchern0t »

Hi Bruce,
thanks for the response. I am not overclocking. The temps are in the normal range - 50 -60 degrees C with a rated max of 72.5. CPU core is generally 100% while the job is running. To check that contention wasn't a problem removed the other load - a series of boinc projects with the same result. In any cases I have assigned two of my four cores to the vm.

I am using the VM to supply a fixed resource for these kind of things rather than using the idle approach

I have completed about 5 or 6 jobs for folding at home so far. One thing I have just thought about is to up the RAM for the VM - currently sitting on 512mb. I will try that and report back. I haven't specifically run memory tests on the RAM - there is 8Gb - but may schedule for overnight.

Is there any way to cancel this WU so that I can continue? Early days and I am keen to do something useful.

Cheers Paul
Grandpa_01
Posts: 1122
Joined: Wed Mar 04, 2009 7:36 am
Hardware configuration: 3 - Supermicro H8QGi-F AMD MC 6174=144 cores 2.5Ghz, 96GB G.Skill DDR3 1333Mhz Ubuntu 10.10
2 - Asus P6X58D-E i7 980X 4.4Ghz 6GB DDR3 2000 A-Data 64GB SSD Ubuntu 10.10
1 - Asus Rampage Gene III 17 970 4.3Ghz DDR3 2000 2-500GB Segate 7200.11 0-Raid Ubuntu 10.10
1 - Asus G73JH Laptop i7 740QM 1.86Ghz ATI 5870M

Re: bad wu -

Post by Grandpa_01 »

If it is continually failing at the same spot it is possible it is a bad WU it is also possible it is just a tough spot in the WU that needs more resources than you are alloeing. But bruce is right error 0x8b is usually an hardware error I have found that it is usually associated with memory. 512 mb is not enough for a VM and folding I would give the VM 2GB of memory if you have 8GB available.
Image
2 - SM H8QGi-F AMD 6xxx=112 cores @ 3.2 & 3.9Ghz
5 - SM X9QRI-f+ Intel 4650 = 320 cores @ 3.15Ghz
2 - I7 980X 4.4Ghz 2-GTX680
1 - 2700k 4.4Ghz GTX680
Total = 464 cores folding
betchern0t
Posts: 5
Joined: Thu Jul 07, 2011 8:24 am

Re: bad wu -

Post by betchern0t »

Hi Grandpa,
looks like you nailed it. It has now completed that section of the WU. VM has all four cores and 3Gb RAM.

Cheers Paul
betchern0t
Posts: 5
Joined: Thu Jul 07, 2011 8:24 am

Re: bad wu -

Post by betchern0t »

crashed again on 155000....
Grandpa_01
Posts: 1122
Joined: Wed Mar 04, 2009 7:36 am
Hardware configuration: 3 - Supermicro H8QGi-F AMD MC 6174=144 cores 2.5Ghz, 96GB G.Skill DDR3 1333Mhz Ubuntu 10.10
2 - Asus P6X58D-E i7 980X 4.4Ghz 6GB DDR3 2000 A-Data 64GB SSD Ubuntu 10.10
1 - Asus Rampage Gene III 17 970 4.3Ghz DDR3 2000 2-500GB Segate 7200.11 0-Raid Ubuntu 10.10
1 - Asus G73JH Laptop i7 740QM 1.86Ghz ATI 5870M

Re: bad wu -

Post by Grandpa_01 »

If it made it past the previous failing stage it is most likely not a bad WU.
VM can be fitful at times. Are you using v6 or v7 of FAH
Image
2 - SM H8QGi-F AMD 6xxx=112 cores @ 3.2 & 3.9Ghz
5 - SM X9QRI-f+ Intel 4650 = 320 cores @ 3.15Ghz
2 - I7 980X 4.4Ghz 2-GTX680
1 - 2700k 4.4Ghz GTX680
Total = 464 cores folding
betchern0t
Posts: 5
Joined: Thu Jul 07, 2011 8:24 am

Re: bad wu - Project: 6882, run 808 clone 0 gen 13

Post by betchern0t »

Hi Grandpa,
I have done 14 passes of memtest86+ with no errors. Memory testing is never definitive when no errors show but....

I am using 6.34. Being a newbie I went with what was offered. :lol:

Cheers Paul
PantherX
Site Moderator
Posts: 6986
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: bad wu - Project: 6882, run 808 clone 0 gen 13

Post by PantherX »

The WU Project: 6882 (Run 808, Clone 0, Gen 13), isn't a bad one as it was successfully completed by another donor:
Your WU (P6882 R808 C0 G13) was added to the stats database on 2011-07-23 05:05:15 for 69 points of credit.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
SombraGuerrero
Posts: 117
Joined: Mon Mar 16, 2009 3:06 am

Re: bad wu - Project: 6882, run 808 clone 0 gen 13

Post by SombraGuerrero »

I've seen this in other threads as well. 6.34 was working perfectly for me until (big shock) the most recent round of glibc updates. I have installed, configured, and run nscd to no avail. I have reverted back to client version 6.02 which is what i did the last time this happened with 6.29 and I am once again successfully folding all cores.
Post Reply