Project 2665 several EUE's ?

Moderators: Site Moderators, FAHC Science Team

Post Reply
ra40
Posts: 13
Joined: Wed Aug 27, 2008 7:26 pm

Project 2665 several EUE's ?

Post by ra40 »

I just started the CPU's back into the SMP projects. One box is working fine thus far but one is trying my patience. :evil:
This is what's been EUE'ing with the 7B errors since it began on the 11'th:

2665 3, 630, 169
[12:52:33] Completed 30000 out of 250000 steps (12 percent)
[12:56:09] disk...
[12:56:09] ... Done.
[12:56:09] - Failed to delete work/wudata_01.xtc
[12:56:09] - Failed to delete - Failed to delete work/wudata_01.chk
[12:56:09] - Failed to delete work/wudata_01.pdo
[12:56:09] Warning: check for stray files
[12:56:09] elete - Failed to delete Warning: check for stray files
[12:56:09] elete work/wudata_01.xvg
[12:56:09] Warning: check for stray files
[12:56:09] h to check the stability of your computer (issues
[12:56:09] such as high temperature, overclocking, etc.).
[12:56:09] Going to send back what have done.
[12:56:09] logfile size: 40710
[12:56:09] - Writing 41260 bytes of core data to disk...
[12:56:09] ... Done.
[12:56:09] - Failed to delete work/wudata_01.arc
[12:56:09] - Failed to delete work/wudata_01.xtc
[12:56:09] No C.P. to delete.
[12:56:09] - Failed to delete work/wudata_01.dyn
[12:56:09] - Failed to delete work/wudata_01.sas
[12:56:09] - Failed to delete work/wudata_01.goe
[12:56:09] - Failed to delete work/wudata_01.pdo
[12:56:09] Warning: check for stray files
[12:56:09]
[12:56:09] Folding@home Core Shutdown: EARLY_UNIT_END
[12:56:09]
[12:56:09] Folding@home Core Shutdown: EARLY_UNIT_END
[12:56:12] CoreStatus = 7B (123)
[12:56:12] Sending work to server

2665 2, 998, 170 This WU died instantly
[12:56:56] Entering M.D.
[12:57:04] GG with glycosylations
[12:57:04] Writing local files
[12:57:04] cal files
[12:57:06] Extra SSE boost OK.
[12:57:16] cal files
[12:57:16] Completed 0 out of 250000 steps (0 percent)
[13:21:13] Warning: long 1-4 interactions
[13:21:15] ed to delete work/wudata_02.xtc
[13:21:15] Warning: check for stray files
[13:21:15] iled to delete work/wudata_02.bed-- Failed to delete work/wudata_02.goe-Warning: check for stray files
[13:21:15] .pdo
[13:21:15] Warning: check for stray files
[13:21:15]
[13:21:15] Folding@home Core Shutdown: EARLY_UNIT_END
[13:21:15] Finalizing output
[13:23:18] CoreStatus = 7B (123)
[13:23:18] Sending work to server
2665 1, 321, 169
[22:15:35] Completed 47500 out of 250000 steps (19 percent)
[22:17:22] Gromacs cannot continue further.
[22:17:22] Going to send back what have done.
[22:17:22] logfile size: 54149
[22:17:22] - Writing 54685 bytes of core data to disk...
[22:17:22] ... Done.
[22:17:23] - Failed to delete work/wudata_03.sas
[22:17:23] - Failed to delete work/wudata_03.goe
[22:17:23] Warning: check for stray files
[22:17:23]
[22:17:23] Folding@home Core Shutdown: EARLY_UNIT_END
[22:17:23]
[22:17:23] Folding@home Core Shutdown: EARLY_UNIT_END
[22:17:26] CoreStatus = 7B (123)
[22:17:26] Sending work to server
So that I can better tune the system, is it possible to verify if these are fresh projects or recycled ones waiting for completion? The other box is crunching a run 17 so these low run projects had me curious. This system ran a basic uniprocessor project before I let it attempt the SMP. I thought it was a sufficient initial test but seems not. :(

TIA
toTOW
Site Moderator
Posts: 6359
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project 2665 several EUE's ?

Post by toTOW »

Can you post some details about the machine it's failing on ? Is it overclocked ? Did you check RAM integrity ?
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Pick2
Posts: 85
Joined: Fri Feb 13, 2009 12:38 pm
Hardware configuration: Linux & CPUs
Location: USA

Re: Project 2665 several EUE's ?

Post by Pick2 »

Project 2665 is an older project , and SMP will run your CPU harder than uniprocessor ( uP ) will.
You might want to clear the dust , check your fans and lower the overclock.
ra40
Posts: 13
Joined: Wed Aug 27, 2008 7:26 pm

Re: Project 2665 several EUE's ?

Post by ra40 »

System:
MSi K9A2 Platinum
AMD 7750 dual core at stock speeds
2-8800GT EVGA video cards
Patriot EPP 1Gx2 memory
Corsair 620HX
XP Home SP2

Regularly cleaned to preserve good airflow to components.
Last memtest was good. I'll run some tests to verify.

I was curious about the low run numbers this box pulls. I might pull the WU off the other box to test with since it is 2653 29, 60, 135.

EDIT
Ran Memtest through it's 8 normal tests, zero errors. Super Pi 32M, completes. The observed temps through AMD Over Drive were hovering 21C while running the SMP. I'm :?:
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project 2665 several EUE's ?

Post by bruce »

SuperPi is better than nothing, but not by much. Try StressCPU2 which is referenced in our 3rd party forum.
ra40
Posts: 13
Joined: Wed Aug 27, 2008 7:26 pm

Re: Project 2665 several EUE's ?

Post by ra40 »

Downloaded StressCPU2, the compression software I have doesn't decompress this file...which utility is used? IZARC is unsuccessful even though it lists the TGZ format. Gzip which I DL'ed does not give an exe file when unpacked. What happened to an old fashioned Zip file :e?: Hunting site after site for a utility to the utility for the test program is frustrating.

Shows me I don't know my way around a computer either since I have had such a struggle. <dooh>
toTOW
Site Moderator
Posts: 6359
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project 2665 several EUE's ?

Post by toTOW »

tgz is a very common format in *nix ... it's a gzipped tar file.

On Windows, WinRAR is able to decompress it.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
ra40
Posts: 13
Joined: Wed Aug 27, 2008 7:26 pm

Re: Project 2665 several EUE's ?

Post by ra40 »

Thanks. After the hunt, manged to run the tests. Ran a variety at stock speed and that came up no errors. Applied some OC'ing, again, fine. That lead me to conclude it wasn't hardware based like it is most instances.

Uninstalled and reinstalled the SMP client. In past times, I would transfer a back-up of current running WU over but seems the client detects the different machine so it was a no go. Was forced to take a live WU and hope it wouldn't EUE. The WU completed fine. Looks like something during the install was corrupted and a fresh install cleaned it up. :D
Post Reply