Page 1 of 1

Project 2665 several EUE's ?

Posted: Wed Jan 13, 2010 6:11 am
by ra40
I just started the CPU's back into the SMP projects. One box is working fine thus far but one is trying my patience. :evil:
This is what's been EUE'ing with the 7B errors since it began on the 11'th:

2665 3, 630, 169
[12:52:33] Completed 30000 out of 250000 steps (12 percent)
[12:56:09] disk...
[12:56:09] ... Done.
[12:56:09] - Failed to delete work/wudata_01.xtc
[12:56:09] - Failed to delete - Failed to delete work/wudata_01.chk
[12:56:09] - Failed to delete work/wudata_01.pdo
[12:56:09] Warning: check for stray files
[12:56:09] elete - Failed to delete Warning: check for stray files
[12:56:09] elete work/wudata_01.xvg
[12:56:09] Warning: check for stray files
[12:56:09] h to check the stability of your computer (issues
[12:56:09] such as high temperature, overclocking, etc.).
[12:56:09] Going to send back what have done.
[12:56:09] logfile size: 40710
[12:56:09] - Writing 41260 bytes of core data to disk...
[12:56:09] ... Done.
[12:56:09] - Failed to delete work/wudata_01.arc
[12:56:09] - Failed to delete work/wudata_01.xtc
[12:56:09] No C.P. to delete.
[12:56:09] - Failed to delete work/wudata_01.dyn
[12:56:09] - Failed to delete work/wudata_01.sas
[12:56:09] - Failed to delete work/wudata_01.goe
[12:56:09] - Failed to delete work/wudata_01.pdo
[12:56:09] Warning: check for stray files
[12:56:09]
[12:56:09] Folding@home Core Shutdown: EARLY_UNIT_END
[12:56:09]
[12:56:09] Folding@home Core Shutdown: EARLY_UNIT_END
[12:56:12] CoreStatus = 7B (123)
[12:56:12] Sending work to server

2665 2, 998, 170 This WU died instantly
[12:56:56] Entering M.D.
[12:57:04] GG with glycosylations
[12:57:04] Writing local files
[12:57:04] cal files
[12:57:06] Extra SSE boost OK.
[12:57:16] cal files
[12:57:16] Completed 0 out of 250000 steps (0 percent)
[13:21:13] Warning: long 1-4 interactions
[13:21:15] ed to delete work/wudata_02.xtc
[13:21:15] Warning: check for stray files
[13:21:15] iled to delete work/wudata_02.bed-- Failed to delete work/wudata_02.goe-Warning: check for stray files
[13:21:15] .pdo
[13:21:15] Warning: check for stray files
[13:21:15]
[13:21:15] Folding@home Core Shutdown: EARLY_UNIT_END
[13:21:15] Finalizing output
[13:23:18] CoreStatus = 7B (123)
[13:23:18] Sending work to server
2665 1, 321, 169
[22:15:35] Completed 47500 out of 250000 steps (19 percent)
[22:17:22] Gromacs cannot continue further.
[22:17:22] Going to send back what have done.
[22:17:22] logfile size: 54149
[22:17:22] - Writing 54685 bytes of core data to disk...
[22:17:22] ... Done.
[22:17:23] - Failed to delete work/wudata_03.sas
[22:17:23] - Failed to delete work/wudata_03.goe
[22:17:23] Warning: check for stray files
[22:17:23]
[22:17:23] Folding@home Core Shutdown: EARLY_UNIT_END
[22:17:23]
[22:17:23] Folding@home Core Shutdown: EARLY_UNIT_END
[22:17:26] CoreStatus = 7B (123)
[22:17:26] Sending work to server
So that I can better tune the system, is it possible to verify if these are fresh projects or recycled ones waiting for completion? The other box is crunching a run 17 so these low run projects had me curious. This system ran a basic uniprocessor project before I let it attempt the SMP. I thought it was a sufficient initial test but seems not. :(

TIA

Re: Project 2665 several EUE's ?

Posted: Wed Jan 13, 2010 10:14 am
by toTOW
Can you post some details about the machine it's failing on ? Is it overclocked ? Did you check RAM integrity ?

Re: Project 2665 several EUE's ?

Posted: Wed Jan 13, 2010 1:16 pm
by Pick2
Project 2665 is an older project , and SMP will run your CPU harder than uniprocessor ( uP ) will.
You might want to clear the dust , check your fans and lower the overclock.

Re: Project 2665 several EUE's ?

Posted: Wed Jan 13, 2010 7:29 pm
by ra40
System:
MSi K9A2 Platinum
AMD 7750 dual core at stock speeds
2-8800GT EVGA video cards
Patriot EPP 1Gx2 memory
Corsair 620HX
XP Home SP2

Regularly cleaned to preserve good airflow to components.
Last memtest was good. I'll run some tests to verify.

I was curious about the low run numbers this box pulls. I might pull the WU off the other box to test with since it is 2653 29, 60, 135.

EDIT
Ran Memtest through it's 8 normal tests, zero errors. Super Pi 32M, completes. The observed temps through AMD Over Drive were hovering 21C while running the SMP. I'm :?:

Re: Project 2665 several EUE's ?

Posted: Thu Jan 14, 2010 5:42 am
by bruce
SuperPi is better than nothing, but not by much. Try StressCPU2 which is referenced in our 3rd party forum.

Re: Project 2665 several EUE's ?

Posted: Sat Jan 16, 2010 1:40 am
by ra40
Downloaded StressCPU2, the compression software I have doesn't decompress this file...which utility is used? IZARC is unsuccessful even though it lists the TGZ format. Gzip which I DL'ed does not give an exe file when unpacked. What happened to an old fashioned Zip file :e?: Hunting site after site for a utility to the utility for the test program is frustrating.

Shows me I don't know my way around a computer either since I have had such a struggle. <dooh>

Re: Project 2665 several EUE's ?

Posted: Sun Jan 17, 2010 6:27 pm
by toTOW
tgz is a very common format in *nix ... it's a gzipped tar file.

On Windows, WinRAR is able to decompress it.

Re: Project 2665 several EUE's ?

Posted: Tue Jan 19, 2010 8:42 am
by ra40
Thanks. After the hunt, manged to run the tests. Ran a variety at stock speed and that came up no errors. Applied some OC'ing, again, fine. That lead me to conclude it wasn't hardware based like it is most instances.

Uninstalled and reinstalled the SMP client. In past times, I would transfer a back-up of current running WU over but seems the client detects the different machine so it was a no go. Was forced to take a live WU and hope it wouldn't EUE. The WU completed fine. Looks like something during the install was corrupted and a fresh install cleaned it up. :D