Page 1 of 1

Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Tue May 25, 2010 5:27 pm
by Slythern
Repeated Client-core communications error: ERROR 0xc0000029 and downloading of WU. Have reloaded the a3 core to verify it is OK, this machine is not OCed.

Code: Select all

[17:10:04] Verifying core Core_a3.fah...
[17:10:04] Signature is VALID
[17:10:04] 
[17:10:04] Trying to unzip core FahCore_a3.exe
[17:10:05] Decompressed FahCore_a3.exe (8084992 bytes) successfully
[17:10:10] + Core successfully engaged
[17:10:17] 
[17:10:17] + Processing work unit
[17:10:17] Core required: FahCore_a3.exe
[17:10:17] Core found.
[17:10:17] Working on queue slot 01 [May 25 17:10:17 UTC]
[17:10:17] + Working ...
[17:10:17] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 01 -np 2 -checkpoint 10 -verbose -lifeline 5124 -version 629'

[17:10:19] 
[17:10:19] *------------------------------*
[17:10:19] Folding@Home Gromacs SMP Core
[17:10:19] Version 2.19 (Mar 12, 2010)
[17:10:19] 
[17:10:19] Preparing to commence simulation
[17:10:19] - Looking at optimizations...
[17:10:19] - Created dyn
[17:10:19] - Files status OK
[17:10:21] - Expanded 7879063 -> 10126021 (decompressed 128.5 percent)
[17:10:21] Called DecompressByteArray: compressed_data_size=7879063 data_size=10126021, decompressed_data_size=10126021 diff=0
[17:10:21] - Digital signature verified
[17:10:21] 
[17:10:21] Project: 6041 (Run 0, Clone 182, Gen 13)
[17:10:21] 
[17:10:21] Assembly optimizations on if available.
[17:10:21] Entering M.D.
[17:10:30] Completed 0 out of 250000 steps  (0%)
[17:10:40] CoreStatus = C0000029 (-1073741783)
[17:10:40] Client-core communications error: ERROR 0xc0000029
[17:10:40] Deleting current work unit & continuing...
[17:10:42] Killing all core threads
[17:10:42] Killing 1 cores
[17:10:42] Killing core 0
Repeats this cycle, please check for bad WU.

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Tue May 25, 2010 5:34 pm
by bruce
Do you use Slythern for your UserName in the client?

When was the last time you tested your Memory?

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Tue May 25, 2010 5:39 pm
by Slythern
Yes
[17:08:13] - User name: Slythern (Team 32362)

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Tue May 25, 2010 5:40 pm
by Slythern
Never tested memory, but can do this afternoon.

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Tue May 25, 2010 5:52 pm
by bruce
The fact that the error caused the WU to be deleted rather than uploading some sort of error report makes determining if this is a bad WU more difficult. One person has reported an error on this WU so there's a good chance that it's a bad WU.

The inconsistency in reporting may depend on the answers to these questions:
Which version of the client are you running?
Which OS are you running?
I can see that you're running Version 2.19 of FahCore_a3.exe.

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Tue May 25, 2010 6:50 pm
by Slythern
Bruce
Thanks for your help on this, answers to your questions:
Which version of the client are you running?
Folding@Home Client Version 6.29
Which OS are you running?
XP Professional SP3
Intel Core 2, 2GB of RAM
When was the last time you tested your Memory?
Finished 2 passes with MemTest86 with no errors

Also restarted the client, with RealTemp showing peak temp of 63C, immediate error as before.

Let me know if you need anything else or want me to try something to capture missing information.

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Wed May 26, 2010 10:13 am
by [Inpact]Terminou
6041 looks to have lots of bad WU.. We can noticed several topics about it

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Thu May 27, 2010 5:57 am
by RAH
Well this one keeps coming back to haunt me. I have deleted 7 times already.
This the only WU out there?

Have tried all the flags, no flags but smp. Same error.

Project: 6041 (Run 0, Clone 182, Gen 13)
[05:41:33]
[05:41:33] Entering M.D.
[05:41:41] Completed 0 out of 250000 steps (0%)
[05:41:59] CoreStatus = C0000029 (-1073741783)
[05:41:59] Client-core communications error: ERROR 0xc0000029

no mas.

Well up to 12 times.

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Thu May 27, 2010 1:41 pm
by Tynat
Just started to get this one too. Keeps crashing the client and the A3 core has to be manually closed. It's done this many times. The client has also downloaded the core twice and each time it gets the same WU, so it crashes again. It's stuck in a loop.

Code: Select all

[13:35:09] Preparing to commence simulation
[13:35:09] - Looking at optimizations...
[13:35:09] - Created dyn
[13:35:09] - Files status OK
[13:35:11] - Expanded 7879063 -> 10126021 (decompressed 128.5 percent)
[13:35:11] Called DecompressByteArray: compressed_data_size=7879063 data_size=10126021, decompressed_data_size=10126021 diff=0
[13:35:11] - Digital signature verified
[13:35:11] 
[13:35:11] Project: 6041 (Run 0, Clone 182, Gen 13)
[13:35:11] 
[13:35:11] Assembly optimizations on if available.
[13:35:11] Entering M.D.
[13:35:20] Completed 0 out of 250000 steps  (0%)
[13:38:41] CoreStatus = C0000029 (-1073741783)
[13:38:41] Client-core communications error: ERROR 0xc0000029
[13:38:41] - Attempting to download new core...

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Thu May 27, 2010 2:27 pm
by RAH
Well it took me 14 deletes of work/queue/unitinfo, before I got another wu. This is with waits in between hoping
for the AS to lose it. Couple of hours anyhow. :(

Really sad too, its one of the big A3s. Out of hundreds of A3s I've done, I have only done one. And it was on
a C2d. :ewink:

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Thu May 27, 2010 6:35 pm
by toTOW
I marked the WU as bad.

Re: Project: 6041 (Run 0, Clone 182, Gen 13)

Posted: Fri May 28, 2010 5:33 am
by Tynat
Thanks.