Project: 3064 (Run 1, Clone 87, Gen 43)

Moderators: Site Moderators, FAHC Science Team

Post Reply
parkut
Posts: 363
Joined: Tue Feb 12, 2008 7:33 am
Hardware configuration: Running exclusively Linux headless blades. All are dedicated crunching machines.
Location: SE Michigan, USA

Project: 3064 (Run 1, Clone 87, Gen 43)

Post by parkut »

Work unit errored out at 80% complete 7 times in a row.
Resolved by deleting entire FAH6 folder. Was then issued different WU.

Each time, something similar to this was written to error log.

[16:25:29] Completed 4000000 out of 5000000 steps (80 percent)
[16:29:42] Warning: long 1-4 interactions
[16:29:46] CoreStatus = 0 (0)
[16:29:46] Client-core communications error: ERROR 0x0
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 3064 (Run 1, Clone 87, Gen 43)

Post by bruce »

Thank you for the report.

I presume that was on Linux.

For future reference, some folks with similar reports have found that if they stop the WU not long before the Warning: long 1-4 interactions and restart, it will finish the WU. Others just delete it an move on to other WUs.
RafaPolit
Posts: 17
Joined: Tue Apr 29, 2008 10:00 pm

Re: Project: 3064 (Run 1, Clone 87, Gen 43)

Post by RafaPolit »

I am on Vista running the 5.92 client and started experiencing this 1-4 long interactions just yesterday, I cannot get past the 1% of a WU that Stanford is determine that I carry to completion. Do I need to reinstall the entire thing??

Could this be attributed to a Vista update? Is anyone else experiencing this issues? Thanks,

Rafa.
bruce
Posts: 20824
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 3064 (Run 1, Clone 87, Gen 43)

Post by bruce »

RafaPolit wrote:Could this be attributed to a Vista update? Is anyone else experiencing this issues? Thanks,
Probably not. . . . and yes, others do experence this issue.

This is beta software and the causes of certain errors are uncertain. It's very likely (based on the reports we've seen like the one above) that most Long 1-4 interaction messages are errors which are inherent in the WU itself, not in the software installation. On the other hand, we see messages about Long 1-4 interactions which do not produce errors and processing continues.

Have each of the errors that you've seen been repeated processing of the same assignment? (Look for the Project, Run, Clone, and Gen numbers in FAHlog.txt)
RafaPolit
Posts: 17
Joined: Tue Apr 29, 2008 10:00 pm

Re: Project: 3064 (Run 1, Clone 87, Gen 43)

Post by RafaPolit »

Thanks, yes... as I posted in a nearby thread, the process repeated itself at the same point of the same WU for over 6 times. I have deleted the Queue and work files and after several retries, I was assigned a new WU. I'll post if further problems arise.

Thanks,
Rafa.
Post Reply