Page 1 of 1

Project: 8031 (Run 11, Clone 95, Gen 59)

Posted: Wed Mar 21, 2012 6:09 pm
by v00d00
Ran over and over again until the EUE trigger activated, always died at step 27.

When i discovered it i deleted the queue, etc. and am back up folding on a different 8031.

GTX460, Win XP, Stock speeds and has done about 15 or so 8031's without a problem and is now folding a different 8031 without a problem.

Code: Select all

[19:57:29] Project: 8031 (Run 11, Clone 95, Gen 59)
[19:57:29] 
[19:57:30] Assembly optimizations on if available.
[19:57:30] Entering M.D.
[19:57:32] Tpr hash work/wudata_06.tpr:  2134229536 3093036034 2766798122 1140413223 239276457
[19:57:32] GPU device info: vendor=0 device=0 name=<NA> match=0
[19:57:32] Working on Protein
[19:57:32] Client config found, loading data.
[19:57:32] Starting GUI Server
[19:58:36] Setting checkpoint frequency: 250000
[19:58:36] Completed         3 out of 25000000 steps (0%).
[20:06:11] Completed    250000 out of 25000000 steps (1%).
[20:13:46] Completed    500000 out of 25000000 steps (2%).
[20:21:21] Completed    750000 out of 25000000 steps (3%).
[20:28:55] Completed   1000000 out of 25000000 steps (4%).
[20:36:30] Completed   1250000 out of 25000000 steps (5%).
[20:44:05] Completed   1500000 out of 25000000 steps (6%).
[20:51:39] Completed   1750000 out of 25000000 steps (7%).
[20:59:14] Completed   2000000 out of 25000000 steps (8%).
[21:06:49] Completed   2250000 out of 25000000 steps (9%).
[21:14:24] Completed   2500000 out of 25000000 steps (10%).
[21:21:58] Completed   2750000 out of 25000000 steps (11%).
[21:29:34] Completed   3000000 out of 25000000 steps (12%).
[21:37:08] Completed   3250000 out of 25000000 steps (13%).
[21:44:43] Completed   3500000 out of 25000000 steps (14%).
[21:52:23] Completed   3750000 out of 25000000 steps (15%).
[22:00:08] Completed   4000000 out of 25000000 steps (16%).
[22:07:52] Completed   4250000 out of 25000000 steps (17%).
[22:15:37] Completed   4500000 out of 25000000 steps (18%).
[22:23:20] Completed   4750000 out of 25000000 steps (19%).
[22:31:02] Completed   5000000 out of 25000000 steps (20%).
[22:38:45] Completed   5250000 out of 25000000 steps (21%).
[22:46:26] Completed   5500000 out of 25000000 steps (22%).
[22:54:09] Completed   5750000 out of 25000000 steps (23%).
[23:01:44] Completed   6000000 out of 25000000 steps (24%).
[23:09:19] Completed   6250000 out of 25000000 steps (25%).
[23:16:53] Completed   6500000 out of 25000000 steps (26%).
[23:24:28] Completed   6750000 out of 25000000 steps (27%).
[23:24:28] mdrun_gpu returned 52
[23:24:28] NANs detected on GPU

Re: Project: 8031 (Run 11, Clone 95, Gen 59)

Posted: Thu Mar 22, 2012 4:20 pm
by v00d00
Since that one ive completed one more 8031.

Code: Select all

[17:56:33] Project: 8031 (Run 17, Clone 35, Gen 43)
[17:56:33] 
[17:56:33] Assembly optimizations on if available.
[17:56:33] Entering M.D.
[17:56:35] Will resume from checkpoint file work/wudata_01.ckp
[17:56:35] Tpr hash work/wudata_01.tpr:  520974183 1892819350 2507068049 3561217719 2944770379
[17:56:35] calling fah_main gpuDeviceId=0
[17:56:35] Working on Protein
[17:56:35] Client config found, loading data.
[17:56:36] Starting GUI Server
[17:57:40] Resuming from checkpoint
[17:57:41] fcCheckPointResume: retreived and current tpr file hash:
[17:57:41]    0    520974183    520974183
[17:57:41]    1   1892819350   1892819350
[17:57:41]    2   2507068049   2507068049
[17:57:41]    3   3561217719   3561217719
[17:57:41]    4   2944770379   2944770379
[17:57:41] fcCheckPointResume: file hashes same.
[17:57:41] fcCheckPointResume: state restored.
[17:57:41] fcCheckPointResume: name work/wudata_01.log Verified work/wudata_01.log
[17:57:41] fcCheckPointResume: name work/wudata_01.trr Verified work/wudata_01.trr
[17:57:41] fcCheckPointResume: name work/wudata_01.xtc Verified work/wudata_01.xtc
[17:57:41] fcCheckPointResume: name work/wudata_01.edr Verified work/wudata_01.edr
[17:57:41] fcCheckPointResume: state restored 2
[17:57:41] Resumed from checkpoint
[17:57:41] Setting checkpoint frequency: 250000
[17:57:41] Completed   1000001 out of 25000000 steps (4%).
~
[06:18:31] Completed  25000000 out of 25000000 steps (100%).
[06:18:31] Finished fah_main status=0
[06:18:31] Successful run
Now processing something from P7643.

Bruce (or someone else in the know), could you tell me if anyone completed that so called bad workunit. I cant see any problems on my end and the fact it completed a workunit from the same Project straight after says to me it was maybe an anomaly.

Re: Project: 8031 (Run 11, Clone 95, Gen 59)

Posted: Mon Mar 26, 2012 8:19 pm
by bruce
No data back from query.

The project has a deadline of 9.86 days, which allows it to get up to 27% several times before being reassigned. Assuming it was created from Project: 8031 (Run 11, Clone 95, Gen 58) which was added to the stats database on 2012-03-20 08:07:53, you're the only one who has worked on the WU, so we can't assume anything beyond what you've posted.

I'll flag this topic for later Mod review.

Re: Project: 8031 (Run 11, Clone 95, Gen 59)

Posted: Tue Apr 24, 2012 11:58 pm
by sortofageek
My apologies for the late follow-up but, for the record, this one was completed successfully.
Your WU (P8031 R11 C95 G59) was added to the stats database on 2012-03-30 11:09:23 for 3843.4 points of credit.