Page 1 of 1

Project: 2684 (Run 3, Clone 18, Gen 0)

Posted: Fri May 28, 2010 7:33 am
by Magic Michael
Hi guys,

this WU is haunting me (it's coming back and back ...). Any advice what to do now ?

Code: Select all

[13:00:13] Working on queue slot 05 [May 25 13:00:13 UTC]
[13:00:13] + Working ...
[13:00:13] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 05 -np 16 -checkpoint 15 -verbose -lifeline 2505 -version 629'
[13:00:13]
[13:00:13] *------------------------------*
[13:00:13] Folding@Home Gromacs SMP Core
[13:00:13] Version 2.21 (May 10, 2010)
[13:00:13]
[13:00:13] Preparing to commence simulation
[13:00:13] - Ensuring status. Please wait.
[13:00:23] - Looking at optimizations...
[13:00:23] - Working with standard loops on this execution.
[13:00:23] - Previous termination of core was improper.
[13:00:23] - Files status OK
[13:00:25] - Expanded 20082788 -> 30791309 (decompressed 153.3 percent)
[13:00:25] Called DecompressByteArray: compressed_data_size=20082788 data_size=30791309, decompressed_data_size=30791309 diff=0
[13:00:26] - Digital signature verified
[13:00:26]
[13:00:26] Project: 2684 (Run 3, Clone 18, Gen 0)
[13:00:26]
[13:00:26] Entering M.D.
[13:00:32] Using Gromacs checkpoints
[13:00:53] Resuming from checkpoint
[13:00:55] Verified work/wudata_05.log
[13:00:56] Verified work/wudata_05.trr
[13:00:56] Verified work/wudata_05.xtc
[13:00:56] Verified work/wudata_05.edr

[...]

[05:46:52] Completed 250000 out of 250000 steps  (100%)
[05:47:06] DynamicWrapper: Finished Work Unit: sleep=10000
[05:47:16]
[05:47:16] Finished Work Unit:
[05:47:16] - Reading up to 52629024 from "work/wudata_05.trr": Read 52629024
[05:47:16] trr file hash check passed.
[05:47:16] - Reading up to 46984840 from "work/wudata_05.xtc": Read 46984840
[05:47:16] xtc file hash check passed.
[05:47:16] edr file hash check passed.
[05:47:16] logfile size: 204963
[05:47:16] Leaving Run
[05:47:17] - Writing 99986771 bytes of core data to disk...
[05:47:19]   ... Done.
[05:47:34] - Shutting down core
[05:47:34]
[05:47:34] Folding@home Core Shutdown: FINISHED_UNIT
[05:47:35] CoreStatus = 64 (100)
[05:47:35] Unit 5 finished with 52 percent of time to deadline remaining.
[05:47:35] Updated performance fraction: 0.612720
[05:47:35] Sending work to server
[05:47:35] Project: 2684 (Run 3, Clone 18, Gen 0)
[05:47:35] + Attempting to send results [May 28 05:47:35 UTC]
[05:47:35] - Reading file work/wuresults_05.dat from core
[05:47:35]   (Read 99986771 bytes from disk)
[05:47:35] Connecting to http://171.67.108.22:8080/
[06:06:34] Posted data.
[06:06:34] Initial: 0000; - Uploaded at ~85 kB/s
[06:06:41] - Averaged speed for that direction ~43 kB/s
[06:06:41] + Results successfully sent
[06:06:41] Thank you for your contribution to Folding@Home.
[06:06:41] + Number of Units Completed: 679

[06:06:47] Trying to send all finished work units
[06:06:47] + No unsent completed units remaining.
[06:06:47] - Preparing to get new work unit...
[06:06:47] Cleaning up work directory
[06:06:49] + Attempting to get work packet
[06:06:49] Passkey found
[06:06:49] - Will indicate memory of 7996 MB
[06:06:49] - Connecting to assignment server
[06:06:49] Connecting to http://assign.stanford.edu:8080/
[06:06:50] Posted data.
[06:06:50] Initial: 43AB; - Successful: assigned to (171.67.108.22).
[06:06:50] + News From Folding@Home: Welcome to Folding@Home
[06:06:51] Loaded queue successfully.
[06:06:51] Connecting to http://171.67.108.22:8080/
[06:07:02] Posted data.
[06:07:02] Initial: 0000; - Receiving payload (expected size: 20083300)
[06:09:14] - Downloaded at ~148 kB/s
[06:09:14] - Averaged speed for that direction ~108 kB/s
[06:09:14] + Received work.
[06:09:14] Trying to send all finished work units
[06:09:14] + No unsent completed units remaining.
[06:09:14] + Closed connections
[06:09:14]
[06:09:14] + Processing work unit
[06:09:14] Core required: FahCore_a3.exe
[06:09:14] Core found.
[06:09:14] Working on queue slot 06 [May 28 06:09:14 UTC]
[06:09:14] + Working ...
[06:09:14] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 06 -np 16 -checkpoint 15 -verbose -lifeline 2505 -version 629'
[06:09:15]
[06:09:15] *------------------------------*
[06:09:15] Folding@Home Gromacs SMP Core
[06:09:15] Version 2.21 (May 10, 2010)
[06:09:15]
[06:09:15] Preparing to commence simulation
[06:09:15] - Looking at optimizations...
[06:09:15] - Created dyn
[06:09:15] - Files status OK
[06:09:17] - Expanded 20082788 -> 30791309 (decompressed 153.3 percent)
[06:09:17] Called DecompressByteArray: compressed_data_size=20082788 data_size=30791309, decompressed_data_size=30791309 diff=0
[06:09:17] - Digital signature verified
[06:09:17]
[06:09:17] Project: 2684 (Run 3, Clone 18, Gen 0)
[06:09:17]
[06:09:17] Assembly optimizations on if available.
[06:09:17] Entering M.D.
[06:09:31] Completed 0 out of 250000 steps  (0%)
[06:50:41] Completed 2500 out of 250000 steps  (1%)
[07:00:13] - Autosending finished units... [May 28 07:00:13 UTC]
[07:00:13] Trying to send all finished work units
[07:00:13] + No unsent completed units remaining.
[07:00:13] - Autosend completed
[07:31:46] Completed 5000 out of 250000 steps  (2%)
Thanks for any help
Michael

Re: Project: 2684 (Run 3, Clone 18, Gen 0)

Posted: Fri May 28, 2010 7:51 am
by toTOW
Strange ... it has been successfully sent, but the server reassigned it to you :? ... I don't have access to the WU DB right now, but if another mod see this thread before I can, please check the status of this WU.

Re: Project: 2684 (Run 3, Clone 18, Gen 0)

Posted: Fri May 28, 2010 10:00 am
by Magic Michael
Please do: Magic-Michael, Team 11298 (Gentoo Linux Users Everywhere)

In the meantime I keep crunching that WU.

Re: Project: 2684 (Run 3, Clone 18, Gen 0)

Posted: Fri May 28, 2010 3:12 pm
by bruce
Your log shows the WU was uploaded at [May 28 06:06:41 UTC] which would be [May 27 23:06:41 PDT] and by the time the stats are collected and entered into the database (hourly) the db time looks right.

Hi Magic-Michael (team 11298),
Your WU (P2684 R3 C18 G0) was added to the stats database on 2010-05-28 00:05:28 for 66177.5 points of credit.

I can't explain why the same WU was assigned at [06:09:14 UTC] but I'll ask someone to look into it. You should probably delete the WU and move on to something else.

Re: Project: 2684 (Run 3, Clone 18, Gen 0)

Posted: Mon May 31, 2010 7:36 am
by Magic Michael
Hey guys, it's getting weird.

I let the WU finish, got full points again and ... got the WU again:

Code: Select all

[02:39:53] - Shutting down core
[02:39:53]
[02:39:53] Folding@home Core Shutdown: FINISHED_UNIT
[02:39:55] CoreStatus = 64 (100)
[02:39:55] Unit 6 finished with 52 percent of time to deadline remaining.
[02:39:55] Updated performance fraction: 0.595021
[02:39:55] Sending work to server
[02:39:55] Project: 2684 (Run 3, Clone 18, Gen 0)


[02:39:55] + Attempting to send results [May 31 02:39:55 UTC]
[02:39:55] - Reading file work/wuresults_06.dat from core
[02:39:55]   (Read 99986590 bytes from disk)
[02:39:55] Connecting to http://171.67.108.22:8080/
[02:53:24] Posted data.
[02:53:25] Initial: 0000; - Uploaded at ~120 kB/s
[02:53:25] - Averaged speed for that direction ~58 kB/s
[02:53:25] + Results successfully sent
[02:53:25] Thank you for your contribution to Folding@Home.
[02:53:25] + Number of Units Completed: 680

[02:53:30] Trying to send all finished work units
[02:53:30] + No unsent completed units remaining.
[02:53:30] - Preparing to get new work unit...
[02:53:30] Cleaning up work directory
[02:53:31] + Attempting to get work packet
[02:53:31] Passkey found
[02:53:31] - Will indicate memory of 7996 MB
[02:53:31] - Connecting to assignment server
[02:53:31] Connecting to http://assign.stanford.edu:8080/
[02:53:32] Posted data.
[02:53:32] Initial: 43AB; - Successful: assigned to (171.67.108.22).
[02:53:32] + News From Folding@Home: Welcome to Folding@Home
[02:53:32] Loaded queue successfully.
[02:53:32] Connecting to http://171.67.108.22:8080/
[02:53:40] Posted data.
[02:53:40] Initial: 0000; - Receiving payload (expected size: 20083300)
[02:56:16] - Downloaded at ~125 kB/s
[02:56:16] - Averaged speed for that direction ~111 kB/s
[02:56:16] + Received work.
[02:56:16] Trying to send all finished work units
[02:56:16] + No unsent completed units remaining.
[02:56:16] + Closed connections
[02:56:16]
[02:56:16] + Processing work unit
[02:56:16] Core required: FahCore_a3.exe
[02:56:16] Core found.
[02:56:16] Working on queue slot 07 [May 31 02:56:16 UTC]
[02:56:16] + Working ...
[02:56:16] - Calling './FahCore_a3.exe -dir work/ -nice 19 -suffix 07 -np 16 -ch                                                                                      eckpoint 15 -verbose -lifeline 2505 -version 629'

[02:56:16]
[02:56:16] *------------------------------*
[02:56:16] Folding@Home Gromacs SMP Core
[02:56:16] Version 2.21 (May 10, 2010)
[02:56:16]
[02:56:16] Preparing to commence simulation
[02:56:16] - Looking at optimizations...
[02:56:16] - Created dyn
[02:56:16] - Files status OK
[02:56:18] - Expanded 20082788 -> 30791309 (decompressed 153.3 percent)
[02:56:18] Called DecompressByteArray: compressed_data_size=20082788 data_size=3                                                                                      0791309, decompressed_data_size=30791309 diff=0
[02:56:18] - Digital signature verified
[02:56:18]
[02:56:18] Project: 2684 (Run 3, Clone 18, Gen 0)
[02:56:18]
[02:56:18] Assembly optimizations on if available.
[02:56:18] Entering M.D.
[02:56:33] Completed 0 out of 250000 steps  (0%)
[03:37:42] Completed 2500 out of 250000 steps  (1%)
Can anyone from PG look into that ? Meanwhile I keep crunching. Otherwise some other guy, who doesn't know about these problems, will get that WU, I suppose.

Bye
Michael

Edit:

Code: Select all

mm2-bln foldingathome # ls -la
insgesamt 15196
drwxr-xr-x  4 foldingathome nogroup          4096 25. Mai 09:29 .
drwxr-xr-x 17 root          root             4096 29. Apr 10:24 ..
-rwxr-x---  1 foldingathome nogroup           198 31. Mai 04:53 client.cfg
-rw-r--r--  1 foldingathome nogroup        671885 23. Mai 03:25 dead.letter
-rwxr-xr-x  1 foldingathome nogroup       1104112 19. Feb 17:57 fah6
-rwxr-x---  1 foldingathome nogroup       3625104 24. Jan 18:03 FahCore_a1.exe
-rwxr-x---  1 foldingathome nogroup       5509624  8. Feb 04:12 FahCore_a2.exe
-rwxr-x---  1 foldingathome nogroup       4375728 24. Mai 19:29 FahCore_a3.exe
-rw-r--r--  1 foldingathome foldingathome   75301 25. Mai 09:28 FAHlog-Prev.txt
-rw-r--r--  1 foldingathome foldingathome   32042 31. Mai 09:02 FAHlog.txt
drwx------  2 foldingathome nogroup          4096  8. Feb 17:24 .fci
-rwxr-xr-x  1 foldingathome nogroup           125 19. Feb 16:31 initfolding
-rw-r--r--  1 foldingathome nogroup             8  9. Mär 2009  machinedependent.dat
-rwxr-xr-x  1 foldingathome nogroup         54848 19. Feb 16:31 mpiexec
-rw-r--r--  1 foldingathome nogroup          1517 29. Jul 2009  MyFolding.html
-rw-r--r--  1 foldingathome nogroup          7168 31. Mai 04:56 queue.dat
-rw-r--r--  1 foldingathome nogroup           152 31. Mai 09:02 unitinfo.txt
drwxr-x---  2 foldingathome nogroup          4096 31. Mai 09:26 work
mm2-bln foldingathome #

Code: Select all

mm2-bln foldingathome # cd work
mm2-bln work # ls -la
insgesamt 552604
drwxr-x--- 2 foldingathome nogroup           4096 31. Mai 09:26 .
drwxr-xr-x 4 foldingathome nogroup           4096 25. Mai 09:29 ..
-rw-r--r-- 1 foldingathome foldingathome        0 31. Mai 04:56 core78.sta
-rwxr-x--- 1 foldingathome foldingathome      221 22. Mai 05:56 logfile_00.txt
-rwxr-x--- 1 foldingathome foldingathome      221 24. Mai 19:25 logfile_01.txt
-rwxr-x--- 1 foldingathome foldingathome      725 25. Mai 09:29 logfile_03.txt
-rwxr-x--- 1 foldingathome foldingathome      610 25. Mai 10:15 logfile_04.txt
-rwxr-x--- 1 foldingathome foldingathome      777 31. Mai 09:02 logfile_07.txt
-rwxr-x--- 1 foldingathome foldingathome      221 17. Mai 02:26 logfile_08.txt
-rwxr-x--- 1 foldingathome foldingathome      221 19. Mai 16:01 logfile_09.txt
-rw-r--r-- 1 foldingathome foldingathome 26358348 22. Mai 04:42 wudata_00_prev.cpt
-rw-r--r-- 1 foldingathome foldingathome 26358348 24. Mai 18:33 wudata_01_prev.cpt
-rw-r--r-- 1 foldingathome foldingathome    75160 25. Mai 09:17 wudata_03.ckp
-rw-r--r-- 1 foldingathome foldingathome 26316808 25. Mai 09:17 wudata_03.cpt
-rw-r--r-- 1 foldingathome foldingathome 20083300 24. Mai 19:32 wudata_03.dat
-rw-r--r-- 1 foldingathome foldingathome       16 25. Mai 09:29 wudata_03.dyn
-rw-r--r-- 1 foldingathome foldingathome    32640 25. Mai 09:29 wudata_03.edr
-rw-r--r-- 1 foldingathome foldingathome    47681 25. Mai 09:29 wudata_03.log
-rw-r--r-- 1 foldingathome foldingathome 26316808 25. Mai 09:03 wudata_03_prev.cpt
-rw-r--r-- 1 foldingathome foldingathome 30791309 25. Mai 09:29 wudata_03.tpr
-rw-r--r-- 1 foldingathome foldingathome 26314512 24. Mai 19:32 wudata_03.trr
-rw-r--r-- 1 foldingathome foldingathome  8542272 25. Mai 02:48 wudata_03.xtc
-rw-r--r-- 1 foldingathome foldingathome    75160 25. Mai 10:19 wudata_04.ckp
-rw-r--r-- 1 foldingathome foldingathome 26316808 25. Mai 10:19 wudata_04.cpt
-rw-r--r-- 1 foldingathome foldingathome 20083300 25. Mai 09:34 wudata_04.dat
-rw-r--r-- 1 foldingathome foldingathome        8 25. Mai 09:34 wudata_04.dyn
-rw-r--r-- 1 foldingathome foldingathome     3424 25. Mai 10:23 wudata_04.edr
-rw-r--r-- 1 foldingathome foldingathome    15013 25. Mai 10:23 wudata_04.log
-rw-r--r-- 1 foldingathome foldingathome 26316808 25. Mai 10:04 wudata_04_prev.cpt
-rw-r--r-- 1 foldingathome foldingathome 30791309 25. Mai 09:34 wudata_04.tpr
-rw-r--r-- 1 foldingathome foldingathome 26314512 25. Mai 09:34 wudata_04.trr
-rw-r--r-- 1 foldingathome foldingathome  4271316 25. Mai 09:34 wudata_04.xtc
-rw-r--r-- 1 foldingathome foldingathome 26316808 28. Mai 07:45 wudata_05_prev.cpt
-rw-r--r-- 1 foldingathome foldingathome 26316808 31. Mai 04:24 wudata_06_prev.cpt
-rw-r--r-- 1 foldingathome foldingathome    75160 31. Mai 09:26 wudata_07.ckp
-rw-r--r-- 1 foldingathome foldingathome 26316808 31. Mai 09:26 wudata_07.cpt
-rw-r--r-- 1 foldingathome foldingathome 20083300 31. Mai 04:56 wudata_07.dat
-rw-r--r-- 1 foldingathome foldingathome        8 31. Mai 04:56 wudata_07.dyn
-rw-r--r-- 1 foldingathome foldingathome    12720 31. Mai 09:35 wudata_07.edr
-rw-r--r-- 1 foldingathome foldingathome    25308 31. Mai 09:35 wudata_07.log
-rw-r--r-- 1 foldingathome foldingathome 26316808 31. Mai 09:11 wudata_07_prev.cpt
-rw-r--r-- 1 foldingathome foldingathome 30791309 31. Mai 04:56 wudata_07.tpr
-rw-r--r-- 1 foldingathome foldingathome 26314512 31. Mai 04:56 wudata_07.trr
-rw-r--r-- 1 foldingathome foldingathome  4271316 31. Mai 04:56 wudata_07.xtc
-rw-r--r-- 1 foldingathome foldingathome 26358348 17. Mai 01:16 wudata_08_prev.cpt
-rw-r--r-- 1 foldingathome foldingathome 26358348 19. Mai 13:28 wudata_09_prev.cpt
-rwxr-x--- 1 foldingathome foldingathome      512 25. Mai 09:29 wuinfo_03.dat
-rwxr-x--- 1 foldingathome foldingathome      512 25. Mai 10:15 wuinfo_04.dat
-rwxr-x--- 1 foldingathome foldingathome      512 31. Mai 09:02 wuinfo_07.dat
mm2-bln work #

Re: Project: 2684 (Run 3, Clone 18, Gen 0)

Posted: Thu Jun 03, 2010 7:18 am
by Magic Michael
Help ! I folded that WU for the third time, got credit for the third time - and got this WU for the fourth time in a row ! I deleted the work folder, unitinfo.txt and queue.dat - and got that WU again !!
Can someone from PG take this curse away ?

Edit: Deleted the WU again, and now I don't get any WU at all: "Can't get work from the server: "Bad packet type from server" . I replaced -bigadv with -advmethods and crunch one (-oneunit) normal WU now. After that I'll try -bigadv again.