What is a WU?
Moderators: Site Moderators, FAHC Science Team
-
- Posts: 26
- Joined: Fri Oct 17, 2008 9:04 am
- Hardware configuration: i7-980X@4200 (AirCooled) w/6GB DDR3-1600 (8-8-8-20), 200GB Vertex2 SSD & 3xGTS-250 w/22" Eizo Monitors.
What is a WU?
I know a WU is a WorkUnit and a workunit is a very little part of a large project - a few nanoseconds of proteins folding/development in real-time, but is the WU's we folders calculate a part of the project, like a Cube (3D), a Puzzle (2D) or string/time (1D)?
Re: What is a WU?
Most work units have a file named current.xyz in the work folder that you can open up with a 3d viewer.
Re: What is a WU?
A WU is set of initial configuration of atoms/models at given temperature. Each WUs is tied to the WU processing method as well (broadly speaking the Core).
-
- Posts: 10179
- Joined: Thu Nov 29, 2007 4:30 pm
- Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
- Location: Arizona
- Contact:
Re: What is a WU?
http://fahwiki.net/index.php/WorkUnits and http://fahwiki.net/index.php/Runs,_Clones_and_Gens
I would also point you at the Project FAQs, but it seems that a work unit is not defined well there.
I would also point you at the Project FAQs, but it seems that a work unit is not defined well there.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Tell me and I forget. Teach me and I remember. Involve me and I learn.
-
- Posts: 26
- Joined: Fri Oct 17, 2008 9:04 am
- Hardware configuration: i7-980X@4200 (AirCooled) w/6GB DDR3-1600 (8-8-8-20), 200GB Vertex2 SSD & 3xGTS-250 w/22" Eizo Monitors.
Re: What is a WU?
I was thinking in wider terms, trying to understand the work we do. If it's a part of a "3D"-structure you can still get a good picture with one WU missing. If it's a "2D"-structure, you miss a part of the puzzle if a WU is missing, but you might miss a vital part of the "movie" if a frame "1D"-structure! Please help me to understand....
-
- Posts: 522
- Joined: Mon Dec 03, 2007 4:33 am
- Location: Australia
Re: What is a WU?
I'd say a WU is a 3-D structure wrt space (x, y, z co-ordinates) but holds minimal significance at any given instant or to an extent, as a standalone unit. When the state of the 3-D structure, i.e., the protein changes over time, that is what FAH is interested in. These timeslices are called Gens or generations. Now, you may ask, can't FAH have one single WU that tracks the transition of a protein from State 0 (initial) to the final state? It can, but the WU would probably run for several months depending on the complexity of the protein. Also, as I understand it, the final state of a protein may not be defined or be something that the researchers want to see. Having these WUs grouped by Gens gives the reasearchers the flexibility of stopping a simulation sequence if its not what they want/ if its reached what they want to see.
Do note, all the above is from someone who's knowledge about proteins/ MD is obtained from being here or looking at the Wiki. I wouldn't be surprised if I've got it mixed up
Do note, all the above is from someone who's knowledge about proteins/ MD is obtained from being here or looking at the Wiki. I wouldn't be surprised if I've got it mixed up
-
- Posts: 1579
- Joined: Fri Jun 27, 2008 2:20 pm
- Hardware configuration: Q6600 - 8gb - p5q deluxe - gtx275 - hd4350 ( not folding ) win7 x64 - smp:4 - gpu slot
E6600 - 4gb - p5wdh deluxe - 9600gt - 9600gso - win7 x64 - smp:2 - 2 gpu slots
E2160 - 2gb - ?? - onboard gpu - win7 x32 - 2 uniprocessor slots
T5450 - 4gb - ?? - 8600M GT 512 ( DDR2 ) - win7 x64 - smp:2 - gpu slot - Location: The Netherlands
- Contact:
Re: What is a WU?
I think you need an amendment to the above as there is an distinction to be made between a wu from the uniprocessor clients and those from the hpc clients ( smp/gpu/ps3 ).
A uniprocessor wu represented ( this might have changed with newer cores but I'm taking the information as it's given to me and it was presented in historical context so while it's correct the actual time frame might have been increased allot but still not near the level of the hpc clients ) between 50 and 100 nanaseconds. The villian protein which was used in the example presented to me ( and which I'll therefore use here ) takes around 50 miliseconds to fold. If you put that in a scientific forumula it would be represented like this.
p(folding) = 1 - exp[ -( 50 ns )/( 50,000 ns ) = ~ 0.1 %
To indicate the chanche of a full fold occuring in the timespan of a single work unit is about 0.1 percent.
A workunit is not standalone, as you can notice from the project - run - clone - gen in it's description. Project we can leave out here as that is only an indication as to which proteine the unit represents. A Run however is important as it's an initial state, the 'start position' of a proteine with the placement of molecules and the forces influencing the behaviour of those molecules. Each run consists of a number of Clones, where clones are as the name says an exact copy of the Run in terms of atom location and what they describe as 'temperature' an indication of forces influencing the folding sequence. I can't say I understand but it seems that even when a work unit has the exact same atom placement and initial forces, the distribution of those forces can change without changing the initial structure. Don't ask me, ask Dan
Then you have the Gen, which isn't refering to genetics but to generation. The first gen is the work unit as they cooked it up, and when they get it back they can formulate a new workunit with the starting position of the returned one increasing the timespan represented by the individual work units alone.
Now why I think there is a disctinction to be made is because the hpc clients enable longer total trajectories, and because of this PG is now able to not rely on thousand of new run's/clone's to 'get lucky' and find a workunit which enables them pinpoint a transition state for that proteine between it's folded state and unfolded state, but they can increase the lengt of each single run/clone with more generations ( each of which already representing a longer timeperiod ) and thus serverly increasing the change of finding what their looking for, the transition state/moment.
It also enables them to look for more complicated behaviour, a proteine which folds not one but more times to reach it's final state which using the old approach is not feasible knowing the how small the chanche of being able to witness that occuring when they have to rely on the many run's/clone's approach instead of the increased length by having more generations of the same run/clone.
I might be wrong in my explanation, if so I would enjoy being corrected and pointed to what I misinterpret.
A uniprocessor wu represented ( this might have changed with newer cores but I'm taking the information as it's given to me and it was presented in historical context so while it's correct the actual time frame might have been increased allot but still not near the level of the hpc clients ) between 50 and 100 nanaseconds. The villian protein which was used in the example presented to me ( and which I'll therefore use here ) takes around 50 miliseconds to fold. If you put that in a scientific forumula it would be represented like this.
p(folding) = 1 - exp[ -( 50 ns )/( 50,000 ns ) = ~ 0.1 %
To indicate the chanche of a full fold occuring in the timespan of a single work unit is about 0.1 percent.
A workunit is not standalone, as you can notice from the project - run - clone - gen in it's description. Project we can leave out here as that is only an indication as to which proteine the unit represents. A Run however is important as it's an initial state, the 'start position' of a proteine with the placement of molecules and the forces influencing the behaviour of those molecules. Each run consists of a number of Clones, where clones are as the name says an exact copy of the Run in terms of atom location and what they describe as 'temperature' an indication of forces influencing the folding sequence. I can't say I understand but it seems that even when a work unit has the exact same atom placement and initial forces, the distribution of those forces can change without changing the initial structure. Don't ask me, ask Dan
Then you have the Gen, which isn't refering to genetics but to generation. The first gen is the work unit as they cooked it up, and when they get it back they can formulate a new workunit with the starting position of the returned one increasing the timespan represented by the individual work units alone.
Now why I think there is a disctinction to be made is because the hpc clients enable longer total trajectories, and because of this PG is now able to not rely on thousand of new run's/clone's to 'get lucky' and find a workunit which enables them pinpoint a transition state for that proteine between it's folded state and unfolded state, but they can increase the lengt of each single run/clone with more generations ( each of which already representing a longer timeperiod ) and thus serverly increasing the change of finding what their looking for, the transition state/moment.
It also enables them to look for more complicated behaviour, a proteine which folds not one but more times to reach it's final state which using the old approach is not feasible knowing the how small the chanche of being able to witness that occuring when they have to rely on the many run's/clone's approach instead of the increased length by having more generations of the same run/clone.
I might be wrong in my explanation, if so I would enjoy being corrected and pointed to what I misinterpret.
Re: What is a WU?
That's pretty much the way I understand it, although I'll add a couple of comments.MtM wrote:I might be wrong in my explanation, if so I would enjoy being corrected and pointed to what I misinterpret.
The High Performance clients are able to do their processing more rapidly. That may mean that a single WU represents a longer slice of simulated time, but it may also mean that it is able to simulate a larger protein. As the number of atoms increases, the computer time increases, and that needs becomes a trade-off vs. simulated time.
In general, there is a "reasonable" range of computer time for a WU. It makes no sense to create WUs that will upload results and download new WUs too frequently but it also makes no sense to create WUs that will run for too long a time without uploading the result and getting a new assignment.
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.
-
- Posts: 1579
- Joined: Fri Jun 27, 2008 2:20 pm
- Hardware configuration: Q6600 - 8gb - p5q deluxe - gtx275 - hd4350 ( not folding ) win7 x64 - smp:4 - gpu slot
E6600 - 4gb - p5wdh deluxe - 9600gt - 9600gso - win7 x64 - smp:2 - 2 gpu slots
E2160 - 2gb - ?? - onboard gpu - win7 x32 - 2 uniprocessor slots
T5450 - 4gb - ?? - 8600M GT 512 ( DDR2 ) - win7 x64 - smp:2 - gpu slot - Location: The Netherlands
- Contact:
Re: What is a WU?
Where is the 'reasonable' range based on though? Is it based on an assumption of how many exponentials are involved in the fold of a certain proteine, as in how I understand it longer simulations are the only way to be able to witness the multitude of those occuring, or is it souly based on the computational power? In short, are the technoligical computation limits the deciding factor or the assumptions to how many transitional states are attributed to that proteine?bruce wrote:That's pretty much the way I understand it, although I'll add a couple of comments.MtM wrote:I might be wrong in my explanation, if so I would enjoy being corrected and pointed to what I misinterpret.
The High Performance clients are able to do their processing more rapidly. That may mean that a single WU represents a longer slice of simulated time, but it may also mean that it is able to simulate a larger protein. As the number of atoms increases, the computer time increases, and that needs becomes a trade-off vs. simulated time.
In general, there is a "reasonable" range of computer time for a WU. It makes no sense to create WUs that will upload results and download new WUs too frequently but it also makes no sense to create WUs that will run for too long a time without uploading the result and getting a new assignment.
I'm not sure how they reach an expectation to how many transitional states are possible within a certain proteine so my question comes down to 'are they able to predict them before creating a run/clone based on folding events witnessed in lab experiments or is it 'wet vinger work'?
-
- Posts: 1024
- Joined: Sun Dec 02, 2007 12:43 pm
Re: What is a WU?
I think the "reasonable" WU is based on an assumed average computer speed for that client, with the idea of a certain frequency range for uploading/downloading. The probability of transitions is related to the total number of WU which looks at both the number of run/clones chosen for the project vs. how many months/years a project can run.MtM wrote:Where is the 'reasonable' range based on though? Is it based on an assumption of how many exponentials are involved in the fold of a certain proteine, as in how I understand it longer simulations are the only way to be able to witness the multitude of those occuring, or is it souly based on the computational power? In short, are the technoligical computation limits the deciding factor or the assumptions to how many transitional states are attributed to that proteine?bruce wrote:In general, there is a "reasonable" range of computer time for a WU. It makes no sense to create WUs that will upload results and download new WUs too frequently but it also makes no sense to create WUs that will run for too long a time without uploading the result and getting a new assignment.
-
- Posts: 1579
- Joined: Fri Jun 27, 2008 2:20 pm
- Hardware configuration: Q6600 - 8gb - p5q deluxe - gtx275 - hd4350 ( not folding ) win7 x64 - smp:4 - gpu slot
E6600 - 4gb - p5wdh deluxe - 9600gt - 9600gso - win7 x64 - smp:2 - 2 gpu slots
E2160 - 2gb - ?? - onboard gpu - win7 x32 - 2 uniprocessor slots
T5450 - 4gb - ?? - 8600M GT 512 ( DDR2 ) - win7 x64 - smp:2 - gpu slot - Location: The Netherlands
- Contact:
Re: What is a WU?
If you read my previous posts ( did you? ) you would know the probability of multiple transitions is not based on total number of wu'scodysluder wrote:I think the "reasonable" WU is based on an assumed average computer speed for that client, with the idea of a certain frequency range for uploading/downloading. The probability of transitions is related to the total number of WU which looks at both the number of run/clones chosen for the project vs. how many months/years a project can run.MtM wrote:Where is the 'reasonable' range based on though? Is it based on an assumption of how many exponentials are involved in the fold of a certain proteine, as in how I understand it longer simulations are the only way to be able to witness the multitude of those occuring, or is it souly based on the computational power? In short, are the technoligical computation limits the deciding factor or the assumptions to how many transitional states are attributed to that proteine?bruce wrote:In general, there is a "reasonable" range of computer time for a WU. It makes no sense to create WUs that will upload results and download new WUs too frequently but it also makes no sense to create WUs that will run for too long a time without uploading the result and getting a new assignment.
-
- Posts: 1024
- Joined: Sun Dec 02, 2007 12:43 pm
Re: What is a WU?
The number of WUs is related to the number of ns which is simulated in a single WU. One might simulate 50 ns of a simple protein in the same "sized" WU as 10 ns of a complex protein.MtM wrote:If you read my previous posts ( did you? ) you would know the probability of multiple transitions is not based on total number of wu's
-
- Posts: 1579
- Joined: Fri Jun 27, 2008 2:20 pm
- Hardware configuration: Q6600 - 8gb - p5q deluxe - gtx275 - hd4350 ( not folding ) win7 x64 - smp:4 - gpu slot
E6600 - 4gb - p5wdh deluxe - 9600gt - 9600gso - win7 x64 - smp:2 - 2 gpu slots
E2160 - 2gb - ?? - onboard gpu - win7 x32 - 2 uniprocessor slots
T5450 - 4gb - ?? - 8600M GT 512 ( DDR2 ) - win7 x64 - smp:2 - gpu slot - Location: The Netherlands
- Contact:
Re: What is a WU?
That's very true, but I thought to understand the hpc wu's are not that much 'bigger' but do have much longer simulation times.
-
- Posts: 174
- Joined: Sun Nov 16, 2008 6:41 am
Re: What is a WU?
so if we're needing to run a project through its course, why are we jumping arou nd from projcet to project so much? if we all focused on (obviously the servers not us) getting one project until completion wouldnt we get a lot more done?
also im sure this is going to sound very stupid, but i just cant understand how you can write the code for the WU, write the algorhythm that runs the code in client and not know the outcome when youre writing it?
also im sure this is going to sound very stupid, but i just cant understand how you can write the code for the WU, write the algorhythm that runs the code in client and not know the outcome when youre writing it?
Re: What is a WU?
Each project comes from a specific researcher and resides on a single server.spazzychalk wrote:so if we're needing to run a project through its course, why are we jumping arou nd from projcet to project so much? if we all focused on (obviously the servers not us) getting one project until completion wouldnt we get a lot more done?
Each researcher has particular areas of interest so there are various types of studies going on simultaneously.
Each server has some limitations and can only manage work to be distributed to a certain number of clients simultaneously and other servers can distribute other projects to other clients.
The algorithm describes only the laws of physics, not the protein or it's shape. The input data describes the initial position and velocity of each of the atoms in the protein. As time passes, each atom moves in accordance with the laws of physics, but there are so many interactions that the positions at future times are not known until they are computed.also im sure this is going to sound very stupid, but i just cant understand how you can write the code for the WU, write the algorhythm that runs the code in client and not know the outcome when youre writing it?
Suppose four people play a game of golf. The characteristics of the clubs and the course and the balls and the people are all known, but you still don't know who will win that day. (And that's only considering four golf balls. What if there were thousands or hundreds of thousands of golf balls all in flight at the same time?)
Posting FAH's log:
How to provide enough info to get helpful support.
How to provide enough info to get helpful support.