Re: 3.21.157.11 overloaded?
Posted: Wed Aug 12, 2020 9:36 am
The WU needs to return to the WS that deployed it so the next gen can be created ... for the most part under normal loads this happens seemlessly ... these is an option for the researchers to state a CS(s) which can temporarily hold the WU until the WS can receive it - but it still has to go back to the WS.
So the process already exists and works if a CS has been set - but actually that just moves the problem as now the WS is trying to receive both the folders contributions and work from the CS ... Originally I believe this option was designed for WS failures or service outages - not to balance load.
There are no doubt ways to re-architect the whole way FaH infrastructure works ... but an easier solution is to work on balancing out the loads so the servers aren't under stress - and iirc this is a huge server to get balanced correctly.
So the process already exists and works if a CS has been set - but actually that just moves the problem as now the WS is trying to receive both the folders contributions and work from the CS ... Originally I believe this option was designed for WS failures or service outages - not to balance load.
There are no doubt ways to re-architect the whole way FaH infrastructure works ... but an easier solution is to work on balancing out the loads so the servers aren't under stress - and iirc this is a huge server to get balanced correctly.