Allow balancing weights to be set per tier #125824

nicktindall · 2025-03-28T06:27:22Z

First cut of approach for allowing weights to be configured per tier

Relates: ES-11367

nicktindall · 2025-03-28T07:05:40Z

...ain/java/org/elasticsearch/cluster/routing/allocation/allocator/BalancedShardsAllocator.java

-            diskUsageBalanceFactor
-        );
-        Balancer balancer = new Balancer(writeLoadForecaster, allocation, weightFunction, threshold);
+        WeightFunction weightFunction = new SpecialisedWeightFunction(settings);


If we wanted to, we could have separate WeightFunctionFactory implementations for stateful and stateless. The stateful one would just return a SingleWeightFunction with the non-tier-specific weights, and the stateless one returns a SpecialisedWeightFunction.

henningandersen

Left a comment, may think more about it.

henningandersen · 2025-03-28T08:54:24Z

...ain/java/org/elasticsearch/cluster/routing/allocation/allocator/BalancedShardsAllocator.java

                while (true) {
                    final ModelNode minNode = modelNodes[lowIdx];
                    final ModelNode maxNode = modelNodes[highIdx];
+                    final float localThreshold = sorter.minWeightDelta(minNode) * threshold;


There is a great deal of implied knowledge here in that it assumes that indexing and search shards are separated on separate nodes and that allocation decisions are made for that. And that the minWeightDelta will give the same for any node in the same tier.

I wonder if we can instead overload BalancedShardsAllocator.allocate (in stateless) and then invoke the logic twice, once for each tier? It could pass down a stripped version of the nodes (per tier) and a specialized weight function. It is also not 100% separate (shard allocator decisions still important), but somewhat less intrusive?

I had another go at this. I investigated partitioning the RoutingAllocation but that seemed like trying to unscramble an egg. It occurred to me we could do the partitioning at the NodeSorter level.

It still feels a bit hacky, but less so than this approach. I did a prototype to test it out here #126091

I ran out of time today but I think the idea is there. I'd be interested to know if it seems sound to you. cc @pxsalehi

pxsalehi · 2025-03-28T14:41:48Z

...ain/java/org/elasticsearch/cluster/routing/allocation/allocator/BalancedShardsAllocator.java

+        Property.Dynamic,
+        Property.NodeScope
+    );
+    public static final Setting<Float> SEARCH_TIER_WRITE_LOAD_BALANCE_FACTOR_SETTING = Setting.floatSetting(


I thought we'd need a flag (like a feature flag for this new behaviour) and one new setting like the INDEXING_TIER_SHARD_BALANCE_FACTOR_SETTING that you have. And that should be enough. I mean we could add SEARCH_TIER_SHARD_BALANCE_FACTOR_SETTING to make this cleaner sure. But why do we need SEARCH_TIER_WRITE_LOAD_BALANCE_FACTOR_SETTING? That just conceptually doesn't make sense and IMO should be zero if the flag for this behaviour is turned on.

Yeah I guess it was just in case we wanted to investigate if write load was somehow a useful predictor of search activity (e.g. if it happens that the most actively updated indices are also the most actively searched, or similar). Perhaps it might be useful to have the option to tweak it to values other than zero?

It seems unlikely, but I didn't want to pre-empt the experiment/investigation. Most likely it'll just be set to zero.

pxsalehi · 2025-03-28T14:46:50Z

server/src/main/java/org/elasticsearch/cluster/routing/allocation/allocator/WeightFunction.java


-    public float calculateNodeWeight(
+    float calculateNodeWeight(


It is only this function that would be different depending on if we want per tier function or one for both tiers, right? I wonder if initially we could limit the change to this? Or is that too hacky? That reduces the size of the PR and we could potentially follow up with a refactoring.

I think minWeightDelta should also be specific. That calculation includes shardWriteLoad, so the deltas would be different in each tier.

In saying that, the POC only set the forecasted write load on search nodes to zero and left the rest untouched.

nicktindall · 2025-04-08T06:14:12Z

Superseded by #126091

nicktindall added 2 commits March 28, 2025 16:23

Allow setting balancing weights per tier

4961422

Implement SpecialisedWeightFunction

5afc234

nicktindall requested a review from a team as a code owner March 28, 2025 06:27

nicktindall marked this pull request as draft March 28, 2025 06:27

elasticsearchmachine added the v9.1.0 label Mar 28, 2025

nicktindall added 3 commits March 28, 2025 17:29

Don't deprecate non-specific weight settings

7a31eef

Split out NodeType

85d86bd

Minimise change

d37db0d

nicktindall commented Mar 28, 2025

View reviewed changes

henningandersen reviewed Mar 28, 2025

View reviewed changes

pxsalehi reviewed Mar 28, 2025

View reviewed changes

nicktindall mentioned this pull request Apr 2, 2025

Allow balancing weights to be set per tier #126091

Merged

nicktindall closed this Apr 8, 2025

nicktindall deleted the ES-11367_allow_weights_to_be_set_per_tier branch April 14, 2025 05:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow balancing weights to be set per tier #125824

Allow balancing weights to be set per tier #125824

nicktindall commented Mar 28, 2025

nicktindall Mar 28, 2025 •

edited

Loading

henningandersen left a comment

henningandersen Mar 28, 2025

henningandersen Mar 28, 2025

nicktindall Apr 2, 2025

pxsalehi Mar 28, 2025

nicktindall Apr 1, 2025

pxsalehi Mar 28, 2025 •

edited

Loading

nicktindall Apr 2, 2025

nicktindall commented Apr 8, 2025

Allow balancing weights to be set per tier #125824

Allow balancing weights to be set per tier #125824

Conversation

nicktindall commented Mar 28, 2025

nicktindall Mar 28, 2025 • edited Loading

Choose a reason for hiding this comment

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Mar 28, 2025

Choose a reason for hiding this comment

henningandersen Mar 28, 2025

Choose a reason for hiding this comment

nicktindall Apr 2, 2025

Choose a reason for hiding this comment

pxsalehi Mar 28, 2025

Choose a reason for hiding this comment

nicktindall Apr 1, 2025

Choose a reason for hiding this comment

pxsalehi Mar 28, 2025 • edited Loading

Choose a reason for hiding this comment

nicktindall Apr 2, 2025

Choose a reason for hiding this comment

nicktindall commented Apr 8, 2025

nicktindall Mar 28, 2025 •

edited

Loading

pxsalehi Mar 28, 2025 •

edited

Loading