MQE: initial implementation of common subexpression elimination #11189

charleskorn · 2025-04-11T06:04:53Z

What this PR does

This PR implements common subexpression elimination in MQE.

Prior to this PR, when given an expression like sum(foo) / (sum(foo) + sum(bar)), MQE would evaluate sum(foo) twice. This is not necessary: we can instead compute sum(foo) once and use the result in both places, and that's what this PR implements.

This improves overall query performance, and also saves CPU time in queriers, ingesters and store-gateways. However, the drawback is that queriers must buffer unused parts of common results until they are consumed by all places that need them. For queries where the size of the common result is small, this is not an issue, but in other cases this can increase the peak memory consumption of queries significantly.

This could be addressed somewhat by consuming from both sides of binary operations concurrently, but this is out of scope for this PR - for now, given the substantial performance improvements and relative cost of CPU and memory, we can provision more memory for queriers if needed. In any case, query memory consumption will be no worse than Prometheus' engine.

Benchmark results

Overall, latency improves roughly in line with the ratio of eliminated selectors to original selectors. (For example, if there were two selectors originally, and they're both the same, then latency improves around 50%, or if there were three selectors and two are the same, then latency improves around 33%.)

goos: darwin
goarch: arm64
pkg: github.com/grafana/mimir/pkg/streamingpromql/benchmarks
cpu: Apple M1 Pro
                                                                                │    Mimir     │        MimirWithQueryPlanner         │
                                                                                │    sec/op    │    sec/op      vs base               │
Query/sum(a_1),_instant_query-10                                                   144.7µ ± 3%    146.8µ ±  1%        ~ (p=0.180 n=6)
Query/sum(a_1),_range_query_with_100_steps-10                                      149.8µ ± 1%    153.6µ ±  2%   +2.51% (p=0.004 n=6)
Query/sum(a_1),_range_query_with_1000_steps-10                                     202.9µ ± 1%    208.0µ ±  0%   +2.47% (p=0.002 n=6)
Query/sum(a_100),_instant_query-10                                                 757.4µ ± 1%    768.9µ ±  2%   +1.52% (p=0.015 n=6)
Query/sum(a_100),_range_query_with_100_steps-10                                    1.270m ± 1%    1.273m ±  1%        ~ (p=0.394 n=6)
Query/sum(a_100),_range_query_with_1000_steps-10                                   5.541m ± 1%    5.538m ±  1%        ~ (p=0.937 n=6)
Query/sum(a_2000),_instant_query-10                                                9.997m ± 2%   10.067m ±  1%        ~ (p=0.132 n=6)
Query/sum(a_2000),_range_query_with_100_steps-10                                   19.41m ± 1%    19.46m ±  1%        ~ (p=0.394 n=6)
Query/sum(a_2000),_range_query_with_1000_steps-10                                  100.0m ± 1%    100.3m ±  1%        ~ (p=0.485 n=6)
Query/a_1_+_a_1,_instant_query-10                                                  273.4µ ± 1%    150.8µ ±  1%  -44.85% (p=0.002 n=6)
Query/a_1_+_a_1,_range_query_with_100_steps-10                                     286.9µ ± 2%    159.1µ ±  1%  -44.54% (p=0.002 n=6)
Query/a_1_+_a_1,_range_query_with_1000_steps-10                                    397.2µ ± 2%    219.3µ ±  1%  -44.79% (p=0.002 n=6)
Query/a_100_+_a_100,_instant_query-10                                             1482.5µ ± 1%    814.3µ ±  1%  -45.07% (p=0.002 n=6)
Query/a_100_+_a_100,_range_query_with_100_steps-10                                 2.549m ± 1%    1.414m ±  1%  -44.53% (p=0.002 n=6)
Query/a_100_+_a_100,_range_query_with_1000_steps-10                               11.587m ± 5%    6.552m ±  2%  -43.45% (p=0.002 n=6)
Query/a_2000_+_a_2000,_instant_query-10                                            21.76m ± 1%    11.82m ±  1%  -45.69% (p=0.002 n=6)
Query/a_2000_+_a_2000,_range_query_with_100_steps-10                               41.85m ± 3%    22.82m ±  1%  -45.47% (p=0.002 n=6)
Query/a_2000_+_a_2000,_range_query_with_1000_steps-10                              212.6m ± 1%    118.7m ±  1%  -44.16% (p=0.002 n=6)
Query/sum(a_1)_+_sum(a_1),_instant_query-10                                        283.2µ ± 1%    155.7µ ±  2%  -45.00% (p=0.002 n=6)
Query/sum(a_1)_+_sum(a_1),_range_query_with_100_steps-10                           306.8µ ± 3%    165.8µ ±  1%  -45.95% (p=0.002 n=6)
Query/sum(a_1)_+_sum(a_1),_range_query_with_1000_steps-10                          412.5µ ± 0%    230.0µ ±  2%  -44.23% (p=0.002 n=6)
Query/sum(a_100)_+_sum(a_100),_instant_query-10                                   1505.3µ ± 3%    785.8µ ±  3%  -47.80% (p=0.002 n=6)
Query/sum(a_100)_+_sum(a_100),_range_query_with_100_steps-10                       2.503m ± 3%    1.300m ±  2%  -48.05% (p=0.002 n=6)
Query/sum(a_100)_+_sum(a_100),_range_query_with_1000_steps-10                     11.011m ± 3%    5.704m ±  4%  -48.20% (p=0.002 n=6)
Query/sum(a_2000)_+_sum(a_2000),_instant_query-10                                  20.50m ± 2%    10.29m ±  4%  -49.79% (p=0.002 n=6)
Query/sum(a_2000)_+_sum(a_2000),_range_query_with_100_steps-10                     39.49m ± 5%    19.69m ±  1%  -50.14% (p=0.002 n=6)
Query/sum(a_2000)_+_sum(a_2000),_range_query_with_1000_steps-10                    201.0m ± 5%    100.3m ±  1%  -50.09% (p=0.002 n=6)
Query/max(a_1)_-_min(a_1),_instant_query-10                                        277.6µ ± 1%    156.0µ ±  3%  -43.80% (p=0.002 n=6)
Query/max(a_1)_-_min(a_1),_range_query_with_100_steps-10                           291.1µ ± 3%    164.7µ ±  1%  -43.40% (p=0.002 n=6)
Query/max(a_1)_-_min(a_1),_range_query_with_1000_steps-10                          408.8µ ± 0%    232.7µ ±  1%  -43.09% (p=0.002 n=6)
Query/max(a_100)_-_min(a_100),_instant_query-10                                   1448.0µ ± 1%    796.8µ ±  1%  -44.98% (p=0.002 n=6)
Query/max(a_100)_-_min(a_100),_range_query_with_100_steps-10                       2.461m ± 1%    1.361m ±  1%  -44.69% (p=0.002 n=6)
Query/max(a_100)_-_min(a_100),_range_query_with_1000_steps-10                     11.058m ± 0%    6.083m ±  0%  -44.99% (p=0.002 n=6)
Query/max(a_2000)_-_min(a_2000),_instant_query-10                                  20.21m ± 1%    10.49m ±  1%  -48.10% (p=0.002 n=6)
Query/max(a_2000)_-_min(a_2000),_range_query_with_100_steps-10                     38.97m ± 2%    20.53m ±  2%  -47.31% (p=0.002 n=6)
Query/max(a_2000)_-_min(a_2000),_range_query_with_1000_steps-10                    203.3m ± 0%    108.4m ±  0%  -46.68% (p=0.002 n=6)
Query/a_1_/_(a_1_+_b_1),_instant_query-10                                          399.9µ ± 2%    282.9µ ±  2%  -29.26% (p=0.002 n=6)
Query/a_1_/_(a_1_+_b_1),_range_query_with_100_steps-10                             422.6µ ± 1%    297.5µ ±  1%  -29.60% (p=0.002 n=6)
Query/a_1_/_(a_1_+_b_1),_range_query_with_1000_steps-10                            588.6µ ± 1%    419.6µ ±  3%  -28.72% (p=0.002 n=6)
Query/a_100_/_(a_100_+_b_100),_instant_query-10                                    2.229m ± 1%    1.571m ±  1%  -29.53% (p=0.002 n=6)
Query/a_100_/_(a_100_+_b_100),_range_query_with_100_steps-10                       3.822m ± 1%    2.723m ±  0%  -28.76% (p=0.002 n=6)
Query/a_100_/_(a_100_+_b_100),_range_query_with_1000_steps-10                      17.64m ± 0%    12.68m ±  0%  -28.12% (p=0.002 n=6)
Query/a_2000_/_(a_2000_+_b_2000),_instant_query-10                                 34.20m ± 1%    22.96m ±  1%  -32.87% (p=0.002 n=6)
Query/a_2000_/_(a_2000_+_b_2000),_range_query_with_100_steps-10                    64.82m ± 2%    45.91m ±  3%  -29.17% (p=0.002 n=6)
Query/a_2000_/_(a_2000_+_b_2000),_range_query_with_1000_steps-10                   326.8m ± 1%    235.0m ±  0%  -28.09% (p=0.002 n=6)
Query/sum(a_1)_/_(sum(a_1)_+_sum(b_1)),_instant_query-10                           407.8µ ± 1%    289.6µ ±  1%  -28.99% (p=0.002 n=6)
Query/sum(a_1)_/_(sum(a_1)_+_sum(b_1)),_range_query_with_100_steps-10              427.1µ ± 1%    302.4µ ±  1%  -29.20% (p=0.002 n=6)
Query/sum(a_1)_/_(sum(a_1)_+_sum(b_1)),_range_query_with_1000_steps-10             611.2µ ± 1%    433.1µ ± 14%  -29.14% (p=0.002 n=6)
Query/sum(a_100)_/_(sum(a_100)_+_sum(b_100)),_instant_query-10                     2.280m ± 2%    1.520m ±  4%  -33.36% (p=0.002 n=6)
Query/sum(a_100)_/_(sum(a_100)_+_sum(b_100)),_range_query_with_100_steps-10        3.636m ± 1%    2.469m ±  2%  -32.09% (p=0.002 n=6)
Query/sum(a_100)_/_(sum(a_100)_+_sum(b_100)),_range_query_with_1000_steps-10       16.08m ± 1%    11.11m ±  3%  -30.90% (p=0.002 n=6)
Query/sum(a_2000)_/_(sum(a_2000)_+_sum(b_2000)),_instant_query-10                  31.07m ± 7%    20.65m ±  5%  -33.54% (p=0.002 n=6)
Query/sum(a_2000)_/_(sum(a_2000)_+_sum(b_2000)),_range_query_with_100_steps-10     60.15m ± 2%    38.97m ±  2%  -35.21% (p=0.002 n=6)
Query/sum(a_2000)_/_(sum(a_2000)_+_sum(b_2000)),_range_query_with_1000_steps-10    300.0m ± 0%    200.4m ±  1%  -33.20% (p=0.002 n=6)
Query/min(a_1)_/_(max(a_1)_+_max(b_1)),_instant_query-10                           410.2µ ± 1%    293.5µ ±  1%  -28.43% (p=0.002 n=6)
Query/min(a_1)_/_(max(a_1)_+_max(b_1)),_range_query_with_100_steps-10              429.2µ ± 2%    309.6µ ±  2%  -27.85% (p=0.002 n=6)
Query/min(a_1)_/_(max(a_1)_+_max(b_1)),_range_query_with_1000_steps-10             615.3µ ± 1%    443.6µ ±  2%  -27.90% (p=0.002 n=6)
Query/min(a_100)_/_(max(a_100)_+_max(b_100)),_instant_query-10                     2.177m ± 1%    1.517m ±  1%  -30.33% (p=0.002 n=6)
Query/min(a_100)_/_(max(a_100)_+_max(b_100)),_range_query_with_100_steps-10        3.659m ± 1%    2.568m ±  5%  -29.83% (p=0.002 n=6)
Query/min(a_100)_/_(max(a_100)_+_max(b_100)),_range_query_with_1000_steps-10       16.52m ± 1%    11.54m ±  1%  -30.15% (p=0.002 n=6)
Query/min(a_2000)_/_(max(a_2000)_+_max(b_2000)),_instant_query-10                  29.94m ± 1%    20.97m ±  1%  -29.97% (p=0.002 n=6)
Query/min(a_2000)_/_(max(a_2000)_+_max(b_2000)),_range_query_with_100_steps-10     60.46m ± 1%    40.88m ±  2%  -32.38% (p=0.002 n=6)
Query/min(a_2000)_/_(max(a_2000)_+_max(b_2000)),_range_query_with_1000_steps-10    306.0m ± 1%    209.1m ±  0%  -31.67% (p=0.002 n=6)
geomean                                                                            4.254m         2.804m        -34.09%

                                                                                │    Mimir     │        MimirWithQueryPlanner         │
                                                                                │      B       │       B        vs base               │
Query/sum(a_1),_instant_query-10                                                  64.78Mi ± 1%    64.65Mi ± 1%        ~ (p=0.781 n=6)
Query/sum(a_1),_range_query_with_100_steps-10                                     65.02Mi ± 2%    65.15Mi ± 1%        ~ (p=0.729 n=6)
Query/sum(a_1),_range_query_with_1000_steps-10                                    64.42Mi ± 2%    64.80Mi ± 2%        ~ (p=0.394 n=6)
Query/sum(a_100),_instant_query-10                                                60.60Mi ± 2%    60.54Mi ± 1%        ~ (p=0.461 n=6)
Query/sum(a_100),_range_query_with_100_steps-10                                   60.86Mi ± 0%    60.93Mi ± 1%        ~ (p=0.818 n=6)
Query/sum(a_100),_range_query_with_1000_steps-10                                  61.23Mi ± 1%    61.10Mi ± 1%        ~ (p=0.894 n=6)
Query/sum(a_2000),_instant_query-10                                               62.08Mi ± 1%    62.10Mi ± 2%        ~ (p=0.974 n=6)
Query/sum(a_2000),_range_query_with_100_steps-10                                  61.98Mi ± 1%    61.66Mi ± 2%        ~ (p=0.699 n=6)
Query/sum(a_2000),_range_query_with_1000_steps-10                                 66.87Mi ± 2%    66.79Mi ± 4%        ~ (p=0.589 n=6)
Query/a_1_+_a_1,_instant_query-10                                                 64.73Mi ± 1%    63.14Mi ± 2%   -2.46% (p=0.015 n=6)
Query/a_1_+_a_1,_range_query_with_100_steps-10                                    63.73Mi ± 2%    63.35Mi ± 1%        ~ (p=0.258 n=6)
Query/a_1_+_a_1,_range_query_with_1000_steps-10                                   63.87Mi ± 2%    63.03Mi ± 2%   -1.31% (p=0.045 n=6)
Query/a_100_+_a_100,_instant_query-10                                             60.95Mi ± 1%    60.48Mi ± 2%        ~ (p=0.119 n=6)
Query/a_100_+_a_100,_range_query_with_100_steps-10                                61.92Mi ± 1%    61.39Mi ± 1%   -0.86% (p=0.011 n=6)
Query/a_100_+_a_100,_range_query_with_1000_steps-10                               64.84Mi ± 3%    64.33Mi ± 1%        ~ (p=0.589 n=6)
Query/a_2000_+_a_2000,_instant_query-10                                           64.12Mi ± 1%    63.77Mi ± 2%        ~ (p=0.132 n=6)
Query/a_2000_+_a_2000,_range_query_with_100_steps-10                              72.92Mi ± 3%    73.05Mi ± 2%        ~ (p=1.000 n=6)
Query/a_2000_+_a_2000,_range_query_with_1000_steps-10                             135.9Mi ± 3%    128.4Mi ± 1%   -5.55% (p=0.002 n=6)
Query/sum(a_1)_+_sum(a_1),_instant_query-10                                       63.82Mi ± 1%    63.26Mi ± 1%   -0.88% (p=0.002 n=6)
Query/sum(a_1)_+_sum(a_1),_range_query_with_100_steps-10                          64.19Mi ± 1%    63.12Mi ± 2%   -1.66% (p=0.037 n=6)
Query/sum(a_1)_+_sum(a_1),_range_query_with_1000_steps-10                         63.64Mi ± 1%    63.38Mi ± 1%        ~ (p=0.589 n=6)
Query/sum(a_100)_+_sum(a_100),_instant_query-10                                   61.14Mi ± 1%    60.96Mi ± 1%        ~ (p=0.818 n=6)
Query/sum(a_100)_+_sum(a_100),_range_query_with_100_steps-10                      61.50Mi ± 1%    60.95Mi ± 1%        ~ (p=0.071 n=6)
Query/sum(a_100)_+_sum(a_100),_range_query_with_1000_steps-10                     61.45Mi ± 0%    61.46Mi ± 2%        ~ (p=1.000 n=6)
Query/sum(a_2000)_+_sum(a_2000),_instant_query-10                                 63.97Mi ± 2%    62.30Mi ± 1%   -2.60% (p=0.002 n=6)
Query/sum(a_2000)_+_sum(a_2000),_range_query_with_100_steps-10                    62.81Mi ± 2%    61.77Mi ± 2%   -1.67% (p=0.015 n=6)
Query/sum(a_2000)_+_sum(a_2000),_range_query_with_1000_steps-10                   72.16Mi ± 2%    66.46Mi ± 2%   -7.90% (p=0.002 n=6)
Query/max(a_1)_-_min(a_1),_instant_query-10                                       63.54Mi ± 2%    63.20Mi ± 1%        ~ (p=0.513 n=6)
Query/max(a_1)_-_min(a_1),_range_query_with_100_steps-10                          63.63Mi ± 2%    63.33Mi ± 2%        ~ (p=0.310 n=6)
Query/max(a_1)_-_min(a_1),_range_query_with_1000_steps-10                         63.67Mi ± 1%    63.30Mi ± 2%        ~ (p=0.509 n=6)
Query/max(a_100)_-_min(a_100),_instant_query-10                                   61.09Mi ± 1%    60.88Mi ± 2%        ~ (p=0.619 n=6)
Query/max(a_100)_-_min(a_100),_range_query_with_100_steps-10                      61.45Mi ± 1%    61.32Mi ± 0%        ~ (p=0.900 n=6)
Query/max(a_100)_-_min(a_100),_range_query_with_1000_steps-10                     61.66Mi ± 3%    64.75Mi ± 1%   +5.00% (p=0.002 n=6)
Query/max(a_2000)_-_min(a_2000),_instant_query-10                                 63.40Mi ± 0%    62.30Mi ± 2%   -1.74% (p=0.015 n=6)
Query/max(a_2000)_-_min(a_2000),_range_query_with_100_steps-10                    63.20Mi ± 2%    70.59Mi ± 1%  +11.68% (p=0.002 n=6)
Query/max(a_2000)_-_min(a_2000),_range_query_with_1000_steps-10                   73.17Mi ± 4%   128.59Mi ± 1%  +75.73% (p=0.002 n=6)
Query/a_1_/_(a_1_+_b_1),_instant_query-10                                         64.17Mi ± 1%    63.75Mi ± 2%        ~ (p=0.310 n=6)
Query/a_1_/_(a_1_+_b_1),_range_query_with_100_steps-10                            64.41Mi ± 1%    63.42Mi ± 1%   -1.54% (p=0.011 n=6)
Query/a_1_/_(a_1_+_b_1),_range_query_with_1000_steps-10                           63.26Mi ± 1%    63.19Mi ± 1%        ~ (p=0.777 n=6)
Query/a_100_/_(a_100_+_b_100),_instant_query-10                                   61.45Mi ± 1%    61.40Mi ± 1%        ~ (p=0.485 n=6)
Query/a_100_/_(a_100_+_b_100),_range_query_with_100_steps-10                      61.73Mi ± 1%    61.42Mi ± 1%        ~ (p=0.310 n=6)
Query/a_100_/_(a_100_+_b_100),_range_query_with_1000_steps-10                     65.74Mi ± 2%    64.52Mi ± 1%   -1.87% (p=0.015 n=6)
Query/a_2000_/_(a_2000_+_b_2000),_instant_query-10                                66.45Mi ± 1%    64.16Mi ± 1%   -3.43% (p=0.002 n=6)
Query/a_2000_/_(a_2000_+_b_2000),_range_query_with_100_steps-10                   77.25Mi ± 2%    73.34Mi ± 2%   -5.06% (p=0.002 n=6)
Query/a_2000_/_(a_2000_+_b_2000),_range_query_with_1000_steps-10                  141.9Mi ± 6%    139.6Mi ± 2%        ~ (p=0.818 n=6)
Query/sum(a_1)_/_(sum(a_1)_+_sum(b_1)),_instant_query-10                          64.63Mi ± 1%    63.60Mi ± 1%   -1.60% (p=0.004 n=6)
Query/sum(a_1)_/_(sum(a_1)_+_sum(b_1)),_range_query_with_100_steps-10             64.25Mi ± 1%    63.61Mi ± 1%        ~ (p=0.071 n=6)
Query/sum(a_1)_/_(sum(a_1)_+_sum(b_1)),_range_query_with_1000_steps-10            63.80Mi ± 1%    63.47Mi ± 1%   -0.51% (p=0.041 n=6)
Query/sum(a_100)_/_(sum(a_100)_+_sum(b_100)),_instant_query-10                    61.36Mi ± 1%    61.10Mi ± 1%        ~ (p=0.195 n=6)
Query/sum(a_100)_/_(sum(a_100)_+_sum(b_100)),_range_query_with_100_steps-10       61.53Mi ± 2%    61.16Mi ± 1%        ~ (p=0.394 n=6)
Query/sum(a_100)_/_(sum(a_100)_+_sum(b_100)),_range_query_with_1000_steps-10      62.01Mi ± 1%    61.67Mi ± 1%        ~ (p=0.093 n=6)
Query/sum(a_2000)_/_(sum(a_2000)_+_sum(b_2000)),_instant_query-10                 63.94Mi ± 1%    63.09Mi ± 1%   -1.33% (p=0.026 n=6)
Query/sum(a_2000)_/_(sum(a_2000)_+_sum(b_2000)),_range_query_with_100_steps-10    65.85Mi ± 1%    63.16Mi ± 2%   -4.08% (p=0.002 n=6)
Query/sum(a_2000)_/_(sum(a_2000)_+_sum(b_2000)),_range_query_with_1000_steps-10   75.27Mi ± 4%    72.55Mi ± 4%   -3.61% (p=0.039 n=6)
Query/min(a_1)_/_(max(a_1)_+_max(b_1)),_instant_query-10                          63.62Mi ± 1%    63.49Mi ± 2%        ~ (p=0.255 n=6)
Query/min(a_1)_/_(max(a_1)_+_max(b_1)),_range_query_with_100_steps-10             63.91Mi ± 2%    63.62Mi ± 1%        ~ (p=0.420 n=6)
Query/min(a_1)_/_(max(a_1)_+_max(b_1)),_range_query_with_1000_steps-10            63.81Mi ± 1%    63.41Mi ± 1%        ~ (p=0.100 n=6)
Query/min(a_100)_/_(max(a_100)_+_max(b_100)),_instant_query-10                    61.72Mi ± 1%    61.20Mi ± 1%   -0.84% (p=0.011 n=6)
Query/min(a_100)_/_(max(a_100)_+_max(b_100)),_range_query_with_100_steps-10       61.48Mi ± 1%    61.85Mi ± 1%        ~ (p=0.485 n=6)
Query/min(a_100)_/_(max(a_100)_+_max(b_100)),_range_query_with_1000_steps-10      61.28Mi ± 1%    64.51Mi ± 2%   +5.27% (p=0.002 n=6)
Query/min(a_2000)_/_(max(a_2000)_+_max(b_2000)),_instant_query-10                 64.19Mi ± 1%    63.72Mi ± 1%        ~ (p=0.132 n=6)
Query/min(a_2000)_/_(max(a_2000)_+_max(b_2000)),_range_query_with_100_steps-10    65.00Mi ± 1%    72.51Mi ± 2%  +11.55% (p=0.002 n=6)
Query/min(a_2000)_/_(max(a_2000)_+_max(b_2000)),_range_query_with_1000_steps-10   75.03Mi ± 5%   133.38Mi ± 2%  +77.76% (p=0.002 n=6)
geomean                                                                           65.78Mi         66.63Mi        +1.29%

Which issue(s) this PR fixes or relates to

(none)

Checklist

Tests updated.
[n/a] Documentation added.
[covered by Mimir Query Engine #10067] CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX].
[n/a] about-versioning.md updated with experimental features.

github-actions · 2025-04-11T06:06:37Z

💻 Deploy preview available:

tacole02

Thank you!

jhesketh

Looks solid, thanks for that! Just small things/nits :-).

pkg/streamingpromql/config.go

jhesketh · 2025-04-16T01:56:28Z

pkg/streamingpromql/optimize/plan/commonsubexpressionelimination/node.go

+}
+
+func (d *Duplicate) Describe() string {
+	return ""


Is this a TOOD?

No - there's nothing to describe about this node (compare this with a node representing a function call where we'd include the function name, for example).

pkg/streamingpromql/optimize/plan/commonsubexpressionelimination/operator.go

jhesketh · 2025-04-16T02:07:37Z

pkg/streamingpromql/optimize/plan/commonsubexpressionelimination/operator.go

+		return metadata, nil
+	}
+
+	// Return a copy of the original series metadata.


Will it be possible in the future to determine if a downstream operator will mutate the metadata or not. And if it will not we can avoid copying it?

If so, perhaps a comment here for now about a future optimisation?

Anything's possible 🙂 But I don't think avoiding the copy here will have a huge impact compared to the cost of evaluating the rest of the query, so I'm not sure it's worth the added complexity.

pkg/streamingpromql/optimize/plan/commonsubexpressionelimination/operator.go

jhesketh · 2025-04-16T05:02:59Z

pkg/streamingpromql/optimize/plan/commonsubexpressionelimination/optimization_pass.go

+
+func NewOptimizationPass(reg prometheus.Registerer) *OptimizationPass {
+	return &OptimizationPass{
+		duplicationNodesIntroduced: promauto.With(reg).NewCounter(prometheus.CounterOpts{


what if other optimisations want to use the duplication node? Wouldn't the metric be better off held on the duplication node and then a label used to determine the source?

For me, this metric is about evaluating the impact of this optimization pass, rather than the node. So if we end up using the node for other optimization passes, I'd expect that optimization pass to have its own metrics.

pkg/streamingpromql/optimize/plan/commonsubexpressionelimination/series_data_ring_buffer.go

jhesketh · 2025-04-16T05:22:24Z

pkg/streamingpromql/optimize/plan/commonsubexpressionelimination/series_data_ring_buffer.go

+}
+
+func (b *SeriesDataRingBuffer) Remove(seriesIndex int) types.InstantVectorSeriesData {
+	if seriesIndex != b.firstSeriesIndex {


(nit) why not just always remove b.firstSeriesIndex (ie why have this and not just RemoveFirst)? Or is this a check to make sure the caller knows the index it wants deleted is correct?

Or is this a check to make sure the caller knows the index it wants deleted is correct?

This - if we had a bug where the positions of each consumer were wrong, I'd want this to fail rather than quietly return incorrect results.

pkg/streamingpromql/planning/plan.go

pkg/streamingpromql/benchmarks/comparison_test.go

… query planning is enabled

…thout query planner enabled

…pression can be reached from different leaf nodes

…electors

…rrectly deduplicated

…hen no longer needed

…orBenchmarkQueries`

charleskorn mentioned this pull request Apr 11, 2025

Mimir Query Engine #10067

Open

charleskorn force-pushed the charleskorn/mqe-common-subexpression-elimination branch 6 times, most recently from 18942bc to 3fda6ae Compare April 14, 2025 05:42

charleskorn marked this pull request as ready for review April 14, 2025 05:59

charleskorn requested review from tacole02 and a team as code owners April 14, 2025 05:59

tacole02 approved these changes Apr 14, 2025

View reviewed changes

charleskorn force-pushed the charleskorn/mqe-common-subexpression-elimination branch from 3fda6ae to 1a81f9e Compare April 16, 2025 00:37

jhesketh reviewed Apr 16, 2025

View reviewed changes

charleskorn added 16 commits April 16, 2025 16:11

Initial commit of previously deleted code

5fbba34

Remove TODOs

c6b5e88

Remove more TODOs

9d1411a

Move optimisation pass registration to NewQueryPlanner

6832088

Add benchmarks

e535235

Rename receiver

ef75e20

Initial version of tests for optimization pass

fa6a4bb

Introduce exported NodeTypeName helper method

d4db12d

Move NewTestEngineOpts to config.go, and enable CSE by default if…

5153005

… query planning is enabled

Run TestBothEnginesReturnSameResultsForBenchmarkQueries with and wi…

e8948f4

…thout query planner enabled

Don't insert duplication node multiple times if the same common subex…

5fe51f1

…pression can be reached from different leaf nodes

Don't deduplicate subqueries directly.

e375813

Don't do unnecessary work if duplicated subqueries contain multiple s…

7d59127

…electors

Reorder methods, remove outdated comments

94e26be

Fix issue where nested subqueries with different ranges could be inco…

f33815e

…rrectly deduplicated

Simplify accumulatePath

12ec9f4

charleskorn added 18 commits April 16, 2025 16:14

Add test for duplicated string literal

ebe60cf

Register node type for deserialization

52756ec

Fix broken tests

6480a5f

Initial implementation of operator

312c0a8

Add overview comment.

8e474f5

Supress linting warning

c9db3cb

Move to separate package and split into multiple files

90e4916

Add some correctness tests

e0098c8

Add tests for operator behaviour, correctly release buffered series w…

41a26c5

…hen no longer needed

Use a ring buffer instead of a map to buffer series

12840e0

Make sure it's safe to close a DuplicateConsumer multiple times

77f6586

Actually enable query planning for `TestBothEnginesReturnSameResultsF…

5015534

…orBenchmarkQueries`

Address PR feedback: be more explicit

c178134

Address PR feedback: make names consistent

11837fc

Address PR feedback: move cloning to InstantVectorSeriesData

f0f56b5

Address PR feedback: extract portion of CloseConsumer

6911f43

Address PR feedback: simplify SeriesDataRingBuffer.Append

349ee18

Address PR feedback: add test for QueryPlan.String()

8e35d43

charleskorn force-pushed the charleskorn/mqe-common-subexpression-elimination branch from 052def7 to 8e35d43 Compare April 16, 2025 06:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MQE: initial implementation of common subexpression elimination #11189

MQE: initial implementation of common subexpression elimination #11189

charleskorn commented Apr 11, 2025 •

edited

Loading

github-actions bot commented Apr 11, 2025 •

edited

Loading

tacole02 left a comment

jhesketh left a comment

jhesketh Apr 16, 2025

charleskorn Apr 16, 2025

jhesketh Apr 16, 2025

charleskorn Apr 16, 2025

jhesketh Apr 16, 2025

charleskorn Apr 16, 2025

jhesketh Apr 16, 2025

charleskorn Apr 16, 2025

MQE: initial implementation of common subexpression elimination #11189

Are you sure you want to change the base?

MQE: initial implementation of common subexpression elimination #11189

Conversation

charleskorn commented Apr 11, 2025 • edited Loading

What this PR does

Benchmark results

Which issue(s) this PR fixes or relates to

Checklist

github-actions bot commented Apr 11, 2025 • edited Loading

tacole02 left a comment

Choose a reason for hiding this comment

jhesketh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

charleskorn commented Apr 11, 2025 •

edited

Loading

github-actions bot commented Apr 11, 2025 •

edited

Loading