[SQL] Unused field analysis for aggregates by mihaibudiu · Pull Request #5601 · feldera/feldera

mihaibudiu · 2026-02-11T05:04:25Z

mihaibudiu · 2026-02-11T05:25:48Z

Interestingly, this pass actually discovers unused aggregate functions which are introduced by the Calcite decorrelator!

mihaibudiu · 2026-02-12T00:02:30Z

Calcite compiles the following query:

create materialized view q4
                as select
                        o_orderpriority,
                        count(*) as order_count
                from
                        orders
                where
                        o_orderdate >= date '1993-07-01'
                        and o_orderdate < date '1993-07-01' + interval '3' month
                        and exists (
                                select
                                        *
                                from
                                        lineitem
                                where
                                        l_orderkey = o_orderkey
                                        and l_commitdate < l_receiptdate
                        )
                group by
                        o_orderpriority

To a plan containing the following fragment:

              LogicalJoin(condition=[=($0, $9)], joinType=[inner]), id = 574
                LogicalFilter(condition=[SEARCH($4, Sarg[[1993-07-01..1993-10-01)])]), id = 552
                  LogicalTableScan(table=[[schema, orders]]), id = 68
                LogicalAggregate(group=[{0}], agg#0=[MIN($1)]), id = 559
                  LogicalProject(l_orderkey=[$0], $f0=[true]), id = 557
                    LogicalFilter(condition=[<($11, $12)]), id = 555
                      LogicalTableScan(table=[[schema, lineitem]]), id = 70

By using a MIN(true) aggregate to figure out whether the EXISTS subquery produces any rows. Turns out that the result of the MIN is never used, and this new optimization can actually completely remove it, leaving an aggregate... which does nothing. This is implemented much more efficiently by using a linear aggregate, which returns Tup0 if there is any value in the collection, or nothing otherwise.

Signed-off-by: Mihai Budiu <mbudiu@feldera.com>

mihaibudiu · 2026-02-12T01:25:59Z

The analysis can also discover that the following SUM aggregate is not used, since it's used only for ORDER BY, which is ignored by default:

                select deptno
                from emp
                group by deptno
                order by sum(sal) filter (where job = 'CLERK')

mihaibudiu · 2026-02-12T01:26:49Z

However, if an aggregate is evaluated just for side-effects (i.e., crash on overflow), removing aggregates is not sound.

mihaibudiu marked this pull request as draft February 11, 2026 05:25

mihaibudiu force-pushed the issue5541 branch from 71af2a8 to cba90af Compare February 11, 2026 06:09

mihaibudiu marked this pull request as ready for review February 11, 2026 06:10

mihaibudiu marked this pull request as draft February 11, 2026 06:33

mihaibudiu force-pushed the issue5541 branch from cba90af to bcd07b4 Compare February 11, 2026 23:49

mihaibudiu marked this pull request as ready for review February 11, 2026 23:50

mihaibudiu force-pushed the issue5541 branch 2 times, most recently from 887e62d to 03616ff Compare February 11, 2026 23:56

mihaibudiu marked this pull request as draft February 12, 2026 00:01

mihaibudiu force-pushed the issue5541 branch from 03616ff to d7551db Compare February 12, 2026 00:51

mihaibudiu marked this pull request as ready for review February 12, 2026 00:51

[SQL] Unused field analysis for aggregates

0e3c38e

Signed-off-by: Mihai Budiu <mbudiu@feldera.com>

mihaibudiu force-pushed the issue5541 branch from d7551db to 0e3c38e Compare February 12, 2026 00:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SQL] Unused field analysis for aggregates#5601

[SQL] Unused field analysis for aggregates#5601
mihaibudiu wants to merge 1 commit intomainfrom
issue5541

mihaibudiu commented Feb 11, 2026

Uh oh!

mihaibudiu commented Feb 11, 2026

Uh oh!

mihaibudiu commented Feb 12, 2026

Uh oh!

mihaibudiu commented Feb 12, 2026

Uh oh!

mihaibudiu commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mihaibudiu commented Feb 11, 2026

Uh oh!

mihaibudiu commented Feb 11, 2026

Uh oh!

mihaibudiu commented Feb 12, 2026

Uh oh!

mihaibudiu commented Feb 12, 2026

Uh oh!

mihaibudiu commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant