[SPARK-56032][SQL][FOLLOWUP] Skip FilterExec subexpression elimination codegen when there is no common subexpression by cloud-fan · Pull Request #56209 · apache/spark

cloud-fan · 2026-05-29T16:40:50Z

What changes were proposed in this pull request?

This is a follow-up of #54862, which introduced subexpression elimination (CSE) in FilterExec whole-stage codegen.

FilterExec takes the CSE codegen path whenever subexpressionEliminationEnabled && otherPreds.nonEmpty, regardless of whether any common subexpression actually exists. That path emits an inputVarsEvalCode prologue at the top of the per-row loop that eagerly evaluates every input column referenced by otherPreds (required so eliminated subexpressions can be materialized into shared variables). When there is nothing to eliminate, this prologue provides no benefit but still defeats the short-circuiting the non-CSE path gets from loading columns lazily, just before the predicate that needs them.

This PR gates the CSE path on whether otherPreds actually contain a common subexpression, using the same EquivalentExpressions analysis (and output binding) as the CSE codegen, so it agrees exactly with whether that path would find anything. When there is none, it falls back to the non-CSE generatePredicateCode, which loads columns lazily and preserves short-circuiting. Filters that do have a common subexpression are unaffected.

To avoid analyzing the predicates twice, subexpressionEliminationForWholeStageCodegen gains an overload that accepts a pre-built EquivalentExpressions, so the single analysis used by the gate is reused by the codegen.

Why are the changes needed?

For a filter with no common subexpression but multiple conjuncts over different columns (e.g. q_int BETWEEN ... AND (decimal_a BETWEEN ... OR decimal_b BETWEEN ...)), the eager prologue decodes the decimal columns for every row, including rows a cheaper earlier predicate would have rejected. Decoding a high-precision decimal allocates a BigInteger/BigDecimal per call, so this is pure waste and shows up as a measurable performance regression versus the lazy non-CSE path (observed on TPC-DS q28).

Does this PR introduce any user-facing change?

No. This is a codegen-only change; query results are unchanged.

How was this patch tested?

New unit test in WholeStageCodegenSuite asserting that, for a filter with no common subexpression, the CSE-enabled generated code is identical to the CSE-disabled code (i.e. it falls back to the lazy, short-circuiting path). The existing FilterExec CSE tests, which use genuine common subexpressions, still exercise the CSE path and pass.

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude (Claude Code)

…when there is no common subexpression ### What changes were proposed in this pull request? `FilterExec` whole-stage codegen takes the subexpression-elimination (CSE) path whenever `subexpressionEliminationEnabled && otherPreds.nonEmpty`, regardless of whether any common subexpression actually exists. That path emits an `inputVarsEvalCode` prologue at the top of the per-row loop that eagerly evaluates every input column referenced by `otherPreds` (required so eliminated subexpressions can be materialized into shared variables). When there is nothing to eliminate, this prologue provides no benefit but still defeats the short-circuiting the non-CSE path gets from loading columns lazily, just before the predicate that needs them. This PR gates the CSE path on whether `otherPreds` actually contain a common subexpression (`hasCommonSubexpressions`, using the same `EquivalentExpressions` detection and `output` binding as the CSE codegen). When there is none, it falls back to the non-CSE `generatePredicateCode`, which loads columns lazily and preserves short-circuiting. ### Why are the changes needed? For a filter with no common subexpression but multiple conjuncts over different columns (e.g. `q_int BETWEEN ... AND (decimal_a BETWEEN ... OR decimal_b BETWEEN ...)`), the eager prologue decodes the decimal columns for every row, including rows a cheaper earlier predicate would have rejected. Decoding a high-precision decimal allocates a BigInteger/BigDecimal per call, so this is pure waste and shows up as a measurable performance regression versus the lazy non-CSE path. ### Does this PR introduce _any_ user-facing change? No. This is a codegen-only change; query results are unchanged. ### How was this patch tested? New unit test in `WholeStageCodegenSuite` asserting that, for a filter with no common subexpression, the CSE-enabled generated code is identical to the CSE-disabled code (i.e. it falls back to the lazy, short-circuiting path). The existing `FilterExec` CSE tests, which use genuine common subexpressions, still exercise the CSE path and pass. ### Was this patch authored or co-authored using generative AI tooling? Generated-by: Claude (Claude Code) Co-authored-by: Isaac

@transient

Reuse a single EquivalentExpressions analysis between the has-common-subexpression gate and the CSE codegen, instead of building it twice. Add a subexpressionEliminationForWholeStageCodegen overload that takes a pre-built EquivalentExpressions, and have FilterExec hold the bound predicates and their CSE analysis as @transient lazy vals (driver-only codegen state; EquivalentExpressions is not serializable). Co-authored-by: Isaac

cloud-fan added 2 commits May 29, 2026 16:40

cloud-fan changed the title ~~[SPARK-56032][SQL] Skip FilterExec subexpression elimination codegen when there is no common subexpression~~ [SPARK-56032][SQL][FOLLOWUP] Skip FilterExec subexpression elimination codegen when there is no common subexpression May 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-56032][SQL][FOLLOWUP] Skip FilterExec subexpression elimination codegen when there is no common subexpression#56209

[SPARK-56032][SQL][FOLLOWUP] Skip FilterExec subexpression elimination codegen when there is no common subexpression#56209
cloud-fan wants to merge 2 commits into
apache:masterfrom
cloud-fan:wenchen/filter-cse-skip-when-no-common-subexpr

cloud-fan commented May 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

cloud-fan commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cloud-fan commented May 29, 2026 •

edited

Loading