refactor(hash-aggr): Use `EmitTo` to output by 2010YOUY01 · Pull Request #23055 · apache/datafusion

2010YOUY01 · 2026-06-20T12:58:06Z

Which issue does this PR close?

Rationale for this change

Regarding the EPIC issue: I have drafted all the migrations locally, and verified that after deleting the old implementation, UTs are passing.

We are now about 4 feature migration PRs away from completing the EPIC. Before continuing with those migrations, this PR performs some cleanup and refactoring.

What changes are included in this PR?

This PR can be read commit by commit:

commit 1: use EmitTo for incremental outputting
commit 2: split hash_table.rs into small files

Are these changes tested?

Are there any user-facing changes?

alamb · 2026-06-22T12:19:42Z

Regarding the EPIC issue: I have drafted all the migrations locally, and verified that after deleting the old implementation, UTs are passing.

Amazing

alamb

This is amazing @2010YOUY01 -- thank you. I found this code really easy to follow and understand. While it is complicated, I think it much more closely mirrors the complexity of the problem being solved now and setting up the control flow logic in this way means we will be in a much better place to improve the performance / featuers going forward

👏

cc @Rachelint

alamb · 2026-06-22T12:29:50Z

+    AggregateExec, PhysicalGroupBy, aggregate_expressions, evaluate_group_by,
+};
+
+/// Marker for raw rows -> partial state aggregation.


I like this structure and how it makes it clearer what is going on with the state here

alamb · 2026-06-22T12:34:20Z

+    pub(super) accumulator_args: Vec<EvaluatedHashAggregateAccumulator>,
+}
+
+/// Hash table state while grouped aggregation is consuming input.


These comments seem a little out of date as this structure also seems to be used while emitting (in addition to building / consuming input)

pub(super) enum AggregateHashTableState { Building(BuildingHashTableState), Outputting(BuildingHashTableState), <--- suggests that "Building" state is also used for outputting Done, }

Maybe a name like AggregateHashTableStateInner would be more generic 🤷

alamb · 2026-06-22T12:37:01Z

+    pub(super) _mode: PhantomData<AggrMode>,
+}
+
+pub(super) struct HashAggregateAccumulator {


A few sentences that describe what this structure is might help future readers

something like

/// State and argument information for a single Aggregate /// /// For example, for `SELECT COUNT(x), SUM(y WHERE z > 10) ...` there would be two /// `HashAggregateAccumulator`, one each for `COUNT(x)` and `SUM(y WHERE z > 10)` pub(super) struct HashAggregateAccumulator {

alamb · 2026-06-22T12:38:49Z

+        }
+    }
+
+    pub(super) fn empty_like(&self) -> Result<Self> {


can you add some comments about what this is used for?

alamb · 2026-06-22T12:40:05Z

+    accumulator: Box<dyn GroupsAccumulator>,
+}
+
+pub(super) struct EvaluatedHashAggregateAccumulator {


Nit -- this seems liess like an "accumulator" and more like "evaluated arguments"

Maybe it would be better called EvaluatedHashAggregateArgs?

Or maybe I mis understand 🤔

In either event, some comments would also help

alamb · 2026-06-22T12:42:11Z

Minor is that the structuis called final but the module is called final_table.rs -- should we keep it consistent with final.rs?

alamb · 2026-06-22T12:45:17Z

likewise here, the struct is named Partial but the module partial_table.rs -- recommend partial.rs to be consistent

alamb · 2026-06-22T12:51:06Z

+    ) -> Result<Option<RecordBatch>> {
+        let output_schema = Arc::clone(&self.output_schema);
+        let batch_size = self.batch_size;
+        match &mut self.state {


this state match and some of the outputtting state is duplicated across the types of tables, but I think it is ok

alamb · 2026-06-22T12:53:59Z

+            .all(|acc| acc.supports_convert_to_state())
+    }
+
+    /// In skip-partial-aggregation optimization, when a decision has made to skip


Suggested change

/// In skip-partial-aggregation optimization, when a decision has made to skip

/// In skip-partial-aggregation optimization, when a decision has been made to skip

alamb · 2026-06-22T12:54:58Z

+    /// In skip-partial-aggregation optimization, when a decision has made to skip
+    /// partial stage, build a typed hash table only for aggregation state conversion
+    /// row-by-row.
+    pub(in crate::aggregates) fn partial_skip_table(


I wonder if we could avoid some clones below if this consumed self rather than took it by reference

Maybe it doesn't matter

alamb · 2026-06-22T13:16:55Z

+            .building()
+            .accumulators
+            .iter()
+            .all(|acc| acc.supports_convert_to_state())


I think we should try and remove this "supports_convert_to_state" API (as a follow on PR / project) to simplify the hash aggregate code and ensure all our groups accumulators have the high performance APIs.

I filed a ticket

Remove GroupsAccumulator::supports_convert_to_state and make convert_to_state mandatory #23081

Rachelint · 2026-06-23T00:17:42Z

+        ))
+    }
+
+    fn evaluate(&self, batch: &RecordBatch) -> Result<EvaluatedHashAggregateAccumulator> {


How about name it evaluate_acc_args like evaluate_group_by ?

Rachelint · 2026-06-23T00:18:42Z

+            .merge_batch(&values.arguments, group_indices, total_num_groups)
+    }
+
+    pub(super) fn evaluate_final(&mut self, emit_to: EmitTo) -> Result<ArrayRef> {


And can just name it evaluate after renaming above.

Rachelint · 2026-06-23T00:32:25Z

+    }
+}
+
+/// Methods shared by all aggregate hash table modes.


Seems move method impls near where it define may be clearer?

Rachelint · 2026-06-23T00:41:42Z

+                acc.update_batch(values, group_indices, total_num_groups)?;
+            }
+        }
+        drop(timer);


Explicit timer drop here seems can be removed, but not really matter.

2010YOUY01 added 2 commits June 20, 2026 20:28

refactor: use EmitTo for aggregate state output

2e7892b

split hash_table.rs into small files

d96b68c

github-actions Bot added the physical-plan Changes to the physical-plan crate label Jun 20, 2026

small comments update

6feef68

2010YOUY01 marked this pull request as draft June 21, 2026 01:16

2010YOUY01 marked this pull request as ready for review June 21, 2026 01:16

2010YOUY01 closed this Jun 21, 2026

2010YOUY01 reopened this Jun 21, 2026

alamb approved these changes Jun 22, 2026

View reviewed changes

alamb reviewed Jun 22, 2026

View reviewed changes

alamb mentioned this pull request Jun 22, 2026

Remove GroupsAccumulator::supports_convert_to_state and make convert_to_state mandatory #23081

Open

Rachelint reviewed Jun 23, 2026

View reviewed changes

	/// In skip-partial-aggregation optimization, when a decision has made to skip
	/// In skip-partial-aggregation optimization, when a decision has been made to skip

Conversation

2010YOUY01 commented Jun 20, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

alamb commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alamb commented Jun 22, 2026 •

edited

Loading