Skip to content

Commit 323802e

Browse files
committed
Optimize groupb y for single partition topics
Group by operations on topics with a single partition are now optimized to avoid creating a repartition topic. Instead, the messages are directly transformed to use the new key, as all messages go to the same partition.
1 parent 5999c3a commit 323802e

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

quixstreams/dataframe/registry.py

+1
Original file line numberDiff line numberDiff line change
@@ -71,6 +71,7 @@ def register_groupby(
7171
:param source_sdf: the SDF used by `sdf.group_by()`
7272
:param new_sdf: the SDF generated by `sdf.group_by()`.
7373
"""
74+
7475
if source_sdf.stream_id in self._repartition_origins:
7576
raise GroupByNestingLimit(
7677
"Subsequent (nested) `SDF.group_by()` operations are not allowed."

0 commit comments

Comments
 (0)