Skip to content

HIVE-28853: SharedWorkOptimizer should always consider MapJoin operators' InMemoryDataSize. #5717

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

ngsg
Copy link
Contributor

@ngsg ngsg commented Mar 28, 2025

What changes were proposed in this pull request?

The patch modifies SharedWorkOptimizer.extractSharedOptimizationInfoForRoot to consider MapJoin's inMemoryDataSize.

Why are the changes needed?

Current SharedWorkOptimizer may produce a vertex with a large number of MapJoins, potentially causing the MapJoin HashTables to exceed available memory. This patch prevents such misbehaviour.

Does this PR introduce any user-facing change?

No

Is the change a dependency upgrade?

No

How was this patch tested?

I tested the patch by qfile test.

Copy link

Copy link
Contributor

@SourabhBadhya SourabhBadhya left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor comment.

Copy link
Contributor

@SourabhBadhya SourabhBadhya left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM +1
@kasakrisz Can you please take a second look.

@ngsg ngsg force-pushed the HIVE-28853-fix-extractSharedOptimizationInfoForRoot branch from ec5d1ba to 1de8b9a Compare May 15, 2025 03:59
@ngsg
Copy link
Contributor Author

ngsg commented May 15, 2025

@kasakrisz , I have updated the patch according to your review. Could you review the changes again?

@kasakrisz
Copy link
Contributor

@ngsg
The test sharedwork_mapjoin_datasize_check failed in the last PTest run.
Could you please check it?

@ngsg
Copy link
Contributor Author

ngsg commented May 16, 2025

@kasakrisz , I have updated the qfile output. It looks like I forgot to compile the project after rebase. Thanks for pointing that out.

@kasakrisz
Copy link
Contributor

@ngsg
Could you please rebase this patch to master and check the test failures.

@ngsg ngsg force-pushed the HIVE-28853-fix-extractSharedOptimizationInfoForRoot branch from 0858fa4 to 0d61ae0 Compare May 27, 2025 08:02
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants