-
Notifications
You must be signed in to change notification settings - Fork 6.2k
8374307: Fix deoptimization storm caused by Action_none in GraphKit::uncommon_trap #28966
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
|
👋 Welcome back bulasevich! A progress list of the required criteria for merging this PR into |
|
❗ This change is not yet ready to be integrated. |
|
@bulasevich The following label will be automatically applied to this pull request:
When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command. |
Webrevs
|
src/hotspot/share/opto/parse2.cpp
Outdated
| } | ||
| return seems_never_taken(prob) && | ||
| // Skip optimization if recompile limit is exceeded to avoid deopts without recompilation. | ||
| !C->too_many_recompiles(method(), bci(), Deoptimization::Reason_unstable_if) && |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can use Compile::too_many_traps_or_recompile here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this related to JDK-8243615? Could you convert your UnstableIf.java test to a jtreg test? Maybe by running in a different process and counting the number of deoptimization events? JDK-8243615 also has a test attached.
74c02fe to
074fade
Compare
|
@bulasevich Please do not rebase or force-push to an active PR as it invalidates existing review comments. Note for future reference, the bots always squash all changes into a single commit automatically as part of the integration. See OpenJDK Developers’ Guide for more information. |
Oh, yes - this is related, and we had a similar fix five years ago.. @WZhuo
Done. I converted it to a jtreg test. I’m skipping the heavyweight part (200+ lines of code) that reproduces the issue without changing the PerMethodRecompilationCutoff limit. |
|
I believe all places where an uncommon trap with The culprit seems to be the discrepancy between As the bug demonstrates, disabling recompilation while keeping the uncommon trap in place (substituting Speaking of the proposed fix, my concern is that it addresses only one particular instance of the problem. Can we do better and fix similar bugs all at once? That would require aligning |
|
BTW JDK-6529811 did not introduce the heuristic in |
We observed a deoptimization storm caused by GraphKit::uncommon_trap generator logic. GraphKit::uncommon_trap considers the too_many_recompiles metric. If the threshold is overflowed, it replaces Deoptimization::Action_reinterpret with Deoptimization::Action_none (see code snippet below).
This replacement changes the uncommon_trap logic: once execution hits a trap, the VM performs deoptimization but does not recompile the method anymore. In an "unlucky" case, when the code part calling this uncommon_trap becomes frequent, a deoptimization storm occurs (thousands of deoptimizations per second) causing a significant performance drop.
The original problematic method, which triggered repeated recompilations, is a high-performance compressed binary serialization algorithm with heavy use of conditional branches driven by bitmasks. See a standalone synthetic benchmark to reproduce the issue.
The issue arises when the method overcomes a global recompilation threshold before stabilizing specific trap counters.
Current thresholds:
Condition: decompile_count() >= (PerMethodRecompilationCutoff / 2) + 1
Default: 201 (derived from default PerMethodRecompilationCutoff = 400).
Checks if the trap count for a specific reason exceeds:
PerMethodTrapLimit (Default: 100) - for Reason_unstable_if, Reason_unstable_fused_if, etc.
PerMethodSpecTrapLimit (Default: 5000) - for Reason_speculate_class_check, Reason_speculate_null_check, etc.
With the gived defaults, if the only reason for the method recompilation is unstable_if, the system stabilizes after 100 traps (PerMethodTrapLimit). However, if the method experiences traps and recompilations for different reasons, the total number of recompilations can exceed 200 before hitting the limit for unstable_if traps. This triggers Action_none and causes the deopt storm.
The proposal is a minimal change in GraphKit::uncommon_trap: apply the same
too_many_recompilesthreshold insideParse::path_is_suitable_for_uncommon_trap- this ensures that on the final recompilation C2 gets a hint not to speculate on untaken branches anymore.As an alternative solution, we can revisit GraphKit::uncommon_trap. This "Temporary fix" has persisted in the codebase for 17 years, so it is probably time to change it as well. Any comments are welcome
Progress
Issue
Reviewing
Using
gitCheckout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/28966/head:pull/28966$ git checkout pull/28966Update a local copy of the PR:
$ git checkout pull/28966$ git pull https://git.openjdk.org/jdk.git pull/28966/headUsing Skara CLI tools
Checkout this PR locally:
$ git pr checkout 28966View PR using the GUI difftool:
$ git pr show -t 28966Using diff file
Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/28966.diff
Using Webrev
Link to Webrev Comment