HBASE-29245 Region reopening batch size should be increased when backoff is 0 #6892

junegunn · 2025-04-08T08:12:33Z

No description provided.

…off is 0

Apache-HBase · 2025-04-08T09:06:23Z

🎊 +1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 42s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 0s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
+1 💚	hbaseanti	0m 0s		Patch does not have any anti-patterns.
			_ master Compile Tests _
+1 💚	mvninstall	4m 48s		master passed
+1 💚	compile	4m 10s		master passed
+1 💚	checkstyle	0m 54s		master passed
+1 💚	spotbugs	1m 58s		master passed
+1 💚	spotless	1m 2s		branch has no errors when running spotless:check.
			_ Patch Compile Tests _
+1 💚	mvninstall	4m 4s		the patch passed
+1 💚	compile	3m 40s		the patch passed
+1 💚	javac	3m 40s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
+1 💚	checkstyle	0m 58s		the patch passed
+1 💚	spotbugs	2m 27s		the patch passed
+1 💚	hadoopcheck	14m 0s		Patch does not cause any errors with Hadoop 3.3.6 3.4.0.
+1 💚	spotless	1m 0s		patch has no errors when running spotless:check.
			_ Other Tests _
+1 💚	asflicense	0m 17s		The patch does not generate ASF License warnings.
		49m 44s

Subsystem	Report/Notes
Docker	ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-6892/1/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#6892
Optional Tests	dupname asflicense javac spotbugs checkstyle codespell detsecrets compile hadoopcheck hbaseanti spotless
uname	Linux ea72d147db4a 5.4.0-1103-aws #111~18.04.1-Ubuntu SMP Tue May 23 20:04:10 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6cdd226`
Default Java	Eclipse Adoptium-17.0.11+9
Max. process+thread count	84 (vs. ulimit of 30000)
modules	C: hbase-server U: hbase-server
Console output	https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-6892/1/console
versions	git=2.34.1 maven=3.9.8 spotbugs=4.7.3
Powered by	Apache Yetus 0.15.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2025-04-08T12:26:16Z

🎊 +1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 28s		Docker mode activated.
-0 ⚠️	yetus	0m 2s		Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --author-ignore-list --blanks-eol-ignore-file --blanks-tabs-ignore-file --quick-hadoopcheck
			_ Prechecks _
			_ master Compile Tests _
+1 💚	mvninstall	3m 14s		master passed
+1 💚	compile	0m 57s		master passed
+1 💚	javadoc	0m 28s		master passed
+1 💚	shadedjars	5m 53s		branch has no errors when building our shaded downstream artifacts.
			_ Patch Compile Tests _
+1 💚	mvninstall	3m 13s		the patch passed
+1 💚	compile	0m 57s		the patch passed
+1 💚	javac	0m 57s		the patch passed
+1 💚	javadoc	0m 27s		the patch passed
+1 💚	shadedjars	5m 52s		patch has no errors when building our shaded downstream artifacts.
			_ Other Tests _
+1 💚	unit	222m 23s		hbase-server in the patch passed.
		248m 29s

Subsystem	Report/Notes
Docker	ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-6892/1/artifact/yetus-jdk17-hadoop3-check/output/Dockerfile
GITHUB PR	#6892
Optional Tests	javac javadoc unit compile shadedjars
uname	Linux 417b24f80060 5.4.0-1103-aws #111~18.04.1-Ubuntu SMP Tue May 23 20:04:10 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6cdd226`
Default Java	Eclipse Adoptium-17.0.11+9
Test Results	https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-6892/1/testReport/
Max. process+thread count	4805 (vs. ulimit of 30000)
modules	C: hbase-server U: hbase-server
Console output	https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-6892/1/console
versions	git=2.34.1 maven=3.9.8
Powered by	Apache Yetus 0.15.0 https://yetus.apache.org

This message was automatically generated.

rmdmattingly

Nice catch. Back in the day, I think my intention was that you'd never configure batching without also configuring a backoff, but that wasn't a good idea because it makes this hard to configure

FWIW I do wonder if there's no significant value in configuring your clusters this way because it will work basically the same as an unbatched table modification (regions will be reopened as quickly as the HMaster can process their reopen procedures). So maybe it's worth adding a warning log recommending that the operator raise hbase.reopen.table.regions.progressive.batch.backoff.ms?

junegunn · 2025-04-08T15:12:51Z

Thanks for the review.

FWIW I do wonder if there's no significant value in configuring your clusters this way because it will work basically the same as an unbatched table modification (regions will be reopened as quickly as the HMaster can process their reopen procedures).

We're expecting two benefits of using the option, even without backoff.

Reduce the number of regions that are (temporarily) unavailable at a certain point during an alter operation (determined by the number of region servers and their hbase.regionserver.executor.closeregion.threads) to achieve better region availability and less overall service impact. For example, we can set the option to something like 16 to ensure that at most 16 regions are unavailable at a certain point. This helps in minimizing service disruption of a latency sensitive application.
- - CLOSING is when the region is marked CLOSING on hbase:meta
  - REJECT is when the region actually becomes unavailable (client starts getting NotServingRegionException and retries)
- - This plots the number of regions between REJECT and OPEN at a certain point of time
It protects the table from a faulty alter operation as pointed out in HBASE-29136, because only one region is affected.

rmdmattingly · 2025-04-08T15:33:14Z

Nice, I suppose there's enough latency baked into awaiting the slowest of each batch & issuing the next batch to really limit the disruption at any point in time. Great stuff 🚀

Apache9 · 2025-04-20T08:40:40Z

@rmdmattingly Can we merge this now?

junegunn · 2025-04-25T13:11:49Z

@Apache9 @rmdmattingly I think it's okay to merge. It's a relatively simple patch. I extended the existing test cases with more assertions and I also tested it manually to confirm that it works as expected with and without backoff.

No backoff (default)

2-second backoff

junegunn · 2025-04-30T15:37:47Z

Please take a look at #6951 as well. Appreciate it!

junegunn · 2025-05-29T03:51:11Z

@Apache9 @rmdmattingly Sorry to ping again, but is there anything else I can do to help move this and #6951 forward? We can live without this one because we can set the backoff value to something like 1ms, but we're particularly interested in getting #6951 into branch-2 so we can avoid maintaining it in our internal fork. I believe supporting multiple throttling configurations without requiring a master server restart makes the feature much more practical.

HBASE-29245 Region reopening batch size should be increased when back…

6cdd226

…off is 0

rmdmattingly approved these changes Apr 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HBASE-29245 Region reopening batch size should be increased when backoff is 0 #6892

HBASE-29245 Region reopening batch size should be increased when backoff is 0 #6892

junegunn commented Apr 8, 2025

Uh oh!

Apache-HBase commented Apr 8, 2025

Uh oh!

Apache-HBase commented Apr 8, 2025

Uh oh!

rmdmattingly left a comment

Uh oh!

junegunn commented Apr 8, 2025 •

edited

Loading

Uh oh!

rmdmattingly commented Apr 8, 2025

Uh oh!

Apache9 commented Apr 20, 2025

Uh oh!

junegunn commented Apr 25, 2025

Uh oh!

junegunn commented Apr 30, 2025

Uh oh!

junegunn commented May 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

HBASE-29245 Region reopening batch size should be increased when backoff is 0 #6892

Are you sure you want to change the base?

HBASE-29245 Region reopening batch size should be increased when backoff is 0 #6892

Conversation

junegunn commented Apr 8, 2025

Uh oh!

Apache-HBase commented Apr 8, 2025

Uh oh!

Apache-HBase commented Apr 8, 2025

Uh oh!

rmdmattingly left a comment

Choose a reason for hiding this comment

Uh oh!

junegunn commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rmdmattingly commented Apr 8, 2025

Uh oh!

Apache9 commented Apr 20, 2025

Uh oh!

junegunn commented Apr 25, 2025

No backoff (default)

2-second backoff

Uh oh!

junegunn commented Apr 30, 2025

Uh oh!

junegunn commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

junegunn commented Apr 8, 2025 •

edited

Loading

junegunn commented May 29, 2025 •

edited

Loading