[Feature Request] Speed up percentile aggregation by switching implementation #18122

peteralfonsi · 2025-04-28T21:55:13Z

Is your feature request related to a problem? Please describe

The percentiles aggregation can be very slow. We rely on the t-digest library to get approximate percentiles. While poking around in the code I noticed we use their AVLTreeDigest implementation, but the recommended one is now MergingDigest. It looks like OpenSearch's TDigestState was last meaningfully modified in March 2017, but this new implementation was introduced after that in April 2017, which explains why we aren't already using it.

The comments claim this implementation is both faster and also uses "much less than half" of the memory of AVLTreeDigest. I couldn't find any actual numbers for speed posted online but I did run some benchmarks with OpenSearch that look good.

Describe the solution you'd like

We should switch to the new implementation. Since these extend the same abstract class it would be a drag-and-drop change.

I benchmarked this change on http_logs which has 247M docs. I did it for the "@timestamp" field (high cardinality) and the "status" field (low cardinality since it's an HTTP status code). The speedup was especially large for status:

Field	Baseline latency (ms)	Modififed latency (ms)
timestamp	13,085	6,293
status	196,794	6,212

Related component

Search:Performance

Describe alternatives you've considered

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

peteralfonsi added enhancement Enhancement or improvement to existing feature or request untriaged labels Apr 28, 2025

github-actions bot added the Search:Performance label Apr 28, 2025

github-project-automation bot added this to Search Project Board Apr 28, 2025

github-project-automation bot moved this to 🆕 New in Search Project Board Apr 28, 2025

peteralfonsi self-assigned this Apr 28, 2025

peteralfonsi mentioned this issue Apr 28, 2025

Switch percentiles implementation to MergingDigest #18124

Merged

1 task

mch2 removed the untriaged label May 7, 2025

msfroh closed this as completed in #18124 May 28, 2025

github-project-automation bot moved this from 🆕 New to ✅ Done in Search Project Board May 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Speed up percentile aggregation by switching implementation #18122

[Feature Request] Speed up percentile aggregation by switching implementation #18122

peteralfonsi commented Apr 28, 2025

[Feature Request] Speed up percentile aggregation by switching implementation #18122

[Feature Request] Speed up percentile aggregation by switching implementation #18122

Comments

peteralfonsi commented Apr 28, 2025

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context