stats: remove storm of refcount bumping during metrics snapshot. by jmarantz · Pull Request #43958 · envoyproxy/envoy

jmarantz · 2026-03-14T00:01:56Z

Commit Message:

Changes Envoy’s stats snapshotting/lifetime strategy to avoid per-metric refcount churn during a metrics flush by retaining scope references instead of taking a refcounted handle to every individual stat.

Changes:

Update MetricSnapshotImpl to keep metrics alive by capturing ConstScopeSharedPtrs for all scopes, rather than incrementing refcounts on every stat during snapshot creation.
Restrict Allocator/AllocatorImpl stat-construction APIs (makeCounter/makeGauge/makeTextReadout) to protected, pushing call sites toward Scope APIs and enabling the snapshot optimization assumption (“stats are scope-owned”).
Update related tests/benchmarks to use scope-based stat construction and adjust benchmark ranges.

This change makes the existing bmFlushToSinks benchmark in test/server/server_stats_flush_benchmark_test.cc run 5x faster, especially visible when extended to 10M stats, which is done in this PR, highlighting the problematic metrics flush duration >5s when the interval is 5s. I will note also that the problem with the excessive ref-count churn is likely much worse on ARM though I did not measure that.

Before:

Benchmark                                         Time             CPU   Iterations
-----------------------------------------------------------------------------------
bmFlushToSinks/10                             0.002 ms        0.002 ms       299435
bmFlushToSinks/100                            0.011 ms        0.011 ms        61284
bmFlushToSinks/1000                           0.109 ms        0.109 ms         6353
bmFlushToSinks/10000                           1.12 ms         1.12 ms          616
bmFlushToSinks/100000                          27.0 ms         27.0 ms           23
bmFlushToSinks/1000000                          394 ms          394 ms            2
bmFlushToSinks/10000000                        4841 ms         4840 ms            1

After:

Benchmark                                         Time             CPU   Iterations
-----------------------------------------------------------------------------------
bmFlushToSinks/10                             0.002 ms        0.002 ms       436476
bmFlushToSinks/100                            0.004 ms        0.004 ms       202299
bmFlushToSinks/1000                           0.030 ms        0.030 ms        23043
bmFlushToSinks/10000                          0.376 ms        0.375 ms         1879
bmFlushToSinks/100000                          6.73 ms         6.73 ms          103
bmFlushToSinks/1000000                          100 ms          100 ms            7
bmFlushToSinks/10000000                        1065 ms         1065 ms            1

Additional Description:
Risk Level: medium -- changing mechanism of keeping stats alive during metrics sink
Testing: //test/... but, man, Envoy test has gotten super flaky since I last did dev. Things eventually pass when I re-run 5 or so integration tests one at a time, on a clean client. IDK what's up with CI.
Docs Changes: n/a
Release Notes: n/a
Platform Specific Features: n/a
Fixes #43836

Signed-off-by: Joshua Marantz <jmarantz@google.com>

repokitteh-read-only · 2026-03-14T00:02:02Z

As a reminder, PRs marked as draft will not be automatically assigned reviewers,
or be handled by maintainer-oncall triage.

Please mark your PR as ready when you want it to be reviewed!

🐱

Caused by: #43958 was opened by jmarantz.

see: more, trace.

Signed-off-by: Joshua Marantz <jmarantz@google.com>

jmarantz · 2026-03-14T04:29:27Z

/retest

Copilot

Pull request overview

This PR changes Envoy’s stats snapshotting/lifetime strategy to avoid per-metric refcount churn during a metrics flush, primarily by retaining scope references instead of taking a refcounted handle to every individual stat.

Changes:

Update MetricSnapshotImpl to keep metrics alive by capturing ConstScopeSharedPtrs for all scopes, rather than incrementing refcounts on every stat during snapshot creation.
Restrict Allocator/AllocatorImpl stat-construction APIs (makeCounter/makeGauge/makeTextReadout) to protected, pushing call sites toward Scope APIs and enabling the snapshot optimization assumption (“stats are scope-owned”).
Update related tests/benchmarks to use scope-based stat construction and adjust benchmark ranges.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
test/server/server_stats_flush_benchmark_test.cc	Expands benchmark ranges for sink flush performance testing.
test/server/admin/prometheus_stats_test.cc	Migrates test metric creation to `ThreadLocalStoreImpl`/`Scope` APIs.
test/common/stats/thread_local_store_test.cc	Avoids repeated allocator-based counter creation for `use_count()` assertions by reusing a store-owned counter.
test/common/stats/metric_impl_test.cc	Adds fixture wrappers for allocator stat creation (now protected) and updates call sites.
test/common/stats/allocator_impl_test.cc	Adds fixture wrappers for allocator stat creation (now protected) and updates call sites.
source/server/server.h	Replaces per-stat “snapped_*” holders with `scopes_` to retain metric lifetimes via scopes.
source/server/server.cc	Implements new snapshot behavior: capture scopes first, then collect sinked metric references.
source/common/stats/allocator_impl.h	Moves allocator construction methods to `protected` and adds friend access for tests/isolated store.
envoy/stats/allocator.h	Makes allocator construction methods `protected` and documents rationale (scope ownership + snapshot performance).

jmarantz · 2026-03-14T14:19:14Z

  void addCounter(const std::string& name, Stats::StatNameTagVector cluster_tags) {
-    Stats::StatNameManagedStorage name_storage(baseName(name, cluster_tags), *symbol_table_);
-    Stats::StatNameManagedStorage tag_extracted_name_storage(name, *symbol_table_);
-    counters_.push_back(alloc_.makeCounter(name_storage.statName(),
-                                           tag_extracted_name_storage.statName(), cluster_tags));
+    counters_.push_back(&scope_->counterFromStatNameWithTags(pool_.add(name), cluster_tags));
  }


Copilot is wrong, here and below; this compiles fine.

jmarantz · 2026-03-14T14:22:44Z

  void addGauge(const std::string& name, Stats::StatNameTagVector cluster_tags,
                Stats::Gauge::ImportMode import_mode = Stats::Gauge::ImportMode::Accumulate) {
-    Stats::StatNameManagedStorage name_storage(baseName(name, cluster_tags), *symbol_table_);
-    Stats::StatNameManagedStorage tag_extracted_name_storage(name, *symbol_table_);
-    gauges_.push_back(alloc_.makeGauge(
-        name_storage.statName(), tag_extracted_name_storage.statName(), cluster_tags, import_mode));
+    gauges_.push_back(
+        &scope_->gaugeFromStatNameWithTags(pool_.add(name), cluster_tags, import_mode));
  }


copilot is incorrect; this compiles fine.

jmarantz · 2026-03-14T14:22:34Z

+    Stats::TextReadoutSharedPtr textReadout =
+        &scope_->textReadoutFromStatNameWithTags(pool_.add(name), cluster_tags);


copilot is incorrect; this compiles fine.

jmarantz · 2026-03-14T14:56:19Z

@pradeepcrao @ggreenway I want to get feedback on this as a whole, but I think it would be reasonble to ask that I make one refactor before this merges.

I tried to ensure we don't directly generate stats from the Stats::Allocator by making its makeCounter (etc) methods protected. I couldn't make them private because of some integration test infrastructure around notifying counters.

However I think what I should do is:

merge Stats::AllocatorImpl and Stats::Allocator into one class, and make the methods non-virtual.
change the NotifyingCounter mechnaism in the integration tests to make a delegate for Scope instead of at the Allocator level.

I think what I have here is safe today but someone could work around the restricted access to makeCounter by subclassing; it would be better for that to be private.

But I thought I'd get high level comments on this PR first. I think this may go a long way to improving Envoy performance on ARM, at least with large numbers of stats.

agrawroh · 2026-03-14T19:58:14Z

Amazing! <3

/assign @ggreenway

jmarantz · 2026-03-15T22:48:51Z

I think #43968 should merge first and then this PR will be more obviously correct -- will make makeCounter et al private instead of protected.

jmarantz · 2026-03-16T13:50:30Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a significant performance optimization by changing how metric snapshots are handled to avoid reference counting churn. The core idea is to hold references to scopes instead of individual stats. This is enforced by making the stat creation methods on the Allocator protected, forcing usage of Scope APIs. The changes are well-implemented, and the necessary updates to tests and benchmarks are included. The benchmark results clearly demonstrate the performance gain. I have kept the minor suggestion to improve a comment that has become misleading due to the refactoring, as it does not contradict any existing rules.

gemini-code-assist · 2026-03-16T13:55:10Z

    // less than a second, and in fastbuild it takes less than 5.
-    CounterSharedPtr counter =
-        alloc_.makeCounter(my_counter_scoped_name_, StatName(), StatNameTagVector{});
    uint32_t use_count = counter->use_count() - 1; // Subtract off this instance.


The comment // Subtract off this instance. is a leftover from the previous implementation where the test held a CounterSharedPtr instance. Now that counter is a raw pointer, this comment is misleading. The -1 is to subtract the reference held by the central allocator, so that use_count reflects only the references from the TLS caches. A more accurate comment would be // Subtract off the central allocator's reference..

Suggested change

uint32_t use_count = counter->use_count() - 1; // Subtract off this instance.

uint32_t use_count = counter->use_count() - 1; // Subtract off the central allocator's reference.

ggreenway · 2026-03-17T15:43:20Z

I haven't looked at any code yet, but one thing immediately came to mind: in the stats access logger (it's new), for a matched add/subtract pair (for something like an active/concurrency stat, it holds a reference to a gauge separate from any scope, so holding a count on all the scopes doesn't necessarily hold all the metrics in the store (although in nearly all cases it would). I'm not sure the best way to handle that. It's currently being worked on in #43812.

jmarantz · 2026-03-17T15:48:40Z

How would a metric be added to the store without being held by a scope? Would it call Allocator::makeCounter directly (as this PR attempts to prevent by putting makeCounter in protected section (soon to be private section when #43968 lands)?

jmarantz · 2026-03-17T15:49:17Z

I haven't read about stats access loggers yet but can't they own a scope for the stats they create/manage?

kyessenov · 2026-03-17T16:12:44Z

Looks nice, thanks for re-working the snapshotting. The stats logger has its own long-living scope, so I think we're fine there. We can now also defer-delete scopes and extend their lifetime to the next flush, which solves the long-standing problem of stats missing when flush period is > add/delete of a scope.

ggreenway · 2026-03-17T16:37:31Z

How would a metric be added to the store without being held by a scope? Would it call Allocator::makeCounter directly (as this PR attempts to prevent by putting makeCounter in protected section (soon to be private section when #43968 lands)?

It's initially created via a scope, but the scope has eviction enabled, and during an operation the logger holds an additional reference/sharedptr to the metric, so it could be removed from the only scope that holds it, but there could still be a reference/sharedptr.

jmarantz · 2026-03-17T16:51:47Z

I think I need to catch up on what scope eviction is. I hadn't thought about the possiblity someone holding onto a reference to a stat, longer than its scope lives.

If that's the case in the code today for stats access loggers, then I think this PR is not safe :(

Do we need stats access logger's stats to be sinked?

kyessenov · 2026-03-17T17:07:40Z

I think I need to catch up on what scope eviction is. I hadn't thought about the possiblity someone holding onto a reference to a stat, longer than its scope lives.

This is possible without eviction. I think it's probably not happening in the tree, all stats are allocated off-a-scope which should outlast, but I don't know how to verify it.

If that's the case in the code today for stats access loggers, then I think this PR is not safe :(

Eviction actually requires not holding a reference to a stat. Gauge is a special case, and they are tricky.

… implementation from Stats::AllocatorImpl to Stats::Allocator (#43968) A long time ago, all Envoy stats were stored in a large shared memory block, significantly limiting how many you could get, and the lengths of their names. This facilitated keeping stat continuity across parent/child hot-restart. The fixed shared-mem block size was very limiting and a bit inefficient, so an alternative mechanism of keeping stats on the heap was added for when hot-restart was not needed. To switch between them, we used the Allocator interface, and kept two implementations of that. Then we removed the shared-memory version in favor of using message passing to update counters from parent to child during hot restart. Now it's time to remove that interface; it's making it a bit harder to reason about guaranteeing that stats users don't call the allocator directly; it's really an implementation detail of the stats system. So this is a pure refactor (no functional change) which can help that, and make it easier to reason about the safety of #43958 . As removing headers and class names can be a breaking change, temporary forwarding headers are left behind, temporarily; out-of-repo referencess should be changed by June. Signed-off-by: Joshua Marantz <jmarantz@google.com>

ggreenway · 2026-03-17T23:33:29Z

Remove CounterSharedPtr et al from the codebase outside stats impl and tests.

I agree this seems like the right plan. However, if this is the plan, now is the time to chime in on #43812

jmarantz · 2026-03-18T02:43:16Z

I'm leaning toward option 1 (loop over scopes) and not try to hide CounterSharedPtr from the masses. It's pretty ingrained at this point. And it's pretty convenient some of the use-cases to have the stat kept alive after the return.

pradeepcrao · 2026-03-18T12:45:40Z

I'm leaning toward option 1 (loop over scopes) and not try to hide CounterSharedPtr from the masses. It's pretty ingrained at this point. And it's pretty convenient some of the use-cases to have the stat kept alive after the return.

I think we will need both, not just option 1. Use scopes to ensure the stats are not deleted during the flush operation, but still iterating over stats in the allocator. We improved the flush speed by an order of magnitude by not iterating over scopes and deduping stats using a StatNameHashSet in #17909

jmarantz · 2026-03-18T13:43:16Z

Oh thanks for the benchmark data from earlier. It seems things got a lot slower between the :"with change" numbers on that PR and the "before" on this one, degrading from 100ms for num_stats=1M to 304ms measured this week, that 3x performance loss might have happened in #18144 a few weeks after #17909 was merged.

Let me get a read on how fast we can make the scopes based iteration; I think we can de-dup with a spare flag bit in the stat rather than a hash-set, which might help.

I think there's going to be a huge churn and probably multiple refactoring PRs if we try to remove CounterSharedPtr from the codebase generally.

ggreenway · 2026-03-18T15:18:08Z

A quick grep shows not too many hits, but the dynamic modules ABI might be hard to fix:

edit: nevermind, I don't think that'll be hard to fix. It's a use of stats_store.iterate() and that's in the signature of the required callback.

./server/admin/stats_handler.cc:54:  for (const Stats::CounterSharedPtr& counter : server_.stats().counters()) {
./server/admin/prometheus_stats.cc:883:    const std::vector<Stats::CounterSharedPtr>& counters,
./server/admin/prometheus_stats.cc:950:    const std::vector<Stats::CounterSharedPtr>& counters,
./server/admin/prometheus_stats.cc:963:    const std::vector<Stats::CounterSharedPtr>& counters,
./server/admin/prometheus_stats.cc:981:    const std::vector<Stats::CounterSharedPtr>& counters,
./server/admin/stats_request.cc:141:      renderStat<Stats::CounterSharedPtr>(iter->first, response, variant);
./server/admin/prometheus_stats.h:77:  static uint64_t statsAsPrometheus(const std::vector<Stats::CounterSharedPtr>& counters,
./server/admin/prometheus_stats.h:88:  statsAsPrometheusText(const std::vector<Stats::CounterSharedPtr>& counters,
./server/admin/prometheus_stats.h:97:  statsAsPrometheusProtobuf(const std::vector<Stats::CounterSharedPtr>& counters,
./server/admin/prometheus_stats.h:107:  generateWithOutputFormat(const std::vector<Stats::CounterSharedPtr>& counters,
./server/admin/stats_request.h:18:  using StatOrScopes = absl::variant<ScopeVec, Stats::TextReadoutSharedPtr, Stats::CounterSharedPtr,
./server/server.cc:180:        snapped_counters_.push_back(Stats::CounterSharedPtr(&counter));
./server/server.h:494:  std::vector<Stats::CounterSharedPtr> snapped_counters_;
./extensions/bootstrap/dynamic_modules/abi_impl.cc:99:      [&name_view, &found, value_ptr](const Envoy::Stats::CounterSharedPtr& counter) -> bool {
./common/stats/thread_local_store.cc:85:    removeRejectedStats<CounterSharedPtr>(central_cache->counters_,
./common/stats/thread_local_store.cc:86:                                          [this](const CounterSharedPtr& counter) mutable {
./common/stats/thread_local_store.cc:153:std::vector<CounterSharedPtr> ThreadLocalStoreImpl::counters() const {
./common/stats/thread_local_store.cc:155:  std::vector<CounterSharedPtr> ret;
./common/stats/thread_local_store.cc:157:                 [&ret](Counter& counter) { ret.emplace_back(CounterSharedPtr(&counter)); });
./common/stats/thread_local_store.cc:601:         const StatNameTagVector& tags) -> CounterSharedPtr {
./common/stats/thread_local_store.cc:1105:  StatNameHashMap<CounterSharedPtr> counters_;
./common/stats/isolated_store_impl.cc:28:                       StatNameTagVectorOptConstRef tags) -> CounterSharedPtr {
./common/stats/allocator_impl.h:27:  CounterSharedPtr makeCounter(StatName name, StatName tag_extracted_name,
./common/stats/allocator_impl.h:62:  void markCounterForDeletion(const CounterSharedPtr& counter) override;
./common/stats/allocator_impl.h:105:  // to exist to hold it as (e.g.) a CounterSharedPtr rather than a Counter&
./common/stats/allocator_impl.h:107:  std::vector<CounterSharedPtr> deleted_counters_ ABSL_GUARDED_BY(mutex_);
./common/stats/thread_local_store.h:184:  std::vector<CounterSharedPtr> counters() const override;
./common/stats/thread_local_store.h:282:    StatNameHashMap<CounterSharedPtr> counters_;
./common/stats/thread_local_store.h:602:  // to exist to hold it as (e.g.) a CounterSharedPtr rather than a Counter&
./common/stats/thread_local_store.h:619:  CounterSharedPtr counters_overflow_;
./common/stats/thread_local_store.h:620:  CounterSharedPtr gauges_overflow_;
./common/stats/thread_local_store.h:621:  CounterSharedPtr histograms_overflow_;
./common/stats/allocator_impl.cc:300:CounterSharedPtr AllocatorImpl::makeCounter(StatName name, StatName tag_extracted_name,
./common/stats/allocator_impl.cc:309:  auto counter = CounterSharedPtr(makeCounterInternal(name, tag_extracted_name, stat_name_tags));
./common/stats/allocator_impl.cc:470:void AllocatorImpl::markCounterForDeletion(const CounterSharedPtr& counter) {
./common/stats/isolated_store_impl.h:43:  std::vector<CounterSharedPtr> counters() const override { return counters_.toVector(); }

pradeepcrao · 2026-03-18T15:19:53Z

I think we can de-dup with a spare flag bit in the stat rather than a hash-set, which might help.

Neat!

kyessenov · 2026-03-18T18:03:40Z

You could also use "used" bit since it's cleared anyways during flush. So that could be a marker if you combine the loops.

ggreenway · 2026-03-19T01:49:08Z

Is this ready for review, or is this waiting on further changes?

jmarantz · 2026-03-19T01:55:14Z

No sorry I haven't checked in anything yet to address the race I described yesterday. Working on iot.

I can shift to draft-mode to disambiguate.

jmarantz · 2026-03-19T13:57:57Z

I tried implementing the metrics sink by iterating over scopes and that gives back all the gains from this PR. I tried de-duping in two ways: using an absl::flat_hash_set, and using a bit in the stat (which also needs to be cleared afterward). The flat_hash_set method was slightly slower.

So I think it's option 1. Stay tuned. I think we have to have kind of a soft blockade on filters using shared pointers to stats, but allow this fully in tests. What I'm thinking is that this will be done by hiding the SharedPtr nicknames in files that are only included in the stats internal infrastructure, with similar definitions exposed for tests.

Note also, there are methods on the Store to return vectors of shared pointers of counters, gauges, etc. I think these also need to be removed from the production API and moved to a test utility. This is not too hard -- just lengthy.

jmarantz · 2026-03-19T13:58:58Z

I guess I should note: iterating over scopes does not provide benefit over the current state on x86. It might have a bigger impact on ARM where atomics are costlier.

jmarantz · 2026-03-19T19:37:59Z

I have drafted #44031 as a checkpointt, which I think (when it is complete) will make this PR safe.

When that's done, CounterSharedPtr et al will not have any visible usage in production outside the stats system.

We may need to have them in tests, still TBD.

jmarantz · 2026-03-23T15:00:00Z

I put an main-thread assert in the destructor of the helper-class for Counters, Gauges, and TextReadouts and it fired in a few unexpected places, but I think I'm being successful wrangling these into shape.

Basically, ThreadLocalStore's cleanup uses ThreadLocal::runOnAllThreads in a way that it seems to expect that the second lambda passed to it gets run on the main thread. But it ain't necessarily so. It runs on the last thread that ThreadLocal processes.

Re-posting to main thread makes things a lot cleaner.

jmarantz · 2026-03-25T14:07:21Z

I'm still working on solving the places where stat destructors get called from something other than the main or test thread. There's a lot of corner cases in our tests that cause this to happen, and I found some interesting races.

… implementation from Stats::AllocatorImpl to Stats::Allocator (envoyproxy#43968) A long time ago, all Envoy stats were stored in a large shared memory block, significantly limiting how many you could get, and the lengths of their names. This facilitated keeping stat continuity across parent/child hot-restart. The fixed shared-mem block size was very limiting and a bit inefficient, so an alternative mechanism of keeping stats on the heap was added for when hot-restart was not needed. To switch between them, we used the Allocator interface, and kept two implementations of that. Then we removed the shared-memory version in favor of using message passing to update counters from parent to child during hot restart. Now it's time to remove that interface; it's making it a bit harder to reason about guaranteeing that stats users don't call the allocator directly; it's really an implementation detail of the stats system. So this is a pure refactor (no functional change) which can help that, and make it easier to reason about the safety of envoyproxy#43958 . As removing headers and class names can be a breaking change, temporary forwarding headers are left behind, temporarily; out-of-repo referencess should be changed by June. Signed-off-by: Joshua Marantz <jmarantz@google.com>

… implementation from Stats::AllocatorImpl to Stats::Allocator (envoyproxy#43968) A long time ago, all Envoy stats were stored in a large shared memory block, significantly limiting how many you could get, and the lengths of their names. This facilitated keeping stat continuity across parent/child hot-restart. The fixed shared-mem block size was very limiting and a bit inefficient, so an alternative mechanism of keeping stats on the heap was added for when hot-restart was not needed. To switch between them, we used the Allocator interface, and kept two implementations of that. Then we removed the shared-memory version in favor of using message passing to update counters from parent to child during hot restart. Now it's time to remove that interface; it's making it a bit harder to reason about guaranteeing that stats users don't call the allocator directly; it's really an implementation detail of the stats system. So this is a pure refactor (no functional change) which can help that, and make it easier to reason about the safety of envoyproxy#43958 . As removing headers and class names can be a breaking change, temporary forwarding headers are left behind, temporarily; out-of-repo referencess should be changed by June. Signed-off-by: Joshua Marantz <jmarantz@google.com> Signed-off-by: Nick Shipilov <nick.shipilov.n@gmail.com>

… implementation from Stats::AllocatorImpl to Stats::Allocator (envoyproxy#43968) A long time ago, all Envoy stats were stored in a large shared memory block, significantly limiting how many you could get, and the lengths of their names. This facilitated keeping stat continuity across parent/child hot-restart. The fixed shared-mem block size was very limiting and a bit inefficient, so an alternative mechanism of keeping stats on the heap was added for when hot-restart was not needed. To switch between them, we used the Allocator interface, and kept two implementations of that. Then we removed the shared-memory version in favor of using message passing to update counters from parent to child during hot restart. Now it's time to remove that interface; it's making it a bit harder to reason about guaranteeing that stats users don't call the allocator directly; it's really an implementation detail of the stats system. So this is a pure refactor (no functional change) which can help that, and make it easier to reason about the safety of envoyproxy#43958 . As removing headers and class names can be a breaking change, temporary forwarding headers are left behind, temporarily; out-of-repo referencess should be changed by June. Signed-off-by: Joshua Marantz <jmarantz@google.com>

github-actions · 2026-04-24T16:17:04Z

This pull request has been automatically marked as stale because it has not had activity in the last 30 days. It will be closed in 7 days if no further activity occurs. Please feel free to give a status update now, ping for review, or re-open when it's ready. Thank you for your contributions!

jmarantz · 2026-04-24T16:25:54Z

pinging to keep from getting stale. @etruong42 will be looking at this, or at least unblocking this.

github-actions · 2026-05-24T20:17:10Z

This pull request has been automatically marked as stale because it has not had activity in the last 30 days. It will be closed in 7 days if no further activity occurs. Please feel free to give a status update now, ping for review, or re-open when it's ready. Thank you for your contributions!

github-actions · 2026-05-31T20:21:22Z

This pull request has been automatically closed because it has not had activity in the last 37 days. Please feel free to give a status update now, ping for review, or re-open when it's ready. Thank you for your contributions!

WIP stats: remove storm of refcount bumping during metrics snapshot.

8ffe981

Signed-off-by: Joshua Marantz <jmarantz@google.com>

jmarantz added 7 commits March 13, 2026 20:34

cleanup

1f565b1

Signed-off-by: Joshua Marantz <jmarantz@google.com>

format

93b9f02

Signed-off-by: Joshua Marantz <jmarantz@google.com>

backout changes not needed.

384e289

Signed-off-by: Joshua Marantz <jmarantz@google.com>

Merge branch 'main' into refcount-storm-on-metrics-snapshot

f8a07c7

Signed-off-by: Joshua Marantz <jmarantz@google.com>

compile fix

228a061

Signed-off-by: Joshua Marantz <jmarantz@google.com>

cleanup

1247b32

Signed-off-by: Joshua Marantz <jmarantz@google.com>

format and cleanup

7556e0f

Signed-off-by: Joshua Marantz <jmarantz@google.com>

jmarantz requested a review from Copilot March 14, 2026 04:29

Copilot started reviewing on behalf of jmarantz March 14, 2026 04:30 View session

Copilot AI reviewed Mar 14, 2026

View reviewed changes

jmarantz marked this pull request as ready for review March 14, 2026 14:50

jmarantz changed the title ~~WIP stats: remove storm of refcount bumping during metrics snapshot.~~ stats: remove storm of refcount bumping during metrics snapshot. Mar 14, 2026

repokitteh-read-only Bot assigned ggreenway Mar 14, 2026

jmarantz mentioned this pull request Mar 15, 2026

Stats: Remove the pure interface Stats::Allocator and rename the sole implementation from Stats::AllocatorImpl to Stats::Allocator #43968

Merged

gemini-code-assist Bot reviewed Mar 16, 2026

View reviewed changes

jmarantz mentioned this pull request Mar 18, 2026

StatsAccessLogger: fixes connection gauge underflow crashes when decrementing metrics after Scope evictions. #43812

Merged

jmarantz marked this pull request as draft March 19, 2026 01:55

jmarantz mentioned this pull request Apr 7, 2026

WiP: variant of #44031 that marks older shared-ptr functions deprecated rather than removing them. #44285

Closed

github-actions Bot added the stale stalebot believes this issue/PR has not been touched recently label Apr 24, 2026

github-actions Bot removed the stale stalebot believes this issue/PR has not been touched recently label Apr 24, 2026

github-actions Bot added the stale stalebot believes this issue/PR has not been touched recently label May 24, 2026

github-actions Bot closed this May 31, 2026

jmarantz mentioned this pull request Jun 2, 2026

stats: add new two-levels symbol table #45359

Draft

		Stats::TextReadoutSharedPtr textReadout =
		&scope_->textReadoutFromStatNameWithTags(pool_.add(name), cluster_tags);

	uint32_t use_count = counter->use_count() - 1; // Subtract off this instance.
	uint32_t use_count = counter->use_count() - 1; // Subtract off the central allocator's reference.

Conversation

jmarantz commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

repokitteh-read-only Bot commented Mar 14, 2026

Uh oh!

jmarantz commented Mar 14, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

jmarantz Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

jmarantz Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

jmarantz Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

jmarantz commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agrawroh commented Mar 14, 2026

Uh oh!

jmarantz commented Mar 15, 2026

Uh oh!

jmarantz commented Mar 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

ggreenway commented Mar 17, 2026

Uh oh!

jmarantz commented Mar 17, 2026

Uh oh!

jmarantz commented Mar 17, 2026

Uh oh!

kyessenov commented Mar 17, 2026

Uh oh!

ggreenway commented Mar 17, 2026

Uh oh!

jmarantz commented Mar 17, 2026

Uh oh!

kyessenov commented Mar 17, 2026

Uh oh!

ggreenway commented Mar 17, 2026

Uh oh!

jmarantz commented Mar 18, 2026

Uh oh!

pradeepcrao commented Mar 18, 2026

Uh oh!

jmarantz commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggreenway commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pradeepcrao commented Mar 18, 2026

Uh oh!

kyessenov commented Mar 18, 2026

Uh oh!

ggreenway commented Mar 19, 2026

Uh oh!

jmarantz commented Mar 19, 2026

Uh oh!

jmarantz commented Mar 19, 2026

Uh oh!

jmarantz commented Mar 19, 2026

Uh oh!

jmarantz commented Mar 19, 2026

Uh oh!

jmarantz commented Mar 23, 2026

Uh oh!

jmarantz commented Mar 14, 2026 •

edited

Loading

jmarantz commented Mar 14, 2026 •

edited

Loading

jmarantz commented Mar 18, 2026 •

edited

Loading

ggreenway commented Mar 18, 2026 •

edited

Loading