(8/5) Reduce VMM reservation contention #7533

smklein · 2025-02-12T21:10:14Z

#7498 was introduced to benchmark the cost of concurrent instance provisioning, and it demonstrated that through contention, performance can be significantly on the VMM reservation pathway.

This PR optimizes that pathway, by removing the VMM reservation transaction, and instead replacing it with some non-transactional queries:

First, we query to see if the VMM reservation has already succeeded (for idempotency)
Next, we query for all viable sled targets and affinity information (sled_find_targets_query)
After parsing that data and picking a sled, we call sled_insert_resource_query to INSERT a desired VMM record, and to re-validate our constraints.

This change significantly improves performance in the vmm-reservation benchmark, while upholding the necessary constraints implicit to VMM provisioning.

nexus/db-queries/src/db/queries/sled_reservation.rs

gjcolombo · 2025-03-03T23:13:57Z

nexus/db-queries/src/db/queries/sled_reservation.rs

+        other_aa_instances AS (
+            SELECT anti_affinity_group_instance_membership.group_id,instance_id
+            FROM anti_affinity_group_instance_membership
+            JOIN our_aa_groups
+            ON anti_affinity_group_instance_membership.group_id = our_aa_groups.group_id
+            WHERE instance_id != ").param().sql("
+        ),


Optional: I'd almost be inclined to make these individual subqueries into individual const &strs, in part so they can be shared between sled_insert_resource_query and sled_find_targets_query, and in part so you can add doc comments to them (or inline comments in the queries) to help describe what's going on. Up to you how far to go here, though.

This gets a fair bit more complicated in #7572 - do you mind if I punt some of the cleanup to that PR, to avoid a merge conflict with myself?

gjcolombo · 2025-03-03T23:40:20Z

nexus/db-queries/src/db/queries/sled_reservation.rs

+                COALESCE(SUM(CAST(sled_resource_vmm.reservoir_ram AS INT8)), 0) + "
+            ).param().sql(" <= sled.reservoir_size
+        ),
+        our_aa_groups AS (


We discussed this offline at the 27 Feb 2025 hypervisor huddle. An alternative approach we discussed (IIRC) went along these lines:

assign a score to each sled based on how preferable it is (e.g., add one point to the score for every "allow"-level affinity rule that would be violated by placing the instance on that sled)

when trying to reserve on a sled, see if the sled's score is at least as good (less than or equal to, in this example) as it was originally

if not, don't reserve there; instead, set its current score to its new score and add it back to the candidate pool

Assuming the instance's affinity group assignments are fixed (so that there are a fixed number of "allow"-level rules that it can break) I think this will always terminate (fixed number of rules implies a maximum possible score, and retrying always increases the threshold score).

Even if my recollection is accurate, I'm not sure I would bother going too far with this in this PR. I think this is because I'm inclined to think of the "allow"-level affinity rules as being entirely best-effort: Nexus might eventually decide to violate them for other reasons (e.g. "co-locating these instances minimizes fragmentation" or "separating these instances is necessary to evacuate this sled for update"), even for instances that were "ideally" placed originally. If that's so (and especially if we expect that the system may eventually choose to migrate VMs to optimize their placements, including their adherence to affinity rules--RFD 494 hints very lightly at this), I'm not sure I would do a lot of extra work in this path to make sure that instances always end up with an ideal initial placement.

I find myself not too convicted about this, though; if implementing a scoring system like the above (or something like it) turns out to be pretty easy to do, I wouldn't be opposed to having it. But I think what's here is good enough to merge if something more sophisticated would take a lot of extra time or add a lot of complexity.

smklein added 30 commits January 30, 2025 12:08

[nexus] Add Affinity/Anti-Affinity Groups to API (unimplemented)

c9fb7a6

[nexus] Add Affinity/Anti-Affinity groups to database

4020517

[nexus] Add CRUD implementations for Affinity/Anti-Affinity Groups

8f1d37c

[nexus] Consider Affinity/Anti-Affinity Groups during instance placement

772e64f

[nexus] Implement Affinity/Anti-Affinity Groups in external API

d8cff32

fix policy tests

161f9d6

Merge branch 'affinity-db-crud' into affinity-instance-integration

df119b6

Merge branch 'affinity-instance-integration' into affinity-integration

83a26a4

fmt

8dc0825

Merge branch 'affinity-api' into affinity-db-model

e3113ff

Merge branch 'affinity-db-model' into affinity-db-crud

789bc97

Merge branch 'affinity-db-crud' into affinity-instance-integration

5e21f34

Merge branch 'affinity-instance-integration' into affinity-integration

fa9461b

tags

4e9cebc

doc comments

6cfca2d

Merge branch 'affinity-api' into affinity-db-model

a1c97d4

Merge branch 'main' into affinity-api

050b4c5

typed UUID

4b08032

Merge branch 'affinity-api' into affinity-db-model

8bc8f0c

Typed UUID

900f09c

Merge branch 'affinity-db-model' into affinity-db-crud

195e167

Typed UUID

f2ebe31

Merge branch 'affinity-db-crud' into affinity-instance-integration

62c38ec

Merge branch 'affinity-instance-integration' into affinity-integration

85985c1

UUID typing

1326116

comments

aba9596

Merge branch 'affinity-db-model' into affinity-db-crud

a271f1d

review feedback

1ad0101

comment

4d26262

clippy

6ae1910

hawkw reviewed Feb 25, 2025

View reviewed changes

nexus/db-queries/src/db/queries/sled_reservation.rs Show resolved Hide resolved

smklein added 19 commits February 24, 2025 16:36

Code review feedback

b8c5024

Merge branch 'main' into affinity-instance-integration

bba32a6

Merge branch 'affinity-instance-integration' into affinity-integration

e893250

Merge branch 'affinity-integration' into sled-resource-vmm

f5ec80d

Merge branch 'sled-resource-vmm' into vmm-reserve-bench

fba0614

Merge branch 'vmm-reserve-bench' into vmm-reduce-contention

0e54843

README

e79fa5f

Merge branch 'main' into vmm-reserve-bench

faed1c4

Add issue

01029f1

Merge branch 'vmm-reserve-bench' into vmm-reduce-contention

df01f7a

new args to test helpers

b36be57

Merge branch 'main' into vmm-reserve-bench

3c0cc80

Merge branch 'vmm-reserve-bench' into vmm-reduce-contention

c2e9493

Merge branch 'main' into vmm-reserve-bench

462afc7

Merge branch 'vmm-reserve-bench' into vmm-reduce-contention

9d10bd8

Merge branch 'main' into vmm-reserve-bench

93d4ac3

Merge branch 'vmm-reserve-bench' into vmm-reduce-contention

77f2675

Merge branch 'main' into vmm-reserve-bench

75acd32

Merge branch 'vmm-reserve-bench' into vmm-reduce-contention

744d97f

gjcolombo approved these changes Mar 3, 2025

View reviewed changes

smklein added 2 commits March 3, 2025 17:57

Merge branch 'main' into vmm-reserve-bench

a3e885a

Merge branch 'vmm-reserve-bench' into vmm-reduce-contention

72ecbf2

Base automatically changed from vmm-reserve-bench to main March 4, 2025 20:02

smklein added 4 commits March 7, 2025 13:58

Merge branch 'main' into vmm-reserve-bench

4ed71c3

Merge branch 'vmm-reserve-bench' into vmm-reduce-contention

d8f173d

Merge branch 'main' into vmm-reduce-contention

37fa3fb

Merge branch 'main' into vmm-reduce-contention

dec670d

smklein merged commit e8bfadf into main Mar 20, 2025
16 checks passed

smklein deleted the vmm-reduce-contention branch March 20, 2025 16:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(8/5) Reduce VMM reservation contention #7533

(8/5) Reduce VMM reservation contention #7533

Uh oh!

smklein commented Feb 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

gjcolombo Mar 3, 2025

Uh oh!

smklein Mar 7, 2025

Uh oh!

gjcolombo Mar 7, 2025

Uh oh!

gjcolombo Mar 3, 2025

Uh oh!

Uh oh!

Uh oh!

(8/5) Reduce VMM reservation contention #7533

(8/5) Reduce VMM reservation contention #7533

Uh oh!

Conversation

smklein commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

gjcolombo Mar 3, 2025

Choose a reason for hiding this comment

Uh oh!

smklein Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

gjcolombo Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

gjcolombo Mar 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

smklein commented Feb 12, 2025 •

edited

Loading