config_tables: Use zerocopy crate to reduce unsafe code blocks #1062

VivekYadav7272 · 2025-11-15T16:51:38Z

Please see issue #893 for why this change was needed.

I'm marking this as draft because I have some cleanup to do, but to give early visibility in case I've done something obviously dumb, I'm raising the draft PR anyways.

VivekYadav7272 · 2025-11-15T17:08:21Z

debug_image_info_table.rs's most of unsafe usage came from things I could not mutate, because either the shape was UEFI-spec mandated (like the union structure), or an operation was UEFI-spec mandated (like the volatile_{read|write}s on DebugImageInfoTableHeader). So I did not see a huge reduction in unsafe there.

joschock · 2025-11-16T08:29:42Z

patina_dxe_core/src/config_tables/debug_image_info_table.rs

+// as that pointer is not shared with unprotected structures, and can only be accessed through the MutexGuard here.
+unsafe impl<'a> Send for DebugImageInfoTableMetadata<'a> {}
+
+static METADATA_TABLE: Mutex<Option<DebugImageInfoTableMetadata>> = Mutex::new(None);


spin::Mutex is problematic in the UEFI context because of the kind of multi-threading that UEFI supports. Unlike most high-level operating systems, UEFI only supports single-threaded execution with interrupts. This means that there is no concurrent execution. As such, if one context acquires the lock, and then an interrupt occurs to a higher-priority context that also attempts to acquire the lock, then deadlock will occur because the lower-priority context will not regain control until the higher-priority context yields, which it will not do since it is blocked attempting to acquire the lock.

If you really need Mutex+Guard capabilities here, recommend using TplMutex instead (implemented in the tpl_lock.rs module). TplMutex raises the priority level of the executing context while the lock is held, preventing pre-emption by other levels that would potentially want to acquire the lock, making it a safer option for avoiding deadlock.

We don't need a lock here, this is only ever invoked when loading an image, which won't happen during an interrupt. Cell should be sufficient.

My DebugImageInfoTable changes seem to already have been merged in the commit that reworked atomics, although it uses RwLock instead of Cell. Perhaps that still exposes us to potential interrupt pre-emption deadlocks since there are a bunch of .write() calls.

Oh - I forgot you had this PR outstanding - if there are additional improvements here that are not part of that commit, please feel free to rebase.

With respect to pre-emption - all the non-unit-test calls to .write() occur as part of an initialization flow (in initialize_debug_image_info_table) before the event subsystem is up, so there should be no danger of pre-emption at that point.

os-d

Thanks for the contribution! I think the general approach is okay, just a few notes on the Mutex vs Cell/TplMutex.

os-d · 2025-11-17T15:20:56Z

patina_dxe_core/src/config_tables/debug_image_info_table.rs

+// as that pointer is not shared with unprotected structures, and can only be accessed through the MutexGuard here.
+unsafe impl<'a> Send for DebugImageInfoTableMetadata<'a> {}
+
+static METADATA_TABLE: Mutex<Option<DebugImageInfoTableMetadata>> = Mutex::new(None);


We don't need a lock here, this is only ever invoked when loading an image, which won't happen during an interrupt. Cell should be sufficient.

patina_dxe_core/src/config_tables/debug_image_info_table.rs