std.mem: Add `packedArrayByteLen` function (and use it) #23406

squeek502 · 2025-03-30T00:36:26Z

Context: #21682

This is a helper function intended to make using writePackedInt/readPackedInt to write to/read from packed arrays a bit less error prone. The math involved is easy, but a naive implementation will overflow when calculating a valid length. For example:

std.math.divCeil(usize, @bitSizeOf(u4) * some_slice.len, byte_size_in_bits)

This will overflow during the multiplication if some_slice.len is >= maxInt(usize) / 4 + 1, even though the final calculated byte count would be able to fit in a usize.

When using this newly added function, error.Overflow is returned only when the final byte count can't fit in a usize.

squeek502 · 2025-03-30T00:36:59Z

lib/compiler/aro/aro/Preprocessor.zig

+    const if_kind_buf_size = comptime mem.packedArrayByteLen(u2, 256) catch unreachable;
+    var if_kind: [if_kind_buf_size]u8 = .{0} ** if_kind_buf_size;


I'm aware that this code is out-of-sync with upstream Aro (Vexu/arocc#787). Will submit an upstream PR if this is merged.

Would var if_kind: [if_kind_buf_size]u8 = @splat(0); be better here?

This is a helper function intended to make using writePackedInt/readPackedInt to write to/read from packed arrays a bit less error prone. The math involved is easy, but a naive implementation will overflow when calculating a valid length. For example: std.math.divCeil(usize, @bitSizeOf(u4) * some_slice.len, byte_size_in_bits) This will overflow during the multiplication if `some_slice.len` is >= `maxInt(usize) / 4 + 1`, even though the final calculated byte count would be able to fit in a `usize`. When using this newly added function, `error.Overflow` is returned only when the final byte count can't fit in a `usize`.

aqrit · 2025-04-05T16:25:17Z

This PR optimizes for maintainability since packedArrayByteLen is likely on a cold path
(which is good and correct).

However since I'm thinking about it...,
A widening multiply is probably only good for sizes u3, or u5.
Finding the array size of u4, u6, u7 types can utilize a shortcut that requires just two instructions.

pub fn packedArrayByteLen(comptime T: type, num_elements: usize) error{Overflow}!usize {
    if (@bitSizeOf(T) == 0) return 0;
    const max_num_elements: comptime_int = (std.math.maxInt(usize) * 8) / @bitSizeOf(T);
    if (num_elements > max_num_elements) return error.Overflow;

    const whole_bytes: usize = num_elements *% (@bitSizeOf(T) / 8);
    const part_bytes: usize = switch (@as(u3, @truncate(@bitSizeOf(T)))) {
        0 => 0,
        // 1 => (num_elements >> 3) +% @intFromBool(((num_elements & 7) != 0)),
        1 => ((num_elements -% (num_elements >> 1)) +% 0x3) >> 2,
        2 => ((num_elements -% (num_elements >> 1)) +% 0x1) >> 1,
        3 => ((num_elements >> 3) *% 3) +%
            ((((num_elements & 0x07) *% 3) +% 0x07) >> 3),
        4 => num_elements -% (num_elements >> 1),
        5 => ((num_elements >> 3) *% 5) +%
            ((((num_elements & 0x07) *% 5) +% 0x07) >> 3),
        6 => num_elements -% (num_elements >> 2),
        7 => num_elements -% (num_elements >> 3),
    };
    return whole_bytes +% part_bytes;
}

squeek502 commented Mar 30, 2025

View reviewed changes

squeek502 force-pushed the packed-byte-len branch from a4c5b07 to 1a4d2d6 Compare April 1, 2025 04:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

std.mem: Add `packedArrayByteLen` function (and use it) #23406

std.mem: Add `packedArrayByteLen` function (and use it) #23406

Uh oh!

squeek502 commented Mar 30, 2025 •

edited

Loading

Uh oh!

squeek502 Mar 30, 2025 •

edited

Loading

Uh oh!

tauoverpi Mar 31, 2025

Uh oh!

aqrit commented Apr 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

		const if_kind_buf_size = comptime mem.packedArrayByteLen(u2, 256) catch unreachable;
		var if_kind: [if_kind_buf_size]u8 = .{0} ** if_kind_buf_size;

Uh oh!

std.mem: Add packedArrayByteLen function (and use it) #23406

Are you sure you want to change the base?

std.mem: Add packedArrayByteLen function (and use it) #23406

Uh oh!

Conversation

squeek502 commented Mar 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

squeek502 Mar 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tauoverpi Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

aqrit commented Apr 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

std.mem: Add `packedArrayByteLen` function (and use it) #23406

std.mem: Add `packedArrayByteLen` function (and use it) #23406

squeek502 commented Mar 30, 2025 •

edited

Loading

squeek502 Mar 30, 2025 •

edited

Loading

aqrit commented Apr 5, 2025 •

edited

Loading