CSHARP-5603: Add Big Endian support in BinaryVectorReader and BinaryVectorWriter #1682

medhatiwari · 2025-05-05T10:53:54Z

Description

This PR adds Big Endian support for System.Single (Float32) to the BinaryVectorWriter.WriteToBytes() method.

Background

While running the MongoDB.Bson.Tests test suite on a Big Endian (s390x) system, we encountered 34 consistent test failures within the BinaryVectorSerializerTests class.
Each failure was caused by a System.NotSupportedException indicating that binary vector data of float32 type is not yet supported on Big Endian architectures.

Exception Observed

System.NotSupportedException: Binary vector data is not supported on Big Endian architecture yet.

Sample Failing Tests

Some of the test cases that failed due to this limitation include:

BinaryVectorSerializerTests.BinaryVectorSerializer_should_deserialize_bson_vector<Float32>

BinaryVectorSerializerTests.BinaryVectorSerializer_should_serialize_bson_vector<Float32>

BinaryVectorSerializerTests.ArrayAsBinaryVectorSerializer_should_deserialize_bson_vector<Float32>

BinaryVectorSerializerTests.ArrayAsBinaryVectorSerializer_should_serialize_bson_vector<Float32>

BinaryVectorSerializerTests.MemoryAsBinaryVectorSerializer_should_serialize_bson_vector<Float32>

BinaryVectorSerializerTests.MemoryAsBinaryVectorSerializer_should_deserialize_bson_vector<Float32>

BinaryVectorSerializerTests.ReadOnlyMemoryAsBinaryVectorSerializer_should_serialize_bson_vector<Float32>

BinaryVectorSerializerTests.ReadOnlyMemoryAsBinaryVectorSerializer_should_deserialize_bson_vector<Float32>

Why This Fix Is Necessary

This limitation was blocking test pass status on Big Endian platforms such as s390x. Adding support for float32 serialization in Big Endian format:

Enables consistent behavior across architectures

Completes existing deserialization support added earlier in BinaryVectorReader.cs

Changes Introduced

Added Big Endian branch to BinaryVectorWriter.WriteToBytes() for T == float.

Used BinaryPrimitives.WriteSingleBigEndian() to write bytes in the correct order.

Left existing Little Endian logic untouched to preserve behavior.

cc: @giritrivedi

…<T>() Signed-off-by: Medha Tiwari <[email protected]>

Signed-off-by: Medha Tiwari <[email protected]>

medhatiwari · 2025-05-06T09:31:34Z

Hi @BorisDog, if everything if fine, can this be merged?

medhatiwari · 2025-05-19T04:42:04Z

Hi @BorisDog, just following up to check if there's any update on this PR. Please let me know if any further changes are needed.

src/MongoDB.Bson/Serialization/BinaryVectorReader.cs

src/MongoDB.Bson/Serialization/BinaryVectorWriter.cs

src/MongoDB.Bson/Serialization/BinaryVectorReader.cs

BorisDog

Review is pending on requested changes.

…or float32 on all platforms Signed-off-by: Medha Tiwari <[email protected]>

…ation Signed-off-by: Medha Tiwari <[email protected]>

…ndling Signed-off-by: Medha Tiwari <[email protected]>

BorisDog

The tests fail on net472.

src/MongoDB.Bson/IO/BinaryPrimitivesCompat.cs

src/MongoDB.Bson/Serialization/BinaryVectorReader.cs

src/MongoDB.Bson/IO/BinaryPrimitivesCompat.cs

src/MongoDB.Bson/Serialization/BinaryVectorWriter.cs

tests/MongoDB.Bson.Tests/Serialization/Serializers/BinaryVectorSerializerTests.cs

src/MongoDB.Bson/IO/BinaryPrimitivesCompat.cs

Signed-off-by: Medha Tiwari <[email protected]>

BorisDog

Looks good! Tests are passing as well.
Few styling comments + tests improvement.

tests/MongoDB.Bson.Tests/IO/BinaryPrimitivesCompatTests.cs

tests/MongoDB.Bson.Tests/Serialization/Serializers/BinaryVectorSerializerTests.cs

src/MongoDB.Bson/Serialization/BinaryVectorWriter.cs

src/MongoDB.Bson/IO/BinaryPrimitivesCompat.cs

tests/MongoDB.Bson.Tests/IO/BinaryPrimitivesCompatTests.cs

BorisDog

Few more minor comments.

src/MongoDB.Bson/Serialization/BinaryVectorWriter.cs

src/MongoDB.Bson/Serialization/BinaryVectorReader.cs

tests/MongoDB.Bson.Tests/IO/BinaryPrimitivesCompatTests.cs

src/MongoDB.Bson/IO/BinaryPrimitivesCompat.cs

Signed-off-by: Medha Tiwari <[email protected]>

BorisDog

ReadSingleLittleEndian_should_throw_on_insufficient_length and
WriteSingleLittleEndian_should_throw_on_insufficient_length fail on net427 and netstandard2.1.

Signed-off-by: Medha Tiwari <[email protected]>

medhatiwari · 2025-06-03T10:21:32Z

ReadSingleLittleEndian_should_throw_on_insufficient_length and WriteSingleLittleEndian_should_throw_on_insufficient_length fail on net427 and netstandard2.1.

Thanks for the feedback! I tested this behavior on both .NET 8 and netstandard2.1, and in both cases, nameof(source) and nameof(source.Length) surprisingly both resulted in ParamName being "length" when the exception was thrown — likely due to compiler or runtime optimizations in span handling.

However, in a minimal standalone test program, nameof(source) correctly produces "source", and nameof(source.Length) produces "Length", as expected.

I haven't been able to test this yet on .NET Framework 4.7.2, but based on this inconsistency and to keep the tests passing across frameworks, I’ve temporarily hardcoded "length" in the exception. I agree this isn't ideal — open to suggestions if you think there's a cleaner cross-target alternative.

BorisDog · 2025-06-03T20:45:08Z

@medhatiwari So would passing nameof(source.Length) and expecting Length solve the issue?

medhatiwari · 2025-06-04T07:34:11Z

@medhatiwari So would passing nameof(source.Length) and expecting Length solve the issue?

@BorisDog nameof(source.Length) — in the driver, it gives "length" and expecting "Length" causes the tests to fail. But in a standalone program, it gives "Length" as I mentioned earlier. I'm a bit confused about why this difference exists, but that's the current behavior I'm seeing

BorisDog · 2025-06-04T19:47:26Z

@medhatiwari
I think nameof(source.Length) returns Length on all platforms.
The problem is probably with BinaryPrimitives exception, which does not use nameof(source.Length).

You can just move the validation to the begging of the method.

Signed-off-by: Medha Tiwari <[email protected]>

medhatiwari · 2025-06-05T06:21:48Z

@medhatiwari I think nameof(source.Length) returns Length on all platforms. The problem is probably with BinaryPrimitives exception, which does not use nameof(source.Length).

You can just move the validation to the begging of the method.

ah, moving the validation outside the #else block and using nameof(source.Length) did the trick — it now consistently returns "Length", and the tests pass. I’m still a bit curious though: when the same check was inside the #else, it was returning "length" instead. Any idea why that subtle difference in placement affects the casing?

BorisDog · 2025-06-05T20:20:48Z

I’m still a bit curious though: when the same check was inside the #else, it was returning "length" instead.
I couldn't reproduce that, but BinaryPrimitives does return low case "length".

adelinowona

LGTM

BorisDog

LGTM!

Add Big Endian Support for Float32 in BinaryVectorWriter.WriteToBytes…

ff89368

…<T>() Signed-off-by: Medha Tiwari <[email protected]>

medhatiwari requested a review from a team as a code owner May 5, 2025 10:53

medhatiwari requested review from rstam and removed request for a team May 5, 2025 10:53

BorisDog requested review from BorisDog and removed request for rstam May 5, 2025 20:14

medhatiwari force-pushed the binaryvectorsupport branch from 5cd9ca1 to a4384e3 Compare May 6, 2025 09:00

Added comments for clarity

2c2cae1

Signed-off-by: Medha Tiwari <[email protected]>

medhatiwari force-pushed the binaryvectorsupport branch from a4384e3 to 2c2cae1 Compare May 6, 2025 09:01

BorisDog requested changes May 22, 2025

View reviewed changes

medhatiwari requested a review from BorisDog May 26, 2025 06:01

BorisDog requested changes May 27, 2025

View reviewed changes

medhatiwari added 3 commits May 28, 2025 14:42

Fix BinaryVectorSerializerTests to generate little-endian test data f…

2c4a16a

…or float32 on all platforms Signed-off-by: Medha Tiwari <[email protected]>

Add BinaryPrimitivesCompat methods for float32 little-endian serializ…

0ee1694

…ation Signed-off-by: Medha Tiwari <[email protected]>

Add float32 BinaryVector serialization/deserialization with endian ha…

f43d935

…ndling Signed-off-by: Medha Tiwari <[email protected]>

medhatiwari requested a review from BorisDog May 28, 2025 12:52

BorisDog requested changes May 28, 2025

View reviewed changes

medhatiwari added 2 commits May 29, 2025 15:10

added tests for new methods in BinaryPrimitivesCompat

92b7ed2

Signed-off-by: Medha Tiwari <[email protected]>

resolved all the review comments

530ecda

Signed-off-by: Medha Tiwari <[email protected]>

medhatiwari force-pushed the binaryvectorsupport branch from ee0aa0a to 530ecda Compare May 29, 2025 13:16

medhatiwari requested a review from BorisDog May 29, 2025 14:37

BorisDog requested changes May 29, 2025

View reviewed changes

resolved all the comments

02579c1

medhatiwari requested a review from BorisDog May 30, 2025 09:49

BorisDog requested changes May 30, 2025

View reviewed changes

BorisDog changed the title ~~Add Big Endian Support for Float32 in BinaryVectorWriter.WriteToBytes<T>()~~ CSHARP-5603: Add Big Endian support in BinaryVectorReader and BinaryVectorWriter May 30, 2025

BorisDog added the improvement label May 30, 2025

another set of changes to resolve minor issues

c547a15

Signed-off-by: Medha Tiwari <[email protected]>

medhatiwari force-pushed the binaryvectorsupport branch from 4078bfa to c547a15 Compare May 30, 2025 19:00

medhatiwari requested a review from BorisDog May 30, 2025 19:02

BorisDog requested changes May 30, 2025

View reviewed changes

hardcoded ParamName to length

c47d4bb

Signed-off-by: Medha Tiwari <[email protected]>

medhatiwari requested a review from BorisDog June 3, 2025 10:23

removed hardocode length to nameof(source.Length)

ed4f5f0

Signed-off-by: Medha Tiwari <[email protected]>

adelinowona approved these changes Jun 5, 2025

View reviewed changes

BorisDog approved these changes Jun 5, 2025

View reviewed changes

BorisDog merged commit e62da2b into mongodb:main Jun 5, 2025
27 of 31 checks passed

medhatiwari mentioned this pull request Jun 9, 2025

CSHARP-5614: Fix deserialization of primitive arrays on Big Endian systems #1683

Merged

CSHARP-5603: Add Big Endian support in BinaryVectorReader and BinaryVectorWriter #1682

CSHARP-5603: Add Big Endian support in BinaryVectorReader and BinaryVectorWriter #1682

Uh oh!

Conversation

medhatiwari commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Background

Exception Observed

Sample Failing Tests

Why This Fix Is Necessary

Changes Introduced

Uh oh!

medhatiwari commented May 6, 2025

Uh oh!

medhatiwari commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BorisDog left a comment

Choose a reason for hiding this comment

Uh oh!

BorisDog left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BorisDog left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BorisDog left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BorisDog left a comment

Choose a reason for hiding this comment

Uh oh!

medhatiwari commented Jun 3, 2025

Uh oh!

BorisDog commented Jun 3, 2025

Uh oh!

medhatiwari commented Jun 4, 2025

Uh oh!

BorisDog commented Jun 4, 2025

Uh oh!

medhatiwari commented Jun 5, 2025

Uh oh!

BorisDog commented Jun 5, 2025

Uh oh!

adelinowona left a comment

Choose a reason for hiding this comment

Uh oh!

BorisDog left a comment

medhatiwari commented May 5, 2025 •

edited

Loading