added `DecodingResult` struct, changed decoding and predictor API to allow in-place by feefladder · Pull Request #103 · developmentseed/async-tiff

feefladder · 2025-06-12T14:11:06Z

closes #87 also #86 is somewhat relevant.

makes the output of a decoding operation a typed array.

The goals:

output data is properly aligned and typed
less-copy: try to copy the resulting data as little as possible
the problem:

data comes in as Bytes from the underlying network layer -> 1 copy is needed for typing+alignment (without extensive spec-reading and changing the api even more if we ever wanted to directly cast the Bytes to aligned+typed arrays)
some decompressions allow for reading into a buffer (deflate, lzw), others (~~lzw~~, jpeg) not. uncompressed copies because of 1
some predictors allow in-place (none: endianness-fixing and horizontal), others (float) not

The typed array can be filled in different places:

during decoding (breaking change, this PR)
during unpredicting (less breaking, leaves DecoderRegistry API untouched)
after unpredicting (current, the user does that here)

this PR does it during the decoding step, which:

6allows decompression algorithms to directly decompress into the buffer, saving a copy
allows users to create their own typed buffers and decode into those using decode_into.

It changes quite some public api, so open to discussion. more comments coming on the changes

sidenote: there is a redundant copy with no compression and floating point prediction, but which sensible person would ever have that configuration?

…dification

feefladder · 2025-06-12T14:58:43Z

oof it seems putting the change at the decoder step introduces some Python headaches. this pyo3 issue seems relevant. Mainly that result_buffer: &mut[u8] which is now passed to the decoder(s) is not python-friendly.... have to go now

feefladder · 2025-06-12T21:49:12Z

Made it a draft, because currently the Python side would need changes as well and I'm as of yet inexperienced with Rust-Python things. Can look into it further after wednesday. For the moment, any thoughts @weiji14? especially if changing to typed array at the unpredicting step is the way to go? (would be slightly sad because of the extra copy, but that's ok and a good compromise I think - if the python stuff turns out to be a lot/difficult or too breaking)

alternatively, this PR could be split into:

one adding the DecodingResult enum, creation logic and its Pythonic equivalents
one that changes the actual decoding steps

weiji14 · 2025-06-12T23:17:42Z

alternatively, this PR could be split into:

* one adding the DecodingResult enum, creation logic and its Pythonic equivalents
* one that changes the actual decoding steps

Yes please, might be good to split this into two (I'm generally in favour of reviewing smaller PRs). I'm wondering if there's an opportunity ot use num-trait's ToBytes trait here somewhere, but haven't thought about it too much yet.

feefladder · 2025-06-13T19:48:49Z

alternatively, this PR could be split into:
* one adding the DecodingResult enum, creation logic and its Pythonic equivalents
* one that changes the actual decoding steps
Yes please, might be good to split this into two (I'm generally in favour of reviewing smaller PRs).

will do! I'll keep it in one step for now, because most changes are coherent and basically I won't know what is needed before I see things working together. So the plan:

make everything work
split it in two
add nice unit tests
make non-draft PRs

I'm wondering if there's an opportunity ot use num-trait's ToBytes trait here somewhere, but haven't thought about it too much yet.

Is num-trait's ToBytes Bytes the same as bytes::Bytes? If so that would be good

I've given the whole Python part a bit more thought. passing a &mut[u8] across the boundary adds a lot of caveats on the python side or requires implementing the buffer protocol for ~~DecodingResult~~ RasterData (I think that's a better name), which is IMO a no-go (the added complexity is absolutely not worth it). Should be as simple as:

impl Decoder for PyDecoder {
    fn decode_tile(
        &self,
        compressed_buffer: Bytes,
        result_buffer: &mut[u8],
        _photometric_interpretation: PhotometricInterpretation,
        _jpeg_tables: Option<&[u8]>,
    ) -> AsyncTiffResult<()> {
        let decoded_buffer = Python::with_gil(|py| self.call(py, buffer))
            .map_err(|err| AsyncTiffError::General(err.to_string()))?;
        result_buffer.copy_from_slice(&decoded_buffer.into_inner()); //only add this extra copy
        Ok(())
    }
}

then there is a bit of sadness (needless copy) in the case of a user-implemented decoder and floating point predictor, but solving that would also be unnecessarily ugly for saving only a copy.

kylebarron · 2025-06-13T20:15:28Z

I'm wondering if there's an opportunity ot use num-trait's ToBytes trait here somewhere

No, that ToBytes is for converting a single number to a byte array. Like converting a single u32 to the four bytes that make it up. So it's irrelevant for our needs I think.

kylebarron · 2025-06-13T20:16:45Z

+macro_rules! integral_slice_as_bytes{($int:ty, $const:ident $(,$mut:ident)*) => {
+    pub(crate) fn $const(slice: &[$int]) -> &[u8] {
+        assert!(mem::align_of::<$int>() <= mem::size_of::<$int>());
+        unsafe { slice::from_raw_parts(slice.as_ptr() as *const u8, mem::size_of_val(slice)) }
+    }
+    $(pub(crate) fn $mut(slice: &mut [$int]) -> &mut [u8] {
+        assert!(mem::align_of::<$int>() <= mem::size_of::<$int>());
+        unsafe { slice::from_raw_parts_mut(slice.as_mut_ptr() as *mut u8, mem::size_of_val(slice)) }
+    })*
+}}


I'm a pretty strong 👎 on having any unsafe in our code, especially for a transmute like this.

kylebarron · 2025-06-13T20:18:39Z

+        compressed_buffer: Bytes,
+        result_buffer: &mut [u8],


It's not clear to me that passing in a result buffer is a meaningful improvement. I'd much rather use bytes::Bytes (i.e. an Arc<Vec<u8>>) that provides cheap cloning. (I.e. keep the existing API)

kylebarron · 2025-06-13T20:20:14Z

oof it seems putting the change at the decoder step introduces some Python headaches. this pyo3 issue seems relevant. Mainly that result_buffer: &mut[u8] which is now passed to the decoder(s) is not python-friendly.... have to go now

This is another reason why we should use bytes::Bytes everywhere, so that we can easily reuse free buffer protocol interop with https://docs.rs/pyo3-bytes/0.3.0/pyo3_bytes/

feefladder · 2025-06-14T10:29:42Z

oof it seems putting the change at the decoder step introduces some Python headaches. this pyo3 issue seems relevant. Mainly that result_buffer: &mut[u8] which is now passed to the decoder(s) is not python-friendly.... have to go now

This is another reason why we should use bytes::Bytes everywhere, so that we can easily reuse free buffer protocol interop with https://docs.rs/pyo3-bytes/0.3.0/pyo3_bytes/

So my final snippet did not change anything Python-API side and solved the issue I raised here. To keep everything Bytes, one would:

impl Decoder for PyDecoder {
    fn decode_tile(
        &self,
        compressed_buffer: Bytes,
        result_buffer: &mut[u8],
        _photometric_interpretation: PhotometricInterpretation,
        _jpeg_tables: Option<&[u8]>,
    ) -> AsyncTiffResult<()> {
        let decoded_buffer = Python::with_gil(|py| self.call(py, buffer))
            .map_err(|err| AsyncTiffError::General(err.to_string()))?;
        result_buffer.copy_from_slice(&decoded_buffer.into_inner()); //only add this extra copy
        Ok(())
    }
}

...I guess it could indeed have some more thought

feefladder added 3 commits June 12, 2025 15:52

added struct, changed decoding and predictor API to allow in-place mo…

d27e7d4

…dification

improved LZW decoding

53d10d9

cargo fmt

d533d78

feefladder marked this pull request as draft June 12, 2025 21:46

feefladder mentioned this pull request Jun 13, 2025

allow to_numpy conversion on typed output? #104

Closed

kylebarron requested changes Jun 13, 2025

View reviewed changes

feefladder closed this Jun 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added `DecodingResult` struct, changed decoding and predictor API to allow in-place #103

added `DecodingResult` struct, changed decoding and predictor API to allow in-place #103
feefladder wants to merge 3 commits intodevelopmentseed:mainfrom
feefladder:decoding-result

feefladder commented Jun 12, 2025 •

edited

Loading

Uh oh!

feefladder commented Jun 12, 2025

Uh oh!

feefladder commented Jun 12, 2025 •

edited

Loading

Uh oh!

weiji14 commented Jun 12, 2025

Uh oh!

feefladder commented Jun 13, 2025

Uh oh!

kylebarron commented Jun 13, 2025

Uh oh!

kylebarron Jun 13, 2025

Uh oh!

kylebarron Jun 13, 2025 •

edited

Loading

Uh oh!

kylebarron commented Jun 13, 2025

Uh oh!

feefladder commented Jun 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

feefladder commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

feefladder commented Jun 12, 2025

Uh oh!

feefladder commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

weiji14 commented Jun 12, 2025

Uh oh!

feefladder commented Jun 13, 2025

Uh oh!

kylebarron commented Jun 13, 2025

Uh oh!

kylebarron Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

kylebarron Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kylebarron commented Jun 13, 2025

Uh oh!

feefladder commented Jun 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feefladder commented Jun 12, 2025 •

edited

Loading

feefladder commented Jun 12, 2025 •

edited

Loading

kylebarron Jun 13, 2025 •

edited

Loading