docs: Details for ambigious channel dimension assignment #37600

yaner-here · 2025-04-18T07:33:32Z

What does this PR do?

Refined docs on how to assign channel dimension of inputs.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

github-actions · 2025-04-18T07:33:46Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

stevhliu

Thanks, I think this should be ok! We can merge once our vision expert approves as well 😄

stevhliu · 2025-04-18T15:10:23Z

src/transformers/image_utils.py

@@ -368,7 +368,7 @@ def infer_channel_dimension_format(

    if image.shape[first_dim] in num_channels and image.shape[last_dim] in num_channels:
        logger.warning(
-            f"The channel dimension is ambiguous. Got image shape {image.shape}. Assuming channels are the first dimension."
+            f"The channel dimension is ambiguous. Got image shape {image.shape}. Assuming channels are the first dimension. Use the `input_data_format` parameter to assign the channel dimension, more details in `https://huggingface.co/docs/transformers/main/internal/image_processing_utils#transformers.image_transforms.rescale.input_data_format`."


cc @qubvel, would you mind taking a quick look and verifying this please? 🤗

Suggested change

f"The channel dimension is ambiguous. Got image shape {image.shape}. Assuming channels are the first dimension. Use the `input_data_format` parameter to assign the channel dimension, more details in `https://huggingface.co/docs/transformers/main/internal/image_processing_utils#transformers.image_transforms.rescale.input_data_format`."

f"The channel dimension is ambiguous. Got image shape {image.shape}. Assuming channels are the first dimension. Use the [input_data_format](https://huggingface.co/docs/transformers/main/internal/image_processing_utils#transformers.image_transforms.rescale.input_data_format) parameter to assign the channel dimension."

Co-authored-by: Steven Liu <[email protected]>

yaner-here · 2025-04-26T13:19:29Z

Hi, any progress?

qubvel

Thanks for update, looks good to me 👍

HuggingFaceDocBuilderDev · 2025-04-29T14:31:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…#37600) * docs: Details for ambigious channel dimension inference * Update src/transformers/image_utils.py Co-authored-by: Steven Liu <[email protected]> --------- Co-authored-by: Steven Liu <[email protected]>

docs: Details for ambigious channel dimension inference

7e76c5d

github-actions bot marked this pull request as draft April 18, 2025 07:33

stevhliu reviewed Apr 18, 2025

View reviewed changes

Update src/transformers/image_utils.py

e0a641e

Co-authored-by: Steven Liu <[email protected]>

yaner-here marked this pull request as ready for review April 23, 2025 15:20

yaner-here changed the title ~~docs: Details for ambigious channel dimension inference~~ docs: Details for ambigious channel dimension assignment Apr 23, 2025

Merge branch 'main' into yaner-here-patch-1

5fa92e6

Merge branch 'main' into yaner-here-patch-1

99bc7ba

qubvel approved these changes Apr 29, 2025

View reviewed changes

stevhliu merged commit 66ad8b2 into huggingface:main Apr 29, 2025
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: Details for ambigious channel dimension assignment #37600

docs: Details for ambigious channel dimension assignment #37600

Uh oh!

yaner-here commented Apr 18, 2025

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

stevhliu left a comment

Uh oh!

stevhliu Apr 18, 2025

Uh oh!

yaner-here commented Apr 26, 2025

Uh oh!

qubvel left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 29, 2025

Uh oh!

Uh oh!

Uh oh!

	f"The channel dimension is ambiguous. Got image shape {image.shape}. Assuming channels are the first dimension. Use the `input_data_format` parameter to assign the channel dimension, more details in `https://huggingface.co/docs/transformers/main/internal/image_processing_utils#transformers.image_transforms.rescale.input_data_format`."
	f"The channel dimension is ambiguous. Got image shape {image.shape}. Assuming channels are the first dimension. Use the [input_data_format](https://huggingface.co/docs/transformers/main/internal/image_processing_utils#transformers.image_transforms.rescale.input_data_format) parameter to assign the channel dimension."

docs: Details for ambigious channel dimension assignment #37600

docs: Details for ambigious channel dimension assignment #37600

Uh oh!

Conversation

yaner-here commented Apr 18, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

yaner-here commented Apr 26, 2025

Uh oh!

qubvel left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 29, 2025

Uh oh!

Uh oh!

Uh oh!