resize transform with max pool approach added #487

kiryteo · 2025-05-23T01:40:21Z

What does this PR do?

Added maxpool_resize transform to obtain maxpool-like functionality when resizing inputs.

MONAI provides resize functionality via monai.transforms.Resized which uses torch.nn.functional.interpolate along with one of the available interpolation modes. None of the modes offer the maxpool-like functionality and thus we need a custom transform. No additional dependencies required. This is an optional transform to be used with config and does not introduce any breaking changes.

Before submitting

Did you make sure title is self-explanatory and the description concisely explains the PR?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you list all the breaking changes introduced by this pull request?
Did you test your PR locally with pytest command?
Did you run pre-commit hooks with pre-commit run -a command?

Did you have fun?

Make sure you had fun coding 🙃

benjijamorris

Looks great! My one suggestion would be to add an explicit spatial_dims argument. I think this would take the guesswork out of trying to find the spatial dimensions and make the code simpler but I can be convinced otherwise!

benjijamorris · 2025-05-23T16:08:11Z

cyto_dl/image/transforms/maxpool_resize.py

+                raise TypeError(f"Input '{key}' must be a PyTorch tensor, got {type(x)}")
+
+            # Determine expected tensor dimensions and spatial size length
+            input_dims = x.dim()


Monai transforms expect C[Z]YX images - would it be reasonable to enforce this based on the spatial size and then just add the batch dimension right before pooling? I often have an explicit spatial_dims argument to help with this kind of check

benjijamorris · 2025-05-23T16:08:49Z

cyto_dl/image/transforms/maxpool_resize.py

+
+            # Normalize spatial_size to match expected number of spatial dimensions
+            try:
+                spatial_size = ensure_tuple_rep(self.spatial_size, expected_spatial_dims)


Adding an explicit spatial_dims argument here would let you find the spatial size once in the init

benjijamorris · 2025-05-23T16:08:59Z

cyto_dl/image/transforms/maxpool_resize.py

+
+            orig_size = x.shape[-expected_spatial_dims:]
+
+            # Replace non-positive spatial_size values with original dimensions


resize transform with max pool approach added

0c90c07

benjijamorris suggested changes May 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

resize transform with max pool approach added #487

resize transform with max pool approach added #487

Uh oh!

kiryteo commented May 23, 2025

Uh oh!

benjijamorris left a comment

Uh oh!

benjijamorris May 23, 2025

Uh oh!

benjijamorris May 23, 2025

Uh oh!

benjijamorris May 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		orig_size = x.shape[-expected_spatial_dims:]

		# Replace non-positive spatial_size values with original dimensions

resize transform with max pool approach added #487

Are you sure you want to change the base?

resize transform with max pool approach added #487

Uh oh!

Conversation

kiryteo commented May 23, 2025

What does this PR do?

Before submitting

Did you have fun?

Uh oh!

benjijamorris left a comment

Choose a reason for hiding this comment

Uh oh!

benjijamorris May 23, 2025

Choose a reason for hiding this comment

Uh oh!

benjijamorris May 23, 2025

Choose a reason for hiding this comment

Uh oh!

benjijamorris May 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants