New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

add utility for memory efficient maximum pairwise distance computation with GPU support #838

Open

grlee77 wants to merge 2 commits into rapidsai:branch-25.04 from grlee77:pdist-max-utility

Contributor

grlee77 commented Feb 16, 2025

The maximum Feret diameter computations compute pairwise distances between all points which can lead to out of memory errors if the number of points on the object boundary is large. (The memory used is quadratic in the number of points).

This MR implements a block-wise version that retains efficiency, but has much lower memory requirements. It also handles checking for optional GPU acceleration via CuPy's optional cuVS/pylibraft dependencies. CPU fallback is done if those are not available.

grlee77 added 2 commits

February 16, 2025 12:47


          add pdist_max_blockwise utility

365eede

Compute maximal pairwise distance without storing all distances
Provides fallback to CPU implementation of optional cuVS/pylibraft dependency is not available


          enable 'feret_diameter_max test cases that were previously skipped

c826ea4

grlee77 added improvement non-breaking labels

grlee77 added this to the v25.04.00 milestone

grlee77 self-assigned this

grlee77 requested a review from a team as a code owner

February 16, 2025 18:00

gigony reviewed

View reviewed changes

Contributor

gigony left a comment

Thanks @grlee77 for this update! I left some minor comments on this.

python/cucim/src/cucim/skimage/_shared/distance.py

+                  if _distance_on_cpu:
+                      warnings.warn(
+                          "cuVS >= 25.02 or pylibraft < 24.12 must be installed to use "
+                          "GPU-accelerated pairwaise distance computations. Falling back "

Contributor

gigony Feb 19, 2025

Suggested change

      
                        "GPU-accelerated pairwaise distance computations. Falling back "
          
                        "GPU-accelerated pairwise distance computations. Falling back "

python/cucim/src/cucim/skimage/_shared/distance.py

+                      Internally, calls to cdist will be made with subsets of coords where
+                      the subset size is (coords_per_block, ndim).
+                  compute_argmax : bool, optional
+                      If True, the value of the coordate indices corresponding to the maxima

Contributor

gigony Feb 19, 2025

Suggested change

      
                    If True, the value of the coordate indices corresponding to the maxima
          
                    If True, the value of the cooridate indices corresponding to the maxima

python/cucim/src/cucim/skimage/_shared/distance.py

+                  requirement. The memory used at runtime will be proportional to
+                  ``coords_per_block**2``.
+                  A block size of >= 2000 is recommended to overhead poor GPU resource usage

Contributor

gigony Feb 19, 2025

Suggested change

      
                A block size of >= 2000 is recommended to overhead poor GPU resource usage
          
                A block size of >= 2000 is recommended to avoid poor GPU resource usage

python/cucim/src/cucim/skimage/_shared/distance.py

+                              )
+                              current_output = temp
+                          else:
+                              # omit out= for the last block as size may be

Contributor

gigony Feb 19, 2025

This comment sentence doesn't seem to be complete.

python/cucim/src/cucim/skimage/_shared/distance.py

Comment on lines +48 to +49

		coords : np.ndarray (num_points, ndim)
		The coordinates to process.

Contributor

gigony Feb 19, 2025

The code converts input coordinates to float32 by default, which could lead to precision loss. I think it would be good to note that this coords would be converted to float32 data type.

python/cucim/src/cucim/skimage/_shared/distance.py

Comment on lines +108 to +109


		num_coords, _ = coords.shape

Contributor

gigony Feb 19, 2025

Input validation might be needed here.

Suggested change

      
                num_coords, _ = coords.shape
          
                if not isinstance(coords, (np.ndarray, cp.ndarray)):
          
                    raise TypeError("coords must be a numpy or cupy array")
          
                if coords.ndim != 2:
          
                    raise ValueError(
          
                        f"coords must be a 2-dimensional array, got shape {coords.shape}"
          
                    )
          
                num_coords, _ = coords.shape

python/cucim/src/cucim/skimage/_shared/distance.py

+                          "to SciPy-based CPU implementation."
+                      )
+                      xp = np
+                      coords = cp.asnumpy(coords)

Contributor

gigony Feb 19, 2025

We don't need to use numpy for here?

Suggested change

      
                    coords = cp.asnumpy(coords)
          
                    coords = np.asnumpy(coords)

python/cucim/src/cucim/skimage/_shared/distance.py

+                  Parameters
+                  ----------
+                  coords : np.ndarray (num_points, ndim)

Contributor

gigony Feb 19, 2025

Suggested change

      
                coords : np.ndarray (num_points, ndim)
          
                coords : numpy.ndarray or cupy.ndarray of shape (num_points, ndim)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

improvement non-breaking