Add multi-channel support #279

guarin · 2025-09-04T17:11:58Z

What has changed and why?

Add multi-channel support with transform_args={"num_channels": <num channels>}
Add DINOv2, DINOv3, and TIMM model multi-channel support
Add multi-channel support for all pretraining methods
Add multi-channel support for all fine-tuning methods
It is no longer necessary to set LIGHTLY_TRAIN_IMAGE_MODE when loading images with more than 3 channels

Multi-channel support is currently limited to 4 channel images because we use PIL for image loading. We'll add support for TIFF/DICOM with more than 4 channels in a follow-up PR.

The PR looks huge but is mostly passing num_channels variables around. I used num_channels when the code is related to the image data (e.g. dataset, transforms, etc.). And num_input_channels when the code is in a model context. This is to disambiguate between input and output channels in a model.

How has it been tested?

Added integration tests
Manual tests

Did you update CHANGELOG.md?

Yes
Not needed (internal change)

Did you update the documentation?

Yes
Not needed (internal change without effects for user)

Copilot

Pull Request Overview

This PR adds multi-channel input support to the lightly_train framework by introducing a num_input_channels parameter throughout the system. This enables training on images with different channel counts beyond the standard 3-channel RGB format.

Key changes include:

Added num_input_channels parameter to all method constructors and model creation functions
Updated transform configurations to support multi-channel normalization and handle incompatible transforms
Modified DINOv2/DINOv3 models to accept different input channel counts with proper weight loading
Enhanced datasets to determine image mode based on channel count

Reviewed Changes

Copilot reviewed 57 out of 57 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
`tests/helpers.py`	Updated test helper methods to include `num_input_channels` parameter
`tests/_models/test_package_helpers.py`	Added `num_input_channels` to all model creation tests
`tests/_methods//test_.py`	Updated method instantiation tests with `num_input_channels` parameter
`tests/_data/test_*.py`	Modified dataset tests to include channel count parameter
`tests/_commands/test_*.py`	Added multi-channel training tests and updated existing tests
`src/lightly_train/_transforms/*.py`	Enhanced transform handling for multi-channel support and auto-resolution
`src/lightly_train/_models/*/`	Updated model packages to support configurable input channels
`src/lightly_train/_methods/*/`	Added `num_input_channels` to all method constructors
`src/lightly_train/_data/*.py`	Modified datasets to handle multi-channel images
`src/lightly_train/_commands/*.py`	Updated training commands to propagate channel count

Comments suppressed due to low confidence (2)

src/lightly_train/_transforms/semantic_segmentation_transform.py:1

The type annotation for random_flip has been changed from RandomFlipArgs to RandomFlipArgs | None, but it still has a default factory. This creates inconsistency - if it can be None, the default should be None instead of creating an instance. The same issue exists with color_jitter.

src/lightly_train/_data/mask_semantic_segmentation_dataset.py:1

The import import albumentations as A has been removed but was likely used elsewhere in the file. Ensure that all references to A have been properly updated to use the full module name or alternative imports.

src/lightly_train/_transforms/transform.py

src/lightly_train/_task_models/dinov2_linear_semantic_segmentation/task_model.py

src/lightly_train/_plot.py

src/lightly_train/_env.py

src/lightly_train/_commands/train_helpers.py

src/lightly_train/_data/image_dataset.py

src/lightly_train/_data/mask_semantic_segmentation_dataset.py

src/lightly_train/_methods/densecl/densecl.py

src/lightly_train/_models/_model_helpers.py

src/lightly_train/_models/dinov3/dinov3_package.py

src/lightly_train/_models/dinov3/dinov3_src/hub/backbones.py

src/lightly_train/_plot.py

src/lightly_train/_task_models/dinov2_eomt_semantic_segmentation/task_model.py

src/lightly_train/_task_models/dinov2_eomt_semantic_segmentation/transforms.py

src/lightly_train/_transforms/transform.py

tests/_commands/test_train_task.py

tests/_data/test_mask_semantic_segmentation_dataset.py

yutong-xiang-97

Mostly good. I left some comments.

The biggest issue is that when num_channels is auto with a default normalization in transform_args the num_channels is set to 3 and used afterwards.

The suggestions for adding multi-channel to some of the tests are optional. I believe if the high-level tests for .train() or train_sem_seg() work then we should be fine.

Btw don't forget the changelog :)

src/lightly_train/_commands/train_helpers.py

src/lightly_train/_models/ultralytics/ultralytics_package.py

src/lightly_train/_models/rfdetr/rfdetr_package.py

src/lightly_train/_task_models/dinov2_linear_semantic_segmentation/transforms.py

src/lightly_train/_task_models/dinov3_eomt_semantic_segmentation/transforms.py

tests/_data/test_image_dataset.py

src/lightly_train/_data/mask_semantic_segmentation_dataset.py

tests/_models/test_package_helpers.py

src/lightly_train/_transforms/transform.py

src/lightly_train/_data/image_dataset.py

yutong-xiang-97

LGTM

guarin added 8 commits September 4, 2025 13:39

WIP

21a9479

WIP

f1f84f6

WIP

b6e3129

Update

836839f

Update

e855bea

More num_channels passing

42eae94

Add DINOv3 multi-channel support

b24840a

Add tests

17da42a

Copilot AI review requested due to automatic review settings September 4, 2025 17:11

Copilot AI reviewed Sep 4, 2025

View reviewed changes

Revert

7d1fc09