Detection trainer fails with In_channels>3 #2749

robmarkcole · 2025-04-22T14:30:39Z

Description

Eg if using a 4 channel dataset, the error will be raised:

  File "/usr/local/lib/python3.11/site-packages/torchvision/models/detection/transform.py", line 141, in forward
    image = self.normalize(image)
            ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/torchvision/models/detection/transform.py", line 169, in normalize
    return (image - mean[:, None, None]) / std[:, None, None]
            ~~~~~~^~~~~~~~~~~~~~~~~~~~~
RuntimeError: The size of tensor a (4) must match the size of tensor b (3) at non-singleton dimension 0

This is because GeneralizedRCNNTransform only supports 3 channels. We want to use Kornia for norm instead or disable this behaviour from torchvision

Steps to reproduce

I'm using a proprietary 4 channel dataset

model:
  class_path: ObjectDetectionTask
  init_args:
    model: faster-rcnn
    backbone: resnet18
    weights: True
    lr: 5e-4
    in_channels: 4

Version

torchgeo==0.7.0

The text was updated successfully, but these errors were encountered:

robmarkcole · 2025-04-22T14:32:14Z

As a workaround in my script:

def dummy_normalize(self, image):
    # Simply return the image as-is; assumes it's already normalized
    return image

# Patch the normalize method of GeneralizedRCNNTransform
detection_transform.GeneralizedRCNNTransform.normalize = dummy_normalize

adamjstewart · 2025-04-22T15:16:17Z

Glad someone actually tested this. Likely affects instance segmentation too. Wish we had some non-RGB datasets in TorchGeo to test this properly.

isaaccorley · 2025-04-22T19:13:32Z

Are we okay just converting this transform to nn.Identity by default? This is one thing I dislike about the torchvision RCNN models baking in the transform into the model. I've been screwed over by this in the past as well.

adamjstewart added this to the 0.7.1 milestone Apr 22, 2025

adamjstewart added the trainers PyTorch Lightning trainers label Apr 22, 2025

isaaccorley linked a pull request Apr 23, 2025 that will close this issue

ObjectDetection/InstanceSegmentationTask: fix support for non-RGB images #2752

Open

adamjstewart removed this from the 0.7.1 milestone May 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detection trainer fails with In_channels>3 #2749

Detection trainer fails with In_channels>3 #2749

robmarkcole commented Apr 22, 2025

robmarkcole commented Apr 22, 2025

adamjstewart commented Apr 22, 2025

isaaccorley commented Apr 22, 2025 •

edited

Loading

Detection trainer fails with In_channels>3 #2749

Detection trainer fails with In_channels>3 #2749

Comments

robmarkcole commented Apr 22, 2025

Description

Steps to reproduce

Version

robmarkcole commented Apr 22, 2025

adamjstewart commented Apr 22, 2025

isaaccorley commented Apr 22, 2025 • edited Loading

isaaccorley commented Apr 22, 2025 •

edited

Loading