Replace `torch.nn.Module` with `metatensor.torch.learn.nn.Module` #938

GardevoirX · 2025-11-21T11:25:57Z

Contributor (creator of pull-request) checklist

Tests updated (for new features and bugfixes)?
Documentation updated (for new features)?
Issue referenced (for PRs that solve an issue)?

Maintainer/Reviewer checklist

CHANGELOG updated with public API or any other important changes?
GPU tests passed (maintainer comment: "cscs-ci run")?

📚 Documentation preview 📚: https://metatrain--938.org.readthedocs.build/en/938/

pfebrer · 2025-11-21T19:48:28Z

Why is this needed? This might make it look more daunting for contributors of new architectures, since it seems you can't no longer use raw torch, you have to understand what the metatensor torch is 😅

GardevoirX · 2025-11-21T21:30:41Z

Why is this needed? This might make it look more daunting for contributors of new architectures, since it seems you can't no longer use raw torch, you have to understand what the metatensor torch is 😅

It's said that it can allow us moving from device and dtypes more easily, because the torch's module doesn't support moving things like tensormap when you call .to

pfebrer · 2025-11-23T18:56:58Z

Hmm I see, could we only apply the change to the modules that need it, which as far as I understand are the CompositionModel and a few others?

pfebrer · 2025-11-23T18:58:58Z

Or could this be solved by overwriting the .to method of CompositionModel ?

ceriottm · 2025-11-23T19:48:59Z

my understanding is that this is helpful for modules that use tensormaps. so there's no point in converting models that are 100% torch, but those that have some metastuff would benefit from being converted.

GardevoirX · 2025-11-25T14:34:55Z

my understanding is that this is helpful for modules that use tensormaps. so there's no point in converting models that are 100% torch, but those that have some metastuff would benefit from being converted.

I think a no-brain-choice of using only one Module would be better? It might be confusing to have two different Modules existing in the repo, and also when implementing new models, one shall think "if any part in my model involves moving TensorMap to a new device or type".

pfebrer · 2025-11-25T14:41:33Z

From the perspective of an outsider I think it is much better to use torch.nn.Module, and it is very unlikely that these people will find the need to have TensorMaps as buffers of their modules. It is also very unlikely that they care what the CompositionModel inherits from.

GardevoirX · 2025-11-25T14:53:02Z

Okay if so I think I don't need to replace everything with metatensor.learn.nn.Module, given that people using TensorMap knows which Module to use

pfebrer · 2025-11-25T17:03:06Z

That would be my opinion yes, but maybe others have a different opinion, let's see🙂

Luthaf · 2025-11-26T10:08:35Z

So mts.learn.nn.Module is 100% compatible with torch.nn.Module, but as mentioned above improves compatibility with metatensor data, both in the .to function and making sure the data is included in the state_dict.

This should remove a bunch of workaround and xxx_buffers and function calls at the start of forward(); instead making everything work out of the box as one would expect, for the cost of changing what you inherit from.

I personally think that it is best not to have two nn.Module, so we should always use (and enforce with linting) mts.learn.nn.Module everywhere. I don't think most people understand what torch.nn.Module even does, so I think it is fine for us to say "use this magic class instead of this other one".

Or could this be solved by overwriting the .to method of CompositionModel ?

No, because this would only override the method in Python, not in TorchScript.

pfebrer · 2025-11-26T10:24:18Z

This should remove a bunch of workaround and xxx_buffers and function calls at the start of forward(); instead making everything work out of the box as one would expect, for the cost of changing what you inherit from.

The only place where I see this PR removing code is for the CompositionModel and Scaler. Will it allow to remove more code?

I don't think most people understand what torch.nn.Module even does, so I think it is fine for us to say "use this magic class instead of this other one".

This is not so clear to me to be honest. The people that will contribute to metatrain with new architectures for sure know what torch.nn.Module does

No, because this would only override the method in Python, not in TorchScript.

Why?

Luthaf · 2025-11-26T10:39:33Z

The only place where I see this PR removing code is for the CompositionModel and Scaler. Will it allow to remove more code?

Any place using TensorMap/TensorBlock/Labels inside the model should become simpler. Everything else will stay the same.

I don't think most people understand what torch.nn.Module even does, so I think it is fine for us to say "use this magic class instead of this other one".

This is not so clear to me to be honest. The people that will contribute to metatrain with new architectures for sure know what torch.nn.Module does

I mean most people don't know how to is implement, or how the state_dict is generated, or why self.a = some_torch_tensor is magic inside a torch.nn.Module (which is fine!). They just use it because it is what the documentation tells them to.

No, because this would only override the method in Python, not in TorchScript.

Why?

Because that's how TorchScript works? You can not override any of the default methods of torch.nn.Module. For mts.learn.nn.Module, implementing this required both overriding it on the Python side, overriding it again for the TorchScript-in-Python execution mode, and overriding it again in C++.

pfebrer · 2025-11-27T22:48:55Z

Any place using TensorMap/TensorBlock/Labels inside the model should become simpler. Everything else will stay the same.

So, some other place apart from CompositionModel and Scaler? haha

Because that's how TorchScript works? You can not override any of the default methods of torch.nn.Module. For mts.learn.nn.Module, implementing this required both overriding it on the Python side, overriding it again for the TorchScript-in-Python execution mode, and overriding it again in C++.

Ok, didn't know this.

GardevoirX added 4 commits November 20, 2025 14:24

Replace torch.nn.Module with metatensor.torch.learn.nn.Module

26a9b34

Update a little bit

2243933

Remove to related to properties of self

291e5b7

Remove more and break more tests

3b13410

GardevoirX requested review from DavideTisi, SanggyuChong, abmazitov, frostedoyster and johannes-spies as code owners November 21, 2025 11:25

GardevoirX marked this pull request as draft November 21, 2025 11:26

PicoCentauri mentioned this pull request Nov 24, 2025

[DOC] Some love ❤️ for dev docs #944

Open

Replace torch.nn.Module with metatensor.torch.learn.nn.Module #938

Are you sure you want to change the base?

Replace torch.nn.Module with metatensor.torch.learn.nn.Module #938

Uh oh!

Conversation

GardevoirX commented Nov 21, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Contributor (creator of pull-request) checklist

Maintainer/Reviewer checklist

Uh oh!

pfebrer commented Nov 21, 2025

Uh oh!

GardevoirX commented Nov 21, 2025

Uh oh!

pfebrer commented Nov 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pfebrer commented Nov 23, 2025

Uh oh!

ceriottm commented Nov 23, 2025

Uh oh!

GardevoirX commented Nov 25, 2025

Uh oh!

pfebrer commented Nov 25, 2025

Uh oh!

GardevoirX commented Nov 25, 2025

Uh oh!

pfebrer commented Nov 25, 2025

Uh oh!

Luthaf commented Nov 26, 2025

Uh oh!

pfebrer commented Nov 26, 2025

Uh oh!

Luthaf commented Nov 26, 2025

Uh oh!

pfebrer commented Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Replace `torch.nn.Module` with `metatensor.torch.learn.nn.Module` #938

Replace `torch.nn.Module` with `metatensor.torch.learn.nn.Module` #938

GardevoirX commented Nov 21, 2025 •

edited by github-actions bot

Loading

pfebrer commented Nov 23, 2025 •

edited

Loading