Add safetensors export feature #3345

jonboh · 2025-07-03T20:23:40Z

This is still a work in progress, I started out following antimora's indications in: #3260 (comment)

I did not find a clean way to walk the Module with the Serializer, and perform the serialization only with the Recorder. The problem is that once the Serializer is finished I'd need to reconstruct each TensorData back from its bytes, dtype and shape fields as the serializer will walk up to the basic types.

The alternative I found was to use the Serializer to get a mapping from ParamId to the TensorName, and use a ModuleVisitor to link to the TensorData. I think this is a bit cleaner, but the API is pretty different from the rest of the serializations, so I'm not sure, if it would be better to do it the other way, even if the logic ends up a bit more convoluted.
Any comments are appreciated :)

crates/burn-import/Cargo.toml

antimora · 2025-07-03T23:11:32Z

I think I know what the issue you are facing is regarding the serializer going all the way down. It's okay to stop if you come across a TensorData struct type and handle serialization differently.

My recommendation:

Don't use ModuleVisitor (you can get away with a serde serializer).
The only intermediate data structure should be a hash type of full path name + tensor safetensor view.

codecov · 2025-07-13T14:15:59Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 35.19%. Comparing base (f8273f0) to head (b504a8e).
Report is 41 commits behind head on main.

❌ Your project check has failed because the head coverage (35.19%) is below the target coverage (80.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #3345       +/-   ##
===========================================
- Coverage   82.49%   35.19%   -47.31%     
===========================================
  Files         990      342      -648     
  Lines      127088    53164    -73924     
===========================================
- Hits       104846    18709    -86137     
- Misses      22242    34455    +12213

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

antimora

Congrats to make it work! I know dealing with Serde is not easy.

I did a quick pass and here are my quick high level comments. I will dive in more details later.

We should hook save_item under SafetensorsFileRecorder.
We should aim not to duplicate tensor data. The tensor view should be sufficient for safetensors to pull data directly.
It seems the serde serializer could be simplified and made more robust. I am still looking into the implementation, so I do not have any concrete suggestions.
We should verify serialization of tuple, enum and vec of modules. It might be worth to update safetensors tests with more complete test data.
We should add module adapter just like in SafetensorsFileRecorder because our modules are not one to one. SafetensorsFileRecorder's default adapter is pytorch.

antimora · 2025-07-13T18:49:21Z

crates/burn-import/Cargo.toml

@@ -26,6 +26,7 @@ safetensors = [
    "thiserror",
    "zip",
    "candle-core",
+    "dep:safetensors",


you don’t need to use dep: for the newer rust editions.

antimora · 2025-07-13T18:54:33Z

crates/burn-import/safetensors-tests/tests/export/mod.rs

+        let mut file = File::create("model.safetensors").unwrap();
+        file.write_all(&serialized).unwrap();
+        let record = SafetensorsFileRecorder::<FullPrecisionSettings>::default()
+            .load(
+                LoadArgs::new("model.safetensors".into()).with_adapter_type(AdapterType::NoAdapter),
+                &device,
+            )
+            .expect("Should decode state successfully");
+        std::fs::remove_file("model.safetensors").unwrap();


I recommend using NamedTempFile https://github.com/tracel-ai/burn/blob/main/crates/burn-dataset/src/dataset/sqlite.rs#L713. Otherwise you'll have collisions and hanging files.

antimora · 2025-07-13T18:59:38Z

crates/burn-import/src/safetensors/export.rs

+        .serialize(&mut ser)
+        .unwrap();
+    safetensors::serialize(ser.into_map(), None)
+}


We should also enhance SafetensorsFileRecorder's save_item method just like in
https://github.com/tracel-ai/burn/blob/main/crates/burn-core/src/record/file.rs#L194

This way SafetensorsFileRecorder feature is symmetrical

antimora · 2025-07-13T19:02:56Z

crates/burn-import/src/safetensors/serializer.rs

+pub struct SafetensorsTensorData {
+    bytes: Vec<u8>,
+    shape: Vec<usize>,
+    dtype: DType,
+}
+
+impl safetensors::View for SafetensorsTensorData {
+    fn dtype(&self) -> safetensors::Dtype {


Can't create a view on TensorData directly without the need to save into bytes? This way there won't be an intermediate memory usage for large models. So if we have 8GB model sitting in GPU, we will not create a copy in RAM.

yes, you are right, I was preocuppied with making it work and I didn't try using just a reference to the original bytes. I'll try modifying it so that instead of Vec<u8>, we point to the original data using a &[u8].

work in progress safetensors export

c25eb13

antimora reviewed Jul 3, 2025

View reviewed changes

crates/burn-import/Cargo.toml Outdated Show resolved Hide resolved

antimora changed the title ~~add safetensors export feature~~ Add safetensors export feature Jul 12, 2025

move all safetensors serialization to serde, remove need for Visitor

fc34c7c

jonboh marked this pull request as ready for review July 13, 2025 12:01

jonboh requested a review from antimora July 13, 2025 12:01

fix lint

b504a8e

antimora requested changes Jul 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add safetensors export feature #3345

Add safetensors export feature #3345

Uh oh!

jonboh commented Jul 3, 2025

Uh oh!

Uh oh!

antimora commented Jul 3, 2025

Uh oh!

codecov bot commented Jul 13, 2025

Uh oh!

antimora left a comment

Uh oh!

antimora Jul 13, 2025

Uh oh!

antimora Jul 13, 2025

Uh oh!

antimora Jul 13, 2025

Uh oh!

antimora Jul 13, 2025

Uh oh!

jonboh Jul 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add safetensors export feature #3345

Are you sure you want to change the base?

Add safetensors export feature #3345

Uh oh!

Conversation

jonboh commented Jul 3, 2025

Uh oh!

Uh oh!

antimora commented Jul 3, 2025

Uh oh!

codecov bot commented Jul 13, 2025

Codecov Report

Uh oh!

antimora left a comment

Choose a reason for hiding this comment

Uh oh!

antimora Jul 13, 2025

Choose a reason for hiding this comment

Uh oh!

antimora Jul 13, 2025

Choose a reason for hiding this comment

Uh oh!

antimora Jul 13, 2025

Choose a reason for hiding this comment

Uh oh!

antimora Jul 13, 2025

Choose a reason for hiding this comment

Uh oh!

jonboh Jul 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jonboh Jul 13, 2025 •

edited

Loading