[Feature suggestion] Optionally return the inferred dimensions #321

ml-w · 2024-05-14T07:34:47Z

Description

Einops calculates automatically some of the inferred dimension. For example, in ViT, Einops is great for decomposing a tensor into stacked patches like this:

x = torch.ones([1, 1, 256, 256]) 
patches = rearrange(x, 'b c (nx px) (ny py) -> b (c nx ny) px py', px=32, py=32)

This decompose the input 256 x 256 image into a stack of 32 x 32 patches, with the number of patches nx and ny calculated automatically.

Now the calculated nx and ny sometimes need to be reuse. When there's this requirement, we generally need to calculate it ourselves before invoking rearrange, which IMO defeat the purpose of using rearrange. It would be great if there's a function that returns the inferred dimensions as a dictionary when using Einops.

Usecase

Sometimes we might want to reuse the calculated dimensions that might be variables. Especially important when the input is a sequence with variable length. Usually, people pad it to a standard length, but this standard length can also be a variable (set to be divisible by a fix number [e.g., padded to a multiple of 64]). This proposed function will be useful in these usecases.

Examples implementation

I think the implementation is straight forward, either return as a dictionary when a flag is set to True, or populate an input dictionary with the calculated infer axis symbols.

x = torch.ones([1, 1, 256, 256])
# Add optional flag `return_dims`
patches, dims = rearrange(x, 'b c (nx px) (ny py) -> b (c nx ny) px py', px=32, py=32, return_dims=True)
# Add optional input dictionary for populating
dims ={}
patches = rearrange(x, 'b c (nx px) (ny py) -> b (c nx ny) px py', px=32, py=32, return_dims_to_dict=dims)
# dims = {
  'b': 1, 
  'c': 1, 
  'nx': 8, 
  'ny': 8,
  'px': 32,
  'py': 32
}

Something like that.

The text was updated successfully, but these errors were encountered:

arogozhnikov · 2024-09-14T17:36:22Z

forgot to answer this, sorry!

I am perfectly aware about desire to have shared/reusable dimensions with einops.
I've sketched a number of options, low-hanging options aren't good enough, deeper integrations will break torch.compile / torch.jit.script (and similar tools that support only very narrow subset of python).

Users need an explicit control over dimensions passed/stored (absent in your example), but python simply doesn't have it.

E.g. in JS this would be legal:

patches, {nx, ny} = rearrange(x, 'b c (nx px) (ny py) -> b (c nx ny) px py', px=32, py=32)

But in python best we have is sequence unpacking, that's why pack/unpack operate on lists, so your problem is solved with:

patches = rearrange(x, 'b c (nx px) (ny py) -> b c nx ny px py', px=32, py=32)
patches, [[c, nx, ny]] = einops.pack([patches], 'b * px py')

or

patches = rearrange(x, 'b c (nx px) (ny py) -> b c nx ny px py', px=32, py=32)
shapes = parse_shape(x, '_ c nx ny _ _')
patches = rearrange(patches, 'b c nx ny px py -> b (c nx ny) px py')

or even with _, c, nx, ny, _, _ = patches.shape - depending on how dimensions should be used later

arogozhnikov · 2024-09-14T17:39:17Z

Also, there was a project that introduced an explicit object to store dimensions in einops, but I can't find it right now

alabamagan · 2024-09-14T18:03:48Z

Hi, thanks for the answer! Never would have thought there's such a depth to this use case, but I can now see the issue is with compatibility requirement to torch.compile.

ml-w added the feature suggestion label May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature suggestion] Optionally return the inferred dimensions #321

[Feature suggestion] Optionally return the inferred dimensions #321

ml-w commented May 14, 2024

arogozhnikov commented Sep 14, 2024

arogozhnikov commented Sep 14, 2024

alabamagan commented Sep 14, 2024

[Feature suggestion] Optionally return the inferred dimensions #321

[Feature suggestion] Optionally return the inferred dimensions #321

Comments

ml-w commented May 14, 2024

Description

Usecase

Examples implementation

arogozhnikov commented Sep 14, 2024

arogozhnikov commented Sep 14, 2024

alabamagan commented Sep 14, 2024