Positional embedding for time in temporal attention layers in SVD #6541
Unanswered
jeanne-wang
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Hey @jeanne-wang, I'm not 100% sure what you mean by "positional embedding for time", but the difference between SVD and say SDXL is that SVD conditions the unet not on discrete timesteps (e.g. 1, 2, ..., 999), but on continous "noise" values such as (3.2341).
You should also probably take a look at https://arxiv.org/abs/2206.00364 |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
It seems that SVD are not actually using positional embedding for time in the temporal attention layers in the released model. Are there any specific reason not doing this?
Beta Was this translation helpful? Give feedback.
All reactions