- Title: SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios
- TL;DR: A novel framework that combines visual priors and dynamic constraints within a synchronized diffusion process for joint generation of video and motion in Hand-Object Interaction (HOI) scenarios.
- Project page: https://droliven.github.io/SViMo_project/.
- arxiv: https://arxiv.org/abs/2506.02444.
- PDF: https://arxiv.org/pdf/2506.02444.
- HF Paper page: https://huggingface.co/papers/2506.02444.
- Video demonstration: https://www.youtube.com/watch?v=pVkntn-8KHo.
- Code: https://github.com/Droliven/SViMo_code.
- Models: Coming soon.
- Dataset: Coming soon.
Coming soon...
If you find Nerfies useful for your work please cite:
@article{dang2025svimo,
title={SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios},
author={Dang, Lingwei and Shao, Ruizhi and Zhang, Hongwen and Min, Wei and Liu, Yebin and Wu, Qingyao},
journal={arXiv preprint arXiv:2506.02444},
year={2025}
}