|
|
--- |
|
|
title: MIMO - Character Video Synthesis |
|
|
emoji: 🎭 |
|
|
colorFrom: blue |
|
|
colorTo: purple |
|
|
sdk: gradio |
|
|
sdk_version: 4.7.1 |
|
|
app_file: app.py |
|
|
pinned: false |
|
|
license: apache-2.0 |
|
|
python_version: "3.10" |
|
|
---IMO - Character Video Synthesis |
|
|
emoji: � |
|
|
colorFrom: blue |
|
|
colorTo: purple |
|
|
sdk: gradio |
|
|
sdk_version: 4.7.1 |
|
|
app_file: app.py |
|
|
pinned: false |
|
|
license: apache-2.0 |
|
|
python_version: "3.10" |
|
|
--- |
|
|
|
|
|
# MIMO - Controllable Character Video Synthesis |
|
|
|
|
|
**🎬 Complete Implementation Matching Research Paper** |
|
|
|
|
|
Transform character images into animated videos with controllable motion and advanced video editing capabilities. |
|
|
|
|
|
## Features |
|
|
|
|
|
- **Character Animation**: Animate character images with driving 3D poses from motion datasets |
|
|
- **Spatial 3D Motion**: Support for in-the-wild video with spatial 3D motion and interactive scenes |
|
|
- **Real-time Processing**: Optimized for interactive use in web interface |
|
|
- **Multiple Templates**: Pre-built motion templates for various activities (sports, dance, martial arts, etc.) |
|
|
|
|
|
## How to Use |
|
|
|
|
|
1. **Upload a character image**: Choose a full-body, front-facing image with no occlusion or handheld objects |
|
|
2. **Select motion template**: Pick from various pre-built motion templates in the gallery |
|
|
3. **Generate**: Click "Run" to synthesize the character animation video |
|
|
|
|
|
## Technical Details |
|
|
|
|
|
- **Model Architecture**: Based on spatial decomposed modeling with UNet 2D/3D architectures |
|
|
- **Motion Control**: Uses 3D pose guidance for precise motion control |
|
|
- **Scene Handling**: Supports background separation and occlusion handling |
|
|
- **Resolution**: Generates videos at 784x784 resolution |
|
|
|
|
|
## Citation |
|
|
|
|
|
If you find this work useful, please cite: |
|
|
|
|
|
```bibtex |
|
|
@inproceedings{men2025mimo, |
|
|
title={MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling}, |
|
|
author={Men, Yifang and Yao, Yuan and Cui, Miaomiao and Liefeng Bo}, |
|
|
booktitle={Computer Vision and Pattern Recognition (CVPR), 2025 IEEE Conference on}, |
|
|
year={2025} |
|
|
} |
|
|
``` |
|
|
|
|
|
## Links |
|
|
|
|
|
- [Project Page](https://menyifang.github.io/projects/MIMO/index.html) |
|
|
- [Paper](https://arxiv.org/abs/2409.16160) |
|
|
- [Original Repository](https://github.com/menyifang/MIMO) |
|
|
- [Video Demo](https://www.youtube.com/watch?v=skw9lPKFfcE) |
|
|
|
|
|
## Acknowledgments |
|
|
|
|
|
This work builds upon several excellent open-source projects including Moore-AnimateAnyone, SAM, 4D-Humans, and ProPainter. |
|
|
|
|
|
--- |
|
|
|
|
|
**Note**: This Space requires GPU resources for optimal performance. Processing time may vary depending on video length and complexity. |