Spaces:

minhho
/

mimo-1.0

Paused

App Files Files Community

mimo-1.0 / README_BACKUP.md

minhho

Clean deployment: All fixes without binary files

6f2c7f0 about 1 month ago

preview code

raw

history blame contribute delete

2.5 kB

	---
	title: MIMO - Character Video Synthesis
	emoji: 🎭
	colorFrom: blue
	colorTo: purple
	sdk: gradio
	sdk_version: 4.7.1
	app_file: app.py
	pinned: false
	license: apache-2.0
	python_version: "3.10"
	---IMO - Character Video Synthesis
	emoji: �
	colorFrom: blue
	colorTo: purple
	sdk: gradio
	sdk_version: 4.7.1
	app_file: app.py
	pinned: false
	license: apache-2.0
	python_version: "3.10"
	---

	# MIMO - Controllable Character Video Synthesis

	🎬 Complete Implementation Matching Research Paper

	Transform character images into animated videos with controllable motion and advanced video editing capabilities.

	## Features

	- Character Animation: Animate character images with driving 3D poses from motion datasets
	- Spatial 3D Motion: Support for in-the-wild video with spatial 3D motion and interactive scenes
	- Real-time Processing: Optimized for interactive use in web interface
	- Multiple Templates: Pre-built motion templates for various activities (sports, dance, martial arts, etc.)

	## How to Use

	1. Upload a character image: Choose a full-body, front-facing image with no occlusion or handheld objects
	2. Select motion template: Pick from various pre-built motion templates in the gallery
	3. Generate: Click "Run" to synthesize the character animation video

	## Technical Details

	- Model Architecture: Based on spatial decomposed modeling with UNet 2D/3D architectures
	- Motion Control: Uses 3D pose guidance for precise motion control
	- Scene Handling: Supports background separation and occlusion handling
	- Resolution: Generates videos at 784x784 resolution

	## Citation

	If you find this work useful, please cite:

	```bibtex
	@inproceedings{men2025mimo,
	title={MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling},
	author={Men, Yifang and Yao, Yuan and Cui, Miaomiao and Liefeng Bo},
	booktitle={Computer Vision and Pattern Recognition (CVPR), 2025 IEEE Conference on},
	year={2025}
	}
	```

	## Links

	- [Project Page](https://menyifang.github.io/projects/MIMO/index.html)
	- [Paper](https://arxiv.org/abs/2409.16160)
	- [Original Repository](https://github.com/menyifang/MIMO)
	- [Video Demo](https://www.youtube.com/watch?v=skw9lPKFfcE)

	## Acknowledgments

	This work builds upon several excellent open-source projects including Moore-AnimateAnyone, SAM, 4D-Humans, and ProPainter.

	---

	Note: This Space requires GPU resources for optimal performance. Processing time may vary depending on video length and complexity.