zR
commited on
Commit
·
84c5dee
1
Parent(s):
7cdc618
GPU memory update
Browse files- .gitignore +9 -0
- README.md +14 -12
- README_zh.md +4 -2
.gitignore
ADDED
|
@@ -0,0 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
output/
|
| 2 |
+
*__pycache__/
|
| 3 |
+
samples*/
|
| 4 |
+
runs/
|
| 5 |
+
checkpoints/
|
| 6 |
+
master_ip
|
| 7 |
+
logs/
|
| 8 |
+
*.DS_Store
|
| 9 |
+
.idea
|
README.md
CHANGED
|
@@ -19,7 +19,7 @@ inference: false
|
|
| 19 |
</div>
|
| 20 |
<p align="center">
|
| 21 |
<a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README_zh.md">📄 中文阅读</a> |
|
| 22 |
-
<a href="https://github.com/THUDM/CogVideo">🌐 Github</a> |
|
| 23 |
<a href="#">📜 arxiv (coming soon) </a>
|
| 24 |
</p>
|
| 25 |
|
|
@@ -87,18 +87,20 @@ inference: false
|
|
| 87 |
CogVideoX is an open-source video generation model that shares the same origins as [清影](https://chatglm.cn/video).
|
| 88 |
The table below provides a list of the video generation models we currently offer, along with their basic information.
|
| 89 |
|
| 90 |
-
| Model Name | CogVideoX-2B (Current Repos)
|
| 91 |
-
|
| 92 |
-
| Supported Prompt Language | English
|
| 93 |
| GPU Memory Required for Inference | 36GB (will be optimized before the PR is merged) |
|
| 94 |
-
| GPU Memory Required for Fine-tuning (bs=1) |
|
| 95 |
-
| Prompt Length | 226 Tokens
|
| 96 |
-
| Video Length | 6 seconds
|
| 97 |
-
| Frames Per Second | 8 frames
|
| 98 |
-
| Resolution | 720 * 480
|
| 99 |
-
| Positional Embeddings | Sinusoidal
|
| 100 |
-
| Quantized Inference | Not Supported
|
| 101 |
-
| Multi-card Inference | Not Supported
|
|
|
|
|
|
|
| 102 |
|
| 103 |
## Quick Start 🤗
|
| 104 |
|
|
|
|
| 19 |
</div>
|
| 20 |
<p align="center">
|
| 21 |
<a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README_zh.md">📄 中文阅读</a> |
|
| 22 |
+
<a href="https://github.com/THUDM/CogVideo">🌐 Github(with PDF paper)</a> |
|
| 23 |
<a href="#">📜 arxiv (coming soon) </a>
|
| 24 |
</p>
|
| 25 |
|
|
|
|
| 87 |
CogVideoX is an open-source video generation model that shares the same origins as [清影](https://chatglm.cn/video).
|
| 88 |
The table below provides a list of the video generation models we currently offer, along with their basic information.
|
| 89 |
|
| 90 |
+
| Model Name | CogVideoX-2B (Current Repos) |
|
| 91 |
+
|--------------------------------------------|-----------------------------------------------|
|
| 92 |
+
| Supported Prompt Language | English |
|
| 93 |
| GPU Memory Required for Inference | 36GB (will be optimized before the PR is merged) |
|
| 94 |
+
| GPU Memory Required for Fine-tuning (bs=1) | 42GB |
|
| 95 |
+
| Prompt Length | 226 Tokens |
|
| 96 |
+
| Video Length | 6 seconds |
|
| 97 |
+
| Frames Per Second | 8 frames |
|
| 98 |
+
| Resolution | 720 * 480 |
|
| 99 |
+
| Positional Embeddings | Sinusoidal |
|
| 100 |
+
| Quantized Inference | Not Supported |
|
| 101 |
+
| Multi-card Inference | Not Supported |
|
| 102 |
+
|
| 103 |
+
**Note** Using [SAT](https://github.com/THUDM/SwissArmyTransformer) model cost 18GB for inference. Check our github.
|
| 104 |
|
| 105 |
## Quick Start 🤗
|
| 106 |
|
README_zh.md
CHANGED
|
@@ -6,7 +6,7 @@
|
|
| 6 |
</div>
|
| 7 |
<p align="center">
|
| 8 |
<a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README.md">📄 Read in English</a> |
|
| 9 |
-
<a href="https://github.com/THUDM/CogVideo">🌐 Github</a> |
|
| 10 |
<a href="#">📜 arxiv (即将发布) </a>
|
| 11 |
</p>
|
| 12 |
|
|
@@ -77,7 +77,7 @@ CogVideoX是 [清影](https://chatglm.cn/video) 同源的开源版本视频生
|
|
| 77 |
|---------------|---------------------|
|
| 78 |
| 提示词语言 | English |
|
| 79 |
| 推理显存消耗 | 36GB(会在PR合并之前优化) |
|
| 80 |
-
| 微调显存消耗 (bs=1) |
|
| 81 |
| 提示词长度上限 | 226 Tokens |
|
| 82 |
| 视频生成长度 | 6 seconds |
|
| 83 |
| 视频生成帧率 (每秒) | 8 frames |
|
|
@@ -86,6 +86,8 @@ CogVideoX是 [清影](https://chatglm.cn/video) 同源的开源版本视频生
|
|
| 86 |
| 量化 | 不支持 |
|
| 87 |
| 多卡推理 | 不支持 |
|
| 88 |
|
|
|
|
|
|
|
| 89 |
## 快速上手 🤗
|
| 90 |
|
| 91 |
本模型已经支持使用 huggingface 的 diffusers 库进行部署,你可以按照以下步骤进行部署。
|
|
|
|
| 6 |
</div>
|
| 7 |
<p align="center">
|
| 8 |
<a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README.md">📄 Read in English</a> |
|
| 9 |
+
<a href="https://github.com/THUDM/CogVideo">🌐 Github(包含PDF论文)</a> |
|
| 10 |
<a href="#">📜 arxiv (即将发布) </a>
|
| 11 |
</p>
|
| 12 |
|
|
|
|
| 77 |
|---------------|---------------------|
|
| 78 |
| 提示词语言 | English |
|
| 79 |
| 推理显存消耗 | 36GB(会在PR合并之前优化) |
|
| 80 |
+
| 微调显存消耗 (bs=1) | 42GB |
|
| 81 |
| 提示词长度上限 | 226 Tokens |
|
| 82 |
| 视频生成长度 | 6 seconds |
|
| 83 |
| 视频生成帧率 (每秒) | 8 frames |
|
|
|
|
| 86 |
| 量化 | 不支持 |
|
| 87 |
| 多卡推理 | 不支持 |
|
| 88 |
|
| 89 |
+
**Note** 使用 [SAT](https://github.com/THUDM/SwissArmyTransformer) 推理SAT版本模型仅需18G显存。欢迎前往我们的github查看。
|
| 90 |
+
|
| 91 |
## 快速上手 🤗
|
| 92 |
|
| 93 |
本模型已经支持使用 huggingface 的 diffusers 库进行部署,你可以按照以下步骤进行部署。
|