🌀 Wan2.1_14B_FusionX

High-Performance Merged Text-to-Video Model
Built on WAN 2.1 and fused with research-grade components for cinematic motion, detail, and speed — optimized for ComfyUI and rapid iteration in as few as 6 steps.

Merged models for faster, richer motion & detail — high performance even at just 8 steps.

📌 Important: To match the quality shown here, use the linked workflows or make sure to follow the recommended settings outlined below.


🚀 Overview

A powerful text-to-video model built on top of WAN 2.1 14B, merged with several research-grade models to boost:

  • Motion quality
  • Scene consistency
  • Visual detail

Comparable with closed-source solutions, but open and optimized for ComfyUI workflows.


💡 Inside the Fusion

This model is made up of the following which is on TOP of Wan 2.1 14B 720p(FusionX would not be what it is without these Models):

All merged models are provided for research and non-commercial use only. Some components are subject to licenses such as CC BY-NC-SA 4.0, and do not fall under permissive licenses like Apache 2.0 or MIT. Please refer to each model’s original license for full usage terms.


🚨✨Hey guys! Just a quick update!

We finally cooked up FusionX LoRAs!! 🧠💥
This is huge – now you can plug FusionX into your favorite workflows as a LoRA on top of the Wan base models and SkyReels models!🔌💫 You can still stick with the base FusionX Model if you already use it, but if you would rather have more control over the "FusionX" strength and a speed boost, then this might be for you.

Oh, and there’s a nice speed boost too! ⚡
Example: (RTX 5090)

  • FusionX as a full base model: 8 steps = 160s ⏱️
  • FusionX as a LoRA on Wan 2.1 14B fp8 T2V: 8 steps = 120s 🚀

Bonus: You can bump up the FusionX LoRA strength and lower your steps for a huge speed boost while testing/drafting.
Example: strength 2.00 with 3 steps takes 72 seconds.
Or lower the strength to experiment with a less “FusionX” look. ⚡🔍

We’ve got:

  • T2V (Text to Video) 🎬 – works perfectly with VACE ⚙️
  • I2V (Image to Video) 🖼️➡️📽️
  • A dedicated Phantom LoRA 👻
    The new LoRA's are HERE Note: The LoRa's are not meant to be put on top of the FusionX main models and instead you would use them with the Wan base models. New workflows are HERE 🛠️🚀

After lots of testing 🧪, the video quality with the LoRA is just as good (and sometimes even better! 💯)
That’s thanks to it being trained on the fp16 version of FusionX 🧬💎


🌀 Preview Gallery

These are compressed GIF previews for quick viewing — final video outputs are higher quality.

FusionX_00020
FusionX_00021
FusionX_00022
FusionX_00023
FusionX_00024
FusionX_00025
FusionX_00026
FusionX_00027
FusionX_00028
FusionX_00029
FusionX_00030
FusionX_00031


📂 Workflows & Model Downloads

🧠 GGUF Variants:


🎬 Example Videos

Want to see what FusionX can do? Check out these real outputs generated using the latest workflows and settings:


🔧 Usage Details

Text-to-Video

  • CGF: Must be set to 1
  • Shift:
    • 1024x576: Start at 1
    • 1080x720: Start at 2
    • For realism → lower values
    • For stylized → test 3–9
  • Scheduler:
    • Recommended: uni_pc
    • Alternative: flowmatch_causvid (better for some details)

Image-to-Video

  • CGF: 1
  • Shift: 2 works best in most cases
  • Scheduler:
    • Recommended: dmp++_sde/beta
  • To boost motion and reduce slow-mo effect:
    • Frame count: 121
    • FPS: 24

🛠 Technical Notes

  • Works in as few as 6 steps
  • Best quality at 8–10 steps
  • Drop-in replacement for Wan2.1-T2V-14B
  • Up to 50% faster rendering, especially with SageAttn
  • Works natively and with Kaji Wan Wrapper
    Wrapper GitHub
  • Do not re-add merged LoRAs (CausVid, AccVideo, MPS)
  • Feel free to add other LoRAs for style/variation
  • Native WAN workflows also supported (slightly slower)

🧪 Performance Tips

  • RTX 5090 → ~138 sec/video at 1024x576 / 81 frames
  • If VRAM is limited:
    • Enable block swapping
    • Start with 5 blocks and adjust as needed
  • Use SageAttn for ~30% speedup (wrapper only)
  • Do not use teacache
  • "Enhance a video" (tested): Adds vibrance (try values 2–4)
  • "SLG" not tested — feel free to explore

🧠 Prompt Help

Want better cinematic prompts? Try the WAN Cinematic Video Prompt Generator GPT — it adds visual richness and makes a big difference in quality. Download Here


📣 Join The Community

We’re building a friendly space to chat, share outputs, and get help.

  • Motion LoRAs coming soon
  • Tips, updates, and support from other users

👉 Join the Discord


⚖️ License

Some merged components use permissive licenses (Apache 2.0 / MIT),
but others — such as those from research models like CausVid — may be released under non-commercial licenses (e.g., CC BY-NC-SA 4.0).

  • ✅ You can use, modify, and redistribute under original license terms
  • ❗ You must retain and respect the license of each component
  • ⚠️ Commercial use is not permitted for models or components under non-commercial licenses
  • 📌 Outputs are not automatically licensed — do your own due diligence

This model is intended for research, education, and personal use only.
For commercial use or monetization, please consult a legal advisor and verify all component licenses.


🙏 Credits

  • WAN Team (base model)
  • aejion (AccVideo)
  • Tianwei Yin (CausVid)
  • ZuluVision (MoviiGen)
  • Alibaba PAI (MPS LoRA)
  • Kijai (ComfyUI Wrapper)

And thanks to the open-source community!


Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Examples
Examples
This model isn't deployed by any Inference Provider. 🙋 6 Ask for provider support

Model tree for vrgamedevgirl84/Wan14BT2VFusioniX

Finetuned
(32)
this model
Merges
1 model
Quantizations
3 models

Spaces using vrgamedevgirl84/Wan14BT2VFusioniX 44