starsfriday's picture
Update README.md
af4082b verified
metadata
license: apache-2.0
language:
  - en
  - zh
library_name: diffusers
base_model:
  - Qwen/Qwen-Image-Edit
pipeline_tag: image-to-image
tags:
  - image-editing
  - consistency
  - aesthetics
  - DiT
  - Qwen-Image
  - ValiantCat


🌈 Qwen-Image-Edit-MeiTu

This model — Qwen-Image-Edit-MeiTu — is an improved variant of Qwen/Qwen-Image-Edit, built with DiT-based architecture fine-tuning to enhance visual consistency, aesthetic quality, and structural alignment in complex edits.

Developed by Valiant Cat AI Lab, this version aims to further close the gap between high-fidelity semantic editing and coherent artistic rendering, achieving a more natural and professional output across a wide range of prompts and subjects.


✨ Key Improvements

  • Enhanced Consistency:
    Utilizes DiT (Diffusion Transformer) fine-tuning to ensure structural stability between input and edited regions, maintaining global spatial coherence.

  • Aesthetic Optimization:
    Trained with aesthetic discriminators and curated aesthetic score datasets, producing more pleasing colors, contrast, and light balance.

  • Better Detail Preservation:
    Improved low-level reconstruction for fine details such as textures, faces, and typography.

  • Broader Scene Adaptability:
    Performs well on portraits, environments, product photos, and illustrations, supporting both semantic and appearance-based editing.


🖼️ Showcase

Below are examples of consistency and aesthetic improvement in complex editing scenarios:

Input & Output

💬 Recommended Prompts

Try these prompts to explore the model’s strengths:

  • “make the lighting soft and cinematic with better balance”
  • “enhance the photo’s composition and maintain realism”
  • “refine skin tone and texture consistency”
  • “improve the global color tone and aesthetic harmony”
  • “increase photo realism and clarity without changing content”

🧩 Integration with ComfyUI

This model works seamlessly with a modified ComfyUI Qwen-Image-Edit workflow.
Just use this model in the Unet node to workflow for edit image.


📥 Download Model

Weights available in Safetensors format:

👉 Download Qwen-Image-Edit-MeiTu


🧠 Training

This model was trained and optimized by the
AI Laboratory of Chongqing Valiant Cat Technology Co., LTD.
Visit https://vvicat.com/ for business collaborations or research partnerships.


📜 License

Licensed under Apache 2.0.


💼 Join Us

We are hiring research engineers and creative ML practitioners at
Chongqing Valiant Cat Technology Co., LTD — reach out via
📧 [email protected]