yichengup's picture
Update README.md
9289ddd verified
metadata
license: cc-by-nc-4.0
base_model:
  - black-forest-labs/FLUX.1-Fill-dev
  - bytedance-research/OneReward
language:
  - en
pipeline_tag: image-to-image

OneReward - ComfyUI

arXiv GitHub Repo GitHub Pages

This repo contains the checkpoint from OneReward processed into a single model suitable for ComfyUI use.

OneReward is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, FLUX.1-Fill-dev-OneReward - based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing.

For more details and examples see original model repo: OneReward