flux.1-fill-dev-OneReward / README.md

yichengup

Update README.md

9289ddd verified about 1 month ago

preview code

raw

history blame contribute delete

1.29 kB

metadata

license: cc-by-nc-4.0
base_model:
  - black-forest-labs/FLUX.1-Fill-dev
  - bytedance-research/OneReward
language:
  - en
pipeline_tag: image-to-image

OneReward - ComfyUI

This repo contains the checkpoint from OneReward processed into a single model suitable for ComfyUI use.

OneReward is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, FLUX.1-Fill-dev-OneReward - based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing.

For more details and examples see original model repo: OneReward