yichengup
/

flux.1-fill-dev-OneReward

Model card Files Files and versions

flux.1-fill-dev-OneReward / README.md

yichengup's picture

Update README.md

9289ddd verified about 1 month ago

|

history blame contribute delete

1.29 kB

	---
	license: cc-by-nc-4.0
	base_model:
	- black-forest-labs/FLUX.1-Fill-dev
	- bytedance-research/OneReward
	language:
	- en
	pipeline_tag: image-to-image
	---
	# OneReward - ComfyUI

	[![arXiv](https://img.shields.io/badge/arXiv-Paper-<COLOR>.svg)](https://arxiv.org/abs/2508.21066) [![GitHub Repo](https://img.shields.io/badge/GitHub-Repo-green?logo=github)](https://github.com/bytedance/OneReward) [![GitHub Pages](https://img.shields.io/badge/GitHub-Project-blue?logo=github)](https://one-reward.github.io/)
	<br>

	This repo contains the checkpoint from [OneReward](https://huggingface.co/bytedance-research/OneReward) processed into a single model suitable for ComfyUI use.

	OneReward is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, FLUX.1-Fill-dev-OneReward - based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing.


	For more details and examples see original model repo: [OneReward](https://huggingface.co/bytedance-research/OneReward)