|
|
--- |
|
|
license: cc-by-nc-4.0 |
|
|
base_model: |
|
|
- black-forest-labs/FLUX.1-Fill-dev |
|
|
- bytedance-research/OneReward |
|
|
language: |
|
|
- en |
|
|
pipeline_tag: image-to-image |
|
|
--- |
|
|
# OneReward - ComfyUI |
|
|
|
|
|
[](https://arxiv.org/abs/2508.21066) [](https://github.com/bytedance/OneReward) [](https://one-reward.github.io/) |
|
|
<br> |
|
|
|
|
|
This repo contains the checkpoint from [OneReward](https://huggingface.co/bytedance-research/OneReward) processed into a single model suitable for ComfyUI use. |
|
|
|
|
|
**OneReward** is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, **FLUX.1-Fill-dev-OneReward** - based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing. |
|
|
|
|
|
|
|
|
For more details and examples see original model repo: [**OneReward**](https://huggingface.co/bytedance-research/OneReward) |