File size: 1,288 Bytes
7cfdf72 7677901 7cfdf72 7677901 7cfdf72 7677901 7cfdf72 7677901 7cfdf72 7677901 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
---
license: cc-by-nc-4.0
base_model:
- black-forest-labs/FLUX.1-Fill-dev
- bytedance-research/OneReward
language:
- en
pipeline_tag: image-to-image
---
# OneReward - ComfyUI
[](https://arxiv.org/abs/2508.21066) [](https://github.com/bytedance/OneReward) [](https://one-reward.github.io/)
<br>
This repo contains the checkpoint from [OneReward](https://huggingface.co/bytedance-research/OneReward) processed into a single model suitable for ComfyUI use.
**OneReward** is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, **FLUX.1-Fill-dev-OneReward** - based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing.
For more details and examples see original model repo: [**OneReward**](https://huggingface.co/bytedance-research/OneReward) |