yichengup linoyts HF Staff commited on
Commit
7677901
·
verified ·
1 Parent(s): 7cfdf72
Files changed (2) hide show
  1. .gitattributes +1 -0
  2. README.md +16 -3
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ show.jpg filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,9 +1,22 @@
1
  ---
 
2
  base_model:
 
3
  - bytedance-research/OneReward
 
 
 
4
  ---
5
- flux.1-fill-dev-OneReward
6
 
7
- Process the model into a single model suitable for ComfyUI use
8
 
9
- Original model link: [OneReward](https://huggingface.co/bytedance-research/OneReward)
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: cc-by-nc-4.0
3
  base_model:
4
+ - black-forest-labs/FLUX.1-Fill-dev
5
  - bytedance-research/OneReward
6
+ language:
7
+ - en
8
+ pipeline_tag: image-to-image
9
  ---
10
+ # OneReward - ComfyUI
11
 
12
+ **ComfyUI community** checkpoint for **[OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning](https://arxiv.org/abs/xxxx)**.
13
 
14
+ [![arXiv](https://img.shields.io/badge/arXiv-Paper-<COLOR>.svg)](https://arxiv.org/abs/2508.21066) [![GitHub Repo](https://img.shields.io/badge/GitHub-Repo-green?logo=github)](https://github.com/bytedance/OneReward) [![GitHub Pages](https://img.shields.io/badge/GitHub-Project-blue?logo=github)](https://one-reward.github.io/)
15
+ <br>
16
+
17
+ This repo contains the checkpoint from [OneReward](https://huggingface.co/bytedance-research/OneReward) processed into a single model suitable for ComfyUI use.
18
+
19
+ **OneReward** is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, **FLUX.1-Fill-dev-OneReward** - based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing.
20
+
21
+
22
+ For more details and examples see original model repo: [**OneReward**](https://huggingface.co/bytedance-research/OneReward)