Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences
Zhuoran Jin
jinzhuoran
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
4 days ago
Dancing in Chains: Strategic Persuasion in Academic Rebuttal via Theory of Mind
updated
a model
8 days ago
jinzhuoran/Qwen3-4B-Instruct-16Env
published
a model
9 days ago
jinzhuoran/Qwen3-4B-Instruct-16Env
Organizations
None yet