CriteriaPO/qwen2.5-3b-orpo-coarse-2e
3B
•
Updated
CriteriaPO/llama3.2-3b-orpo-vanilla
Updated
CriteriaPO/llama3.2-3b-orpo-coarse
Updated
CriteriaPO/llama3.2-3b-orpo-finegrained
Updated
CriteriaPO/qwen2.5-3b-orpo-mini
Updated
CriteriaPO/qwen2.5-3b-orpo-mini-2e
Text Generation
•
3B
•
Updated
•
7
CriteriaPO/qwen2.5-3b-dpo-coarse-40-vanilla
Text Generation
•
3B
•
Updated
•
3
CriteriaPO/qwen2.5-3b-dpo-finegrained-5-vanilla
Text Generation
•
3B
•
Updated
•
5
CriteriaPO/qwen2.5-3b-dpo-finegrained-40-vanilla
Text Generation
•
3B
•
Updated
•
7
CriteriaPO/qwen2.5-3b-dpo-finegrained-20-vanilla
Text Generation
•
3B
•
Updated
•
5
CriteriaPO/qwen2.5-3b-dpo-finegrained-10-vanilla
Text Generation
•
3B
•
Updated
•
5
CriteriaPO/qwen2.5-3b-dpo-coarse-20-vanilla
Text Generation
•
3B
•
Updated
•
8
CriteriaPO/qwen2.5-3b-dpo-coarse-10-vanilla
Text Generation
•
3B
•
Updated
•
8
CriteriaPO/qwen2.5-3b-dpo-coarse-5-vanilla
Text Generation
•
3B
•
Updated
•
6
CriteriaPO/qwen2.5-3b-dpo-mini-40-vanilla
Text Generation
•
3B
•
Updated
•
5
CriteriaPO/qwen2.5-3b-dpo-mini-20-vanilla
Text Generation
•
3B
•
Updated
•
6
CriteriaPO/qwen2.5-3b-dpo-mini-5-vanilla
Text Generation
•
3B
•
Updated
•
3
CriteriaPO/qwen2.5-3b-dpo-mini-10-vanilla
Text Generation
•
3B
•
Updated
•
6
CriteriaPO/qwen2.5-3b-dpo-finegrained
Text Generation
•
3B
•
Updated
•
72
CriteriaPO/qwen2.5-3b-dpo-coarse
Text Generation
•
3B
•
Updated
•
33
CriteriaPO/qwen2.5-3b-dpo-mini
Text Generation
•
3B
•
Updated
•
57
CriteriaPO/qwen2.5-3b-dpo-vanilla
Text Generation
•
3B
•
Updated
•
57
CriteriaPO/llama3.2-3b-orpo-vanilla-2e
Text Generation
•
3B
•
Updated
•
24
CriteriaPO/llama3.2-3b-orpo-finegrained-2e
Text Generation
•
3B
•
Updated
•
19
CriteriaPO/llama3.2-3b-orpo-coarse-2e
Text Generation
•
3B
•
Updated
•
16
CriteriaPO/llama3.2-3b-dpo-coarse
Text Generation
•
3B
•
Updated
•
80
CriteriaPO/llama3.2-3b-dpo-finegrained
Text Generation
•
3B
•
Updated
•
51
CriteriaPO/llama3.2-3b-dpo-vanilla
Text Generation
•
3B
•
Updated
•
137
CriteriaPO/llama3.2-3b-dpo-mini
Text Generation
•
3B
•
Updated
•
121
CriteriaPO/qwen2.5-3b-sft-10
Text Generation
•
3B
•
Updated
•
187