Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
princeton-nlp
's Collections
RLMT Experiments
SimPO
SWE-bench
ProLong
Sheared Llama
SimCSE
RLMT Experiments
updated
Sep 24
The *RLMT* collection. Coming soon!
Upvote
3
princeton-nlp/warm-start__sft__think__Llama-3.1-8B-Instruct
8B
•
Updated
Sep 22
•
14
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B-Instruct
8B
•
Updated
Sep 22
•
16
princeton-nlp/warm-start__sft__think__Llama-3.1-8B
8B
•
Updated
Sep 22
•
12
princeton-nlp/warm-start__sft__think__Qwen2.5-7B
8B
•
Updated
Sep 22
•
20
princeton-nlp/warm-start__sft__nothink__Llama-3.1-8B-Instruct
8B
•
Updated
Sep 22
•
13
princeton-nlp/warm-start__sft__think__Qwen2.5-7B-Instruct
8B
•
Updated
Sep 22
•
16
princeton-nlp/warm-start__sft__nothink__Llama-3.1-8B
8B
•
Updated
Sep 22
•
14
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B
8B
•
Updated
Sep 22
•
13
princeton-nlp/warm-start__dpo__think__Llama-3.1-8B
8B
•
Updated
Sep 22
•
11
princeton-nlp/warm-start__dpo__think__Qwen2.5-7B
8B
•
Updated
Sep 22
•
26
princeton-nlp/warm-start__dpo__think__Llama-3.1-8B-Instruct
8B
•
Updated
Sep 22
•
14
princeton-nlp/warm-start__dpo__think__Qwen2.5-7B-Instruct
8B
•
Updated
Sep 22
•
12
princeton-nlp/warm-start__dpo__nothink__Llama-3.1-8B
8B
•
Updated
Sep 22
•
12
princeton-nlp/warm-start__dpo__nothink__Qwen2.5-7B
8B
•
Updated
Sep 22
•
12
princeton-nlp/warm-start__dpo__nothink__Llama-3.1-8B-Instruct
8B
•
Updated
Sep 22
•
11
princeton-nlp/warm-start__dpo__nothink__Qwen2.5-7B-Instruct
8B
•
Updated
Sep 22
•
12
princeton-nlp/warm-start__ppo__think__Qwen2.5-7B
8B
•
Updated
Sep 22
•
12
princeton-nlp/warm-start__ppo__think__Llama-3.1-8B-Instruct
8B
•
Updated
Sep 22
•
12
princeton-nlp/warm-start__ppo__think__Qwen2.5-7B-Instruct
8B
•
Updated
Sep 22
•
13
princeton-nlp/warm-start__ppo__nothink__Llama-3.1-8B
8B
•
Updated
Sep 22
•
12
princeton-nlp/warm-start__ppo__nothink__Qwen2.5-7B
8B
•
Updated
Sep 22
•
12
princeton-nlp/warm-start__ppo__nothink__Llama-3.1-8B-Instruct
8B
•
Updated
Sep 22
•
13
princeton-nlp/warm-start__ppo__nothink__Qwen2.5-7B-Instruct
8B
•
Updated
Sep 22
•
12
princeton-nlp/zero__base__think__Llama-3.1-8B
8B
•
Updated
Sep 22
•
12
princeton-nlp/zero__base__nothink__Llama-3.1-8B
8B
•
Updated
Sep 22
•
12
princeton-nlp/zero__base__think__Qwen2.5-7B
8B
•
Updated
Sep 22
•
11
princeton-nlp/zero__base__nothink__Qwen2.5-7B
8B
•
Updated
Sep 22
•
13
princeton-nlp/zero__dpo__think__Llama-3.1-8B
8B
•
Updated
Sep 22
•
10
princeton-nlp/zero__dpo__think__Qwen2.5-7B
8B
•
Updated
Sep 22
•
11
princeton-nlp/zero__dpo__nothink__Llama-3.1-8B
8B
•
Updated
Sep 22
•
11
princeton-nlp/zero__dpo__nothink__Qwen2.5-7B
8B
•
Updated
Sep 22
•
11
princeton-nlp/zero__ppo__think__Llama-3.1-8B
8B
•
Updated
Sep 22
•
12
princeton-nlp/zero__ppo__think__Qwen2.5-7B
8B
•
Updated
Sep 22
•
11
princeton-nlp/warm-start__ppo__think__Llama-3.1-8B
8B
•
Updated
Sep 22
•
11
princeton-nlp/zero__ppo__nothink__Qwen2.5-7B
8B
•
Updated
Sep 22
•
11
princeton-nlp/zero__grpo__think__Llama-3.1-8B
8B
•
Updated
Sep 22
•
11
princeton-nlp/zero__grpo__think__Qwen2.5-7B
8B
•
Updated
Sep 22
•
12
princeton-nlp/zero__grpo__nothink__Llama-3.1-8B
8B
•
Updated
Sep 22
•
11
princeton-nlp/zero__grpo__nothink__Qwen2.5-7B
8B
•
Updated
Sep 22
•
12
princeton-nlp/rl_tulu3_wildchat-if_prompts
Viewer
•
Updated
Sep 22
•
7.79k
•
133
•
3
princeton-nlp/gemini_2.5_flash_0417_sft-data
Viewer
•
Updated
Sep 22
•
6k
•
154
•
1
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B-Instruct
8B
•
Updated
Sep 23
•
12
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B-Instruct
8B
•
Updated
Sep 23
•
11
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B
8B
•
Updated
Sep 23
•
11
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B
8B
•
Updated
Sep 23
•
12
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B-Instruct
8B
•
Updated
Sep 23
•
18
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B-Instruct
8B
•
Updated
Sep 23
•
23
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B
8B
•
Updated
Sep 23
•
15
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B
8B
•
Updated
Sep 23
•
11
Upvote
3
Share collection
View history
Collection guide
Browse collections