Shiyu-Lab/HarnessLLM_SFT_Llama3_3B
4B
•
Updated
•
4
Shiyu-Lab/Inputoutput_SFT_Llama3_3B
4B
•
Updated
•
5
Shiyu-Lab/Inputoutput_SFT_Qwen3_4B
4B
•
Updated
•
5
Shiyu-Lab/HarnessLLM_SFT_Qwen3_4B
4B
•
Updated
•
6
Shiyu-Lab/Inputoutput_RL_Llama3_3B
4B
•
Updated
•
6
Shiyu-Lab/HarnessLLM_RL_Llama3_3B
4B
•
Updated
•
7
Shiyu-Lab/Inputoutput_RL_Qwen3_4B
4B
•
Updated
•
5
Shiyu-Lab/HarnessLLM_RL_Qwen3_4B
4B
•
Updated
•
9
Shiyu-Lab/QwQ-32B-thinkprune-iter2k
Text Generation
•
33B
•
Updated
•
8
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-3k
Text Generation
•
2B
•
Updated
•
11
Shiyu-Lab/QwQ-32B-thinkprune-iter3k
Text Generation
•
33B
•
Updated
•
3
Shiyu-Lab/QwQ-32B-thinkprune-2k
Text Generation
•
33B
•
Updated
•
6
Shiyu-Lab/QwQ-32B-thinkprune-3k
Text Generation
•
33B
•
Updated
•
7
Shiyu-Lab/QwQ-32B-thinkprune-4k
Text Generation
•
33B
•
Updated
•
7
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
Text Generation
•
2B
•
Updated
•
47
•
1
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-2k
Text Generation
•
2B
•
Updated
•
12
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-4k
Text Generation
•
2B
•
Updated
•
15
Shiyu-Lab/DeepScaleR-1.5B-Preview-thinkprune-4k
Text Generation
•
2B
•
Updated
•
12
Shiyu-Lab/DeepScaleR-1.5B-Preview-thinkprune-3k
Text Generation
•
2B
•
Updated
•
7
Shiyu-Lab/DeepScaleR-1.5B-Preview-thinkprune-2k
Text Generation
•
2B
•
Updated
•
5
Shiyu-Lab/DeepScaleR-1.5B-Preview-thinkprune-iter2k
Text Generation
•
2B
•
Updated
•
8
Shiyu-Lab/DeepScaleR-1.5B-Preview-thinkprune-iter3k
Text Generation
•
2B
•
Updated
•
5
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter3k
Text Generation
•
2B
•
Updated
•
21
Shiyu-Lab/roberta-base-watermark-embed
0.1B
•
Updated
•
109
Shiyu-Lab/Llama3B-KVLink5
4B
•
Updated
•
8
Shiyu-Lab/Llama1B-KVLink5
1B
•
Updated
•
24
Shiyu-Lab/Prereq-Tune_medical
Updated
Shiyu-Lab/Prereq-Tune_hotpotqa
Updated
Shiyu-Lab/Prereq-Tune_popqa
Updated
Shiyu-Lab/Prereq-Tune_bio
Updated