·
AI & ML interests
None yet
Organizations
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
11
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
9
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
9
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
7
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
11
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
8
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
9
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
15
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
8
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
12
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
10
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
7
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
11
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
6
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
7
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
7
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
7
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-0.0001_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
8
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-0.0001_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
8
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-3e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
6
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-3e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
6
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-3e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-1e-05_neftune_alpha-10
Text Generation
•
1B
•
Updated
•
9
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-1e-05_neftune_alpha-5
Text Generation
•
1B
•
Updated
•
7
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-1e-05_neftune_alpha-0
Text Generation
•
1B
•
Updated
•
5
Text Classification
•
67M
•
Updated
•
5
Text Generation
•
0.4B
•
Updated
•
6
Mlxa/TinyStories-8M-DPO-2
Text Generation
•
19.7M
•
Updated
•
6
Text Generation
•
19.7M
•
Updated
•
11