Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ShethArihant
/
deepseek-coder-1.3b-instruct_sft-v4-with-setup_3-epochs_ce-0.8_triplet-0.2_lora2
like
0
Transformers
Safetensors
Generated from Trainer
trl
sft
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
deepseek-coder-1.3b-instruct_sft-v4-with-setup_3-epochs_ce-0.8_triplet-0.2_lora2
62.3 MB
1 contributor
History:
2 commits
ShethArihant
Training in progress, step 198
3115b54
verified
15 days ago
.gitattributes
Safe
1.52 kB
initial commit
15 days ago
README.md
1.96 kB
Training in progress, step 198
15 days ago
adapter_config.json
952 Bytes
Training in progress, step 198
15 days ago
adapter_model.safetensors
60 MB
xet
Training in progress, step 198
15 days ago
chat_template.jinja
Safe
1.04 kB
Training in progress, step 198
15 days ago
special_tokens_map.json
Safe
462 Bytes
Training in progress, step 198
15 days ago
tokenizer.json
Safe
2.29 MB
Training in progress, step 198
15 days ago
tokenizer_config.json
Safe
4.3 kB
Training in progress, step 198
15 days ago
training_args.bin
pickle
Detected Pickle imports (10)
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.SaveStrategy"
,
"accelerate.utils.dataclasses.DistributedType"
,
"torch.device"
,
"trl.trainer.sft_config.SFTConfig"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.trainer_utils.SchedulerType"
How to fix it?
6.42 kB
xet
Training in progress, step 198
15 days ago