Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

lapp0
/
distily_bench_obj_cross_v2.15_gpt2

TensorBoard
Safetensors
Distily
gpt2
Generated from Trainer
8-bit precision
bitsandbytes
Model card Files Files and versions
xet
Metrics Training metrics Community
distily_bench_obj_cross_v2.15_gpt2 / logs
82.3 MB
  • 1 contributor
History: 14 commits

This model has 1 file scanned as unsafe.

lapp0's picture
lapp0
End of training
e7564aa verified over 1 year ago
  • copy_teacher_modules=_(_lm_head___True)_, hs_layer_mapper=last, hs_loss_fn=mse, hs_weight=1.0, learning_rate=0.0001, per_device_train_batch_size=4
    Training in progress, step 24750 over 1 year ago
  • copy_teacher_modules=_(_lm_head___True)_, hs_layer_mapper=last, hs_loss_fn=mse, hs_weight=1.0
    Training in progress, step 12375 over 1 year ago
  • dataset_subset=default, dataset_uri=distily_c4_multilingual_1M, learning_rate=0.0001, per_device_train_batch_size=4
    Training in progress, step 24750 over 1 year ago
  • dataset_subset=default, dataset_uri=distily_c4_multilingual_1M
    Training in progress, step 24750 over 1 year ago
  • hs_layer_mapper=last, hs_loss_fn=mse, hs_weight=1.0, learning_rate=0.0001, per_device_train_batch_size=4
    End of training over 1 year ago
  • hs_layer_mapper=last, hs_loss_fn=mse, hs_weight=1.0
    End of training over 1 year ago
  • learning_rate=0.0001, per_device_train_batch_size=4, reinitialize_weights=xavier
    Training in progress, step 24750 over 1 year ago
  • learning_rate=0.0001, per_device_train_batch_size=4
    End of training over 1 year ago
  • completed.flag
    0 Bytes
    End of training over 1 year ago
  • events.out.tfevents.1724126966.02dbb11e2dcc
    5.85 MB
    xet
    End of training over 1 year ago
  • events.out.tfevents.1724131158.02dbb11e2dcc
    578 Bytes
    xet
    End of training over 1 year ago