rv2307
/

quora_duplicate_distilbert_model

Text Classification

Model card Files Files and versions

rv2307 commited on Sep 21, 2023

Commit

cc20ff1

·

1 Parent(s): 227d23b

update

Files changed (1) hide show

README.md +29 -10

README.md CHANGED Viewed

@@ -1,10 +1,29 @@
----
-license: apache-2.0
-datasets:
-- quora
-language:
-- en
-metrics:
-- accuracy
-library_name: transformers
----

+distilbert-base-uncased Quora Duplicate Pair Detection
+This model is fine tuned of version of distilbert-base-uncased on quora dataset for detecting duplicate sentences or questions.
+Loss     ::  0.111300
+Accuracy ::  0.900740
+F1 Score ::  0.868633
+Model Description:
+DistilBERT model is a distilled form of the BERT model. The size of a BERT model was reduced by 40% via knowledge distillation during the pre-training phase while retaining 97% of its language understanding abilities and being 60% faster.
+Training and evaluation data:
+"quora"
+Training Hyper Parameters:
+learning_rate = 3e-4,
+per_device_train_batch_size=32,
+per_device_eval_batch_size=32,
+num_train_epochs=4,
+evaluation_strategy="epoch",
+seed: 42,
+optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+Training Results:
+Epoch	Training Loss	Validation Loss	Accuracy	F1
+1	0.271500	0.264808	0.884909	0.844402
+2	0.191200	0.258109	0.896399	0.866099
+3	0.111300	0.315554	0.900740	0.868633