Kundyzka
/

t5-kazakh-qa-informatics-kaz

Question Answering

Model card Files Files and versions

Kundyzka commited on Jan 28

Commit

99327d1

·

verified ·

1 Parent(s): b21d8dd

Update README.md

Files changed (1) hide show

README.md +24 -12

README.md CHANGED Viewed

@@ -5,16 +5,16 @@ datasets:
 language:
 - kk
 metrics:
-  - name: F1
-    type: F1 Score (Validation)
     value: 31.405
-  - name: Exact Match (Validation)
     type: Exact Match
     value: 14.675
-  - name: F1 (Test)
     type: F1 Score
     value: 56.819
-  - name: Exact Match (Test)
     type: Exact Match
     value: 35.454
 base_model:
@@ -33,13 +33,18 @@ This model was developed by **Kundyz Maksutova**, PhD Candidate, as part of rese
 - **Dataset**: `Kundyzka/informatics_kaz`
 - **Language**: Kazakh (`kk`)
 - **Task**: Question Answering
-- **Performance**:
-  - **Validation**:
-    - F1 Score: 31.405
-    - Exact Match: 14.675
-  - **Test**:
-    - F1 Score: 56.819
-    - Exact Match: 35.454
 ### Dataset:
 The `Kundyzka/informatics_kaz` dataset is curated to provide a diverse set of questions and answers in Kazakh, primarily targeting topics in computer science. This dataset ensures the model handles domain-specific terminology effectively.
@@ -54,3 +59,10 @@ This model is designed for answering questions in the Kazakh language, with appl
 - **Domain-Specific Bias**: Performance may drop on topics outside computer science.
 - **Dataset Bias**: Potential biases from the dataset can influence model outputs.
 - **Language Support**: The model is optimized for Kazakh and does not support other languages.

 language:
 - kk
 metrics:
+  - name: F1 (Before Training)
+    type: F1 Score
     value: 31.405
+  - name: Exact Match (Before Training)
     type: Exact Match
     value: 14.675
+  - name: F1 (After Training)
     type: F1 Score
     value: 56.819
+  - name: Exact Match (After Training)
     type: Exact Match
     value: 35.454
 base_model:
 - **Dataset**: `Kundyzka/informatics_kaz`
 - **Language**: Kazakh (`kk`)
 - **Task**: Question Answering
+### Performance:
+This model demonstrates significant improvements after fine-tuning, as shown by the following metrics:
+- **Before Training**:
+  - F1 Score: 31.405
+  - Exact Match (EM): 14.675
+- **After Training**:
+  - F1 Score: 56.819
+  - Exact Match (EM): 35.454
+These metrics highlight the enhanced ability of the model to handle domain-specific questions after training on the `Kundyzka/informatics_kaz` dataset.
 ### Dataset:
 The `Kundyzka/informatics_kaz` dataset is curated to provide a diverse set of questions and answers in Kazakh, primarily targeting topics in computer science. This dataset ensures the model handles domain-specific terminology effectively.
 - **Domain-Specific Bias**: Performance may drop on topics outside computer science.
 - **Dataset Bias**: Potential biases from the dataset can influence model outputs.
 - **Language Support**: The model is optimized for Kazakh and does not support other languages.
+### Tags:
+- `computerscience`
+- `question-answering`
+- `Kazakh`
+This model represents a significant step toward advancing natural language processing tools for low-resource languages like Kazakh. For further details or customization, refer to the model repository.