Update model card with benchmark result plotting
Browse files
README.md
CHANGED
|
@@ -72,4 +72,19 @@ model-index:
|
|
| 72 |
verified: false
|
| 73 |
---
|
| 74 |
# Control-LLM-Llama3.1-8B-Math16
|
| 75 |
-
This is a fine-tuned model of Llama-3.1-8B-Instruct for mathematical tasks on OpenMath2 dataset.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 72 |
verified: false
|
| 73 |
---
|
| 74 |
# Control-LLM-Llama3.1-8B-Math16
|
| 75 |
+
This is a fine-tuned model of Llama-3.1-8B-Instruct for mathematical tasks on OpenMath2 dataset.
|
| 76 |
+
|
| 77 |
+
## Evaluation Results
|
| 78 |
+
Here is an overview of the evaluation results and findings:
|
| 79 |
+
|
| 80 |
+
### Benchmark Result and Catastrophic Forgetting on OpenMath
|
| 81 |
+
The following plot illustrates benchmark result and catastrophic forgetting mitigation on the OpenMath2 dataset.
|
| 82 |
+
|
| 83 |
+

|
| 84 |
+
|
| 85 |
+
### Alignment Comparison
|
| 86 |
+
The plot below highlights the alignment comparison of the model trained with Control LLM and Full Parameter Tuning.
|
| 87 |
+
|
| 88 |
+

|
| 89 |
+
|
| 90 |
+
|