Added results
Browse files
README.md
CHANGED
|
@@ -6,6 +6,18 @@ pipeline_tag: text-generation
|
|
| 6 |
dtype: bfloat16
|
| 7 |
---
|
| 8 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
# Edit/Disclaimer:
|
| 10 |
Currently the #1 ranked 7B LLM on the LLM Leaderboards, woah!
|
| 11 |
I did not expect that result at all and am in no way a professional when it comes to LLM's or computer science in general,
|
|
|
|
| 6 |
dtype: bfloat16
|
| 7 |
---
|
| 8 |
|
| 9 |
+
|
| 10 |
+
# Results:
|
| 11 |
+
T: 🟦
|
| 12 |
+
Model: CultriX/MistralTrix-v1 📑
|
| 13 |
+
Average: 73.39
|
| 14 |
+
ARC: 72.27
|
| 15 |
+
HellaSwag: 88.33
|
| 16 |
+
MMLU: 65.24
|
| 17 |
+
TruthfulQA: 70.73
|
| 18 |
+
Winogrande: 80.98
|
| 19 |
+
GSM8K: 62.77
|
| 20 |
+
|
| 21 |
# Edit/Disclaimer:
|
| 22 |
Currently the #1 ranked 7B LLM on the LLM Leaderboards, woah!
|
| 23 |
I did not expect that result at all and am in no way a professional when it comes to LLM's or computer science in general,
|