Update README.md (#11)
Browse files- Update README.md (48f90eee14522c334ec22be8502aa251b16964bb)
README.md
CHANGED
|
@@ -244,6 +244,21 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
|
|
| 244 |
|
| 245 |

|
| 246 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 247 |
# Prompt Template
|
| 248 |
|
| 249 |
This model uses `ChatML` prompt template:
|
|
@@ -320,16 +335,4 @@ outputs = pipeline(
|
|
| 320 |
print(outputs[0]["generated_text"][len(prompt):])
|
| 321 |
```
|
| 322 |
|
| 323 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
| 324 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Llama-3-8B-Instruct-v0.8)
|
| 325 |
-
|
| 326 |
-
| Metric |Value|
|
| 327 |
-
|-------------------|----:|
|
| 328 |
-
|Avg. |26.75|
|
| 329 |
-
|IFEval (0-Shot) |75.12|
|
| 330 |
-
|BBH (3-Shot) |28.27|
|
| 331 |
-
|MATH Lvl 5 (4-Shot)| 7.10|
|
| 332 |
-
|GPQA (0-shot) | 7.38|
|
| 333 |
-
|MuSR (0-shot) |10.92|
|
| 334 |
-
|MMLU-PRO (5-shot) |31.68|
|
| 335 |
|
|
|
|
| 244 |
|
| 245 |

|
| 246 |
|
| 247 |
+
|
| 248 |
+
**Leaderboard 2.0:**
|
| 249 |
+
|
| 250 |
+
| Metric |Value|
|
| 251 |
+
|-------------------|----:|
|
| 252 |
+
|Avg. |26.75|
|
| 253 |
+
|IFEval (0-Shot) |75.12|
|
| 254 |
+
|BBH (3-Shot) |28.27|
|
| 255 |
+
|MATH Lvl 5 (4-Shot)| 7.10|
|
| 256 |
+
|GPQA (0-shot) | 7.38|
|
| 257 |
+
|MuSR (0-shot) |10.92|
|
| 258 |
+
|MMLU-PRO (5-shot) |31.68|
|
| 259 |
+
|
| 260 |
+
|
| 261 |
+
|
| 262 |
# Prompt Template
|
| 263 |
|
| 264 |
This model uses `ChatML` prompt template:
|
|
|
|
| 335 |
print(outputs[0]["generated_text"][len(prompt):])
|
| 336 |
```
|
| 337 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 338 |
|