nvidia
/

OpenMath2-Llama3.1-70B

Text Generation

text-generation-inference

Model card Files Files and versions

Add text-generation pipeline tag

#3

by nielsr HF Staff - opened Apr 6, 2025

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

Files changed (1) hide show

README.md +8 -5

README.md CHANGED Viewed

@@ -1,15 +1,16 @@
 ---
-license: llama3.1
 base_model:
 - meta-llama/Llama-3.1-70B
 datasets:
 - nvidia/OpenMathInstruct-2
 language:
 - en
 tags:
 - nvidia
 - math
-library_name: transformers
 ---
 # OpenMath2-Llama3.1-70B
@@ -23,10 +24,10 @@ The model outperforms [Llama3.1-70B-Instruct](https://huggingface.co/meta-llama/
 | Model | GSM8K | MATH | AMC 2023 | AIME 2024 | Omni-MATH |
 |:---|:---:|:---:|:---:|:---:|:---:|
 | Llama3.1-8B-Instruct | 84.5 | 51.9 | 9/40 | 2/30 | 12.7 |
-| OpenMath2-Llama3.1-8B ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B-nemo) \| [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)) | 91.7 | 67.8 | 16/40 | 3/30 | 22.0 |
 | + majority@256 | 94.1 | 76.1 | 23/40 | 3/30 | 24.6 |
 | Llama3.1-70B-Instruct | 95.8 | 67.9 | 19/40 | 6/30 | 19.0 |
-| **OpenMath2-Llama3.1-70B** ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B-nemo) \| [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B)) | 94.9 | 71.9 | 20/40 | 4/30 | 23.1 |
 | + majority@256 | 96.0 | 79.6 | 24/40 | 6/30 | 27.6 |
 The pipeline we used to produce the data and models is fully open-sourced!
@@ -61,7 +62,9 @@ pipeline = transformers.pipeline(
 messages = [
     {
         "role": "user",
-        "content": "Solve the following math problem. Make sure to put the answer (and only answer) inside \\boxed{}.\n\n" +
         "What is the minimum value of $a^2+6a-7$?"},
 ]

 ---
 base_model:
 - meta-llama/Llama-3.1-70B
 datasets:
 - nvidia/OpenMathInstruct-2
 language:
 - en
+library_name: transformers
+license: llama3.1
 tags:
 - nvidia
 - math
+pipeline_tag: text-generation
 ---
 # OpenMath2-Llama3.1-70B
 | Model | GSM8K | MATH | AMC 2023 | AIME 2024 | Omni-MATH |
 |:---|:---:|:---:|:---:|:---:|:---:|
 | Llama3.1-8B-Instruct | 84.5 | 51.9 | 9/40 | 2/30 | 12.7 |
+| OpenMath2-Llama3.1-8B ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B-nemo) | [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)) | 91.7 | 67.8 | 16/40 | 3/30 | 22.0 |
 | + majority@256 | 94.1 | 76.1 | 23/40 | 3/30 | 24.6 |
 | Llama3.1-70B-Instruct | 95.8 | 67.9 | 19/40 | 6/30 | 19.0 |
+| **OpenMath2-Llama3.1-70B** ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B-nemo) | [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B)) | 94.9 | 71.9 | 20/40 | 4/30 | 23.1 |
 | + majority@256 | 96.0 | 79.6 | 24/40 | 6/30 | 27.6 |
 The pipeline we used to produce the data and models is fully open-sourced!
 messages = [
     {
         "role": "user",
+        "content": "Solve the following math problem. Make sure to put the answer (and only answer) inside \\boxed{}.
+" +
         "What is the minimum value of $a^2+6a-7$?"},
 ]