BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
Paper
•
2402.10631
•
Published
•
2
| PPL | arc_easy | arc_challenge | piqa | winogrande | hellaswag | mmlu | QA Avg |
|---|---|---|---|---|---|---|---|
| 4082.93 | 26.26 ± 0.90 | 22.44 ± 1.22 | 52.45 ± 1.17 | 51.78 ± 1.40 | 25.88 ± 0.44 | - | 35.76 |
Training method based on BitDistiller Paper
Base model
TinyLlama/TinyLlama_v1.1