RedHatAI/Meta-Llama-3.1-405B-Instruct-quantized.w4a16 Text Generation • 409B • Updated Oct 10, 2024 • 236 • 12
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w4a16 Text Generation • 71B • Updated Feb 12, 2025 • 115k • 33
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • 8B • Updated May 5 • 63.9k • 30
RedHatAI/Mistral-Nemo-Instruct-2407-quantized.w4a16 Text Generation • 12B • Updated Oct 9, 2024 • 857 • 4
RedHatAI/Phi-3-medium-128k-instruct-quantized.w4a16 Text Generation • 14B • Updated Oct 9, 2024 • 1.72k • 3
RedHatAI/Phi-3-mini-128k-instruct-quantized.w4a16 Text Generation • 4B • Updated Oct 9, 2024 • 45 • 1
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w4a16 Text Generation • 8B • Updated Jul 18, 2024 • 1.21k • 2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16 Text Generation • 71B • Updated Aug 29, 2024 • 17 • 2
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w4a16 Text Generation • 7B • Updated Mar 13, 2025 • 224 • 2