FP8 Please

#1
by valdanito - opened

For NVIDIA H800 🥺

Sure thing!
Just uploaded it. It should also be compatible with FP8 kv cache quantization in vLLM!
numind/NuExtract3-FP8
https://huggingface.co/numind/NuExtract3-FP8

SorenDreano changed discussion status to closed

Sign up or log in to comment