Inference Providers
Active filters: ONNX
microsoft/Phi-3-small-8k-instruct-onnx-cuda
Text Generation
• Updated • 53
• 14
Intel/gpt-j-6B-int8-dynamic-inc
Text Generation
• Updated • 36
• 16
Intel/gpt-j-6B-int8-static-inc
Text Generation
• Updated • 40
• 9
Text-to-Image
• Updated • 1
Text-to-Image
• Updated • 5
Text-to-Image
• Updated • 2
vgorce/distilbert-base-multi-cased-ner
Token Classification
• Updated • 8
• 1
Image Segmentation
• Updated • 3
Updated • 15
yilunzhang/all-mpnet-base-v2-onnx
Sentence Similarity
• Updated • 2
Feature Extraction
• Updated • 133
• 2
microsoft/Phi-3-mini-4k-instruct-onnx
Text Generation
• Updated • 323
• 146
microsoft/Phi-3-mini-128k-instruct-onnx
Text Generation
• Updated • 105
• 193
renwoshin/Phi-3-mini-128k-instruct-onnx-tf
Text Generation
• Updated • 18
• 1
Xenova/Phi-3-mini-4k-instruct
Text Generation
• Updated • 698
• 21
Xenova/Phi-3-mini-4k-instruct_fp16
Text Generation
• Updated • 177
• 5
microsoft/Phi-3-mini-4k-instruct-onnx-web
Text Generation
• Updated • 191
• 27
FusionQuill/Phi-3-mini-128k-instruct-onnx
Text Generation
• Updated • 3
microsoft/Phi-3-medium-4k-instruct-onnx-cpu
Text Generation
• Updated • 48
• 7
microsoft/Phi-3-medium-4k-instruct-onnx-cuda
Text Generation
• Updated • 56
• 11
microsoft/Phi-3-medium-4k-instruct-onnx-directml
Text Generation
• Updated • 46
• 10
microsoft/Phi-3-medium-128k-instruct-onnx-cpu
Text Generation
• Updated • 65
• 14
microsoft/Phi-3-medium-128k-instruct-onnx-cuda
Text Generation
• Updated • 47
• 24
microsoft/Phi-3-medium-128k-instruct-onnx-directml
Text Generation
• Updated • 49
• 6
microsoft/Phi-3-vision-128k-instruct-onnx-cpu
Text Generation
• Updated • 37
• 29
microsoft/Phi-3-vision-128k-instruct-onnx-cuda
Text Generation
• Updated • 24
• 27
microsoft/Phi-3-vision-128k-instruct-onnx-directml
Text Generation
• Updated • 19
• 8
microsoft/mistral-7b-instruct-v0.2-ONNX
Text Generation
• Updated • 9
• 6
luweigen/Llama-3-8B-Instruct-int4-onnx-directml
Text Generation
• Updated • 4
EmbeddedLLM/llama-2-7b-chat-int4-onnx-directml
Text Generation
• Updated • 9
• 1