Instructions to use microsoft/bloom-deepspeed-inference-int8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/bloom-deepspeed-inference-int8 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="microsoft/bloom-deepspeed-inference-int8")# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("microsoft/bloom-deepspeed-inference-int8") model = AutoModel.from_pretrained("microsoft/bloom-deepspeed-inference-int8") - Notebooks
- Google Colab
- Kaggle
File size: 135 Bytes
a804055 | 1 2 3 4 | version https://git-lfs.github.com/spec/v1
oid sha256:d1ad7cf22501554295a53e68597c7b58630957e1bf2189d30b7aabfd6e5da930
size 5551123341
|