Non-English Embeddings and Models
updated
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper
•
2211.05100
•
Published
•
35
Contrastive Language-Image Pre-training for the Italian Language
Paper
•
2108.08688
•
Published
•
2
IT5: Large-scale Text-to-text Pretraining for Italian Language
Understanding and Generation
Paper
•
2203.03759
•
Published
•
5
Spanish Pre-trained BERT Model and Evaluation Data
Paper
•
2308.02976
•
Published
•
3
German FinBERT: A German Pre-trained Language Model
Paper
•
2311.08793
•
Published
•
3
German Text Embedding Clustering Benchmark
Paper
•
2401.02709
•
Published
•
6
AfroDigits: A Community-Driven Spoken Digit Dataset for African
Languages
Paper
•
2303.12582
•
Published
•
21
Text Generation
•
7B
•
Updated
•
8.95k
•
68
Updated
•
247
•
24
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
Model
Paper
•
2402.07827
•
Published
•
48
Viewer
•
Updated
•
206k
•
3.77k
•
329
CohereLabs/c4ai-command-r-v01
Text Generation
•
35B
•
Updated
•
12.5k
•
1.1k