Whisper Models Dutch Language Collection This repo contains Dutch Whisper models finetuned on CV and other synthetic data, with different filtering options β’ 11 items β’ Updated Sep 16, 2025 β’ 2
Whisper Models Portuguese Language Collection This Repo contains Whisper models trained on subsets of data like Common Voice 17(CV_17), Synthetic(Generated by OpenAI) + CV17 and Synthetic Only. β’ 13 items β’ Updated 13 days ago β’ 2
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. β’ 14 items β’ Updated Dec 10, 2025 β’ 21
Seamless: Multilingual Expressive and Streaming Speech Translation Paper β’ 2312.05187 β’ Published Dec 8, 2023 β’ 14