view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 16 days ago • 854
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper • 2404.05719 • Published Apr 8, 2024 • 83
Whisper Collection Whisper models for automatic speech recognition (ASR) and speech translation, quantized for faster inference speeds. • 0 items • Updated about 16 hours ago • 3