Running 76 puzzle 👁 76 Process text inputs through a neural network model and return numerical outputs
AudioX: Diffusion Transformer for Anything-to-Audio Generation Paper • 2503.10522 • Published Mar 13, 2025 • 27
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 284k • 1.57k