10 6 10

Ghosh

Sreyan88

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

nvidia/music-flamingo-2601-hf

authored a paper about 2 months ago

Music Flamingo: Scaling Music Understanding in Audio Language Models

commented on a paper about 2 months ago

Music Flamingo: Scaling Music Understanding in Audio Language Models

View all activity

Organizations

liked a model 2 days ago

nvidia/music-flamingo-2601-hf

Audio-Text-to-Text • 8B • Updated 1 day ago • 158 • 17

authored a paper about 2 months ago

Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 10

commented a paper about 2 months ago

Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 10 •

authored 3 papers 3 months ago

Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding

Paper • 2508.11818 • Published Aug 15, 2025

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 89

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 89

upvoted a paper 3 months ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 89

liked a Space 4 months ago

GPT-OSS-120B on AMD MI300X

💻

329

gpt-oss-120b on AMD MI300X GPUs

updated a collection 4 months ago

Audio

Collection

liked a dataset 5 months ago

gamma-lab-umd/MMAU-Pro

Viewer • Updated Aug 28, 2025 • 5.31k • 338 • 12

authored a paper 5 months ago

MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence

Paper • 2508.13992 • Published Aug 19, 2025 • 7

commented a paper 5 months ago

MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence

Paper • 2508.13992 • Published Aug 19, 2025 • 7 •

liked a model 5 months ago

nvidia/audio-flamingo-2-SoundCoT

Audio-Text-to-Text • Updated Aug 28, 2025 • 9

New activity in nvidia/AudioSkills 5 months ago

BBC-Sound-Effect duration doesn't match.

#5 opened 5 months ago by

WhaleDolphin

commented a paper 6 months ago

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models

Paper • 2507.08128 • Published Jul 10, 2025 • 10 •

liked a model 6 months ago

nvidia/audio-flamingo-3

Audio-Text-to-Text • Updated Nov 28, 2025 • 718 • 140

authored 2 papers 6 months ago

Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge

Paper • 2505.07365 • Published May 12, 2025

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models

Paper • 2507.08128 • Published Jul 10, 2025 • 10

commented a paper 6 months ago

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models

Paper • 2507.08128 • Published Jul 10, 2025 • 10 •

commented a paper 10 months ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published Mar 6, 2025 • 26 •

Ghosh

AI & ML interests

Recent Activity

Organizations

Sreyan88's activity

GPT-OSS-120B on AMD MI300X

BBC-Sound-Effect duration doesn't match.