Hao Fei
scofield7419
AI & ML interests
Multimodal Learning, Large Language Model, Vision and Language, Natural Language Processing, Structural Modeling
Recent Activity
authored
a paper
about 16 hours ago
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
upvoted
a
paper
2 days ago
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
authored
a paper
about 2 months ago
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist