Video-CoM
Collection
Video-CoM: Interactive Video Reasoning via Chain of Manipulations • 2 items • Updated
Natural Language Processing, Machine Learning, and Computer Vision
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework
LinguDistill: Recovering Linguistic Ability in Vision- Language Models via Selective Cross-Modal Distillation