Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yeliudev 's Collections
VideoMind
UniPixel
E.T. Bench
R2-Tuning

VideoMind

updated 10 days ago

[ICLR 2026] VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning

Upvote
3

  • Running on Zero
    37

    VideoMind 2B

    💡
    37

    A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning


  • yeliudev/VideoMind-2B

    Video-Text-to-Text • Updated 10 days ago • 16 • 2

  • yeliudev/VideoMind-7B

    Video-Text-to-Text • Updated 10 days ago • 17 • 4

  • yeliudev/VideoMind-Dataset

    Preview • Updated 10 days ago • 3.3k • 11

  • yeliudev/VideoMind-2B-FT-QVHighlights

    Video-Text-to-Text • Updated 10 days ago • 5 • 1

  • VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

    Paper • 2503.13444 • Published Mar 17, 2025 • 17
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs