Pathology-CoT: Learning Visual Chain-of-Thought Agent from Expert Whole Slide Image Diagnosis Behavior Paper • 2510.04587 • Published Oct 6, 2025 • 2
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 3 days ago • 83
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 3 days ago • 59
Towards a Visual-Language Foundation Model for Computational Pathology Paper • 2307.12914 • Published Jul 24, 2023 • 1
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding Paper • 2410.11761 • Published Oct 15, 2024 • 3
Multistain Pretraining for Slide Representation Learning in Pathology Paper • 2408.02859 • Published Aug 5, 2024 • 1
Agentic Systems in Radiology: Design, Applications, Evaluation, and Challenges Paper • 2510.09404 • Published Oct 10, 2025 • 1
Towards Generalist Foundation Model for Radiology Paper • 2308.02463 • Published Aug 4, 2023 • 1
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training in Radiology Paper • 2301.02228 • Published Jan 5, 2023 • 1
A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalities Paper • 2403.17834 • Published Mar 26, 2024 • 3
RadVLM: A Multitask Conversational Vision-Language Model for Radiology Paper • 2502.03333 • Published Feb 5, 2025 • 1
Visual Prompt Engineering for Medical Vision Language Models in Radiology Paper • 2408.15802 • Published Aug 28, 2024 • 1
view article Article VideoMamba: State Space Model for Efficient Video Understanding Mar 16, 2024 • 2
view article Article Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac Jun 15, 2023 • 6
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 15 days ago • 90