SwiftTailor: Efficient 3D Garment Generation with Geometry Image Representation Paper • 2603.19053 • Published 17 days ago • 2
ViDoRe V3: A Comprehensive Evaluation of Retrieval Augmented Generation in Complex Real-World Scenarios Paper • 2601.08620 • Published Jan 13 • 12
Surfer 2: The Next Generation of Cross-Platform Computer Use Agents Paper • 2510.19949 • Published Oct 22, 2025 • 38
ViDoRe Benchmark V2: Raising the Bar for Visual Retrieval Paper • 2505.17166 • Published May 22, 2025 • 1
ModernVBERT: Towards Smaller Visual Document Retrievers Paper • 2510.01149 • Published Oct 1, 2025 • 33
Inherently Faithful Attention Maps for Vision Transformers Paper • 2506.08915 • Published Jun 10, 2025 • 3
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper • 2506.08279 • Published Jun 9, 2025 • 27
MixerMDM: Learnable Composition of Human Motion Diffusion Models Paper • 2504.01019 • Published Apr 1, 2025 • 18
view post Post 3246 Made a HF Dataset editor a la gg sheets here: lhoestq/dataset-spreadsheetsWith Dataset Spreadsheets:✏️ Edit datasets in the UI🔗 Share link with collaborators🐍 Use locally in DuckDB or PythonAvailable for the 100,000+ parquet datasets on HF :) See translation ❤️ 9 9 🔥 1 1 + Reply
Are Vision Language Models Texture or Shape Biased and Can We Steer Them? Paper • 2403.09193 • Published Mar 14, 2024 • 9
On the Interplay of Convolutional Padding and Adversarial Robustness Paper • 2308.06612 • Published Aug 12, 2023
An Extended Study of Human-like Behavior under Adversarial Training Paper • 2303.12669 • Published Mar 22, 2023
The Power of Linear Combinations: Learning with Random Convolutions Paper • 2301.11360 • Published Jan 26, 2023
Does Medical Imaging learn different Convolution Filters? Paper • 2210.13799 • Published Oct 25, 2022
Adversarial Robustness through the Lens of Convolutional Filters Paper • 2204.02481 • Published Apr 5, 2022
CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters Paper • 2203.15331 • Published Mar 29, 2022