Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval Paper • 2408.00441 • Published Aug 1, 2024 • 1
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation Paper • 2303.12343 • Published Mar 22, 2023 • 2
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published Apr 3, 2025 • 88
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published Jun 19, 2025 • 89