2 11

JaeminKim

kjm981995

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation

upvoted a paper 8 months ago

Fork-Merge Decoding: Enhancing Multimodal Understanding in Audio-Visual Large Language Models

authored a paper 8 months ago

Optical-Flow Guided Prompt Optimization for Coherent Video Generation

View all activity

Organizations

None yet

upvoted a paper 4 days ago

DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation

Paper • 2601.22904 • Published 8 days ago • 13

upvoted a paper 8 months ago

Fork-Merge Decoding: Enhancing Multimodal Understanding in Audio-Visual Large Language Models

Paper • 2505.20873 • Published May 27, 2025 • 9

authored 2 papers 8 months ago

Optical-Flow Guided Prompt Optimization for Coherent Video Generation

Paper • 2411.15540 • Published Nov 23, 2024

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

Paper • 2505.19075 • Published May 25, 2025 • 21

upvoted 2 papers 8 months ago

Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Paper • 2505.18600 • Published May 24, 2025 • 48

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

Paper • 2505.19075 • Published May 25, 2025 • 21

commented a paper 8 months ago

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

Paper • 2505.19075 • Published May 25, 2025 • 21 •

upvoted a paper 10 months ago

Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model

Paper • 2503.22622 • Published Mar 28, 2025 • 18

upvoted 2 papers 11 months ago

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Paper • 2503.07677 • Published Mar 10, 2025 • 86

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

Paper • 2503.09151 • Published Mar 12, 2025 • 32

upvoted a paper about 1 year ago

Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models

Paper • 2411.17041 • Published Nov 26, 2024 • 13

commented a paper about 1 year ago

Free$^2$Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models

Paper • 2411.17041 • Published Nov 26, 2024 • 13 •

authored a paper about 1 year ago

Free$^2$Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models

Paper • 2411.17041 • Published Nov 26, 2024 • 13

upvoted 3 papers over 1 year ago

TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation

Paper • 2410.05591 • Published Oct 8, 2024 • 13

ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler

Paper • 2410.05651 • Published Oct 8, 2024 • 12

VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide

Paper • 2410.04364 • Published Oct 6, 2024 • 29

JaeminKim

AI & ML interests

Recent Activity

Organizations

kjm981995's activity