7 14 2

Zhuoyang Zhang PRO

zhuoyang20

https://zhuoyangz.com

AI & ML interests

Efficient AI computing

Recent Activity

upvoted a paper 17 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

upvoted a paper about 1 month ago

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

authored a paper about 2 months ago

Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model

View all activity

Organizations

upvoted a paper 17 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 20 days ago • 143

upvoted a paper about 1 month ago

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82

upvoted a paper 2 months ago

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Paper • 2602.02958 • Published Feb 3 • 34

upvoted a paper 5 months ago

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference

Paper • 2512.01031 • Published Nov 30, 2025 • 26

upvoted a paper 6 months ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10, 2025 • 53

upvoted a paper 7 months ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 189

upvoted a paper 8 months ago

MolmoAct: Action Reasoning Models that can Reason in Space

Paper • 2508.07917 • Published Aug 11, 2025 • 45

upvoted a paper 9 months ago

GR-3 Technical Report

Paper • 2507.15493 • Published Jul 21, 2025 • 47

upvoted 3 papers 10 months ago

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

Paper • 2507.01957 • Published Jul 2, 2025 • 23

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24, 2025 • 42

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

Paper • 2506.16500 • Published Jun 19, 2025 • 17

upvoted a paper about 1 year ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Paper • 2502.14866 • Published Feb 20, 2025 • 13

upvoted a paper over 1 year ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

upvoted a paper about 2 years ago

EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

Zhuoyang Zhang PRO

AI & ML interests

Recent Activity

Organizations

zhuoyang20's activity