daily_review
updated
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Paper
• 2311.05556
• Published
• 87
LongAlign: A Recipe for Long Context Alignment of Large Language Models
Paper
• 2401.18058
• Published
• 24
Efficient Tool Use with Chain-of-Abstraction Reasoning
Paper
• 2401.17464
• Published
• 21
Transfer Learning for Text Diffusion Models
Paper
• 2401.17181
• Published
• 17
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
• 2401.00908
• Published
• 189
VL-GPT: A Generative Pre-trained Transformer for Vision and Language
Understanding and Generation
Paper
• 2312.09251
• Published
• 10
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual
Perception
Paper
• 2401.16158
• Published
• 20
SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents
Paper
• 2401.10935
• Published
• 5
WebVoyager: Building an End-to-End Web Agent with Large Multimodal
Models
Paper
• 2401.13919
• Published
• 32
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
Paper
• 2401.13178
• Published
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent
Paper
• 2401.07324
• Published
• 3
AUTOACT: Automatic Agent Learning from Scratch via Self-Planning
Paper
• 2401.05268
• Published
• 4
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Paper
• 2401.15077
• Published
• 20
CreativeSynth: Creative Blending and Synthesis of Visual Arts based on
Multimodal Diffusion
Paper
• 2401.14066
• Published
• 11
MaLA-500: Massive Language Adaptation of Large Language Models
Paper
• 2401.13303
• Published
• 12
In-Context Language Learning: Architectures and Algorithms
Paper
• 2401.12973
• Published
• 4
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper
• 2401.06951
• Published
• 26
Fast Inference of Mixture-of-Experts Language Models with Offloading
Paper
• 2312.17238
• Published
• 7
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts
for Instruction Tuning on General Tasks
Paper
• 2401.02731
• Published
• 3
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper
• 2402.01739
• Published
• 28
A Closer Look into Mixture-of-Experts in Large Language Models
Paper
• 2406.18219
• Published
• 17
Octo-planner: On-device Language Model for Planner-Action Agents
Paper
• 2406.18082
• Published
• 48
Paper
• 2412.15115
• Published
• 377
LearnLM: Improving Gemini for Learning
Paper
• 2412.16429
• Published
• 22
MH-MoE:Multi-Head Mixture-of-Experts
Paper
• 2411.16205
• Published
• 26
Multi-Head Mixture-of-Experts
Paper
• 2404.15045
• Published
• 60
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
Paper
• 2511.08577
• Published
• 108
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
Paper
• 2511.11373
• Published
• 14
Language Models that Think, Chat Better
Paper
• 2509.20357
• Published
• 1
AgentEvolver: Towards Efficient Self-Evolving Agent System
Paper
• 2511.10395
• Published
• 4
Efficient Reasoning via Reward Model
Paper
• 2511.09158
• Published