28 41

Beren Meng

Beren1128

qingyumeng1128@gmail.com

AI & ML interests

Explainable NLP Missing Data Imputation LLMOps

Recent Activity

upvoted a paper 4 days ago

PaperBanana: Automating Academic Illustration for AI Scientists

upvoted a paper 4 days ago

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

upvoted a paper 4 days ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

View all activity

Organizations

upvoted 3 papers 4 days ago

upvoted an article 2 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

May 7, 2024

•

118

upvoted an article 3 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

604

upvoted a paper 3 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 299

upvoted an article 4 months ago

Article

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

Jan 24, 2025

•

upvoted an article 7 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.08k

upvoted 2 papers 7 months ago

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4, 2025 • 159

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 261

upvoted an article 7 months ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

•

1.17k

upvoted a paper 12 months ago

Towards Universal Soccer Video Understanding

Paper • 2412.01820 • Published Dec 2, 2024 • 11

upvoted an article about 1 year ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22, 2024

•

128

upvoted a collection about 1 year ago

Qwen Papers

Collection

8 items • Updated Feb 13, 2025 • 2

upvoted 2 papers about 1 year ago

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 59

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 53

upvoted a collection about 1 year ago

Deepseek Papers

Collection

Deepseek papers collection • 31 items • Updated about 10 hours ago • 325

upvoted 3 articles over 1 year ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

•

291

Article

AI Watermarking 101: Tools and Techniques

Feb 26, 2024

•

Article

Vision Language Models Explained

Apr 11, 2024

•

521

Beren Meng

AI & ML interests

Recent Activity

Organizations

Beren1128's activity

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

We Got Claude to Fine-Tune an Open Source LLM

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

Mixture of Experts Explained

Introducing smolagents: simple agents that write actions in code.

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

How to generate text: using different decoding methods for language generation with Transformers

AI Watermarking 101: Tools and Techniques

Vision Language Models Explained