Wang Ting An

Tim0207

TingAnWang

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

upvoted a paper 3 months ago

Agent Learning via Early Experience

upvoted a paper 3 months ago

Less is More: Recursive Reasoning with Tiny Networks

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published 11 days ago • 60

upvoted 8 papers 3 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 501

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 538

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 660

liked 2 models 4 months ago

Qwen/Qwen2.5-Math-PRM-7B

Text Classification • 8B • Updated Jan 17, 2025 • 37.1k • 80

BounharAbdelaziz/Qwen2.5-3B-GRPO-Math-GSM8K

Text Generation • 3B • Updated Jun 25, 2025 • 37 • 1

upvoted a paper 4 months ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

upvoted an article 5 months ago

Article

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

Mar 15, 2024

•

upvoted a paper 5 months ago

Learning to Skip the Middle Layers of Transformers

Paper • 2506.21103 • Published Jun 26, 2025 • 18

upvoted an article 5 months ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

Jul 9, 2024

•

upvoted a paper 5 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 188

upvoted a paper about 1 year ago

Medical SAM 2: Segment medical images as video via Segment Anything Model 2

Paper • 2408.00874 • Published Aug 1, 2024 • 52

updated a model over 1 year ago

Tim0207/distilbert-base-uncased-finetuned-imdb

Fill-Mask • 67M • Updated Jun 16, 2024 • 4