Fine Tuning
updated
Fine-Tuning Language Models from Human Preferences
Paper
•
1909.08593
•
Published
•
3
PromptCoT: Synthesizing Olympiad-level Problems for Mathematical
Reasoning in Large Language Models
Paper
•
2503.02324
•
Published
How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs'
Reasoning Capabilities: A Preliminary Experimental Study
Paper
•
2504.00829
•
Published
GPG: A Simple and Strong Reinforcement Learning Baseline for Model
Reasoning
Paper
•
2504.02546
•
Published
•
2
RL of Thoughts: Navigating LLM Reasoning with Inference-time
Reinforcement Learning
Paper
•
2505.14140
•
Published
•
1
SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement
Learning on LLM
Paper
•
2504.14286
•
Published
•
2
General-Reasoner: Advancing LLM Reasoning Across All Domains
Paper
•
2505.14652
•
Published
•
24
SWE-agent: Agent-Computer Interfaces Enable Automated Software
Engineering
Paper
•
2405.15793
•
Published
•
7
VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and
3D Reasoning from CAD Software
Paper
•
2505.24838
•
Published
CAD-Recode: Reverse Engineering CAD Code from Point Clouds
Paper
•
2412.14042
•
Published
•
6
FlexiDreamer: Single Image-to-3D Generation with FlexiCubes
Paper
•
2404.00987
•
Published
•
23
CADCrafter: Generating Computer-Aided Design Models from Unconstrained
Images
Paper
•
2504.04753
•
Published
•
1
Neural Kernel Surface Reconstruction
Paper
•
2305.19590
•
Published
MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
Paper
•
2312.06947
•
Published
SWE-Dev: Building Software Engineering Agents with Training and
Inference Scaling
Paper
•
2506.07636
•
Published
•
1