Running 1.48k Big Code Models Leaderboard 📈 1.48k Explore and compare code generation models on a leaderboard
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming Paper • 2402.14261 • Published Feb 22, 2024 • 10