Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 19 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated Oct 29, 2025 • 11.6k • 2.78k • 32
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 19
Multilingual Leaderboards Leaderboards for languages other than English Runtime error 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 124 Open Chinese LLM Leaderboard 🏆 124 Explore and submit LLM benchmarks Running on CPU Upgrade 173 Open Arabic LLM Leaderboard 🏆 173 Track, rank and evaluate open Arabic LLMs and chatbots Running 40 OpenLLM French leaderboard 🇫🇷 🥇 40 Explore and submit LLM benchmarks
Running on CPU Upgrade 173 Open Arabic LLM Leaderboard 🏆 173 Track, rank and evaluate open Arabic LLMs and chatbots
Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 19 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated Oct 29, 2025 • 11.6k • 2.78k • 32
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 19
Multilingual Leaderboards Leaderboards for languages other than English Runtime error 74 La Leaderboard 🌸 74 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 124 Open Chinese LLM Leaderboard 🏆 124 Explore and submit LLM benchmarks Running on CPU Upgrade 173 Open Arabic LLM Leaderboard 🏆 173 Track, rank and evaluate open Arabic LLMs and chatbots Running 40 OpenLLM French leaderboard 🇫🇷 🥇 40 Explore and submit LLM benchmarks
Running on CPU Upgrade 173 Open Arabic LLM Leaderboard 🏆 173 Track, rank and evaluate open Arabic LLMs and chatbots