Dialogue Is Not Enough to Make a Communicative BabyLM (But Neither Is Developmentally Inspired Reinforcement Learning) Paper • 2510.20358 • Published Oct 23, 2025
BLiSS 1.0: Evaluating Bilingual Learner Competence in Second Language Small Language Models Paper • 2510.19419 • Published Oct 22, 2025 • 1
Teacher Demonstrations in a BabyLM's Zone of Proximal Development for Contingent Multi-Turn Interaction Paper • 2510.20411 • Published Oct 23, 2025 • 2
Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues Paper • 2510.19028 • Published Oct 21, 2025 • 7
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data Paper • 2510.10159 • Published Oct 11, 2025 • 3
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data Paper • 2510.10159 • Published Oct 11, 2025 • 3
Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling Paper • 2510.08470 • Published Oct 9, 2025 • 1
Pico: A Modular Framework for Hypothesis-Driven Small Language Model Research Paper • 2509.16413 • Published Sep 19, 2025 • 1
Meta-Pretraining for Zero-Shot Cross-Lingual Named Entity Recognition in Low-Resource Philippine Languages Paper • 2509.02160 • Published Sep 2, 2025 • 1
Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque Paper • 2506.07597 • Published Jun 9, 2025
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26, 2025 • 75
Lessons from the Trenches on Reproducible Evaluation of Language Models Paper • 2405.14782 • Published May 23, 2024 • 1