CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models Paper • 2506.07463 • Published Jun 9 • 10
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 222