Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer
2510.05846v1
cs.CL, I.2.7
2025-10-09
Авторы:
Maxence Lasbordes, Sinoué Gad
Abstract
The landscape of Large Language Models (LLMs) remains predominantly
English-centric, resulting in a significant performance gap for other major
languages, such as French, especially in the context of Small Language Models
(SLMs). Existing multilingual models demonstrate considerably lower performance
in French compared to English, and research on efficient adaptation methods for
French remains limited. To address this, we introduce \textbf{Luth}, a family
of French-specialized SLMs: through targeted post-training on curated,
high-quality French data, our models outperform all open-source counterparts of
comparable size on multiple French benchmarks while retaining their original
English capabilities. We further show that strategic model merging enhances
performance in both languages, establishing Luth as a new state of the art for
French SLMs and a robust baseline for future French-language research.
Ссылки и действия
Дополнительные ресурсы: