Do LLMs Understand Romanian Driving Laws? A Study on Multimodal and Fine-Tuned Question Answering
2509.23715v1
cs.CL, cs.LG
2025-10-01
Авторы:
Eduard Barbu, Adrian Marius Dumitran
Abstract
Ensuring that both new and experienced drivers master current traffic rules
is critical to road safety. This paper evaluates Large Language Models (LLMs)
on Romanian driving-law QA with explanation generation. We release a
1{,}208-question dataset (387 multimodal) and compare text-only and multimodal
SOTA systems, then measure the impact of domain-specific fine-tuning for Llama
3.1-8B-Instruct and RoLlama 3.1-8B-Instruct. SOTA models perform well, but
fine-tuned 8B models are competitive. Textual descriptions of images outperform
direct visual input. Finally, an LLM-as-a-Judge assesses explanation quality,
revealing self-preference bias. The study informs explainable QA for
less-resourced languages.
Ссылки и действия
Дополнительные ресурсы: