Fine-tuning of Large Language Models for Constituency Parsing Using a Sequence to Sequence Approach
2510.16604v1
cs.CL, 68T50, I.2.7; I.2.6
2025-10-22
Авторы:
Francisco Jose Cortes Delgado, Eduardo Martinez Gracia, Rafael Valencia Garcia
Abstract
Recent advances in natural language processing with large neural models have
opened new possibilities for syntactic analysis based on machine learning. This
work explores a novel approach to phrase-structure analysis by fine-tuning
large language models (LLMs) to translate an input sentence into its
corresponding syntactic structure. The main objective is to extend the
capabilities of MiSintaxis, a tool designed for teaching Spanish syntax.
Several models from the Hugging Face repository were fine-tuned using training
data generated from the AnCora-ES corpus, and their performance was evaluated
using the F1 score. The results demonstrate high accuracy in phrase-structure
analysis and highlight the potential of this methodology.