sam-llm: interpretable lane change trajectoryprediction via parametric finetuning

2509.03462v1 cs.AI, cs.CV, cs.RO 2025-09-05

Авторы:

Zhuo Cao, Yunxiao Shi, Min Xu

Резюме на русском

Научная статья представляет SAM-LLM — новую гибридную модель, которая объединяет контекстное разумение Large Language Models (LLMs) с физической точностью моделей кинематики для прогнозирования траекторий смены полосы движения в автономном вождении. Основная идея заключается в том, чтобы приспособить LLM для предсказания ключевых физических параметров траектории (например, отклонение, продолжительность маневра, начальная латентная скорость и изменение горизонтальной скорости) вместо вывода координат в виде непрерывных векторов. Этот подход позволяет получить полную, непрерывную и физически правильную модель траектории, которая интерпретируемая и эффективна в ресурсах, сокращая размер выходных данных на 80% по сравнению с методами, основанными на координатах. Модель достигла высокой точности прогнозирования намерений — 98,73%, показав себя эквивалентной традиционным LLM-моделям, но с дополнительным преимуществом возможности точного объяснения результатов.

Abstract

This work introduces SAM-LLM, a novel hybrid architecture that bridges the gap between the contextual reasoning of Large Language Models (LLMs) and the physical precision of kinematic lane change models for autonomous driving. The system is designed for interpretable lane change trajectory prediction by finetuning an LLM to output the core physical parameters of a trajectory model instead of raw coordinates. For lane-keeping scenarios, the model predicts discrete coordinates, but for lane change maneuvers, it generates the parameters for an enhanced Sinusoidal Acceleration Model (SAM), including lateral displacement, maneuver duration, initial lateral velocity, and longitudinal velocity change. This parametric approach yields a complete, continuous, and physically plausible trajectory model that is inherently interpretable and computationally efficient, achieving an 80% reduction in output size compared to coordinate-based methods. The SAM-LLM achieves a state-of-the-art overall intention prediction accuracy of 98.73%, demonstrating performance equivalent to traditional LLM predictors while offering significant advantages in explainability and resource efficiency.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

sam-llm: interpretable lane change trajectoryprediction via parametric finetuning

Авторы:

Резюме на русском

Abstract

Ссылки и действия

Связанные статьи

Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning

Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigat...

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied ...

The Safety Challenge of World Models for Embodied AI Agents: A Review

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Навигация