CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving

2510.12560v1 cs.CV, cs.LG, cs.RO 2025-10-16

Авторы:

Xiaoji Zheng, Ziyuan Yang, Yanhao Chen, Yuhang Peng, Yuanrong Tang, Gengyuan Liu, Bokui Chen, Jiangtao Gong

Abstract

End-to-end autonomous driving models trained solely with imitation learning (IL) often suffer from poor generalization. In contrast, reinforcement learning (RL) promotes exploration through reward maximization but faces challenges such as sample inefficiency and unstable convergence. A natural solution is to combine IL and RL. Moving beyond the conventional two-stage paradigm (IL pretraining followed by RL fine-tuning), we propose CoIRL-AD, a competitive dual-policy framework that enables IL and RL agents to interact during training. CoIRL-AD introduces a competition-based mechanism that facilitates knowledge exchange while preventing gradient conflicts. Experiments on the nuScenes dataset show an 18% reduction in collision rate compared to baselines, along with stronger generalization and improved performance on long-tail scenarios. Code is available at: https://github.com/SEU-zxj/CoIRL-AD.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving

Авторы:

Abstract

Ссылки и действия

Связанные статьи

MM-ACT: Learn from Multimodal Parallel Generation to Act

Flux4D: Flow-based Unsupervised 4D Reconstruction

Fast Post-Hoc Confidence Fusion for 3-Class Open-Set Aerial Object Detection

M2H: Multi-Task Learning with Efficient Window-Based Cross-Task Attention for Mo...

EReLiFM: Evidential Reliability-Aware Residual Flow Meta-Learning for Open-Set D...

Навигация