📊 Статистика дайджестов
Всего дайджестов: 35039 Добавлено сегодня: 432
Последнее обновление: сегодня
Авторы:
Gabriel Diaz, Lucky Li, Wenhao Zhang
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Reinforcement Learning (RL) has emerged as a powerful framework for
sequential decision-making in dynamic environments, particularly when system
parameters are unknown. This paper investigates RL-based control for
entropy-regularized Linear Quadratic control (LQC) problems with multiplicative
noises over an infinite time horizon. First, we adapt the Regularized Policy
Gradient (RPG) algorithm to stochastic optimal control settings, proving that
despite the non-convexity of the problem, RPG conve...