📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 0

Последнее обновление: сегодня

📄 Asymptotically optimal reinforcement learning in Block Markov Decision Processes

2025-10-17

Авторы:

Thomas van Vuren, Fiona Sloothaak, Maarten G. Wolf, Jaron Sanders

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The curse of dimensionality renders Reinforcement Learning (RL) impractical in many real-world settings with exponentially large state and action spaces. Yet, many environments exhibit exploitable structure that can accelerate learning. To formalize this idea, we study RL in Block Markov Decision Processes (BMDPs). BMDPs model problems with large observation spaces, but where transition dynamics are fully determined by latent states. Recent advances in clustering methods have enabled the efficie...

ID: 2510.13748v1 cs.LG, math.PR, stat.ML, 90C40, 62H30, 60J20

arXiv PDF