📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 Predictive Safety Shield for Dyna-Q Reinforcement Learning

2025-11-27

Авторы:

Jin Pin, Krasowski Hanna, Vanneaux Elena

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Obtaining safety guarantees for reinforcement learning is a major challenge to achieve applicability for real-world tasks. Safety shields extend standard reinforcement learning and achieve hard safety guarantees. However, existing safety shields commonly use random sampling of safe actions or a fixed fallback controller, therefore disregarding future performance implications of different safe actions. In this work, we propose a predictive safety shield for model-based reinforcement learning agen...

ID: 2511.21531v1 cs.LG, cs.AI, cs.RO, eess.SY

arXiv PDF

📄 Statistically Assuring Safety of Control Systems using Ensembles of Safety Filters and Conformal Prediction

2025-11-15

Авторы:

Ihab Tabbara, Yuxuan Yang, Hussein Sibai

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Safety assurance is a fundamental requirement for deploying learning-enabled autonomous systems. Hamilton-Jacobi (HJ) reachability analysis is a fundamental method for formally verifying safety and generating safe controllers. However, computing the HJ value function that characterizes the backward reachable set (BRS) of a set of user-defined failure states is computationally expensive, especially for high-dimensional systems, motivating the use of reinforcement learning approaches to approximat...

ID: 2511.07899v1 cs.LG, cs.AI, cs.RO, eess.SY

arXiv PDF