Learning Parameterized Skills from Demonstrations

2510.24095v1 cs.LG, cs.AI, cs.RO 2025-10-30

Авторы:

Vedant Gupta, Haotian Fu, Calvin Luo, Yiding Jiang, George Konidaris

Abstract

We present DEPS, an end-to-end algorithm for discovering parameterized skills from expert demonstrations. Our method learns parameterized skill policies jointly with a meta-policy that selects the appropriate discrete skill and continuous parameters at each timestep. Using a combination of temporal variational inference and information-theoretic regularization methods, we address the challenge of degeneracy common in latent variable models, ensuring that the learned skills are temporally extended, semantically meaningful, and adaptable. We empirically show that learning parameterized skills from multitask expert demonstrations significantly improves generalization to unseen tasks. Our method outperforms multitask as well as skill learning baselines on both LIBERO and MetaWorld benchmarks. We also demonstrate that DEPS discovers interpretable parameterized skills, such as an object grasping skill whose continuous arguments define the grasp location.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Learning Parameterized Skills from Demonstrations

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Forecasting in Offline Reinforcement Learning for Non-stationary Environments

Leveraging LLMs for reward function design in reinforcement learning control tas...

Are LLMs The Way Forward? A Case Study on LLM-Guided Reinforcement Learning for ...

Harnessing Bounded-Support Evolution Strategies for Policy Refinement

Dynamic Sparsity: Challenging Common Sparsity Assumptions for Learning World Mod...

Навигация