ADAM Optimization with Adaptive Batch Selection

2512.06795v1 stat.ML, cs.LG 2025-12-09

Авторы:

Gyu Yeol Kim, Min-hwan Oh

Abstract

Adam is a widely used optimizer in neural network training due to its adaptive learning rate. However, because different data samples influence model updates to varying degrees, treating them equally can lead to inefficient convergence. To address this, a prior work proposed adapting the sampling distribution using a bandit framework to select samples adaptively. While promising, the bandit-based variant of Adam suffers from limited theoretical guarantees. In this paper, we introduce Adam with Combinatorial Bandit Sampling (AdamCB), which integrates combinatorial bandit techniques into Adam to resolve these issues. AdamCB is able to fully utilize feedback from multiple samples at once, enhancing both theoretical guarantees and practical performance. Our regret analysis shows that AdamCB achieves faster convergence than Adam-based methods including the previous bandit-based variant. Numerical experiments demonstrate that AdamCB consistently outperforms existing methods.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

ADAM Optimization with Adaptive Batch Selection

Авторы:

Abstract

Ссылки и действия

Связанные статьи

One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow

Do We Really Even Need Data? A Modern Look at Drawing Inference with Predicted D...

Contextual Strongly Convex Simulation Optimization: Optimize then Predict with I...

Canonical Tail Dependence for Soft Extremal Clustering of Multichannel Brain Sig...

Latent Nonlinear Denoising Score Matching for Enhanced Learning of Structured Di...

Навигация