📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 Vector-valued self-normalized concentration inequalities beyond sub-Gaussianity

2025-11-07

Авторы:

Diego Martinez-Taboada, Tomas Gonzalez, Aaditya Ramdas

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The study of self-normalized processes plays a crucial role in a wide range of applications, from sequential decision-making to econometrics. While the behavior of self-normalized concentration has been widely investigated for scalar-valued processes, vector-valued processes remain comparatively underexplored, especially outside of the sub-Gaussian framework. In this contribution, we provide concentration bounds for self-normalized processes with light tails beyond sub-Gaussianity (such as Benne...

ID: 2511.03606v1 stat.ML, cs.LG, math.ST, stat.TH

arXiv PDF

📄 Optimal Convergence Analysis of DDPM for General Distributions

2025-11-04

Авторы:

Yuchen Jiao, Yuchen Zhou, Gen Li

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Score-based diffusion models have achieved remarkable empirical success in generating high-quality samples from target data distributions. Among them, the Denoising Diffusion Probabilistic Model (DDPM) is one of the most widely used samplers, generating samples via estimated score functions. Despite its empirical success, a tight theoretical understanding of DDPM -- especially its convergence properties -- remains limited. In this paper, we provide a refined convergence analysis of the DDPM sa...

ID: 2510.27562v1 stat.ML, cs.LG, math.ST, stat.TH

arXiv PDF

📄 Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms

2025-11-01

Авторы:

William Réveillard, Richard Combes

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We consider a stochastic multi-armed bandit problem with i.i.d. rewards where the expected reward function is multimodal with at most m modes. We propose the first known computationally tractable algorithm for computing the solution to the Graves-Lai optimization problem, which in turn enables the implementation of asymptotically optimal algorithms for this bandit problem. The code for the proposed algorithms is publicly available at https://github.com/wilrev/MultimodalBandits

ID: 2510.25811v1 stat.ML, cs.LG, math.ST, stat.TH

arXiv PDF

📄 Complexity Dependent Error Rates for Physics-informed Statistical Learning via the Small-ball Method

2025-10-29

Авторы:

Diego Marcondes

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Physics-informed statistical learning (PISL) integrates empirical data with physical knowledge to enhance the statistical performance of estimators. While PISL methods are widely used in practice, a comprehensive theoretical understanding of how informed regularization affects statistical properties is still missing. Specifically, two fundamental questions have yet to be fully addressed: (1) what is the trade-off between considering soft penalties versus hard constraints, and (2) what is the sta...

ID: 2510.23149v1 stat.ML, cs.LG, math.ST, stat.TH

arXiv PDF

📄 Provable test-time adaptivity and distributional robustness of in-context learning

2025-10-29

Авторы:

Tianyi Ma, Tengyao Wang, Richard J. Samworth

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We study in-context learning problems where a Transformer is pretrained on tasks drawn from a mixture distribution $\pi=\sum_{\alpha\in\mathcal{A}} \lambda_{\alpha} \pi_{\alpha}$, called the pretraining prior, in which each mixture component $\pi_{\alpha}$ is a distribution on tasks of a specific difficulty level indexed by $\alpha$. Our goal is to understand the performance of the pretrained Transformer when evaluated on a different test distribution $\mu$, consisting of tasks of fixed difficul...

ID: 2510.23254v1 stat.ML, cs.LG, math.ST, stat.TH, 62G08, 68T07

arXiv PDF

📄 Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach

2025-10-24

Авторы:

Sebastian Reboul, Hélène Halconruy, Randal Douc

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We investigate the fundamental problem of leveraging offline data to accelerate online reinforcement learning - a direction with strong potential but limited theoretical grounding. Our study centers on how to learn and apply value envelopes within this context. To this end, we introduce a principled two-stage framework: the first stage uses offline data to derive upper and lower bounds on value functions, while the second incorporates these learned bounds into online algorithms. Our method exten...

ID: 2510.19528v1 stat.ML, cs.LG, math.ST, stat.TH

arXiv PDF

📄 Non-asymptotic error bounds for probability flow ODEs under weak log-concavity

2025-10-22

Авторы:

Gitte Kremling, Francesco Iafrate, Mahsa Taheri, Johannes Lederer

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Score-based generative modeling, implemented through probability flow ODEs, has shown impressive results in numerous practical settings. However, most convergence guarantees rely on restrictive regularity assumptions on the target distribution -- such as strong log-concavity or bounded support. This work establishes non-asymptotic convergence bounds in the 2-Wasserstein distance for a general class of probability flow ODEs under considerably weaker assumptions: weak log-concavity and Lipschitz c...

ID: 2510.17608v1 stat.ML, cs.LG, math.ST, stat.TH

arXiv PDF

📄 The Minimax Lower Bound of Kernel Stein Discrepancy Estimation

2025-10-21

Авторы:

Jose Cribeiro-Ramallo, Agnideep Aich, Florian Kalinke, Ashit Baran Aich, Zoltán Szabó

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Kernel Stein discrepancies (KSDs) have emerged as a powerful tool for quantifying goodness-of-fit over the last decade, featuring numerous successful applications. To the best of our knowledge, all existing KSD estimators with known rate achieve $\sqrt n$-convergence. In this work, we present two complementary results (with different proof strategies), establishing that the minimax lower bound of KSD estimation is $n^{-1/2}$ and settling the optimality of these estimators. Our first result focus...

ID: 2510.15058v1 stat.ML, cs.LG, math.ST, stat.TH, 62C20 (Primary) 46E22, 62B10 (Secondary), G.3; H.1.1; I.2.6

arXiv PDF

📄 Transfer Learning for Benign Overfitting in High-Dimensional Linear Regression

2025-10-21

Авторы:

Yeichan Kim, Ilmun Kim, Seyoung Park

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Transfer learning is a key component of modern machine learning, enhancing the performance of target tasks by leveraging diverse data sources. Simultaneously, overparameterized models such as the minimum-$\ell_2$-norm interpolator (MNI) in high-dimensional linear regression have garnered significant attention for their remarkable generalization capabilities, a property known as benign overfitting. Despite their individual importance, the intersection of transfer learning and MNI remains largely ...

ID: 2510.15337v1 stat.ML, cs.LG, math.ST, stat.TH

arXiv PDF

📄 On the Optimality of the Median-of-Means Estimator under Adversarial Contamination

2025-10-11

Авторы:

Xabier de Juan, Santiago Mazuelas

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The Median-of-Means (MoM) is a robust estimator widely used in machine learning that is known to be (minimax) optimal in scenarios where samples are i.i.d. In more grave scenarios, samples are contaminated by an adversary that can inspect and modify the data. Previous work has theoretically shown the suitability of the MoM estimator in certain contaminated settings. However, the (minimax) optimality of MoM and its limitations under adversarial contamination remain unknown beyond the Gaussian cas...

ID: 2510.07867v1 stat.ML, cs.LG, math.ST, stat.TH

arXiv PDF

Показано 1 - 10 из 19 записей