📊 Статистика дайджестов
Всего дайджестов: 34022 Добавлено сегодня: 82
Последнее обновление: сегодня
Авторы:
Diego Martinez-Taboada, Tomas Gonzalez, Aaditya Ramdas
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
The study of self-normalized processes plays a crucial role in a wide range
of applications, from sequential decision-making to econometrics. While the
behavior of self-normalized concentration has been widely investigated for
scalar-valued processes, vector-valued processes remain comparatively
underexplored, especially outside of the sub-Gaussian framework. In this
contribution, we provide concentration bounds for self-normalized processes
with light tails beyond sub-Gaussianity (such as Benne...
Авторы:
Yuchen Jiao, Yuchen Zhou, Gen Li
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Score-based diffusion models have achieved remarkable empirical success in
generating high-quality samples from target data distributions. Among them, the
Denoising Diffusion Probabilistic Model (DDPM) is one of the most widely used
samplers, generating samples via estimated score functions. Despite its
empirical success, a tight theoretical understanding of DDPM -- especially its
convergence properties -- remains limited.
In this paper, we provide a refined convergence analysis of the DDPM sa...
Авторы:
William Réveillard, Richard Combes
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We consider a stochastic multi-armed bandit problem with i.i.d. rewards where
the expected reward function is multimodal with at most m modes. We propose the
first known computationally tractable algorithm for computing the solution to
the Graves-Lai optimization problem, which in turn enables the implementation
of asymptotically optimal algorithms for this bandit problem. The code for the
proposed algorithms is publicly available at
https://github.com/wilrev/MultimodalBandits
Авторы:
Diego Marcondes
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Physics-informed statistical learning (PISL) integrates empirical data with
physical knowledge to enhance the statistical performance of estimators. While
PISL methods are widely used in practice, a comprehensive theoretical
understanding of how informed regularization affects statistical properties is
still missing. Specifically, two fundamental questions have yet to be fully
addressed: (1) what is the trade-off between considering soft penalties versus
hard constraints, and (2) what is the sta...
Авторы:
Tianyi Ma, Tengyao Wang, Richard J. Samworth
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We study in-context learning problems where a Transformer is pretrained on
tasks drawn from a mixture distribution $\pi=\sum_{\alpha\in\mathcal{A}}
\lambda_{\alpha} \pi_{\alpha}$, called the pretraining prior, in which each
mixture component $\pi_{\alpha}$ is a distribution on tasks of a specific
difficulty level indexed by $\alpha$. Our goal is to understand the performance
of the pretrained Transformer when evaluated on a different test distribution
$\mu$, consisting of tasks of fixed difficul...
Авторы:
Sebastian Reboul, Hélène Halconruy, Randal Douc
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We investigate the fundamental problem of leveraging offline data to
accelerate online reinforcement learning - a direction with strong potential
but limited theoretical grounding. Our study centers on how to learn and apply
value envelopes within this context. To this end, we introduce a principled
two-stage framework: the first stage uses offline data to derive upper and
lower bounds on value functions, while the second incorporates these learned
bounds into online algorithms. Our method exten...
Авторы:
Gitte Kremling, Francesco Iafrate, Mahsa Taheri, Johannes Lederer
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Score-based generative modeling, implemented through probability flow ODEs,
has shown impressive results in numerous practical settings. However, most
convergence guarantees rely on restrictive regularity assumptions on the target
distribution -- such as strong log-concavity or bounded support. This work
establishes non-asymptotic convergence bounds in the 2-Wasserstein distance for
a general class of probability flow ODEs under considerably weaker assumptions:
weak log-concavity and Lipschitz c...
Авторы:
Jose Cribeiro-Ramallo, Agnideep Aich, Florian Kalinke, Ashit Baran Aich, Zoltán Szabó
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Kernel Stein discrepancies (KSDs) have emerged as a powerful tool for
quantifying goodness-of-fit over the last decade, featuring numerous successful
applications. To the best of our knowledge, all existing KSD estimators with
known rate achieve $\sqrt n$-convergence. In this work, we present two
complementary results (with different proof strategies), establishing that the
minimax lower bound of KSD estimation is $n^{-1/2}$ and settling the optimality
of these estimators. Our first result focus...
Авторы:
Yeichan Kim, Ilmun Kim, Seyoung Park
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Transfer learning is a key component of modern machine learning, enhancing
the performance of target tasks by leveraging diverse data sources.
Simultaneously, overparameterized models such as the minimum-$\ell_2$-norm
interpolator (MNI) in high-dimensional linear regression have garnered
significant attention for their remarkable generalization capabilities, a
property known as benign overfitting. Despite their individual importance, the
intersection of transfer learning and MNI remains largely ...
Авторы:
Xabier de Juan, Santiago Mazuelas
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
The Median-of-Means (MoM) is a robust estimator widely used in machine
learning that is known to be (minimax) optimal in scenarios where samples are
i.i.d. In more grave scenarios, samples are contaminated by an adversary that
can inspect and modify the data. Previous work has theoretically shown the
suitability of the MoM estimator in certain contaminated settings. However, the
(minimax) optimality of MoM and its limitations under adversarial contamination
remain unknown beyond the Gaussian cas...
Показано 1 -
10
из 19 записей