📊 Статистика дайджестов
Всего дайджестов: 34022 Добавлено сегодня: 0
Последнее обновление: сегодня
📄 Efficiently Generating Correlated Sample Paths from Multi-step Time Series Foundation Models
2025-10-04Авторы:
Ethan Baron, Boris Oreshkin, Ruijun Ma, Hanyu Zhang, Kari Torkkola, Michael W. Mahoney, Andrew Gordon Wilson, Tatiana Konstantinova
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Many time series applications require access to multi-step forecast
trajectories in the form of sample paths. Recently, time series foundation
models have leveraged multi-step lookahead predictions to improve the quality
and efficiency of multi-step forecasts. However, these models only predict
independent marginal distributions for each time step, rather than a full joint
predictive distribution. To generate forecast sample paths with realistic
correlation structures, one typically resorts to a...
Авторы:
Nicholas Barnfield, Hugo Cui, Yue M. Lu
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
When and how can an attention mechanism learn to selectively attend to
informative tokens, thereby enabling detection of weak, rare, and sparsely
located features? We address these questions theoretically in a sparse-token
classification model in which positive samples embed a weak signal vector in a
randomly chosen subset of tokens, whereas negative samples are pure noise. In
the long-sequence limit, we show that a simple single-layer attention
classifier can in principle achieve vanishing test...
Авторы:
Shuang Liang, Guido Montúfar
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We examine gradient descent in matrix factorization and show that under large
step sizes the parameter space develops a fractal structure. We derive the
exact critical step size for convergence in scalar-vector factorization and
show that near criticality the selected minimizer depends sensitively on the
initialization. Moreover, we show that adding regularization amplifies this
sensitivity, generating a fractal boundary between initializations that
converge and those that diverge. The analysis ...
📄 Flow Matching with Semidiscrete Couplings
2025-10-03Авторы:
Alireza Mousavi-Hosseini, Stephen Y. Zhang, Michal Klein, Marco Cuturi
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Flow models parameterized as time-dependent velocity fields can generate data
from noise by integrating an ODE. These models are often trained using flow
matching, i.e. by sampling random pairs of noise and target points
$(\mathbf{x}_0,\mathbf{x}_1)$ and ensuring that the velocity field is aligned,
on average, with $\mathbf{x}_1-\mathbf{x}_0$ when evaluated along a segment
linking $\mathbf{x}_0$ to $\mathbf{x}_1$. While these pairs are sampled
independently by default, they can also be selected ...
Авторы:
Yichi Zhang, Fangzheng Xie, Shu Yang, Chong Wu
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
In language tasks that require extensive human--model interaction, deploying
a single "best" model for every query can be expensive. To reduce inference
cost while preserving the quality of the responses, a large language model
(LLM) router selects the most appropriate model from a pool of candidates for
each query. A central challenge to training a high-quality router is the
scarcity of reliable supervision. Gold-standard data (e.g., expert-verified
labels or rubric-based scores) provide accura...
Авторы:
Jianyu Xu, Vidhi Jain, Bryan Wilder, Aarti Singh
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
With advances in generative AI, decision-making agents can now dynamically
create new actions during online learning, but action generation typically
incurs costs that must be balanced against potential benefits. We study an
online learning problem where an agent can generate new actions at any time
step by paying a one-time cost, with these actions becoming permanently
available for future use. The challenge lies in learning the optimal sequence
of two-fold decisions: which action to take and w...
Авторы:
Jingqi Fan, Canzhe Zhao, Shuai Li, Siwei Wang
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
In recent years, multi-player multi-armed bandits (MP-MAB) have been
extensively studied due to their wide applications in cognitive radio networks
and Internet of Things systems. While most existing research on MP-MAB focuses
on synchronized settings, real-world systems are often decentralized and
asynchronous, where players may enter or leave the system at arbitrary times,
and do not have a global clock. This decentralized asynchronous setting
introduces two major challenges. First, without a ...
📄 Informed Asymmetric Actor-Critic: Leveraging Privileged Signals Beyond Full-State Access
2025-10-02Авторы:
Daniel Ebi, Gaspard Lambrechts, Damien Ernst, Klemens Böhm
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Reinforcement learning in partially observable environments requires agents
to act under uncertainty from noisy, incomplete observations. Asymmetric
actor-critic methods leverage privileged information during training to improve
learning under these conditions. However, existing approaches typically assume
full-state access during training. In this work, we challenge this assumption
by proposing a novel actor-critic framework, called informed asymmetric
actor-critic, that enables conditioning th...
Авторы:
Margarita A. Guerrero, Cristian R. Rojas
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Counterfactual Explanations (CFEs) interpret machine learning models by
identifying the smallest change to input features needed to change the model's
prediction to a desired output. For classification tasks, CFEs determine how
close a given sample is to the decision boundary of a trained classifier.
Existing methods are often sample-inefficient, requiring numerous evaluations
of a black-box model -- an approach that is both costly and impractical when
access to the model is limited. We propose ...
Авторы:
Sven Dummer, Tjeerd Jan Heeringa, José A. Iglesias
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Recently, there has been growing interest in characterizing the function
spaces underlying neural networks. While shallow and deep scalar-valued neural
networks have been linked to scalar-valued reproducing kernel Banach spaces
(RKBS), $\mathbb{R}^d$-valued neural networks and neural operator models remain
less understood in the RKBS setting. To address this gap, we develop a general
definition of vector-valued RKBS (vv-RKBS), which inherently includes the
associated reproducing kernel. Our cons...
Показано 251 -
260
из 385 записей