📊 Статистика дайджестов
Всего дайджестов: 34022 Добавлено сегодня: 0
Последнее обновление: сегодня
Авторы:
Tianyi Ma, Tengyao Wang, Richard J. Samworth
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We study in-context learning problems where a Transformer is pretrained on
tasks drawn from a mixture distribution $\pi=\sum_{\alpha\in\mathcal{A}}
\lambda_{\alpha} \pi_{\alpha}$, called the pretraining prior, in which each
mixture component $\pi_{\alpha}$ is a distribution on tasks of a specific
difficulty level indexed by $\alpha$. Our goal is to understand the performance
of the pretrained Transformer when evaluated on a different test distribution
$\mu$, consisting of tasks of fixed difficul...
Авторы:
William Réveillard, Vasileios Saketos, Alexandre Proutiere, Richard Combes
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We introduce and study an online problem arising in question answering
systems. In this problem, an agent must sequentially classify user-submitted
queries represented by $d$-dimensional embeddings drawn i.i.d. from an unknown
distribution. The agent may consult a costly human expert for the correct
label, or guess on her own without receiving feedback. The goal is to minimize
regret against an oracle with free expert access. When the time horizon $T$ is
at least exponential in the embedding dim...
Авторы:
Roxanne Holden, Luana Ruiz
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Graphons, as limits of graph sequences, provide a framework for analyzing the
asymptotic behavior of graph neural operators. Spectral convergence of sampled
graphs to graphons yields operator-level convergence rates, enabling
transferability analyses of GNNs. This note summarizes known bounds under no
assumptions, global Lipschitz continuity, and piecewise-Lipschitz continuity,
highlighting tradeoffs between assumptions and rates, and illustrating their
empirical tightness on synthetic and real ...
Авторы:
Kyungseon Lee, Kunwoong Kim, Jihu Lee, Dongyoon Yang, Yongdai Kim
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Algorithmic fairness is a socially crucial topic in real-world applications
of AI.
Among many notions of fairness, subgroup fairness is widely studied when
multiple sensitive attributes (e.g., gender, race, age) are present.
However, as the number of sensitive attributes grows, the number of subgroups
increases accordingly, creating heavy computational burdens and data sparsity
problem (subgroups with too small sizes).
In this paper, we develop a novel learning algorithm for subgroup fairn...
📄 Enforcing Calibration in Multi-Output Probabilistic Regression with Pre-rank Regularization
2025-10-28Авторы:
Naomi Desobry, Elnura Zhalieva, Souhaib Ben Taieb
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Probabilistic models must be well calibrated to support reliable
decision-making. While calibration in single-output regression is well studied,
defining and achieving multivariate calibration in multi-output regression
remains considerably more challenging. The existing literature on multivariate
calibration primarily focuses on diagnostic tools based on pre-rank functions,
which are projections that reduce multivariate prediction-observation pairs to
univariate summaries to detect specific typ...
Авторы:
Jung-hun Kim, Milan Vojnović, Min-hwan Oh
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We study the combinatorial semi-bandit problem where an agent selects a
subset of base arms and receives individual feedback. While this generalizes
the classical multi-armed bandit and has broad applicability, its scalability
is limited by the high cost of combinatorial optimization, requiring oracle
queries at every round. To tackle this, we propose oracle-efficient frameworks
that significantly reduce oracle calls while maintaining tight regret
guarantees. For the worst-case linear reward set...
Авторы:
Johann Flemming Gloy, Simon Olsson
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Flow and diffusion-based models have emerged as powerful tools for scientific
applications, particularly for sampling non-normalized probability
distributions, as exemplified by Boltzmann Generators (BGs). A critical
challenge in deploying these models is their reliance on sample likelihood
computations, which scale prohibitively with system size $n$, often rendering
them infeasible for large-scale problems. To address this, we introduce
$\textit{HollowFlow}$, a flow-based generative model lever...
Авторы:
Diana Cai, Robert M. Gower, David M. Blei, Lawrence K. Saul
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We introduce a highly expressive yet distinctly tractable family for
black-box variational inference (BBVI). Each member of this family is a
weighted product of experts (PoE), and each weighted expert in the product is
proportional to a multivariate $t$-distribution. These products of experts can
model distributions with skew, heavy tails, and multiple modes, but to use them
for BBVI, we must be able to sample from their densities. We show how to do
this by reformulating these products of expert...
Авторы:
Raheem Karim Hashmani, Garrett W. Merz, Helen Qu, Mariel Pettee, Kyle Cranmer
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We introduce a framework for generating highly multimodal datasets with
explicitly calculable mutual information between modalities. This enables the
construction of benchmark datasets that provide a novel testbed for systematic
studies of mutual information estimators and multimodal self-supervised
learning techniques. Our framework constructs realistic datasets with known
mutual information using a flow-based generative model and a structured causal
framework for generating correlated latent v...
📄 Testing Most Influential Sets
2025-10-27Авторы:
Lucas Darius Konrad, Nikolas Kuschnig
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Small subsets of data with disproportionate influence on model outcomes can
have dramatic impacts on conclusions, with a few data points sometimes
overturning key findings. While recent work has developed methods to identify
these most influential sets, no formal theory exists to determine when their
influence reflects genuine problems rather than natural sampling variation. We
address this gap by developing a principled framework for assessing the
statistical significance of most influential se...
Показано 181 -
190
из 564 записей