📊 Статистика дайджестов
Всего дайджестов: 34022 Добавлено сегодня: 0
Последнее обновление: сегодня
Авторы:
Hamsa Bastani, Osbert Bastani, Bryce McLaughlin
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
There has been a surge of recent interest in automatically learning policies
to target treatment decisions based on rich individual covariates. A common
approach is to train a machine learning model to predict counterfactual
outcomes, and then select the policy that optimizes the predicted objective
value. In addition, practitioners also want confidence that the learned policy
has better performance than the incumbent policy according to downstream policy
evaluation. However, due to the winner's...
📄 Calibrated Principal Component Regression
2025-10-25Авторы:
Yixuan Florence Wu, Yilun Zhu, Lei Cao and, Naichen Shi
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We propose a new method for statistical inference in generalized linear
models. In the overparameterized regime, Principal Component Regression (PCR)
reduces variance by projecting high-dimensional data to a low-dimensional
principal subspace before fitting. However, PCR incurs truncation bias whenever
the true regression vector has mass outside the retained principal components
(PC). To mitigate the bias, we propose Calibrated Principal Component
Regression (CPCR), which first learns a low-vari...
📄 Signature Kernel Scoring Rule as Spatio-Temporal Diagnostic for Probabilistic Forecasting
2025-10-25Авторы:
Archer Dodson, Ritabrata Dutta
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Modern weather forecasting has increasingly transitioned from numerical
weather prediction (NWP) to data-driven machine learning forecasting
techniques. While these new models produce probabilistic forecasts to quantify
uncertainty, their training and evaluation may remain hindered by conventional
scoring rules, primarily MSE, which ignore the highly correlated data
structures present in weather and atmospheric systems. This work introduces the
signature kernel scoring rule, grounded in rough pa...
📄 Extreme Event Aware ($η$-) Learning
2025-10-25Авторы:
Kai Chang, Themistoklis P. Sapsis
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Quantifying and predicting rare and extreme events persists as a crucial yet
challenging task in understanding complex dynamical systems. Many practical
challenges arise from the infrequency and severity of these events, including
the considerable variance of simple sampling methods and the substantial
computational cost of high-fidelity numerical simulations. Numerous data-driven
methods have recently been developed to tackle these challenges. However, a
typical assumption for the success of th...
Авторы:
Yixiao Wang, Zishan Shao, Ting Jiang, Aditya Devarakonda
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We present a novel enhanced cyclic coordinate descent (ECCD) framework for
solving generalized linear models with elastic net constraints that reduces
training time in comparison to existing state-of-the-art methods. We redesign
the CD method by performing a Taylor expansion around the current iterate to
avoid nonlinear operations arising in the gradient computation. By introducing
this approximation, we are able to unroll the vector recurrences occurring in
the CD method and reformulate the res...
Авторы:
Somayajulu L. N. Dhulipala, Deep Ray, Nicholas Forman
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Simulating coupled PDE systems is computationally intensive, and prior
efforts have largely focused on training surrogates on the joint (coupled)
data, which requires a large amount of data. In the paper, we study
compositional diffusion approaches where diffusion models are only trained on
the decoupled PDE data and are composed at inference time to recover the
coupled field. Specifically, we investigate whether the compositional strategy
can be feasible under long time horizons involving a lar...
Авторы:
Wei Cao, Shanshan Wang
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Expectile regression neural networks (ERNNs) are powerful tools for capturing
heterogeneity and complex nonlinear structures in data. However, most existing
research has primarily focused on fully observed data, with limited attention
paid to scenarios involving censored observations. In this paper, we propose a
data augmentation based ERNNs algorithm, termed DAERNN, for modeling
heterogeneous censored data. The proposed DAERNN is fully data driven, requires
minimal assumptions, and offers subst...
📄 Testing Most Influential Sets
2025-10-25Авторы:
Lucas Darius Konrad, Nikolas Kuschnig
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Small subsets of data with disproportionate influence on model outcomes can
have dramatic impacts on conclusions, with a few data points sometimes
overturning key findings. While recent work has developed methods to identify
these \emph{most influential sets}, no formal theory exists to determine when
their influence reflects genuine problems rather than natural sampling
variation. We address this gap by developing a principled framework for
assessing the statistical significance of most influen...
Авторы:
Federico Lozano-Cuadra, Beatriz Soret, Marc Sanchez Net, Abhishek Cauligi, Federico Rossi
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We present a fully decentralized routing framework for multi-robot
exploration missions operating under the constraints of a Lunar Delay-Tolerant
Network (LDTN). In this setting, autonomous rovers must relay collected data to
a lander under intermittent connectivity and unknown mobility patterns. We
formulate the problem as a Partially Observable Markov Decision Problem (POMDP)
and propose a Graph Attention-based Multi-Agent Reinforcement Learning
(GAT-MARL) policy that performs Centralized Trai...
📄 Concentration and excess risk bounds for imbalanced classification with synthetic oversampling
2025-10-25Авторы:
Touqeer Ahmad, Mohammadreza M. Kalan, François Portier, Gilles Stupfler
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Synthetic oversampling of minority examples using SMOTE and its variants is a
leading strategy for addressing imbalanced classification problems. Despite the
success of this approach in practice, its theoretical foundations remain
underexplored. We develop a theoretical framework to analyze the behavior of
SMOTE and related methods when classifiers are trained on synthetic data. We
first derive a uniform concentration bound on the discrepancy between the
empirical risk over synthetic minority sa...
Показано 191 -
200
из 564 записей