📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 0

Последнее обновление: сегодня

📄 Beating the Winner's Curse via Inference-Aware Policy Optimization

2025-10-25

Авторы:

Hamsa Bastani, Osbert Bastani, Bryce McLaughlin

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

There has been a surge of recent interest in automatically learning policies to target treatment decisions based on rich individual covariates. A common approach is to train a machine learning model to predict counterfactual outcomes, and then select the policy that optimizes the predicted objective value. In addition, practitioners also want confidence that the learned policy has better performance than the incumbent policy according to downstream policy evaluation. However, due to the winner's...

ID: 2510.18161v2 stat.ML, cs.LG, econ.EM

arXiv PDF

📄 Calibrated Principal Component Regression

2025-10-25

Авторы:

Yixuan Florence Wu, Yilun Zhu, Lei Cao and, Naichen Shi

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We propose a new method for statistical inference in generalized linear models. In the overparameterized regime, Principal Component Regression (PCR) reduces variance by projecting high-dimensional data to a low-dimensional principal subspace before fitting. However, PCR incurs truncation bias whenever the true regression vector has mass outside the retained principal components (PC). To mitigate the bias, we propose Calibrated Principal Component Regression (CPCR), which first learns a low-vari...

ID: 2510.19020v1 stat.ML, cs.LG

arXiv PDF

📄 Signature Kernel Scoring Rule as Spatio-Temporal Diagnostic for Probabilistic Forecasting

2025-10-25

Авторы:

Archer Dodson, Ritabrata Dutta

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Modern weather forecasting has increasingly transitioned from numerical weather prediction (NWP) to data-driven machine learning forecasting techniques. While these new models produce probabilistic forecasts to quantify uncertainty, their training and evaluation may remain hindered by conventional scoring rules, primarily MSE, which ignore the highly correlated data structures present in weather and atmospheric systems. This work introduces the signature kernel scoring rule, grounded in rough pa...

ID: 2510.19110v1 stat.ML, cs.LG, stat.AP

arXiv PDF

📄 Extreme Event Aware ($η$-) Learning

2025-10-25

Авторы:

Kai Chang, Themistoklis P. Sapsis

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Quantifying and predicting rare and extreme events persists as a crucial yet challenging task in understanding complex dynamical systems. Many practical challenges arise from the infrequency and severity of these events, including the considerable variance of simple sampling methods and the substantial computational cost of high-fidelity numerical simulations. Numerous data-driven methods have recently been developed to tackle these challenges. However, a typical assumption for the success of th...

ID: 2510.19161v1 stat.ML, cs.LG, cs.NA, math.DS, math.NA

arXiv PDF

📄 Enhanced Cyclic Coordinate Descent Methods for Elastic Net Penalized Linear Models

2025-10-25

Авторы:

Yixiao Wang, Zishan Shao, Ting Jiang, Aditya Devarakonda

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We present a novel enhanced cyclic coordinate descent (ECCD) framework for solving generalized linear models with elastic net constraints that reduces training time in comparison to existing state-of-the-art methods. We redesign the CD method by performing a Taylor expansion around the current iterate to avoid nonlinear operations arising in the gradient computation. By introducing this approximation, we are able to unroll the vector recurrences occurring in the CD method and reformulate the res...

ID: 2510.19999v1 stat.ML, cs.LG, cs.MS, cs.NA, math.NA, stat.AP

arXiv PDF

📄 Compositional Generation for Long-Horizon Coupled PDEs

2025-10-25

Авторы:

Somayajulu L. N. Dhulipala, Deep Ray, Nicholas Forman

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Simulating coupled PDE systems is computationally intensive, and prior efforts have largely focused on training surrogates on the joint (coupled) data, which requires a large amount of data. In the paper, we study compositional diffusion approaches where diffusion models are only trained on the decoupled PDE data and are composed at inference time to recover the coupled field. Specifically, we investigate whether the compositional strategy can be feasible under long time horizons involving a lar...

ID: 2510.20141v1 stat.ML, cs.LG

arXiv PDF

📄 Neural Networks for Censored Expectile Regression Based on Data Augmentation

2025-10-25

Авторы:

Wei Cao, Shanshan Wang

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Expectile regression neural networks (ERNNs) are powerful tools for capturing heterogeneity and complex nonlinear structures in data. However, most existing research has primarily focused on fully observed data, with limited attention paid to scenarios involving censored observations. In this paper, we propose a data augmentation based ERNNs algorithm, termed DAERNN, for modeling heterogeneous censored data. The proposed DAERNN is fully data driven, requires minimal assumptions, and offers subst...

ID: 2510.20344v1 stat.ML, cs.LG

arXiv PDF

📄 Testing Most Influential Sets

2025-10-25

Авторы:

Lucas Darius Konrad, Nikolas Kuschnig

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Small subsets of data with disproportionate influence on model outcomes can have dramatic impacts on conclusions, with a few data points sometimes overturning key findings. While recent work has developed methods to identify these \emph{most influential sets}, no formal theory exists to determine when their influence reflects genuine problems rather than natural sampling variation. We address this gap by developing a principled framework for assessing the statistical significance of most influen...

ID: 2510.20372v1 stat.ML, cs.LG, econ.EM, math.ST, stat.ME, stat.TH

arXiv PDF

📄 Learning Decentralized Routing Policies via Graph Attention-based Multi-Agent Reinforcement Learning in Lunar Delay-Tolerant Networks

2025-10-25

Авторы:

Federico Lozano-Cuadra, Beatriz Soret, Marc Sanchez Net, Abhishek Cauligi, Federico Rossi

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We present a fully decentralized routing framework for multi-robot exploration missions operating under the constraints of a Lunar Delay-Tolerant Network (LDTN). In this setting, autonomous rovers must relay collected data to a lander under intermittent connectivity and unknown mobility patterns. We formulate the problem as a Partially Observable Markov Decision Problem (POMDP) and propose a Graph Attention-based Multi-Agent Reinforcement Learning (GAT-MARL) policy that performs Centralized Trai...

ID: 2510.20436v1 stat.ML, cs.LG

arXiv PDF

📄 Concentration and excess risk bounds for imbalanced classification with synthetic oversampling

2025-10-25

Авторы:

Touqeer Ahmad, Mohammadreza M. Kalan, François Portier, Gilles Stupfler

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Synthetic oversampling of minority examples using SMOTE and its variants is a leading strategy for addressing imbalanced classification problems. Despite the success of this approach in practice, its theoretical foundations remain underexplored. We develop a theoretical framework to analyze the behavior of SMOTE and related methods when classifiers are trained on synthetic data. We first derive a uniform concentration bound on the discrepancy between the empirical risk over synthetic minority sa...

ID: 2510.20472v1 stat.ML, cs.LG

arXiv PDF

Показано 191 - 200 из 564 записей