📊 Статистика дайджестов

Всего дайджестов: 34022 Добавлено сегодня: 82

Последнее обновление: сегодня

📄 High-dimensional Mean-Field Games by Particle-based Flow Matching

2025-12-04

Авторы:

Jiajia Yu, Junghwan Lee, Yao Xie, Xiuyuan Cheng

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Mean-field games (MFGs) study the Nash equilibrium of systems with a continuum of interacting agents, which can be formulated as the fixed-point of optimal control problems. They provide a unified framework for a variety of applications, including optimal transport (OT) and generative models. Despite their broad applicability, solving high-dimensional MFGs remains a significant challenge due to fundamental computational and analytical obstacles. In this work, we propose a particle-based deep Flo...

ID: 2512.01172v1 stat.ML, cs.LG, math.OC

arXiv PDF

📄 Sparse Polyak with optimal thresholding operators for high-dimensional M-estimation

2025-11-26

Авторы:

Tianqi Qiao, Marie Maros

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We propose and analyze a variant of Sparse Polyak for high dimensional M-estimation problems. Sparse Polyak proposes a novel adaptive step-size rule tailored to suitably estimate the problem's curvature in the high-dimensional setting, guaranteeing that the algorithm's performance does not deteriorate when the ambient dimension increases. However, convergence guarantees can only be obtained by sacrificing solution sparsity and statistical accuracy. In this work, we introduce a variant of Sparse ...

ID: 2511.18167v1 stat.ML, cs.LG, math.OC

arXiv PDF

📄 Splat Regression Models

2025-11-20

Авторы:

Mara Daniels, Philippe Rigollet

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We introduce a highly expressive class of function approximators called Splat Regression Models. Model outputs are mixtures of heterogeneous and anisotropic bump functions, termed splats, each weighted by an output vector. The power of splat modeling lies in its ability to locally adjust the scale and direction of each splat, achieving both high interpretability and accuracy. Fitting splat models reduces to optimization over the space of mixing measures, which can be implemented using Wasserstei...

ID: 2511.14042v1 stat.ML, cs.LG, math.OC

arXiv PDF

📄 SCOPE: Spectral Concentration by Distributionally Robust Joint Covariance-Precision Estimation

2025-11-20

Авторы:

Renjie Chen, Viet Anh Nguyen, Huifu Xu

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We propose a distributionally robust formulation for simultaneously estimating the covariance matrix and the precision matrix of a random vector.The proposed model minimizes the worst-case weighted sum of the Frobenius loss of the covariance estimator and Stein's loss of the precision matrix estimator against all distributions from an ambiguity set centered at the nominal distribution. The radius of the ambiguity set is measured via convex spectral divergence. We demonstrate that the proposed di...

ID: 2511.14146v1 stat.ML, cs.LG, math.OC

arXiv PDF

📄 The Tree-SNE Tree Exists

2025-10-21

Авторы:

Jack Kendrick

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

The clustering and visualisation of high-dimensional data is a ubiquitous task in modern data science. Popular techniques include nonlinear dimensionality reduction methods like t-SNE or UMAP. These methods face the `scale-problem' of clustering: when dealing with the MNIST dataset, do we want to distinguish different digits or do we want to distinguish different ways of writing the digits? The answer is task dependent and depends on scale. We revisit an idea of Robinson & Pierce-Hoffman that ex...

ID: 2510.15014v1 stat.ML, cs.LG, math.OC

arXiv PDF

📄 Exact Dynamics of Multi-class Stochastic Gradient Descent

2025-10-19

Авторы:

Elizabeth Collins-Woodfin, Inbar Seroussi

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

We develop a framework for analyzing the training and learning rate dynamics on a variety of high- dimensional optimization problems trained using one-pass stochastic gradient descent (SGD) with data generated from multiple anisotropic classes. We give exact expressions for a large class of functions of the limiting dynamics, including the risk and the overlap with the true signal, in terms of a deterministic solution to a system of ODEs. We extend the existing theory of high-dimensional SGD dyn...

ID: 2510.14074v1 stat.ML, cs.LG, math.OC, math.PR, 60H30

arXiv PDF

📄 Improved Central Limit Theorem and Bootstrap Approximations for Linear Stochastic Approximation

2025-10-16

Авторы:

Bogdan Butyrin, Eric Moulines, Alexey Naumov, Sergey Samsonov, Qi-Man Shao, Zhuo-Song Zhang

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

In this paper, we refine the Berry-Esseen bounds for the multivariate normal approximation of Polyak-Ruppert averaged iterates arising from the linear stochastic approximation (LSA) algorithm with decreasing step size. We consider the normal approximation by the Gaussian distribution with covariance matrix predicted by the Polyak-Juditsky central limit theorem and establish the rate up to order $n^{-1/3}$ in convex distance, where $n$ is the number of samples used in the algorithm. We also prove...

ID: 2510.12375v1 stat.ML, cs.LG, math.OC, math.PR, math.ST, stat.TH, 60F05, 62L20, 62E20

arXiv PDF

📄 Efficient Group Lasso Regularized Rank Regression with Data-Driven Parameter Determination

2025-10-15

Авторы:

Meixia Lin, Meijiao Shi, Yunhai Xiao, Qian Zhang

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

High-dimensional regression often suffers from heavy-tailed noise and outliers, which can severely undermine the reliability of least-squares based methods. To improve robustness, we adopt a non-smooth Wilcoxon score based rank objective and incorporate structured group sparsity regularization, a natural generalization of the lasso, yielding a group lasso regularized rank regression method. By extending the tuning-free parameter selection scheme originally developed for the lasso, we introduce a...

ID: 2510.11546v1 stat.ML, cs.LG, math.OC, math.ST, stat.TH

arXiv PDF

📄 Guaranteed Noisy CP Tensor Recovery via Riemannian Optimization on the Segre Manifold

2025-10-04

Авторы:

Ke Xu, Yuefeng Han

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Recovering a low-CP-rank tensor from noisy linear measurements is a central challenge in high-dimensional data analysis, with applications spanning tensor PCA, tensor regression, and beyond. We exploit the intrinsic geometry of rank-one tensors by casting the recovery task as an optimization problem over the Segre manifold, the smooth Riemannian manifold of rank-one tensors. This geometric viewpoint yields two powerful algorithms: Riemannian Gradient Descent (RGD) and Riemannian Gauss-Newton (RG...

ID: 2510.00569v1 stat.ML, cs.LG, math.OC, math.ST, stat.ME, stat.TH, 90C26 (Primary) 15A69, 62F10, 62J05, 62H25 (Secondary)

arXiv PDF

📄 Sharpness of Minima in Deep Matrix Factorization: Exact Expressions

2025-10-02

Авторы:

Anil Kamber, Rahul Parhi

Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']

Annotation:

Understanding the geometry of the loss landscape near a minimum is key to explaining the implicit bias of gradient-based methods in non-convex optimization problems such as deep neural network training and deep matrix factorization. A central quantity to characterize this geometry is the maximum eigenvalue of the Hessian of the loss, which measures the sharpness of the landscape. Currently, its precise role has been obfuscated because no exact expressions for this sharpness measure were known in...

ID: 2509.25783v1 stat.ML, cs.LG, math.OC

arXiv PDF

Показано 1 - 10 из 18 записей