📊 Статистика дайджестов
Всего дайджестов: 34022 Добавлено сегодня: 82
Последнее обновление: сегодня
Авторы:
Jiajia Yu, Junghwan Lee, Yao Xie, Xiuyuan Cheng
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Mean-field games (MFGs) study the Nash equilibrium of systems with a continuum of interacting agents, which can be formulated as the fixed-point of optimal control problems. They provide a unified framework for a variety of applications, including optimal transport (OT) and generative models. Despite their broad applicability, solving high-dimensional MFGs remains a significant challenge due to fundamental computational and analytical obstacles. In this work, we propose a particle-based deep Flo...
Авторы:
Tianqi Qiao, Marie Maros
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We propose and analyze a variant of Sparse Polyak for high dimensional M-estimation problems. Sparse Polyak proposes a novel adaptive step-size rule tailored to suitably estimate the problem's curvature in the high-dimensional setting, guaranteeing that the algorithm's performance does not deteriorate when the ambient dimension increases. However, convergence guarantees can only be obtained by sacrificing solution sparsity and statistical accuracy. In this work, we introduce a variant of Sparse ...
📄 Splat Regression Models
2025-11-20Авторы:
Mara Daniels, Philippe Rigollet
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We introduce a highly expressive class of function approximators called Splat Regression Models. Model outputs are mixtures of heterogeneous and anisotropic bump functions, termed splats, each weighted by an output vector. The power of splat modeling lies in its ability to locally adjust the scale and direction of each splat, achieving both high interpretability and accuracy. Fitting splat models reduces to optimization over the space of mixing measures, which can be implemented using Wasserstei...
📄 SCOPE: Spectral Concentration by Distributionally Robust Joint Covariance-Precision Estimation
2025-11-20Авторы:
Renjie Chen, Viet Anh Nguyen, Huifu Xu
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We propose a distributionally robust formulation for simultaneously estimating the covariance matrix and the precision matrix of a random vector.The proposed model minimizes the worst-case weighted sum of the Frobenius loss of the covariance estimator and Stein's loss of the precision matrix estimator against all distributions from an ambiguity set centered at the nominal distribution. The radius of the ambiguity set is measured via convex spectral divergence. We demonstrate that the proposed di...
📄 The Tree-SNE Tree Exists
2025-10-21Авторы:
Jack Kendrick
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
The clustering and visualisation of high-dimensional data is a ubiquitous
task in modern data science. Popular techniques include nonlinear
dimensionality reduction methods like t-SNE or UMAP. These methods face the
`scale-problem' of clustering: when dealing with the MNIST dataset, do we want
to distinguish different digits or do we want to distinguish different ways of
writing the digits? The answer is task dependent and depends on scale. We
revisit an idea of Robinson & Pierce-Hoffman that ex...
Авторы:
Elizabeth Collins-Woodfin, Inbar Seroussi
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
We develop a framework for analyzing the training and learning rate dynamics
on a variety of high- dimensional optimization problems trained using one-pass
stochastic gradient descent (SGD) with data generated from multiple anisotropic
classes. We give exact expressions for a large class of functions of the
limiting dynamics, including the risk and the overlap with the true signal, in
terms of a deterministic solution to a system of ODEs. We extend the existing
theory of high-dimensional SGD dyn...
📄 Improved Central Limit Theorem and Bootstrap Approximations for Linear Stochastic Approximation
2025-10-16Авторы:
Bogdan Butyrin, Eric Moulines, Alexey Naumov, Sergey Samsonov, Qi-Man Shao, Zhuo-Song Zhang
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
In this paper, we refine the Berry-Esseen bounds for the multivariate normal
approximation of Polyak-Ruppert averaged iterates arising from the linear
stochastic approximation (LSA) algorithm with decreasing step size. We consider
the normal approximation by the Gaussian distribution with covariance matrix
predicted by the Polyak-Juditsky central limit theorem and establish the rate
up to order $n^{-1/3}$ in convex distance, where $n$ is the number of samples
used in the algorithm. We also prove...
📄 Efficient Group Lasso Regularized Rank Regression with Data-Driven Parameter Determination
2025-10-15Авторы:
Meixia Lin, Meijiao Shi, Yunhai Xiao, Qian Zhang
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
High-dimensional regression often suffers from heavy-tailed noise and
outliers, which can severely undermine the reliability of least-squares based
methods. To improve robustness, we adopt a non-smooth Wilcoxon score based rank
objective and incorporate structured group sparsity regularization, a natural
generalization of the lasso, yielding a group lasso regularized rank regression
method. By extending the tuning-free parameter selection scheme originally
developed for the lasso, we introduce a...
Авторы:
Ke Xu, Yuefeng Han
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Recovering a low-CP-rank tensor from noisy linear measurements is a central
challenge in high-dimensional data analysis, with applications spanning tensor
PCA, tensor regression, and beyond. We exploit the intrinsic geometry of
rank-one tensors by casting the recovery task as an optimization problem over
the Segre manifold, the smooth Riemannian manifold of rank-one tensors. This
geometric viewpoint yields two powerful algorithms: Riemannian Gradient Descent
(RGD) and Riemannian Gauss-Newton (RG...
Авторы:
Anil Kamber, Rahul Parhi
Саммари на русском не найдено
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Доступные поля: ['id', 'arxiv_id', 'title', 'authors', 'abstract', 'summary_ru', 'categories', 'published_date', 'created_at']
Annotation:
Understanding the geometry of the loss landscape near a minimum is key to
explaining the implicit bias of gradient-based methods in non-convex
optimization problems such as deep neural network training and deep matrix
factorization. A central quantity to characterize this geometry is the maximum
eigenvalue of the Hessian of the loss, which measures the sharpness of the
landscape. Currently, its precise role has been obfuscated because no exact
expressions for this sharpness measure were known in...
Показано 1 -
10
из 18 записей