Structure-Aware Spectral Sparsification via Uniform Edge Sampling

2510.12669v1 cs.LG, cs.DS 2025-10-16

Авторы:

Kaiwen He, Petros Drineas, Rajiv Khanna

Abstract

Spectral clustering is a fundamental method for graph partitioning, but its reliance on eigenvector computation limits scalability to massive graphs. Classical sparsification methods preserve spectral properties by sampling edges proportionally to their effective resistances, but require expensive preprocessing to estimate these resistances. We study whether uniform edge sampling-a simple, structure-agnostic strategy-can suffice for spectral clustering. Our main result shows that for graphs admitting a well-separated $k$-clustering, characterized by a large structure ratio $\Upsilon(k) = \lambda_{k+1} / \rho_G(k)$, uniform sampling preserves the spectral subspace used for clustering. Specifically, we prove that uniformly sampling $O(\gamma^2 n \log n / \epsilon^2)$ edges, where $\gamma$ is the Laplacian condition number, yields a sparsifier whose top $(n-k)$-dimensional eigenspace is approximately orthogonal to the cluster indicators. This ensures that the spectral embedding remains faithful, and clustering quality is preserved. Our analysis introduces new resistance bounds for intra-cluster edges, a rank-$(n-k)$ effective resistance formulation, and a matrix Chernoff bound adapted to the dominant eigenspace. These tools allow us to bypass importance sampling entirely. Conceptually, our result connects recent coreset-based clustering theory to spectral sparsification, showing that under strong clusterability, even uniform sampling is structure-aware. This provides the first provable guarantee that uniform edge sampling suffices for structure-preserving spectral clustering.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Structure-Aware Spectral Sparsification via Uniform Edge Sampling

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Dynamic Algorithm for Explainable k-medians Clustering under lp Norm

Limitations of Membership Queries in Testable Learning

Learning-Augmented Online Bipartite Matching in the Random Arrival Order Model

Learning Intersections of Halfspaces under Factorizable Distribution

Tight Differentially Private PCA via Matrix Coherence

Навигация