Fast Agnostic Learners in the Plane
2510.18057v1
cs.DS, cs.CG, cs.LG, stat.ML
2025-10-23
Авторы:
Talya Eden, Ludmila Glinskih, Sofya Raskhodnikova
Abstract
We investigate the computational efficiency of agnostic learning for several
fundamental geometric concept classes in the plane. While the sample complexity
of agnostic learning is well understood, its time complexity has received much
less attention. We study the class of triangles and, more generally, the class
of convex polygons with $k$ vertices for small $k$, as well as the class of
convex sets in a square. We present a proper agnostic learner for the class of
triangles that has optimal sample complexity and runs in time $\tilde
O({\epsilon^{-6}})$, improving on the algorithm of Dobkin and Gunopulos (COLT
`95) that runs in time $\tilde O({\epsilon^{-10}})$. For 4-gons and 5-gons, we
improve the running time from $O({\epsilon^{-12}})$, achieved by Fischer and
Kwek (eCOLT `96), to $\tilde O({\epsilon^{-8}})$ and $\tilde
O({\epsilon^{-10}})$, respectively.
We also design a proper agnostic learner for convex sets under the uniform
distribution over a square with running time $\tilde O({\epsilon^{-5}})$,
improving on the previous $\tilde O(\epsilon^{-8})$ bound at the cost of
slightly higher sample complexity. Notably, agnostic learning of convex sets in
$[0,1]^2$ under general distributions is impossible because this concept class
has infinite VC-dimension. Our agnostic learners use data structures and
algorithms from computational geometry and their analysis relies on tools from
geometry and probabilistic combinatorics. Because our learners are proper, they
yield tolerant property testers with matching running times. Our results raise
a fundamental question of whether a gap between the sample and time complexity
is inherent for agnostic learning of these and other natural concept classes.