Some theoretical improvements on the tightness of PAC-Bayes risk certificates for neural networks

2510.07935v1 cs.LG, cs.IT, math.IT, stat.ML 2025-10-11
Авторы:

Diego García-Pérez, Emilio Parrado-Hernández, John Shawe-Taylor

Abstract

This paper presents four theoretical contributions that improve the usability of risk certificates for neural networks based on PAC-Bayes bounds. First, two bounds on the KL divergence between Bernoulli distributions enable the derivation of the tightest explicit bounds on the true risk of classifiers across different ranges of empirical risk. The paper next focuses on the formalization of an efficient methodology based on implicit differentiation that enables the introduction of the optimization of PAC-Bayesian risk certificates inside the loss/objective function used to fit the network/model. The last contribution is a method to optimize bounds on non-differentiable objectives such as the 0-1 loss. These theoretical contributions are complemented with an empirical evaluation on the MNIST and CIFAR-10 datasets. In fact, this paper presents the first non-vacuous generalization bounds on CIFAR-10 for neural networks.

Ссылки и действия

Связанные статьи

On optimal solutions of classical and sliced Wasserstein GANs with non-Gaussian ...

## Контекст Генерирующая аддитивная сеть (GAN) представляет собой модель, которая призвана аппроксимировать неизвестную ...

2025-09-10

The alpha-beta divergence for real and complex data

## Контекст Область исследования связана с разработкой методов для обработки и анализа сигналов, где ключевой аспект явл...

2025-08-09