RankSEG-RMA: An Efficient Segmentation Algorithm via Reciprocal Moment Approximation

2510.15362v1 stat.ML, cs.CV, cs.LG 2025-10-21
Авторы:

Zixun Wang, Ben Dai

Abstract

Semantic segmentation labels each pixel in an image with its corresponding class, and is typically evaluated using the Intersection over Union (IoU) and Dice metrics to quantify the overlap between predicted and ground-truth segmentation masks. In the literature, most existing methods estimate pixel-wise class probabilities, then apply argmax or thresholding to obtain the final prediction. These methods have been shown to generally lead to inconsistent or suboptimal results, as they do not directly maximize segmentation metrics. To address this issue, a novel consistent segmentation framework, RankSEG, has been proposed, which includes RankDice and RankIoU specifically designed to optimize the Dice and IoU metrics, respectively. Although RankSEG almost guarantees improved performance, it suffers from two major drawbacks. First, it is its computational expense-RankDice has a complexity of O(d log d) with a substantial constant factor (where d represents the number of pixels), while RankIoU exhibits even higher complexity O(d^2), thus limiting its practical application. For instance, in LiTS, prediction with RankSEG takes 16.33 seconds compared to just 0.01 seconds with the argmax rule. Second, RankSEG is only applicable to overlapping segmentation settings, where multiple classes can occupy the same pixel, which contrasts with standard benchmarks that typically assume non-overlapping segmentation. In this paper, we overcome these two drawbacks via a reciprocal moment approximation (RMA) of RankSEG with the following contributions: (i) we improve RankSEG using RMA, namely RankSEG-RMA, reduces the complexity of both algorithms to O(d) while maintaining comparable performance; (ii) inspired by RMA, we develop a pixel-wise score function that allows efficient implementation for non-overlapping segmentation settings.

Ссылки и действия

Связанные статьи

Kernel VICReg for Self-Supervised Learning in Reproducing Kernel Hilbert Space

## Контекст Self-supervised learning (SSL) является важной областью исследований в машинном обучении, которая позволяет ...

2025-09-11

Membership Inference Attacks with False Discovery Rate Control

## Контекст В последнее время глубокие нейронные сети приобрели широкое применение в различных областях, однако они такж...

2025-08-13