Deep Attention-guided Adaptive Subsampling
2510.12376v1
cs.CV, cs.AI, cs.LG
2025-10-16
Авторы:
Sharath M Shankaranarayana, Soumava Kumar Roy, Prasad Sudhakar, Chandan Aladahalli
Abstract
Although deep neural networks have provided impressive gains in performance,
these improvements often come at the cost of increased computational complexity
and expense. In many cases, such as 3D volume or video classification tasks,
not all slices or frames are necessary due to inherent redundancies. To address
this issue, we propose a novel learnable subsampling framework that can be
integrated into any neural network architecture. Subsampling, being a
nondifferentiable operation, poses significant challenges for direct adaptation
into deep learning models. While some works, have proposed solutions using the
Gumbel-max trick to overcome the problem of non-differentiability, they fall
short in a crucial aspect: they are only task-adaptive and not inputadaptive.
Once the sampling mechanism is learned, it remains static and does not adjust
to different inputs, making it unsuitable for real-world applications. To this
end, we propose an attention-guided sampling module that adapts to inputs even
during inference. This dynamic adaptation results in performance gains and
reduces complexity in deep neural network models. We demonstrate the
effectiveness of our method on 3D medical imaging datasets from MedMNIST3D as
well as two ultrasound video datasets for classification tasks, one of them
being a challenging in-house dataset collected under real-world clinical
conditions.
Ссылки и действия
Дополнительные ресурсы: