Towards more holistic interpretability: A lightweight disentangled Concept Bottleneck Model

2510.15770v1 cs.CV, cs.LG 2025-10-21

Авторы:

Gaoxiang Huang, Songning Lai, Yutao Yue

Abstract

Concept Bottleneck Models (CBMs) enhance interpretability by predicting human-understandable concepts as intermediate representations. However, existing CBMs often suffer from input-to-concept mapping bias and limited controllability, which restricts their practical value, directly damage the responsibility of strategy from concept-based methods. We propose a lightweight Disentangled Concept Bottleneck Model (LDCBM) that automatically groups visual features into semantically meaningful components without region annotation. By introducing a filter grouping loss and joint concept supervision, our method improves the alignment between visual patterns and concepts, enabling more transparent and robust decision-making. Notably, Experiments on three diverse datasets demonstrate that LDCBM achieves higher concept and class accuracy, outperforming previous CBMs in both interpretability and classification performance. By grounding concepts in visual evidence, our method overcomes a fundamental limitation of prior models and enhances the reliability of interpretable AI.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Towards more holistic interpretability: A lightweight disentangled Concept Bottleneck Model

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Selective Masking based Self-Supervised Learning for Image Semantic Segmentation

Evaluating and Preserving High-level Fidelity in Super-Resolution

DFIR-DETR: Frequency Domain Enhancement and Dynamic Feature Aggregation for Cros...

Understanding Diffusion Models via Code Execution

AutoLugano: A Deep Learning Framework for Fully Automated Lymphoma Segmentation ...

Навигация