Demographic-Agnostic Fairness without Harm
2509.24077v1
cs.LG, stat.ML
2025-10-01
Авторы:
Zhongteng Cai, Mohammad Mahdi Khalili, Xueru Zhang
Abstract
As machine learning (ML) algorithms are increasingly used in social domains
to make predictions about humans, there is a growing concern that these
algorithms may exhibit biases against certain social groups. Numerous notions
of fairness have been proposed in the literature to measure the unfairness of
ML. Among them, one class that receives the most attention is
\textit{parity-based}, i.e., achieving fairness by equalizing treatment or
outcomes for different social groups. However, achieving parity-based fairness
often comes at the cost of lowering model accuracy and is undesirable for many
high-stakes domains like healthcare. To avoid inferior accuracy, a line of
research focuses on \textit{preference-based} fairness, under which any group
of individuals would experience the highest accuracy and collectively prefer
the ML outcomes assigned to them if they were given the choice between various
sets of outcomes. However, these works assume individual demographic
information is known and fully accessible during training. In this paper, we
relax this requirement and propose a novel \textit{demographic-agnostic
fairness without harm (DAFH)} optimization algorithm, which jointly learns a
group classifier that partitions the population into multiple groups and a set
of decoupled classifiers associated with these groups. Theoretically, we
conduct sample complexity analysis and show that our method can outperform the
baselines when demographic information is known and used to train decoupled
classifiers. Experiments on both synthetic and real data validate the proposed
method.
Ссылки и действия
Дополнительные ресурсы: