Artificial Intelligence (AI) finds widespread application across various domains, but it sparks concerns about fairness in its deployment. The prevailing discourse in classification often emphasizes outcome-based metrics comparing sensitive subgroups without consideration of the differential impacts within subgroups. Bias mitigation techniques do not only affect the ranking of pairs of instances across sensitive groups, but often also significantly affect the ranking of instances within these groups. Such changes are hard to explain and raise concerns regarding the validity of the intervention. Unfortunately, these effects remain under the radar in the accuracy-fairness evaluation framework that is usually applied. In this paper, we illustrate the effect of several popular bias mitigation methods, and how their output often does not reflect real-world scenarios.
Building similarity graph...
Analyzing shared references across papers
Loading...
Sofie Goethals
Marco Favier
Toon Calders
ACM Journal on Responsible Computing
University of Antwerp
Building similarity graph...
Analyzing shared references across papers
Loading...
Goethals et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69df2bcae4eeef8a2a6b0bed — DOI: https://doi.org/10.1145/3802540