Fairness shields: Safeguarding against biased decision makers

Cano Cordoba F, Henzinger TA, Könighofer B, Kueffner K, Mallik K. 2025. Fairness shields: Safeguarding against biased decision makers. Proceedings of the AAAI Conference on Artificial Intelligence. AAAI: Conference on Artificial Intelligence vol. 39, 15659–15668.

Download (ext.)

Conference Paper | Published | English

Scopus indexed

Corresponding author has ISTA affiliation

Abstract
As AI-based decision-makers increasingly influence human lives, it is a growing concern that their decisions may be unfair or biased with respect to people's protected attributes, such as gender and race. Most existing bias prevention measures provide probabilistic fairness guarantees in the long run, and it is possible that the decisions are biased on any decision sequence of fixed length. We introduce *fairness shielding*, where a symbolic decision-maker---the fairness shield---continuously monitors the sequence of decisions of another deployed black-box decision-maker, and makes interventions so that a given fairness criterion is met while the total intervention costs are minimized. We present four different algorithms for computing fairness shields, among which one guarantees fairness over fixed horizons, and three guarantee fairness periodically after fixed intervals. Given a distribution over future decisions and their intervention costs, our algorithms solve different instances of bounded-horizon optimal control problems with different levels of computational costs and optimality guarantees. Our empirical evaluation demonstrates the effectiveness of these shields in ensuring fairness while maintaining cost efficiency across various scenarios.
Publishing Year
Date Published
2025-04-11
Proceedings Title
Proceedings of the AAAI Conference on Artificial Intelligence
Publisher
Association for the Advancement of Artificial Intelligence
Acknowledgement
This work is partly supported by the European Research Council under Grant No.: ERC-2020-AdG 101020093. It is also partially supported by the State Government of Styria, Austria – Department Zukunftsfonds Steiermark.
Volume
39
Issue
15
Page
15659-15668
Conference
AAAI: Conference on Artificial Intelligence
Conference Location
Philadelphia, PA, United States
Conference Date
2025-02-25 – 2025-03-04
ISSN
eISSN
IST-REx-ID

Cite this

Cano Cordoba F, Henzinger TA, Könighofer B, Kueffner K, Mallik K. Fairness shields: Safeguarding against biased decision makers. In: Proceedings of the AAAI Conference on Artificial Intelligence. Vol 39. Association for the Advancement of Artificial Intelligence; 2025:15659-15668. doi:10.1609/aaai.v39i15.33719
Cano Cordoba, F., Henzinger, T. A., Könighofer, B., Kueffner, K., & Mallik, K. (2025). Fairness shields: Safeguarding against biased decision makers. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 39, pp. 15659–15668). Philadelphia, PA, United States: Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v39i15.33719
Cano Cordoba, Filip, Thomas A Henzinger, Bettina Könighofer, Konstantin Kueffner, and Kaushik Mallik. “Fairness Shields: Safeguarding against Biased Decision Makers.” In Proceedings of the AAAI Conference on Artificial Intelligence, 39:15659–68. Association for the Advancement of Artificial Intelligence, 2025. https://doi.org/10.1609/aaai.v39i15.33719.
F. Cano Cordoba, T. A. Henzinger, B. Könighofer, K. Kueffner, and K. Mallik, “Fairness shields: Safeguarding against biased decision makers,” in Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, United States, 2025, vol. 39, no. 15, pp. 15659–15668.
Cano Cordoba F, Henzinger TA, Könighofer B, Kueffner K, Mallik K. 2025. Fairness shields: Safeguarding against biased decision makers. Proceedings of the AAAI Conference on Artificial Intelligence. AAAI: Conference on Artificial Intelligence vol. 39, 15659–15668.
Cano Cordoba, Filip, et al. “Fairness Shields: Safeguarding against Biased Decision Makers.” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 39, no. 15, Association for the Advancement of Artificial Intelligence, 2025, pp. 15659–68, doi:10.1609/aaai.v39i15.33719.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)
Access Level
OA Open Access

Export

Marked Publications

Open Data ISTA Research Explorer

Sources

arXiv 2412.11994

Search this title in

Google Scholar