Fairness shields: Safeguarding against biased decision makers

Cano Cordoba, Filip; Henzinger, Thomas A; Könighofer, Bettina; Kueffner, Konstantin; Mallik, Kaushik

Fairness shields: Safeguarding against biased decision makers

Cano Cordoba F, Henzinger TA, Könighofer B, Kueffner K, Mallik K. 2025. Fairness shields: Safeguarding against biased decision makers. Proceedings of the 39th AAAI Conference on Artificial Intelligence. AAAI: Conference on Artificial Intelligence vol. 39, 15659–15668.

Download (ext.)

https://doi.org/10.48550/arXiv.2412.11994 [Preprint]

DOI

10.1609/aaai.v39i15.33719

Conference Paper | Published | English

Scopus indexed

Author

Cano Cordoba, Filip^ISTA ; Henzinger, Thomas A^ISTA ; Könighofer, Bettina; Kueffner, Konstantin^ISTA ; Mallik, Kaushik^ISTA

Corresponding author has ISTA affiliation

Department

Henzinger_Thomas Group

Grant

Vigilant Algorithmic Monitoring of Software

Abstract

As AI-based decision-makers increasingly influence human lives, it is a growing concern that their decisions may be unfair or biased with respect to people's protected attributes, such as gender and race. Most existing bias prevention measures provide probabilistic fairness guarantees in the long run, and it is possible that the decisions are biased on any decision sequence of fixed length. We introduce *fairness shielding*, where a symbolic decision-maker---the fairness shield---continuously monitors the sequence of decisions of another deployed black-box decision-maker, and makes interventions so that a given fairness criterion is met while the total intervention costs are minimized. We present four different algorithms for computing fairness shields, among which one guarantees fairness over fixed horizons, and three guarantee fairness periodically after fixed intervals. Given a distribution over future decisions and their intervention costs, our algorithms solve different instances of bounded-horizon optimal control problems with different levels of computational costs and optimality guarantees. Our empirical evaluation demonstrates the effectiveness of these shields in ensuring fairness while maintaining cost efficiency across various scenarios.

Publishing Year

2025

Date Published

2025-04-11

Proceedings Title

Proceedings of the 39th AAAI Conference on Artificial Intelligence

Publisher

Association for the Advancement of Artificial Intelligence

Acknowledgement

This work is partly supported by the European Research Council under Grant No.: ERC-2020-AdG 101020093. It is also partially supported by the State Government of Styria, Austria – Department Zukunftsfonds Steiermark.

Volume

Issue

Page

15659-15668

Conference

AAAI: Conference on Artificial Intelligence

Conference Location

Philadelphia, PA, United States

Conference Date

2025-02-25 – 2025-03-04

ISSN

2159-5399

eISSN

2374-3468

IST-REx-ID

19665

Cite this

Cano Cordoba F, Henzinger TA, Könighofer B, Kueffner K, Mallik K. Fairness shields: Safeguarding against biased decision makers. In: Proceedings of the 39th AAAI Conference on Artificial Intelligence. Vol 39. Association for the Advancement of Artificial Intelligence; 2025:15659-15668. doi:10.1609/aaai.v39i15.33719

Cano Cordoba, F., Henzinger, T. A., Könighofer, B., Kueffner, K., & Mallik, K. (2025). Fairness shields: Safeguarding against biased decision makers. In Proceedings of the 39th AAAI Conference on Artificial Intelligence (Vol. 39, pp. 15659–15668). Philadelphia, PA, United States: Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v39i15.33719

Cano Cordoba, Filip, Thomas A Henzinger, Bettina Könighofer, Konstantin Kueffner, and Kaushik Mallik. “Fairness Shields: Safeguarding against Biased Decision Makers.” In Proceedings of the 39th AAAI Conference on Artificial Intelligence, 39:15659–68. Association for the Advancement of Artificial Intelligence, 2025. https://doi.org/10.1609/aaai.v39i15.33719.

F. Cano Cordoba, T. A. Henzinger, B. Könighofer, K. Kueffner, and K. Mallik, “Fairness shields: Safeguarding against biased decision makers,” in Proceedings of the 39th AAAI Conference on Artificial Intelligence, Philadelphia, PA, United States, 2025, vol. 39, no. 15, pp. 15659–15668.

Cano Cordoba, Filip, et al. “Fairness Shields: Safeguarding against Biased Decision Makers.” Proceedings of the 39th AAAI Conference on Artificial Intelligence, vol. 39, no. 15, Association for the Advancement of Artificial Intelligence, 2025, pp. 15659–68, doi:10.1609/aaai.v39i15.33719.

All files available under the following license(s):

Copyright Statement:

This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)

URL

https://doi.org/10.48550/arXiv.2412.11994

Access Level

Open Access

Export

Marked Publications

Open Data ISTA Research Explorer

Sources

arXiv 2412.11994

Search this title in

Google Scholar