

Author: Calders Toon Verwer Sicco
Publisher: Springer Publishing Company
ISSN: 1384-5810
Source: Data Mining and Knowledge Discovery, Vol.21, Iss.2, 2010-09, pp. : 277-292
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Abstract
In this paper, we investigate how to modify the naive Bayes classifier in order to perform classification that is restricted to be independent with respect to a given sensitive attribute. Such independency restrictions occur naturally when the decision process leading to the labels in the data-set was biased; e.g., due to gender or racial discrimination. This setting is motivated by many cases in which there exist laws that disallow a decision that is partly based on discrimination. Naive application of machine learning techniques would result in huge fines for companies. We present three approaches for making the naive Bayes classifier discrimination-free: (i) modifying the probability of the decision being positive, (ii) training one model for every sensitive attribute value and balancing them, and (iii) adding a latent variable to the Bayesian model that represents the unbiased label and optimizing the model parameters for likelihood using expectation maximization. We present experiments for the three approaches on both artificial and real-life data.
Related content




Naive Bayes for optimal ranking
Journal of Experimental & Theoretical Artificial Intelligence, Vol. 20, Iss. 2, 2008-06 ,pp. :


By Cannon Edward Amini Ata Bender Andreas Sternberg Michael Muggleton Stephen Glen Robert Mitchell John
Journal of Computer-Aided Molecular Design, Vol. 21, Iss. 5, 2007-05 ,pp. :


NUMERIC MAPPING AND LEARNABILITY OF NAIVE BAYES
By ZHANG HARRY
Applied Artificial Intelligence, Vol. 17, Iss. 5-6, 2003-05 ,pp. :


Technical Note: Naive Bayes for Regression
By Frank E.
Machine Learning, Vol. 41, Iss. 1, 2000-10 ,pp. :