Data quality dimensions for fair AI

Camilla Quaresmini, Giuseppe Primiero
Computer Science, Artificial Intelligence, Artificial Intelligence (cs.AI), Logic in Computer Science (cs.LO)
2023-05-10 16:00:00
AI systems are not intrinsically neutral and biases trickle in any type of technological tool. In particular when dealing with people, AI algorithms reflect technical errors originating with mislabeled data. As they feed wrong and discriminatory classifications, perpetuating structural racism and marginalization, these systems are not systematically guarded against bias. In this article we consider the problem of bias in AI systems from the point of view of Information Quality dimensions. We illustrate potential improvements of a bias mitigation tool in gender classification errors, referring to two typically difficult contexts: the classification of non-binary individuals and the classification of transgender individuals. The identification of data quality dimensions to implement in bias mitigation tool may help achieve more fairness. Hence, we propose to consider this issue in terms of completeness, consistency, timeliness and reliability, and offer some theoretical results.
PDF: Data quality dimensions for fair AI.pdf
Empowered by ChatGPT