Interpretability and Transparency in Artificial Intelligence

Brent Mittelstadt ¹

Hide authors affiliations

Oxford Internet Institute, University of Oxford |

Publication type: Book Chapter

Publication date: 2022-10-20

—

DOI: 10.1093/oxfordhb/9780198857815.013.20

Copy DOI

Abstract

Artificial Intelligence (AI) systems are frequently thought of as opaque, meaning their performance or logic is thought to be inaccessible or incomprehensible to human observers. Models can consist of millions of features connected in a complex web of dependent behaviours. Conveying this internal state and dependencies in a humanly comprehensible way is extremely challenging. Explaining the functionality and behaviour of AI systems in a meaningful and useful way to people designing, operating, regulating, or affected by their outputs is a complex technical, philosophical, and ethical project. Despite this complexity, principles citing ‘transparency’ or ‘interpretability’ are commonly found in ethical and regulatory frameworks addressing technology. This chapter provides an overview of these concepts and methods design to explain how AI works. After reviewing key concepts and terminology, two sets of methods are examined: (1) interpretability methods designed to explain and approximate AI functionality and behaviour; and (2) transparency frameworks meant to help assess and provide information about the development, governance, and potential impact of training datasets, models, and specific applications. These methods are analysed in the context of prior work on explanations in the philosophy of science. The chapter closes by introducing a framework of criteria to evaluate the quality and utility of methods in explainable AI (XAI) and to clarify the open challenges facing the field.

Found

	1
Ecological Modelling	Ecological Modelling, 1, 14.29% Ecological Modelling 1 publication, 14.29%
Journal of Agriculture and Food Research	Journal of Agriculture and Food Research, 1, 14.29% Journal of Agriculture and Food Research 1 publication, 14.29%
IEEE Access	IEEE Access, 1, 14.29% IEEE Access 1 publication, 14.29%
Journalism	Journalism, 1, 14.29% Journalism 1 publication, 14.29%
International Journal of Pharmaceutics	International Journal of Pharmaceutics, 1, 14.29% International Journal of Pharmaceutics 1 publication, 14.29%
Social Influence	Social Influence, 1, 14.29% Social Influence 1 publication, 14.29%
	1

	1 2 3
Elsevier	Elsevier, 3, 42.86% Elsevier 3 publications, 42.86%
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 1, 14.29% Institute of Electrical and Electronics Engineers (IEEE) 1 publication, 14.29%
SAGE	SAGE, 1, 14.29% SAGE 1 publication, 14.29%
Wiley	Wiley, 1, 14.29% Wiley 1 publication, 14.29%
Taylor & Francis	Taylor & Francis, 1, 14.29% Taylor & Francis 1 publication, 14.29%
	1 2 3

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Found error?

Publisher

—

Interpretability and Transparency in Artificial Intelligence

Top-30

Journals

Publishers

Are you a researcher?