Machine Learning Interpretability: A Survey on Methods and Metrics
Machine learning systems are becoming increasingly ubiquitous. These systems’s adoption has been expanding, accelerating the shift towards a more algorithmic society, meaning that algorithmically informed decisions have greater potential for significant social impact. However, most of these accurate decision support systems remain complex black boxes, meaning their internal logic and inner workings are hidden to the user and even experts cannot fully understand the rationale behind their predictions. Moreover, new regulations and highly regulated domains have made the audit and verifiability of decisions mandatory, increasing the demand for the ability to question, understand, and trust machine learning systems, for which interpretability is indispensable. The research community has recognized this interpretability problem and focused on developing both interpretable models and explanation methods over the past few years. However, the emergence of these methods shows there is no consensus on how to assess the explanation quality. Which are the most suitable metrics to assess the quality of an explanation? The aim of this article is to provide a review of the current state of the research field on machine learning interpretability while focusing on the societal impact and on the developed methods and metrics. Furthermore, a complete literature review is presented in order to identify future directions of work on this field.
Top-30
Journals
|
10
20
30
40
50
60
|
|
|
IEEE Access
52 publications, 4.23%
|
|
|
Lecture Notes in Computer Science
52 publications, 4.23%
|
|
|
Applied Sciences (Switzerland)
20 publications, 1.63%
|
|
|
Sensors
18 publications, 1.47%
|
|
|
Lecture Notes in Networks and Systems
18 publications, 1.47%
|
|
|
Expert Systems with Applications
17 publications, 1.38%
|
|
|
Communications in Computer and Information Science
14 publications, 1.14%
|
|
|
ACM Computing Surveys
13 publications, 1.06%
|
|
|
Electronics (Switzerland)
12 publications, 0.98%
|
|
|
Proceedings of the ACM on Human-Computer Interaction
10 publications, 0.81%
|
|
|
Neurocomputing
10 publications, 0.81%
|
|
|
Scientific Reports
9 publications, 0.73%
|
|
|
Engineering Applications of Artificial Intelligence
9 publications, 0.73%
|
|
|
Information Fusion
8 publications, 0.65%
|
|
|
SN Computer Science
7 publications, 0.57%
|
|
|
Frontiers in Artificial Intelligence
7 publications, 0.57%
|
|
|
Mathematics
7 publications, 0.57%
|
|
|
Artificial Intelligence
7 publications, 0.57%
|
|
|
Information Sciences
6 publications, 0.49%
|
|
|
Information (Switzerland)
6 publications, 0.49%
|
|
|
Neural Computing and Applications
6 publications, 0.49%
|
|
|
Computers in Biology and Medicine
6 publications, 0.49%
|
|
|
PLoS ONE
6 publications, 0.49%
|
|
|
Artificial Intelligence in Data and Big Data Processing
6 publications, 0.49%
|
|
|
Machine Learning and Knowledge Extraction
5 publications, 0.41%
|
|
|
Diagnostics
5 publications, 0.41%
|
|
|
Aerospace
5 publications, 0.41%
|
|
|
Artificial Intelligence Review
5 publications, 0.41%
|
|
|
AI and Ethics
5 publications, 0.41%
|
|
|
10
20
30
40
50
60
|
Publishers
|
50
100
150
200
250
300
|
|
|
Springer Nature
263 publications, 21.42%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
260 publications, 21.17%
|
|
|
Elsevier
231 publications, 18.81%
|
|
|
MDPI
144 publications, 11.73%
|
|
|
Association for Computing Machinery (ACM)
78 publications, 6.35%
|
|
|
Wiley
41 publications, 3.34%
|
|
|
Frontiers Media S.A.
26 publications, 2.12%
|
|
|
Taylor & Francis
20 publications, 1.63%
|
|
|
IGI Global
12 publications, 0.98%
|
|
|
SAGE
9 publications, 0.73%
|
|
|
Cold Spring Harbor Laboratory
9 publications, 0.73%
|
|
|
IOP Publishing
7 publications, 0.57%
|
|
|
Public Library of Science (PLoS)
7 publications, 0.57%
|
|
|
Cambridge University Press
5 publications, 0.41%
|
|
|
Institution of Engineering and Technology (IET)
4 publications, 0.33%
|
|
|
Oxford University Press
4 publications, 0.33%
|
|
|
Emerald
4 publications, 0.33%
|
|
|
American Chemical Society (ACS)
4 publications, 0.33%
|
|
|
AIP Publishing
3 publications, 0.24%
|
|
|
Annual Reviews
3 publications, 0.24%
|
|
|
King Saud University
3 publications, 0.24%
|
|
|
Walter de Gruyter
3 publications, 0.24%
|
|
|
American Society of Civil Engineers (ASCE)
2 publications, 0.16%
|
|
|
Copernicus
2 publications, 0.16%
|
|
|
IOS Press
2 publications, 0.16%
|
|
|
Society of Petroleum Engineers
2 publications, 0.16%
|
|
|
Tech Science Press
2 publications, 0.16%
|
|
|
Ovid Technologies (Wolters Kluwer Health)
2 publications, 0.16%
|
|
|
Hindawi Limited
2 publications, 0.16%
|
|
|
50
100
150
200
250
300
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.