From attribution maps to human-understandable explanations through Concept Relevance Propagation
The field of explainable artificial intelligence (XAI) aims to bring transparency to today’s powerful but opaque deep learning models. While local XAI methods explain individual predictions in the form of attribution maps, thereby identifying ‘where’ important features occur (but not providing information about ‘what’ they represent), global explanation techniques visualize what concepts a model has generally learned to encode. Both types of method thus provide only partial insights and leave the burden of interpreting the model’s reasoning to the user. Here we introduce the Concept Relevance Propagation (CRP) approach, which combines the local and global perspectives and thus allows answering both the ‘where’ and ‘what’ questions for individual predictions. We demonstrate the capability of our method in various settings, showcasing that CRP leads to more human interpretable explanations and provides deep insights into the model’s representation and reasoning through concept atlases, concept-composition analyses, and quantitative investigations of concept subspaces and their role in fine-grained decision-making.
Top-30
Journals
|
2
4
6
8
10
12
|
|
|
Lecture Notes in Computer Science
12 publications, 10.81%
|
|
|
Communications in Computer and Information Science
9 publications, 8.11%
|
|
|
Information Fusion
3 publications, 2.7%
|
|
|
IEEE Access
3 publications, 2.7%
|
|
|
Nature Machine Intelligence
3 publications, 2.7%
|
|
|
Information Sciences
2 publications, 1.8%
|
|
|
Machine Learning
2 publications, 1.8%
|
|
|
npj Digital Medicine
2 publications, 1.8%
|
|
|
Computers in Biology and Medicine
2 publications, 1.8%
|
|
|
Scientific Reports
2 publications, 1.8%
|
|
|
Journal of the Franklin Institute
1 publication, 0.9%
|
|
|
Sensors
1 publication, 0.9%
|
|
|
Pattern Recognition
1 publication, 0.9%
|
|
|
Communications Earth & Environment
1 publication, 0.9%
|
|
|
KI - Künstliche Intelligenz
1 publication, 0.9%
|
|
|
IEEE Transactions on Artificial Intelligence
1 publication, 0.9%
|
|
|
i-com
1 publication, 0.9%
|
|
|
International Journal of Human-Computer Interaction
1 publication, 0.9%
|
|
|
ICGA Journal
1 publication, 0.9%
|
|
|
ACM Transactions on Software Engineering and Methodology
1 publication, 0.9%
|
|
|
Nature Communications
1 publication, 0.9%
|
|
|
Computer Methods and Programs in Biomedicine
1 publication, 0.9%
|
|
|
Advanced Intelligent Systems
1 publication, 0.9%
|
|
|
Frontiers in Medicine
1 publication, 0.9%
|
|
|
IEEE Transactions on Pattern Analysis and Machine Intelligence
1 publication, 0.9%
|
|
|
United European Gastroenterology Journal
1 publication, 0.9%
|
|
|
ACM Transactions on Computing for Healthcare
1 publication, 0.9%
|
|
|
IEEE Geoscience and Remote Sensing Magazine
1 publication, 0.9%
|
|
|
IEEE Transactions on Knowledge and Data Engineering
1 publication, 0.9%
|
|
|
2
4
6
8
10
12
|
Publishers
|
5
10
15
20
25
30
35
40
|
|
|
Springer Nature
39 publications, 35.14%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
26 publications, 23.42%
|
|
|
Elsevier
15 publications, 13.51%
|
|
|
Association for Computing Machinery (ACM)
9 publications, 8.11%
|
|
|
MDPI
3 publications, 2.7%
|
|
|
Cold Spring Harbor Laboratory
2 publications, 1.8%
|
|
|
Walter de Gruyter
2 publications, 1.8%
|
|
|
SAGE
2 publications, 1.8%
|
|
|
Wiley
2 publications, 1.8%
|
|
|
Taylor & Francis
1 publication, 0.9%
|
|
|
American Geophysical Union
1 publication, 0.9%
|
|
|
Frontiers Media S.A.
1 publication, 0.9%
|
|
|
Copernicus
1 publication, 0.9%
|
|
|
Proceedings of the National Academy of Sciences (PNAS)
1 publication, 0.9%
|
|
|
Public Library of Science (PLoS)
1 publication, 0.9%
|
|
|
MIT Press
1 publication, 0.9%
|
|
|
Pleiades Publishing
1 publication, 0.9%
|
|
|
BMJ
1 publication, 0.9%
|
|
|
5
10
15
20
25
30
35
40
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.