From local explanations to global understanding with explainable AI for trees
Scott M Lundberg
1, 2
,
Gabriel Erion
2, 3
,
Hugh Chen
2
,
Alex Degrave
2, 3
,
Jordan M. Prutkin
4
,
Bala Nair
5, 6
,
Ronit Katz
7
,
Jonathan Himmelfarb
7
,
Nisha Bansal
7
,
Su-In Lee
2
1
Microsoft Research, Redmond, USA
|
2
Publication type: Journal Article
Publication date: 2020-01-17
scimago Q1
wos Q1
SJR: 5.876
CiteScore: 37.6
Impact factor: 23.9
ISSN: 25225839
PubMed ID:
32607472
Computer Networks and Communications
Artificial Intelligence
Software
Human-Computer Interaction
Computer Vision and Pattern Recognition
Abstract
Tree-based machine learning models such as random forests, decision trees and gradient boosted trees are popular nonlinear predictive models, yet comparatively little attention has been paid to explaining their predictions. Here we improve the interpretability of tree-based models through three main contributions. (1) A polynomial time algorithm to compute optimal explanations based on game theory. (2) A new type of explanation that directly measures local feature interaction effects. (3) A new set of tools for understanding global model structure based on combining many local explanations of each prediction. We apply these tools to three medical machine learning problems and show how combining many high-quality local explanations allows us to represent global structure while retaining local faithfulness to the original model. These tools enable us to (1) identify high-magnitude but low-frequency nonlinear mortality risk factors in the US population, (2) highlight distinct population subgroups with shared risk characteristics, (3) identify nonlinear interaction effects among risk factors for chronic kidney disease and (4) monitor a machine learning model deployed in a hospital by identifying which features are degrading the model’s performance over time. Given the popularity of tree-based machine learning models, these improvements to their interpretability have implications across a broad set of domains. Tree-based machine learning models are widely used in domains such as healthcare, finance and public services. The authors present an explanation method for trees that enables the computation of optimal local explanations for individual predictions, and demonstrate their method on three medical datasets.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
20
40
60
80
100
120
140
160
|
|
|
Scientific Reports
154 publications, 2.52%
|
|
|
Lecture Notes in Computer Science
71 publications, 1.16%
|
|
|
IEEE Access
66 publications, 1.08%
|
|
|
Nature Communications
66 publications, 1.08%
|
|
|
PLoS ONE
55 publications, 0.9%
|
|
|
Science of the Total Environment
55 publications, 0.9%
|
|
|
Applied Sciences (Switzerland)
52 publications, 0.85%
|
|
|
Remote Sensing
45 publications, 0.74%
|
|
|
Environmental Science & Technology
43 publications, 0.7%
|
|
|
Sustainable Cities and Society
38 publications, 0.62%
|
|
|
Expert Systems with Applications
34 publications, 0.56%
|
|
|
Journal of Hydrology
33 publications, 0.54%
|
|
|
Journal of Medical Internet Research
31 publications, 0.51%
|
|
|
Journal of Hazardous Materials
29 publications, 0.47%
|
|
|
Engineering Applications of Artificial Intelligence
29 publications, 0.47%
|
|
|
SSRN Electronic Journal
29 publications, 0.47%
|
|
|
BMC Medical Informatics and Decision Making
27 publications, 0.44%
|
|
|
Sustainability
26 publications, 0.43%
|
|
|
Journal of Environmental Management
26 publications, 0.43%
|
|
|
Journal of Chemical Information and Modeling
25 publications, 0.41%
|
|
|
Sensors
24 publications, 0.39%
|
|
|
Lecture Notes in Networks and Systems
24 publications, 0.39%
|
|
|
Computers in Biology and Medicine
22 publications, 0.36%
|
|
|
Communications in Computer and Information Science
22 publications, 0.36%
|
|
|
Diagnostics
21 publications, 0.34%
|
|
|
Water Research
21 publications, 0.34%
|
|
|
Journal of Cleaner Production
21 publications, 0.34%
|
|
|
Briefings in Bioinformatics
21 publications, 0.34%
|
|
|
Environmental Research
20 publications, 0.33%
|
|
|
20
40
60
80
100
120
140
160
|
Publishers
|
200
400
600
800
1000
1200
1400
1600
1800
2000
|
|
|
Elsevier
1858 publications, 30.42%
|
|
|
Springer Nature
1108 publications, 18.14%
|
|
|
MDPI
539 publications, 8.83%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
423 publications, 6.93%
|
|
|
Wiley
268 publications, 4.39%
|
|
|
Cold Spring Harbor Laboratory
266 publications, 4.36%
|
|
|
American Chemical Society (ACS)
212 publications, 3.47%
|
|
|
Frontiers Media S.A.
197 publications, 3.23%
|
|
|
Taylor & Francis
135 publications, 2.21%
|
|
|
Oxford University Press
118 publications, 1.93%
|
|
|
JMIR Publications
90 publications, 1.47%
|
|
|
Public Library of Science (PLoS)
84 publications, 1.38%
|
|
|
Association for Computing Machinery (ACM)
73 publications, 1.2%
|
|
|
SAGE
57 publications, 0.93%
|
|
|
Royal Society of Chemistry (RSC)
56 publications, 0.92%
|
|
|
Copernicus
49 publications, 0.8%
|
|
|
Ovid Technologies (Wolters Kluwer Health)
42 publications, 0.69%
|
|
|
American Geophysical Union
42 publications, 0.69%
|
|
|
IOP Publishing
38 publications, 0.62%
|
|
|
Hindawi Limited
19 publications, 0.31%
|
|
|
American Association for the Advancement of Science (AAAS)
19 publications, 0.31%
|
|
|
Research Square Platform LLC
19 publications, 0.31%
|
|
|
AIP Publishing
18 publications, 0.29%
|
|
|
Social Science Electronic Publishing
18 publications, 0.29%
|
|
|
Cambridge University Press
14 publications, 0.23%
|
|
|
American Society of Clinical Oncology (ASCO)
13 publications, 0.21%
|
|
|
Emerald
12 publications, 0.2%
|
|
|
American Medical Association (AMA)
12 publications, 0.2%
|
|
|
American Physical Society (APS)
11 publications, 0.18%
|
|
|
200
400
600
800
1000
1200
1400
1600
1800
2000
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
6.1k
Total citations:
6108
Citations from 2025:
2278
(37.3%)
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Lundberg S. M. et al. From local explanations to global understanding with explainable AI for trees // Nature Machine Intelligence. 2020. Vol. 2. No. 1. pp. 56-67.
GOST all authors (up to 50)
Copy
Lundberg S. M., Erion G., Chen H., Degrave A., Prutkin J. M., Nair B., Katz R., Himmelfarb J., Bansal N., Lee S. From local explanations to global understanding with explainable AI for trees // Nature Machine Intelligence. 2020. Vol. 2. No. 1. pp. 56-67.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1038/s42256-019-0138-9
UR - https://doi.org/10.1038/s42256-019-0138-9
TI - From local explanations to global understanding with explainable AI for trees
T2 - Nature Machine Intelligence
AU - Lundberg, Scott M
AU - Erion, Gabriel
AU - Chen, Hugh
AU - Degrave, Alex
AU - Prutkin, Jordan M.
AU - Nair, Bala
AU - Katz, Ronit
AU - Himmelfarb, Jonathan
AU - Bansal, Nisha
AU - Lee, Su-In
PY - 2020
DA - 2020/01/17
PB - Springer Nature
SP - 56-67
IS - 1
VL - 2
PMID - 32607472
SN - 2522-5839
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2020_Lundberg,
author = {Scott M Lundberg and Gabriel Erion and Hugh Chen and Alex Degrave and Jordan M. Prutkin and Bala Nair and Ronit Katz and Jonathan Himmelfarb and Nisha Bansal and Su-In Lee},
title = {From local explanations to global understanding with explainable AI for trees},
journal = {Nature Machine Intelligence},
year = {2020},
volume = {2},
publisher = {Springer Nature},
month = {jan},
url = {https://doi.org/10.1038/s42256-019-0138-9},
number = {1},
pages = {56--67},
doi = {10.1038/s42256-019-0138-9}
}
Cite this
MLA
Copy
Lundberg, Scott M., et al. “From local explanations to global understanding with explainable AI for trees.” Nature Machine Intelligence, vol. 2, no. 1, Jan. 2020, pp. 56-67. https://doi.org/10.1038/s42256-019-0138-9.