Open Access
Open access
страницы 330-343

Feature Ranking from Random Forest Through Complex Network’s Centrality Measures

Тип публикацииBook Chapter
Дата публикации2022-08-28
scimago Q2
SJR0.352
CiteScore2.4
Impact factor
ISSN03029743, 16113349, 18612075, 18612083
Краткое описание
The volume of available data in recent years has rapidly increased. In consequence, datasets commonly end up with many irrelevant features. That increase may disturb human understanding and even lead to poor machine learning models. This research proposes a novel feature ranking method that employs trees from a Random Forest to transform a dataset into a complex network to which centrality measures are applied to rank the features. That process takes place by representing each tree as a graph where all the tree features are vertices on this graph, and the links within the nodes (father $$\rightarrow $$ child) of the tree are represented by a weighted edge between the two respective vertices. The union of all graphs from individual trees leads to the complex network. Then, three centrality measures are applied to rank the features in the complex network. Experiments were performed in eighty-five supervised classification datasets, with a variation in the feature noise level, to evaluate our novel method. Results show that centrality measures in non-oriented complex networks are comparable and may be correlated to the Random Forest’s variable importance ranking algorithm. Vertex strength and eigenvector outperformed the Random Forest in 40% noise datasets, with a not statistically different result at a 95% confidence level.
Для доступа к списку цитирований публикации необходимо авторизоваться.

Топ-30

Журналы

1
BioData Mining
1 публикация, 50%
Transactions in GIS
1 публикация, 50%
1

Издатели

1
Springer Nature
1 публикация, 50%
Wiley
1 публикация, 50%
1
  • Мы не учитываем публикации, у которых нет DOI.
  • Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
Метрики
2
Поделиться
Цитировать
ГОСТ |
Цитировать
Cantão A. H. et al. Feature Ranking from Random Forest Through Complex Network’s Centrality Measures // Lecture Notes in Computer Science. 2022. pp. 330-343.
ГОСТ со всеми авторами (до 50) Скопировать
Cantão A. H., Macedo A., Zhao L., Baranauskas J. A. Feature Ranking from Random Forest Through Complex Network’s Centrality Measures // Lecture Notes in Computer Science. 2022. pp. 330-343.
RIS |
Цитировать
TY - GENERIC
DO - 10.1007/978-3-031-15740-0_24
UR - https://doi.org/10.1007/978-3-031-15740-0_24
TI - Feature Ranking from Random Forest Through Complex Network’s Centrality Measures
T2 - Lecture Notes in Computer Science
AU - Cantão, Adriano Henrique
AU - Macedo, Alessandra
AU - Zhao, Liang
AU - Baranauskas, José Augusto
PY - 2022
DA - 2022/08/28
PB - Springer Nature
SP - 330-343
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
BibTex
Цитировать
BibTex (до 50 авторов) Скопировать
@incollection{2022_Cantão,
author = {Adriano Henrique Cantão and Alessandra Macedo and Liang Zhao and José Augusto Baranauskas},
title = {Feature Ranking from Random Forest Through Complex Network’s Centrality Measures},
publisher = {Springer Nature},
year = {2022},
pages = {330--343},
month = {aug}
}
Ошибка в публикации?