Open Access
,
страницы 330-343
Feature Ranking from Random Forest Through Complex Network’s Centrality Measures
Тип публикации: Book Chapter
Дата публикации: 2022-08-28
scimago Q2
SJR: 0.352
CiteScore: 2.4
Impact factor: —
ISSN: 03029743, 16113349, 18612075, 18612083
Краткое описание
The volume of available data in recent years has rapidly increased. In consequence, datasets commonly end up with many irrelevant features. That increase may disturb human understanding and even lead to poor machine learning models. This research proposes a novel feature ranking method that employs trees from a Random Forest to transform a dataset into a complex network to which centrality measures are applied to rank the features. That process takes place by representing each tree as a graph where all the tree features are vertices on this graph, and the links within the nodes (father
$$\rightarrow $$
child) of the tree are represented by a weighted edge between the two respective vertices. The union of all graphs from individual trees leads to the complex network. Then, three centrality measures are applied to rank the features in the complex network. Experiments were performed in eighty-five supervised classification datasets, with a variation in the feature noise level, to evaluate our novel method. Results show that centrality measures in non-oriented complex networks are comparable and may be correlated to the Random Forest’s variable importance ranking algorithm. Vertex strength and eigenvector outperformed the Random Forest in 40% noise datasets, with a not statistically different result at a 95% confidence level.
Найдено
Ничего не найдено, попробуйте изменить настройки фильтра.
Для доступа к списку цитирований публикации необходимо авторизоваться.
Топ-30
Журналы
|
1
|
|
|
BioData Mining
1 публикация, 50%
|
|
|
Transactions in GIS
1 публикация, 50%
|
|
|
1
|
Издатели
|
1
|
|
|
Springer Nature
1 публикация, 50%
|
|
|
Wiley
1 публикация, 50%
|
|
|
1
|
- Мы не учитываем публикации, у которых нет DOI.
- Статистика публикаций обновляется еженедельно.
Вы ученый?
Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
Метрики
2
Всего цитирований:
2
Цитирований c 2025:
2
(100%)
Цитировать
ГОСТ |
RIS |
BibTex
Цитировать
ГОСТ
Скопировать
Cantão A. H. et al. Feature Ranking from Random Forest Through Complex Network’s Centrality Measures // Lecture Notes in Computer Science. 2022. pp. 330-343.
ГОСТ со всеми авторами (до 50)
Скопировать
Cantão A. H., Macedo A., Zhao L., Baranauskas J. A. Feature Ranking from Random Forest Through Complex Network’s Centrality Measures // Lecture Notes in Computer Science. 2022. pp. 330-343.
Цитировать
RIS
Скопировать
TY - GENERIC
DO - 10.1007/978-3-031-15740-0_24
UR - https://doi.org/10.1007/978-3-031-15740-0_24
TI - Feature Ranking from Random Forest Through Complex Network’s Centrality Measures
T2 - Lecture Notes in Computer Science
AU - Cantão, Adriano Henrique
AU - Macedo, Alessandra
AU - Zhao, Liang
AU - Baranauskas, José Augusto
PY - 2022
DA - 2022/08/28
PB - Springer Nature
SP - 330-343
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
Цитировать
BibTex (до 50 авторов)
Скопировать
@incollection{2022_Cantão,
author = {Adriano Henrique Cantão and Alessandra Macedo and Liang Zhao and José Augusto Baranauskas},
title = {Feature Ranking from Random Forest Through Complex Network’s Centrality Measures},
publisher = {Springer Nature},
year = {2022},
pages = {330--343},
month = {aug}
}
Ошибка в публикации?