Open Access
Lecture Notes in Computer Science, pages 99-114
Classification and Outlier Detection Based on Topic Based Pattern Synthesis
1
Citrix R&D India Pvt. Ltd, India
|
Publication type: Book Chapter
Publication date: 2013-07-11
Journal:
Lecture Notes in Computer Science
Q2
SJR: 0.606
CiteScore: 2.6
Impact factor: —
ISSN: 03029743, 16113349, 18612075, 18612083
Abstract
In several pattern classification problems, we encounter training datasets with an imbalanced class distribution and the presence of outliers, which can hinder the performance of classifiers. In this paper, we propose classification schemes based on the pre-processing of data using Novel Pattern Synthesis (NPS), with the aim to improve performance on such datasets. We provide a formal framework for characterizing the class imbalance and outlier elimination. Specifically, we look into the role of NPS in: Outlier elimination and handling class imbalance problem. In NPS, for every pattern its k-nearest neighbours are found and a weighted average of the neighbours is taken to form a synthesized pattern. It is found that the classification accuracy of minority class increases in the presence of synthesized patterns. However, finding nearest neighbours in high-dimensional datasets is challenging. Hence, we make use of Latent Dirichlet Allocation to reduce the dimensionality of the dataset. An extensive experimental evaluation carried out on 25 real-world imbalanced datasets shows that pre-processing of data using NPS is effective and has a greater impact on the classification accuracy over minority class for imbalanced learning. We also observed that NPS outperforms the state-of-the-art methods for imbalanced classification. Experiments on 9 real-world datasets with outliers, demonstrate that NPS approach not only substantially increases the detection performance, but is also relatively scalable in large datasets in comparison to the state-of-the-art outlier detection methods.
Found
Found
Top-30
Journals
1
|
|
ACM Transactions on Knowledge Discovery from Data
1 publication, 33.33%
|
|
Artificial Intelligence Review
1 publication, 33.33%
|
|
Lecture Notes in Computer Science
1 publication, 33.33%
|
|
1
|
Publishers
1
2
|
|
Springer Nature
2 publications, 66.67%
|
|
Association for Computing Machinery (ACM)
1 publication, 33.33%
|
|
1
2
|
- We do not take into account publications without a DOI.
- Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
Cite this
GOST |
RIS |
BibTex
Cite this
GOST
Copy
Kokkula S., Musti N. M. Classification and Outlier Detection Based on Topic Based Pattern Synthesis // Lecture Notes in Computer Science. 2013. pp. 99-114.
GOST all authors (up to 50)
Copy
Kokkula S., Musti N. M. Classification and Outlier Detection Based on Topic Based Pattern Synthesis // Lecture Notes in Computer Science. 2013. pp. 99-114.
Cite this
RIS
Copy
TY - GENERIC
DO - 10.1007/978-3-642-39712-7_8
UR - https://doi.org/10.1007/978-3-642-39712-7_8
TI - Classification and Outlier Detection Based on Topic Based Pattern Synthesis
T2 - Lecture Notes in Computer Science
AU - Kokkula, Samrat
AU - Musti, Narasimha Murty
PY - 2013
DA - 2013/07/11
PB - Springer Nature
SP - 99-114
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
Cite this
BibTex (up to 50 authors)
Copy
@incollection{2013_Kokkula,
author = {Samrat Kokkula and Narasimha Murty Musti},
title = {Classification and Outlier Detection Based on Topic Based Pattern Synthesis},
publisher = {Springer Nature},
year = {2013},
pages = {99--114},
month = {jul}
}