Subsequence kernels-based Arabic text classification

Attia Nehar ¹

Benmessaoud Abdelkader ²

Hadda Cherroun ²

Djelloul Ziadi ³

Hide authors affiliations Show authors affiliations: 3 affiliations

Université Ziane Achour, Djelfa, Algérie |

Laboratoire d'Informatique et Mathématiques, Université Amar Telidji, Laghouat, Algérie |

Laboratoire LITIS, EA 4108, Normandie Université, Rouen, France |

Publication type: Proceedings Article

Publication date: 2014-11-01

Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/AICCSA.2014.7073200

Copy DOI

Abstract

Kernel methods have known huge success in machine learning. This success is mainly due to their flexibility to deal with high dimensionality of the feature space of complex data such as graphs, trees or textual data. In the field of text classification (TC) their performances have supplanted traditional algorithms. For textual data, different kernels were introduced (P-spectrum, All-Sub-sequences, Gap-Weighted Subsequences kernel, ...) to improve the performance of TC systems. In this paper, we carried out a system for Arabic TC which supports aspects of order and co-occurrence of words within a text. Transducers, specific automata, are used to represent documents. Such representation allows an efficient implementation of subsequence kernel. An empirical study is conducted to evaluate the ATC system on the large SPA corpus. Results show an improvement of the classification in terms of precision.

Found

Top-30

Journals

	1
Data Mining and Knowledge Discovery	Data Mining and Knowledge Discovery, 1, 50% Data Mining and Knowledge Discovery 1 publication, 50%
Studies in Computational Intelligence	Studies in Computational Intelligence, 1, 50% Studies in Computational Intelligence 1 publication, 50%
	1

Publishers

	1 2
Springer Nature	Springer Nature, 2, 100% Springer Nature 2 publications, 100%
	1 2

We do not take into account publications without a DOI.
Statistics recalculated only for publications connected to researchers, organizations and labs registered on the platform.
Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Found error?

Publisher

Institute of Electrical and Electronics Engineers (IEEE)