Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition

Тип публикацииBook Chapter
Дата публикации2017-11-28
SCImago Q4
SJR0.181
CiteScore1.1
Impact factor
ISSN18650929, 18650937
Краткое описание
Named Entity Recognition (NER) is one of the most common tasks of the natural language processing. The purpose of NER is to find and classify tokens in text documents into predefined categories called tags, such as person names, quantity expressions, percentage expressions, names of locations, organizations, as well as expression of time, currency and others. Although there is a number of approaches have been proposed for this task in Russian language, it still has a substantial potential for the better solutions. In this work, we studied several deep neural network models starting from vanilla Bi-directional Long Short Term Memory (Bi-LSTM) then supplementing it with Conditional Random Fields (CRF) as well as highway networks and finally adding external word embeddings. All models were evaluated across three datasets Gareev’s, Person-1000 and FactRuEval 2016. We found that extension of Bi-LSTM model with CRF significantly increased the quality of predictions. Encoding input tokens with external word embeddings reduced training time and allowed to achieve state of the art for the Russian NER task.
Для доступа к списку цитирований публикации необходимо авторизоваться.

Топ-30

Журналы

1
2
InterCarto InterGIS
2 публикации, 16.67%
Lecture Notes in Electrical Engineering
2 публикации, 16.67%
Scientometrics
1 публикация, 8.33%
Arabian Journal for Science and Engineering
1 публикация, 8.33%
Lecture Notes in Computer Science
1 публикация, 8.33%
Modeling and Analysis of Information Systems
1 публикация, 8.33%
Automatic Control and Computer Sciences
1 публикация, 8.33%
1
2

Издатели

1
2
3
4
5
Springer Nature
5 публикаций, 41.67%
LLC Kartfond
2 публикации, 16.67%
Institute of Electrical and Electronics Engineers (IEEE)
2 публикации, 16.67%
P.G. Demidov Yaroslavl State University
1 публикация, 8.33%
Allerton Press
1 публикация, 8.33%
Association for Computing Machinery (ACM)
1 публикация, 8.33%
1
2
3
4
5
  • Мы не учитываем публикации, у которых нет DOI.
  • Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
 Войти с ORCID
Метрики
12
Поделиться
Цитировать
ГОСТ |
Цитировать
Le T. A. et al. Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition // Communications in Computer and Information Science. 2017. pp. 91-103.
ГОСТ со всеми авторами (до 50) Скопировать
Le T. A., Arkhipov M. Y., Burtsev M. S. Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition // Communications in Computer and Information Science. 2017. pp. 91-103.
RIS |
Цитировать
TY - GENERIC
DO - 10.1007/978-3-319-71746-3_8
UR - https://doi.org/10.1007/978-3-319-71746-3_8
TI - Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition
T2 - Communications in Computer and Information Science
AU - Le, The Anh
AU - Arkhipov, Mikhail Y
AU - Burtsev, Mikhail S.
PY - 2017
DA - 2017/11/28
PB - Springer Nature
SP - 91-103
SN - 1865-0929
SN - 1865-0937
ER -
BibTex
Цитировать
BibTex (до 50 авторов) Скопировать
@incollection{2017_Le,
author = {The Anh Le and Mikhail Y Arkhipov and Mikhail S. Burtsev},
title = {Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition},
publisher = {Springer Nature},
year = {2017},
pages = {91--103},
month = {nov}
}
Ошибка в публикации?