том 23 издание 1 страницы 7-19

A Regression Approach to Speech Enhancement Based on Deep Neural Networks

Тип публикацииJournal Article
Дата публикации2015-01-01
scimago Q1
wos Q1
white level БС1
SJR1.061
CiteScore12.4
Impact factor5.1
ISSN23299290, 23299304
Electrical and Electronic Engineering
Computer Science (miscellaneous)
Computational Mathematics
Acoustics and Ultrasonics
Краткое описание
In contrast to the conventional minimum mean square error (MMSE)-based noise reduction techniques, we propose a supervised method to enhance speech by means of finding a mapping function between noisy and clean speech signals based on deep neural networks (DNNs). In order to be able to handle a wide range of additive noises in real-world situations, a large training set that encompasses many possible combinations of speech and noise types, is first designed. A DNN architecture is then employed as a nonlinear regression function to ensure a powerful modeling capability. Several techniques have also been proposed to improve the DNN-based speech enhancement system, including global variance equalization to alleviate the over-smoothing problem of the regression model, and the dropout and noise-aware training strategies to further improve the generalization capability of DNNs to unseen noise conditions. Experimental results demonstrate that the proposed framework can achieve significant improvements in both objective and subjective measures over the conventional MMSE based technique. It is also interesting to observe that the proposed DNN approach can well suppress highly nonstationary noise, which is tough to handle in general. Furthermore, the resulting DNN model, trained with artificial synthesized data, is also effective in dealing with noisy speech data recorded in real-world scenarios without the generation of the annoying musical artifact commonly observed in conventional enhancement methods.
Для доступа к списку цитирований публикации необходимо авторизоваться.
Для доступа к списку профилей, цитирующих публикацию, необходимо авторизоваться.

Топ-30

Журналы

20
40
60
80
100
120
IEEE/ACM Transactions on Audio Speech and Language Processing
104 публикации, 10.25%
Speech Communication
34 публикации, 3.35%
IEEE Access
32 публикации, 3.15%
IEEE Signal Processing Letters
25 публикаций, 2.46%
Applied Acoustics
20 публикаций, 1.97%
Circuits, Systems, and Signal Processing
17 публикаций, 1.67%
Applied Sciences (Switzerland)
14 публикаций, 1.38%
Lecture Notes in Computer Science
13 публикаций, 1.28%
Journal of the Acoustical Society of America
11 публикаций, 1.08%
Computer Speech and Language
11 публикаций, 1.08%
Journal of Intelligent and Fuzzy Systems
10 публикаций, 0.99%
Electronics (Switzerland)
9 публикаций, 0.89%
Digital Signal Processing: A Review Journal
8 публикаций, 0.79%
IEEE Journal on Selected Topics in Signal Processing
8 публикаций, 0.79%
IEEE Transactions on Audio Speech and Language Processing
8 публикаций, 0.79%
Multimedia Tools and Applications
7 публикаций, 0.69%
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
7 публикаций, 0.69%
Lecture Notes in Networks and Systems
7 публикаций, 0.69%
Eurasip Journal on Audio, Speech, and Music Processing
6 публикаций, 0.59%
International Journal of Speech Technology
6 публикаций, 0.59%
Eurasip Journal on Advances in Signal Processing
6 публикаций, 0.59%
Neural Networks
6 публикаций, 0.59%
Journal of Physics: Conference Series
6 публикаций, 0.59%
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies
5 публикаций, 0.49%
Sensors
5 публикаций, 0.49%
Lecture Notes in Electrical Engineering
5 публикаций, 0.49%
Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
4 публикации, 0.39%
IEEE Open Journal of Signal Processing
4 публикации, 0.39%
Communications in Computer and Information Science
4 публикации, 0.39%
20
40
60
80
100
120

Издатели

100
200
300
400
500
600
700
Institute of Electrical and Electronics Engineers (IEEE)
617 публикаций, 60.79%
Springer Nature
114 публикаций, 11.23%
Elsevier
114 публикаций, 11.23%
MDPI
43 публикации, 4.24%
Association for Computing Machinery (ACM)
19 публикаций, 1.87%
Acoustical Society of America (ASA)
11 публикаций, 1.08%
IOP Publishing
10 публикаций, 0.99%
Wiley
10 публикаций, 0.99%
Institute of Electronics, Information and Communications Engineers (IEICE)
10 публикаций, 0.99%
IOS Press
7 публикаций, 0.69%
SAGE
7 публикаций, 0.69%
Institution of Engineering and Technology (IET)
5 публикаций, 0.49%
AIP Publishing
3 публикации, 0.3%
World Scientific
3 публикации, 0.3%
Taylor & Francis
3 публикации, 0.3%
JMIR Publications
2 публикации, 0.2%
Frontiers Media S.A.
2 публикации, 0.2%
Tech Science Press
2 публикации, 0.2%
Hindawi Limited
2 публикации, 0.2%
SAE International
2 публикации, 0.2%
American Geophysical Union
1 публикация, 0.1%
American Physical Society (APS)
1 публикация, 0.1%
ASME International
1 публикация, 0.1%
The Royal Society
1 публикация, 0.1%
Ovid Technologies (Wolters Kluwer Health)
1 публикация, 0.1%
Alexandria University
1 публикация, 0.1%
Public Library of Science (PLoS)
1 публикация, 0.1%
Royal Society of Chemistry (RSC)
1 публикация, 0.1%
Pleiades Publishing
1 публикация, 0.1%
100
200
300
400
500
600
700
  • Мы не учитываем публикации, у которых нет DOI.
  • Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
Метрики
1k
Поделиться
Цитировать
ГОСТ |
Цитировать
Xu Y. et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks // IEEE/ACM Transactions on Audio Speech and Language Processing. 2015. Vol. 23. No. 1. pp. 7-19.
ГОСТ со всеми авторами (до 50) Скопировать
Xu Y., Du J., DAI L., Lee C. A Regression Approach to Speech Enhancement Based on Deep Neural Networks // IEEE/ACM Transactions on Audio Speech and Language Processing. 2015. Vol. 23. No. 1. pp. 7-19.
RIS |
Цитировать
TY - JOUR
DO - 10.1109/taslp.2014.2364452
UR - https://doi.org/10.1109/taslp.2014.2364452
TI - A Regression Approach to Speech Enhancement Based on Deep Neural Networks
T2 - IEEE/ACM Transactions on Audio Speech and Language Processing
AU - Xu, Yong
AU - Du, Jun
AU - DAI, LI-RONG
AU - Lee, Chin-Hui
PY - 2015
DA - 2015/01/01
PB - Institute of Electrical and Electronics Engineers (IEEE)
SP - 7-19
IS - 1
VL - 23
SN - 2329-9290
SN - 2329-9304
ER -
BibTex |
Цитировать
BibTex (до 50 авторов) Скопировать
@article{2015_Xu,
author = {Yong Xu and Jun Du and LI-RONG DAI and Chin-Hui Lee},
title = {A Regression Approach to Speech Enhancement Based on Deep Neural Networks},
journal = {IEEE/ACM Transactions on Audio Speech and Language Processing},
year = {2015},
volume = {23},
publisher = {Institute of Electrical and Electronics Engineers (IEEE)},
month = {jan},
url = {https://doi.org/10.1109/taslp.2014.2364452},
number = {1},
pages = {7--19},
doi = {10.1109/taslp.2014.2364452}
}
MLA
Цитировать
Xu, Yong, et al. “A Regression Approach to Speech Enhancement Based on Deep Neural Networks.” IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 23, no. 1, Jan. 2015, pp. 7-19. https://doi.org/10.1109/taslp.2014.2364452.
Ошибка в публикации?