Institute of Electrical and Electronics Engineers (IEEE)

IEEE/ACM Transactions on Audio Speech and Language Processing

, том 23 , издание 1 , страницы 7-19

A Regression Approach to Speech Enhancement Based on Deep Neural Networks

Тип публикации: Journal Article

Дата публикации: 2015-01-01

Institute of Electrical and Electronics Engineers (IEEE)

IEEE/ACM Transactions on Audio Speech and Language Processing

scimago Q1

wos Q1

white level БС1

SJR: 1.061

CiteScore: 12.4

Impact factor: 5.1

ISSN: 23299290, 23299304

DOI: 10.1109/taslp.2014.2364452

Скопировать DOI

Electrical and Electronic Engineering

Computer Science (miscellaneous)

Computational Mathematics

Acoustics and Ultrasonics

Краткое описание

In contrast to the conventional minimum mean square error (MMSE)-based noise reduction techniques, we propose a supervised method to enhance speech by means of finding a mapping function between noisy and clean speech signals based on deep neural networks (DNNs). In order to be able to handle a wide range of additive noises in real-world situations, a large training set that encompasses many possible combinations of speech and noise types, is first designed. A DNN architecture is then employed as a nonlinear regression function to ensure a powerful modeling capability. Several techniques have also been proposed to improve the DNN-based speech enhancement system, including global variance equalization to alleviate the over-smoothing problem of the regression model, and the dropout and noise-aware training strategies to further improve the generalization capability of DNNs to unseen noise conditions. Experimental results demonstrate that the proposed framework can achieve significant improvements in both objective and subjective measures over the conventional MMSE based technique. It is also interesting to observe that the proposed DNN approach can well suppress highly nonstationary noise, which is tough to handle in general. Furthermore, the resulting DNN model, trained with artificial synthesized data, is also effective in dealing with noisy speech data recorded in real-world scenarios without the generation of the annoying musical artifact commonly observed in conventional enhancement methods.

Для доступа к списку цитирований публикации необходимо авторизоваться.

Войти с ORCID

Для доступа к списку профилей, цитирующих публикацию, необходимо авторизоваться.

Войти с ORCID

Топ-30

Журналы

Издатели

	100 200 300 400 500 600 700
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 617, 60.79% Institute of Electrical and Electronics Engineers (IEEE) 617 публикаций, 60.79%
Springer Nature	Springer Nature, 114, 11.23% Springer Nature 114 публикаций, 11.23%
Elsevier	Elsevier, 114, 11.23% Elsevier 114 публикаций, 11.23%
MDPI	MDPI, 43, 4.24% MDPI 43 публикации, 4.24%
Association for Computing Machinery (ACM)	Association for Computing Machinery (ACM), 19, 1.87% Association for Computing Machinery (ACM) 19 публикаций, 1.87%
Acoustical Society of America (ASA)	Acoustical Society of America (ASA), 11, 1.08% Acoustical Society of America (ASA) 11 публикаций, 1.08%
IOP Publishing	IOP Publishing, 10, 0.99% IOP Publishing 10 публикаций, 0.99%
Wiley	Wiley, 10, 0.99% Wiley 10 публикаций, 0.99%
Institute of Electronics, Information and Communications Engineers (IEICE)	Institute of Electronics, Information and Communications Engineers (IEICE), 10, 0.99% Institute of Electronics, Information and Communications Engineers (IEICE) 10 публикаций, 0.99%
IOS Press	IOS Press, 7, 0.69% IOS Press 7 публикаций, 0.69%
SAGE	SAGE, 7, 0.69% SAGE 7 публикаций, 0.69%
Institution of Engineering and Technology (IET)	Institution of Engineering and Technology (IET), 5, 0.49% Institution of Engineering and Technology (IET) 5 публикаций, 0.49%
AIP Publishing	AIP Publishing, 3, 0.3% AIP Publishing 3 публикации, 0.3%
World Scientific	World Scientific, 3, 0.3% World Scientific 3 публикации, 0.3%
Taylor & Francis	Taylor & Francis, 3, 0.3% Taylor & Francis 3 публикации, 0.3%
JMIR Publications	JMIR Publications, 2, 0.2% JMIR Publications 2 публикации, 0.2%
Frontiers Media S.A.	Frontiers Media S.A., 2, 0.2% Frontiers Media S.A. 2 публикации, 0.2%
Tech Science Press	Tech Science Press, 2, 0.2% Tech Science Press 2 публикации, 0.2%
Hindawi Limited	Hindawi Limited, 2, 0.2% Hindawi Limited 2 публикации, 0.2%
SAE International	SAE International, 2, 0.2% SAE International 2 публикации, 0.2%
American Geophysical Union	American Geophysical Union, 1, 0.1% American Geophysical Union 1 публикация, 0.1%
American Physical Society (APS)	American Physical Society (APS), 1, 0.1% American Physical Society (APS) 1 публикация, 0.1%
ASME International	ASME International, 1, 0.1% ASME International 1 публикация, 0.1%
The Royal Society	The Royal Society, 1, 0.1% The Royal Society 1 публикация, 0.1%
Ovid Technologies (Wolters Kluwer Health)	Ovid Technologies (Wolters Kluwer Health), 1, 0.1% Ovid Technologies (Wolters Kluwer Health) 1 публикация, 0.1%
Alexandria University	Alexandria University, 1, 0.1% Alexandria University 1 публикация, 0.1%
Public Library of Science (PLoS)	Public Library of Science (PLoS), 1, 0.1% Public Library of Science (PLoS) 1 публикация, 0.1%
Royal Society of Chemistry (RSC)	Royal Society of Chemistry (RSC), 1, 0.1% Royal Society of Chemistry (RSC) 1 публикация, 0.1%
Pleiades Publishing	Pleiades Publishing, 1, 0.1% Pleiades Publishing 1 публикация, 0.1%
	100 200 300 400 500 600 700

Мы не учитываем публикации, у которых нет DOI.
Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.

Войти с ORCID

Метрики

Цитировать

ГОСТ |

Цитировать

ГОСТ Скопировать

Xu Y. et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks // IEEE/ACM Transactions on Audio Speech and Language Processing. 2015. Vol. 23. No. 1. pp. 7-19.

ГОСТ со всеми авторами (до 50) Скопировать

Xu Y., Du J., DAI L., Lee C. A Regression Approach to Speech Enhancement Based on Deep Neural Networks // IEEE/ACM Transactions on Audio Speech and Language Processing. 2015. Vol. 23. No. 1. pp. 7-19.

RIS |

Цитировать

RIS Скопировать

TY - JOUR

DO - 10.1109/taslp.2014.2364452

UR - https://doi.org/10.1109/taslp.2014.2364452

TI - A Regression Approach to Speech Enhancement Based on Deep Neural Networks

T2 - IEEE/ACM Transactions on Audio Speech and Language Processing

AU - Xu, Yong

AU - Du, Jun

AU - DAI, LI-RONG

AU - Lee, Chin-Hui

PY - 2015

DA - 2015/01/01

PB - Institute of Electrical and Electronics Engineers (IEEE)

SP - 7-19

IS - 1

VL - 23

SN - 2329-9290

SN - 2329-9304

ER -

BibTex |

Цитировать

BibTex (до 50 авторов) Скопировать

@article{2015_Xu,

author = {Yong Xu and Jun Du and LI-RONG DAI and Chin-Hui Lee},

title = {A Regression Approach to Speech Enhancement Based on Deep Neural Networks},

journal = {IEEE/ACM Transactions on Audio Speech and Language Processing},

year = {2015},

volume = {23},

publisher = {Institute of Electrical and Electronics Engineers (IEEE)},

month = {jan},

url = {https://doi.org/10.1109/taslp.2014.2364452},

number = {1},

pages = {7--19},

doi = {10.1109/taslp.2014.2364452}

}

MLA

Цитировать

MLA Скопировать

Xu, Yong, et al. “A Regression Approach to Speech Enhancement Based on Deep Neural Networks.” IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 23, no. 1, Jan. 2015, pp. 7-19. https://doi.org/10.1109/taslp.2014.2364452.

Ошибка в публикации?

Издатель

Institute of Electrical and Electronics Engineers (IEEE)

Журнал

IEEE/ACM Transactions on Audio Speech and Language Processing

scimago Q1

wos Q1

white level БС1

SJR

1.061

CiteScore

12.4

Impact factor

5.1

ISSN

23299290 (Print)

23299304 (Electronic)

	20 40 60 80 100 120
IEEE/ACM Transactions on Audio Speech and Language Processing	IEEE/ACM Transactions on Audio Speech and Language Processing, 104, 10.25% IEEE/ACM Transactions on Audio Speech and Language Processing 104 публикации, 10.25%
Speech Communication	Speech Communication, 34, 3.35% Speech Communication 34 публикации, 3.35%
IEEE Access	IEEE Access, 32, 3.15% IEEE Access 32 публикации, 3.15%
IEEE Signal Processing Letters	IEEE Signal Processing Letters, 25, 2.46% IEEE Signal Processing Letters 25 публикаций, 2.46%
Applied Acoustics	Applied Acoustics, 20, 1.97% Applied Acoustics 20 публикаций, 1.97%
Circuits, Systems, and Signal Processing	Circuits, Systems, and Signal Processing, 17, 1.67% Circuits, Systems, and Signal Processing 17 публикаций, 1.67%
Applied Sciences (Switzerland)	Applied Sciences (Switzerland), 14, 1.38% Applied Sciences (Switzerland) 14 публикаций, 1.38%
Lecture Notes in Computer Science	Lecture Notes in Computer Science, 13, 1.28% Lecture Notes in Computer Science 13 публикаций, 1.28%
Journal of the Acoustical Society of America	Journal of the Acoustical Society of America, 11, 1.08% Journal of the Acoustical Society of America 11 публикаций, 1.08%
Computer Speech and Language	Computer Speech and Language, 11, 1.08% Computer Speech and Language 11 публикаций, 1.08%
Journal of Intelligent and Fuzzy Systems	Journal of Intelligent and Fuzzy Systems, 10, 0.99% Journal of Intelligent and Fuzzy Systems 10 публикаций, 0.99%
Electronics (Switzerland)	Electronics (Switzerland), 9, 0.89% Electronics (Switzerland) 9 публикаций, 0.89%
Digital Signal Processing: A Review Journal	Digital Signal Processing: A Review Journal, 8, 0.79% Digital Signal Processing: A Review Journal 8 публикаций, 0.79%
IEEE Journal on Selected Topics in Signal Processing	IEEE Journal on Selected Topics in Signal Processing, 8, 0.79% IEEE Journal on Selected Topics in Signal Processing 8 публикаций, 0.79%
IEEE Transactions on Audio Speech and Language Processing	IEEE Transactions on Audio Speech and Language Processing, 8, 0.79% IEEE Transactions on Audio Speech and Language Processing 8 публикаций, 0.79%
Multimedia Tools and Applications	Multimedia Tools and Applications, 7, 0.69% Multimedia Tools and Applications 7 публикаций, 0.69%
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences	IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 7, 0.69% IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences 7 публикаций, 0.69%
Lecture Notes in Networks and Systems	Lecture Notes in Networks and Systems, 7, 0.69% Lecture Notes in Networks and Systems 7 публикаций, 0.69%
Eurasip Journal on Audio, Speech, and Music Processing	Eurasip Journal on Audio, Speech, and Music Processing, 6, 0.59% Eurasip Journal on Audio, Speech, and Music Processing 6 публикаций, 0.59%
International Journal of Speech Technology	International Journal of Speech Technology, 6, 0.59% International Journal of Speech Technology 6 публикаций, 0.59%
Eurasip Journal on Advances in Signal Processing	Eurasip Journal on Advances in Signal Processing, 6, 0.59% Eurasip Journal on Advances in Signal Processing 6 публикаций, 0.59%
Neural Networks	Neural Networks, 6, 0.59% Neural Networks 6 публикаций, 0.59%
Journal of Physics: Conference Series	Journal of Physics: Conference Series, 6, 0.59% Journal of Physics: Conference Series 6 публикаций, 0.59%
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies	Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies, 5, 0.49% Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies 5 публикаций, 0.49%
Sensors	Sensors, 5, 0.49% Sensors 5 публикаций, 0.49%
Lecture Notes in Electrical Engineering	Lecture Notes in Electrical Engineering, 5, 0.49% Lecture Notes in Electrical Engineering 5 публикаций, 0.49%
Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference	Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference, 4, 0.39% Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference 4 публикации, 0.39%
IEEE Open Journal of Signal Processing	IEEE Open Journal of Signal Processing, 4, 0.39% IEEE Open Journal of Signal Processing 4 публикации, 0.39%
Communications in Computer and Information Science	Communications in Computer and Information Science, 4, 0.39% Communications in Computer and Information Science 4 публикации, 0.39%
	20 40 60 80 100 120