Gated recurrent unit predictor model-based adaptive differential pulse code modulation speech decoder

Gardado en:

Detalles Bibliográficos
Publicado en:	EURASIP Journal on Audio, Speech, and Music Processing vol. 2024, no. 1 (Dec 2024), p. 6
Autor Principal:	Sheferaw, Gebremichael Kibret
Outros autores:	Mwangi, Waweru, Kimwele, Michael, Mamuye, Adane
Publicado:	Springer Nature B.V.
Materias:	Speech Algorithms Neural networks Decoding Prediction models Predictions Differential pulse code modulation Pulse code modulation Coding
Acceso en liña:	Citation/Abstract Full Text - PDF
Etiquetas:	Engadir etiqueta Sen Etiquetas, Sexa o primeiro en etiquetar este rexistro!

MARC


LEADER	00000nab a2200000uu 4500
001	2916737247
003	UK-CbPIL
022			\|a 1687-4714
022			\|a 1687-4722
024	7		\|a 10.1186/s13636-023-00325-3 \|2 doi
035			\|a 2916737247
045	2		\|b d20241201 \|b d20241231
084			\|a 130317 \|2 nlm
100	1		\|a Sheferaw, Gebremichael Kibret \|u Jomo Kenyatta University of Agriculture and Technology, School of Computing and Information Technology, Nairobi, Kenya (GRID:grid.411943.a) (ISNI:0000 0000 9146 7108)
245	1		\|a Gated recurrent unit predictor model-based adaptive differential pulse code modulation speech decoder
260			\|b Springer Nature B.V. \|c Dec 2024
513			\|a Journal Article
520	3		\|a Speech coding is a method to reduce the amount of data needs to represent speech signals by exploiting the statistical properties of the speech signal. Recently, in the speech coding process, a neural network prediction model has gained attention as the reconstruction process of a nonlinear and nonstationary speech signal. This study proposes a novel approach to improve speech coding performance by using a gated recurrent unit (GRU)-based adaptive differential pulse code modulation (ADPCM) system. This GRU predictor model is trained using a data set of speech samples from the DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus actual sample and the ADPCM fixed-predictor output speech sample. Our contribution lies in the development of an algorithm for training the GRU predictive model that can improve its performance in speech coding prediction and a new offline trained predictive model for speech decoder. The results indicate that the proposed system significantly improves the accuracy of speech prediction, demonstrating its potential for speech prediction applications. Overall, this work presents a unique application of the GRU predictive model with ADPCM decoding in speech signal compression, providing a promising approach for future research in this field.
653			\|a Speech
653			\|a Algorithms
653			\|a Neural networks
653			\|a Decoding
653			\|a Prediction models
653			\|a Predictions
653			\|a Differential pulse code modulation
653			\|a Pulse code modulation
653			\|a Coding
700	1		\|a Mwangi, Waweru \|u Jomo Kenyatta University of Agriculture and Technology, School of Computing and Information Technology, Nairobi, Kenya (GRID:grid.411943.a) (ISNI:0000 0000 9146 7108)
700	1		\|a Kimwele, Michael \|u Jomo Kenyatta University of Agriculture and Technology, School of Computing and Information Technology, Nairobi, Kenya (GRID:grid.411943.a) (ISNI:0000 0000 9146 7108)
700	1		\|a Mamuye, Adane \|u Addis Ababa University Institute of Technology, School of Information Technology and Engineering, Addis Ababa, Ethiopia (GRID:grid.7123.7) (ISNI:0000 0001 1250 5688)
773	0		\|t EURASIP Journal on Audio, Speech, and Music Processing \|g vol. 2024, no. 1 (Dec 2024), p. 6
786	0		\|d ProQuest \|t Advanced Technologies & Aerospace Database
856	4	1	\|3 Citation/Abstract \|u https://www.proquest.com/docview/2916737247/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch
856	4	0	\|3 Full Text - PDF \|u https://www.proquest.com/docview/2916737247/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch