Speech signal authentication and self-recovery based on DTWT and ADPCM

Guardado en:

Detalles Bibliográficos
Publicado en:	Multimedia Tools and Applications vol. 83, no. 31 (Sep 2024), p. 76341
Autor principal:	Quiñonez-Carbajal, Maria T.
Otros Autores:	Reyes-Reyes, Rogelio, Ponomaryov, Volodymyr, Cruz-Ramos, Clara, Garcia-Salgado, Beatriz P.
Publicado:	Springer Nature B.V.
Materias:	Recovery Speech Watermarking Wavelet transforms Hash based algorithms Differential pulse code modulation Embedding Pulse code modulation Multimedia Signal to noise ratio
Acceso en línea:	Citation/Abstract Full Text - PDF
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

MARC


LEADER	00000nab a2200000uu 4500
001	3100356690
003	UK-CbPIL
022			\|a 1380-7501
022			\|a 1573-7721
024	7		\|a 10.1007/s11042-024-18614-0 \|2 doi
035			\|a 3100356690
045	2		\|b d20240901 \|b d20240930
084			\|a 108528 \|2 nlm
100	1		\|a Quiñonez-Carbajal, Maria T. \|u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782)
245	1		\|a Speech signal authentication and self-recovery based on DTWT and ADPCM
260			\|b Springer Nature B.V. \|c Sep 2024
513			\|a Journal Article
520	3		\|a The digital voice is multimedia content of great importance, given the range of applications where it can be found. This paper addresses the shortcomings of existing voice authentication algorithms, presenting a completely blind speech authentication and recovery method based on fragile watermarking using the Least Significant Bit (LSB) method. This scheme obtains a compressed version of the original speech signal by Adaptive Differential Pulse Code Modulation (ADPCM) coding and the Discrete-Time Wavelet Transform (DTWT). Authentication bits are then generated by the SHA256 hash function, and the watermark is afterward embedded in the last three LSBs of the original audio samples. Experimental results evaluated on five different audio databases, each comprising speech signals recorded in different situations, contexts, and languages, have demonstrated a high embedding payload and imperceptibility of the watermark, obtaining an average Signal-to-Noise Ratio (SNR) value above 40dB<inline-graphic xlink:href="11042_2024_18614_Article_IEq1.gif" />. Furthermore, the proposed method demonstrates a strong ability to accurately locate and restore up to 50% of a speech signal that has been tampered with, using no additional information. Moreover, the recovered speech signal is intelligible and has an SNR value higher than other recovery schemes, justifying the efficiency of the proposed method.
653			\|a Recovery
653			\|a Speech
653			\|a Watermarking
653			\|a Wavelet transforms
653			\|a Hash based algorithms
653			\|a Differential pulse code modulation
653			\|a Embedding
653			\|a Pulse code modulation
653			\|a Multimedia
653			\|a Signal to noise ratio
700	1		\|a Reyes-Reyes, Rogelio \|u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782)
700	1		\|a Ponomaryov, Volodymyr \|u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782)
700	1		\|a Cruz-Ramos, Clara \|u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782)
700	1		\|a Garcia-Salgado, Beatriz P. \|u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782)
773	0		\|t Multimedia Tools and Applications \|g vol. 83, no. 31 (Sep 2024), p. 76341
786	0		\|d ProQuest \|t ABI/INFORM Global
856	4	1	\|3 Citation/Abstract \|u https://www.proquest.com/docview/3100356690/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch
856	4	0	\|3 Full Text - PDF \|u https://www.proquest.com/docview/3100356690/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch