Speech signal authentication and self-recovery based on DTWT and ADPCM

Guardado en:
Detalles Bibliográficos
Publicado en:Multimedia Tools and Applications vol. 83, no. 31 (Sep 2024), p. 76341
Autor principal: Quiñonez-Carbajal, Maria T.
Otros Autores: Reyes-Reyes, Rogelio, Ponomaryov, Volodymyr, Cruz-Ramos, Clara, Garcia-Salgado, Beatriz P.
Publicado:
Springer Nature B.V.
Materias:
Acceso en línea:Citation/Abstract
Full Text - PDF
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!

MARC

LEADER 00000nab a2200000uu 4500
001 3100356690
003 UK-CbPIL
022 |a 1380-7501 
022 |a 1573-7721 
024 7 |a 10.1007/s11042-024-18614-0  |2 doi 
035 |a 3100356690 
045 2 |b d20240901  |b d20240930 
084 |a 108528  |2 nlm 
100 1 |a Quiñonez-Carbajal, Maria T.  |u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782) 
245 1 |a Speech signal authentication and self-recovery based on DTWT and ADPCM 
260 |b Springer Nature B.V.  |c Sep 2024 
513 |a Journal Article 
520 3 |a The digital voice is multimedia content of great importance, given the range of applications where it can be found. This paper addresses the shortcomings of existing voice authentication algorithms, presenting a completely blind speech authentication and recovery method based on fragile watermarking using the Least Significant Bit (LSB) method. This scheme obtains a compressed version of the original speech signal by Adaptive Differential Pulse Code Modulation (ADPCM) coding and the Discrete-Time Wavelet Transform (DTWT). Authentication bits are then generated by the SHA256 hash function, and the watermark is afterward embedded in the last three LSBs of the original audio samples. Experimental results evaluated on five different audio databases, each comprising speech signals recorded in different situations, contexts, and languages, have demonstrated a high embedding payload and imperceptibility of the watermark, obtaining an average Signal-to-Noise Ratio (SNR) value above 40dB<inline-graphic xlink:href="11042_2024_18614_Article_IEq1.gif" />. Furthermore, the proposed method demonstrates a strong ability to accurately locate and restore up to 50% of a speech signal that has been tampered with, using no additional information. Moreover, the recovered speech signal is intelligible and has an SNR value higher than other recovery schemes, justifying the efficiency of the proposed method. 
653 |a Recovery 
653 |a Speech 
653 |a Watermarking 
653 |a Wavelet transforms 
653 |a Hash based algorithms 
653 |a Differential pulse code modulation 
653 |a Embedding 
653 |a Pulse code modulation 
653 |a Multimedia 
653 |a Signal to noise ratio 
700 1 |a Reyes-Reyes, Rogelio  |u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782) 
700 1 |a Ponomaryov, Volodymyr  |u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782) 
700 1 |a Cruz-Ramos, Clara  |u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782) 
700 1 |a Garcia-Salgado, Beatriz P.  |u Instituto Politécnico Nacional, ESIME Unidad Culhuacan, México City, México (GRID:grid.418275.d) (ISNI:0000 0001 2165 8782) 
773 0 |t Multimedia Tools and Applications  |g vol. 83, no. 31 (Sep 2024), p. 76341 
786 0 |d ProQuest  |t ABI/INFORM Global 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3100356690/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3100356690/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch