Efficient Audio Watermarking via Advanced Signal Processing: EMD–SVD Decomposition and Intelligent Embedding for Secure Multimedia

Yasmin Makki Mohialden; Mohammad Mosleh; Jamal N.  Hasoon; Reihaneh Khorsand

doi:10.57647/ijm2c.2027.1701.08

10.57647/ijm2c.2027.1701.08

Efficient Audio Watermarking via Advanced Signal Processing: EMD–SVD Decomposition and Intelligent Embedding for Secure Multimedia

Show Details XML PDF

Yasmin Makki Mohialden ¹,
Mohammad Mosleh*²,
Jamal N. Hasoon³,
Reihaneh Khorsand¹,

Department of Computer Engineering, Isf.C., Islamic Azad University, Isfahan, Iran
Department of Computer Engineering, Dez.C., Islamic Azad University, Dezful, Iran
Computer Science Department, College of Science,Mustansiriyah University ,Baghdad, Iraq

Received: 10-12-2025

Revised: 12-06-2026

Accepted: 13-06-2026

Published Online: 20-06-2026

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

Makki Mohialden, Y., Mosleh, M., N. Hasoon, J., & Khorsand, R. (2025). Efficient Audio Watermarking via Advanced Signal Processing: EMD–SVD Decomposition and Intelligent Embedding for Secure Multimedia. International Journal of Mathematical Modelling & Computations. https://doi.org/10.57647/ijm2c.2027.1701.08

Abstract

Audio watermarking is an important technique for copyright protection, multimedia authentication, and secure content distribution. However, many existing methods still struggle to achieve a reliable balance among imperceptibility, robustness, and embedding capacity, mainly because watermark insertion is often performed using fixed rules that do not sufficiently reflect the local behaviour of audio signals. This paper proposes an adaptive audio watermarking framework in which each processing stage is designed to support a specific decision in the embedding and extraction pipeline. First, empirical mode decomposition (EMD) is used to decompose each audio frame into intrinsic mode functions (IMFs), providing a signal-adaptive representation of the non-stationary host audio. Then, a 1D convolutional neural network(1D-CNN) extracts representative features from these components. Based on these features, K-Means++ clustering identifies stable and perceptually suitable IMFs with favourable energy and variance characteristics. The watermark is embedded in the SVD domain by modifying the dominant singular values through a 2-bit quantization strategy, which improves payload capacity while preserving audio quality. Finally, an XGBoost classifier learns the selected embedding locations and supports blind watermark extraction. Experiments on four audio genres show that the proposed method achieves an average SNR of 45.1 dB, ODG of −0.28, embedding capacity of 2350 bps, and perfect extraction under no-attack conditions with BER = 0 and NC = 1.0. The method also maintains low BER and high NC under Stirmark and common signal-processing attacks, making it suitable for secure audio distribution and copyright protection.

Keywords

Audio Watermarking,
Empirical Mode Decomposition,
Singular Value Decomposition,
XGBoost Classifier,
Copyright Protection

PDF

References

Mosleh M, Setayeshi S, Mosleh M. Presenting a Novel Audio Watermarking Based on Discrete Wavelet Transform. Int J Comput Electr Eng. 2011;3(3). doi: https://doi.org/10.7763/IJCEE.2011.V3.348
Dhar PK. A blind audio watermarking method based on lifting wavelet transform and QR decomposition. In: 8th International Conference on Electrical and Computer Engineering. Dhaka, Bangladesh: IEEE; 2014:136-139. doi: https://doi.org/10.1109/ICECE.2014.7026978
Liu J, He X. A review study on digital watermarking. In: 2005 International Conference on Information and Communication Technologies. Karachi, Pakistan: IEEE; 2005:337-341. doi: https://doi.org/10.1109/ICICT.2005.1598603
Cho JW, Park HJ, Huh Y, Chung HY, Jung HY. Echo watermarking in sub-band domain. In: Digital Watermarking: Second International Workshop, IWDW 2003. Seoul, Korea: Springer; 2004:447-455. doi: https://doi.org/10.1007/978-3-540-24624-6_37
Ko BS, Nishimura R, Suzuki Y. Time-spread echo method for digital audio watermarking. IEEE Trans Multimedia. 2005;7(2):212-221. doi: https://doi.org/10.1109/TMM.2005.843352
Bassia P, Pitas I, Nikolaidis N. Robust audio watermarking in the time domain. IEEE Trans Multimedia. 2001;3(2):232-241. doi: https://doi.org/10.1109/6046.923822
Khaldi K, Boudraa AO. Audio watermarking via EMD. IEEE Trans Audio Speech Lang Process. 2012;21(3):675-680. doi: https://doi.org/10.1109/TASL.2012.2227734
Mandic DP, Rehman NU, Wu Z, Huang NE. Empirical mode decomposition-based time-frequency analysis of multivariate signals: The power of adaptive data analysis. IEEE Signal Process Mag. 2013;30(6):74-86. doi: https://doi.org/10.1109/MSP.2013.2267931
Peng H, Li B, Luo X, Wang J, Zhang Z. A learning-based audio watermarking scheme using kernel Fisher discriminant analysis. Digit Signal Process. 2013;23(1):382-389. doi: https://doi.org/10.1016/j.dsp.2012.08.004
Latifpour H, Mosleh M, Kheyrandish M. An intelligent audio watermarking based on KNN learning algorithm. Int J Speech Technol. 2015;18:697-706. doi: https://doi.org/10.1007/s10772-015-9298-0
Mohsenfar SM, Mosleh M, Barati A. Audio watermarking method using QR decomposition and genetic algorithm. Multimed Tools Appl. 2015;74:759-779. doi: https://doi.org/10.1007/s11042-013-1720-x
Mosleh M, Latifpour H, Kheyrandish M, Mosleh M, Hosseinpour N. A robust intelligent audio watermarking scheme using support vector machine. Front Inf Technol Electron Eng. 2016;17:1320-1330. doi: https://doi.org/10.1631/FITEE.1500349
Pourhashemi SM, Mosleh M, Erfani Y. Audio watermarking based on synergy between Lucas regular sequence and Fast Fourier Transform. Multimed Tools Appl. 2019;78:22883-22908. doi: https://doi.org/10.1007/s11042-019-7574-0
Abdelwahab KM, Abd El-atty SM, El-Shafai W, El-Rabaie S, Abd El-Samie FE. Efficient SVD-based audio watermarking technique in FRT domain. Multimed Tools Appl. 2020;79:5617-5648. doi: https://doi.org/10.1007/s11042-019-08279-x
El-Gazar S, El-Dolil S, Abbas AM, Dessouky MI, El-Rabaie ESM, El-Dokany IM, et al. Speech Watermarking using a Hybrid Strategy of both Empirical Mode Decomposition and Singular Value Decomposition. Menoufia J Electron Eng Res. 2020;29(1):39-49. doi: https://doi.org/10.21608/mjeer.2020.53103
Pourhashemi SM, Mosleh M, Erfani Y. Presenting an intelligent extraction method in audio watermarking systems based on lifting wavelet transform and support vector machine. J Soft Comput Inf Technol. 2020;9(1):34-47.
Mosleh M, Setayeshi S, Barekatain B, Mosleh M. A novel audio watermarking scheme based on fuzzy inference system in DCT domain. Multimed Tools Appl. 2021;80:20423-20447. doi: https://doi.org/10.1007/s11042-021-10649-x
Pourhashemi SM, Mosleh M, Erfani Y. A novel audio watermarking scheme using ensemble-based watermark detector and discrete wavelet transform. Neural Comput Appl. 2021;33:6161-6181. doi: https://doi.org/10.1007/s00521-020-05376-5
Wu Q, Ding R, Wei J. Audio watermarking algorithm with a synchronization mechanism based on spectrum distribution. Secur Commun Netw. 2022;2022:2617107. doi: https://doi.org/10.1155/2022/2617107
Alshathri S, Hemdan EED. An efficient audio watermarking scheme with scrambled medical images for secure medical internet of things systems. Multimed Tools Appl. 2023;82:20177-20195. doi: https://doi.org/10.1007/s11042-022-14188-x
Yamni M, Daoui A, Karmouni H, Sayyouri M, Qjidaa H, Motahhir S, et al. An efficient watermarking algorithm for digital audio data in security applications. Sci Rep. 2023;13:18432. doi: https://doi.org/10.1038/s41598-023-45049-y
Lai WH, Chou TY, Chou MC, Schuller BW. Robust Audio Watermarking based on empirical mode decomposition and group differential relations. J Audio Eng Soc. 2023;71(3):100-117. doi: https://doi.org/10.17743/jaes.2022.0077
Naqash KI, Malik SA, Parah SA. Robust audio watermarking based on iterative filtering. Circuits Syst Signal Process. 2024;43:348-367. doi: https://doi.org/10.1007/s00034-023-02478-6
Li P, Zhang X, Xiao J, Wang J. IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding. arXiv. 2024;2409.19627. doi: https://doi.org/10.48550/arXiv.2409.19627
Dutta AK, Lall B, Joshi SD. Empirical mode decomposition techniques: A simulated review. In: 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT). Delhi, India: IEEE; 2023:1-5. doi: https://doi.org/10.1109/ICCCNT56998.2023.10306989
Weiss S, Proudler IK, Barbarino G, Pestana J, McWhirter JG. On properties and structure of the analytic singular value decomposition. IEEE Trans Signal Process. 2024;72:2260-2275. doi:https://doi.org/10.1109/TSP.2024.3390549
Kabal P. An examination and interpretation of ITU-R BS. 1387: Perceptual evaluation of audio quality. TSP Lab Tech Rep. McGill University; 2002:1-89.
Chen CJ, Huang HN, Tu SY, Lin CH, Chen ST. Digital audio watermarking using minimum-amplitude scaling on optimized DWT low-frequency coefficients. Multimed Tools Appl. 2021;80:2413-2439. doi:https://doi.org/10.1007/s11042-020-09696-3
Salah E, Narima Z, Khaldi A, Redouane KM. Survey of imperceptible and robust digital audio watermarking systems. Multimed Tools Appl. 2025;84:3635-3681. doi:https://doi.org/10.1007/s11042-024-19375-6
Karajeh H, Khatib T, Rajab L, Maqableh M. A robust digital audio watermarking scheme based on DWT and Schur decomposition. Multimed Tools Appl. 2019;78:18395-18418. doi:https://doi.org/10.1007/s11042-019-7178-8

Efficient Audio Watermarking via Advanced Signal Processing: EMD–SVD Decomposition and Intelligent Embedding for Secure Multimedia

How to Cite

Download Citation

Abstract

Keywords

References

Most read articles by the same author(s)