Speech Signal Enhancement Techniques
Main Article Content
Abstract
Speech enhancement and noise reduction have wide applications in speech processing. They are often employed as preprocessing stage in various applications. The work to be presented in this paper is denoising a single-channel speech signal in the presence of a highly non-stationary background noise in order to improve the perceptible quality and intelligibility of the speech. Real world noise is mostly highly non-stationary and does not affect the speech signal uniformly over the spectrum. This paper explores a set of DFT-based algorithms as single-channel speech enhancement techniques which are as follows: Spectral Subtraction using oversubtraction and spectral floor. Multi-Band Spectral Subtraction (MBSS). Wiener Filter. MMSE of Short-Time Spectral Amplitude (MMSE-STSA) estimator with, and without using SPU modifier. MMSE Log-Spectral Amplitude Estimator with, and without using SPU modifier. Optimally-Modified Log-Spectral Amplitude estimator (OM-LSA). The comparison study results based on subjective and objective tests showed that the Optimally Modified Log-Spectral Amplitude Estimator (OM-LSA) method outperforms all the implemented DFTbased single-channel speech enhancement algorithms
Article Details
References
noise estimation. Proc. IEEE Int. Conf.
Acoust. Speech Signal Processing, 629-
632. [6] Ephraim Y. and Malah D, “Speech enhancement using a minimum mean- square error short-time spectral amplitude
estimator,” IEEE Trans. Acoustics, Speech, Signal Processing, vol. ASSP-32,
pp. 1109-1121, Dec. 1984.
[7] Boll, S.F., “Suppression of acoustic noise in speech using spectral subtraction”, IEEE Trans. on Acoust., Speech, Signal Proc., Vol. ASSP-27, No.2, pp.113-12 April 1979. [8] Middleton D. and Esposito R, “Simulta- neous optimum detection and estimation of signals in noise.” IEEE Trans. Inform. Theory, vol. IT-14, pp. 434444, May 1968 H. L. [9] Cohen L “Optimal Speech Enhancement Under Signal Presence Uncertainty Using Log-Spectral Amplitude Estimator,” La- mar Signal Processing Ltd.2003. [10] Soon L Y., Koh S. N. and Yeo C. K. “Improved Noise Suppression Filter Us ing Self Adaptive Estimator of Probab ity of Speech Absence,” Signal Pro- cessing, vol. 75, pp. 151-159, 1999. [11] Martin R, Wittke Land Jax P., “Optmi- zed Estimation of Spectral Parameters for the Coding of Noisy Speech,"in Proc. Int. Conf. Acoustics, Speech and Signal Pro- cessing, ICASSP 2000, pp. 1479-1482. [12] Cohen L and Berdugo B., “Speech En- hancement for Non-Stationary Noise En- vironments,” to appear in Signal Pro- cessing. [13] Cohen L and Berdugo B., “Speech En- hancement for Non-Stationary Noise En- vironments,” to appear in Signal Pro- cessing. [14] Hu Y. and Loizou P. “Subjective evalua- tion and comparison of speech enhance- ment algorithms,” Speech Communica- tion, 49, 588-601. [15] Sovka, P, “Extended Spectral Substraction:Description and Preliminary Results” [Research Report]. R95-2. Pra- gue, CTU, Faculty of Electrical Engineer- ing 1995. pp 15.