Automatic Speech Recognition Errors Detection And Correction: A Review

Main Article Content

Rahhal Errattahi
Asmaa El Hannani
Hassan Ouahmane

Abstract

Even though Automatic Speech Recognition (ASR) has matured to the point of commercial applications, high error rate in some speech recognition domains remain as one of the main impediment factors to the wide adoption of speech technology, and especially for continuous large vocabulary speech recognition applications. The persistent presence of ASR errors have intensified the need to find alternative techniques to automatically detect and correct such errors. The correction of the transcription errors is very crucial not only to improve the speech recognition accuracy, but also to avoid the propagation of the errors to the subsequent language processing modules such as machine translation. In this paper, basic principles of ASR evaluation are first summarized, and then the state of the current ASR errors detection and correction research is reviewed. We focus on emerging techniques using word error rate metric.


 

Article Details

How to Cite
Errattahi, R., El Hannani, A., & Ouahmane, H. (2016). Automatic Speech Recognition Errors Detection And Correction: A Review. AL-Lisaniyyat, 22(2), 40-43. https://doi.org/10.61850/allj.v22i2.372
Section
Articles

References

HTK Hidden Markov Model Toolkit , Speech recognition Toolkit avalaible at : hhtp://www.htk.eng.cam.ac.uk.
ITU-T Recommandation G.729, Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear prediction CS-ACELP ,1996.
H.Yong, Z.Jiang ,Imlementation of ITU-T G729 Speech Code in IP Telephony Gateway ,Wuhan university Journal of natural sciences , vol.5, pp.159-163,2000.
B.Milner ,B, Semmani ,Robust speech Recognition Over Networks, IEEE International Conference Acoustics,Speech , And signal processing, pp.1791-1794, vol.3,2000.
Recommandation UIT-T G.711 , A high quality low-complexity algorithm for packet loss concealment with G.711, September 1999.
J.Wiley , Volp voice and fax signal processing , published simultaneously in Canada ,p.592,2008.
K.Nakamura, An Improvement ofG.711 PLC Using sinusoidal model proceedings of the IEEE The International conference on computer as a toll,pp.1670-1673,2005.
P.C.X. Sommen and J.A.K.S. Jayasinghe, On Frequency Domain Adaptive Filters using the Overlap-add Method , IEEE Philips Research Laboraories,pp.28-30,1988.