Automatic Speech Recognition Errors Detection And Correction: A Review

Rahhal Errattahi; Asmaa El Hannani; Hassan Ouahmane

doi:10.61850/allj.v22i2.372

pdf

Published: May 30, 2016

DOI: https://doi.org/10.61850/allj.v22i2.372

Keywords:

Automatic Speech Recognition- ASR Error Detection- ASR Error Correction- ASR evaluation

Rahhal Errattahi

University of Chouaib Doukkali El Jadida

Asmaa El Hannani

University of Chouaib Doukkali El Jadida

Hassan Ouahmane

University of Chouaib Doukkali El Jadida

Abstract

Even though Automatic Speech Recognition (ASR) has matured to the point of commercial applications, high error rate in some speech recognition domains remain as one of the main impediment factors to the wide adoption of speech technology, and especially for continuous large vocabulary speech recognition applications. The persistent presence of ASR errors have intensified the need to find alternative techniques to automatically detect and correct such errors. The correction of the transcription errors is very crucial not only to improve the speech recognition accuracy, but also to avoid the propagation of the errors to the subsequent language processing modules such as machine translation. In this paper, basic principles of ASR evaluation are first summarized, and then the state of the current ASR errors detection and correction research is reviewed. We focus on emerging techniques using word error rate metric.

How to Cite

Errattahi, R., El Hannani, A., & Ouahmane, H. (2016). Automatic Speech Recognition Errors Detection And Correction: A Review. AL-Lisaniyyat, 22(2), 40-43. https://doi.org/10.61850/allj.v22i2.372

Issue

Vol. 22 No. 2 (2016): v22i22016

Section

Articles

References

HTK Hidden Markov Model Toolkit , Speech recognition Toolkit avalaible at : hhtp://www.htk.eng.cam.ac.uk.
ITU-T Recommandation G.729, Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear prediction CS-ACELP ,1996.
H.Yong, Z.Jiang ,Imlementation of ITU-T G729 Speech Code in IP Telephony Gateway ,Wuhan university Journal of natural sciences , vol.5, pp.159-163,2000.
B.Milner ,B, Semmani ,Robust speech Recognition Over Networks, IEEE International Conference Acoustics,Speech , And signal processing, pp.1791-1794, vol.3,2000.
Recommandation UIT-T G.711 , A high quality low-complexity algorithm for packet loss concealment with G.711, September 1999.
J.Wiley , Volp voice and fax signal processing , published simultaneously in Canada ,p.592,2008.
K.Nakamura, An Improvement ofG.711 PLC Using sinusoidal model proceedings of the IEEE The International conference on computer as a toll,pp.1670-1673,2005.
P.C.X. Sommen and J.A.K.S. Jayasinghe, On Frequency Domain Adaptive Filters using the Overlap-add Method , IEEE Philips Research Laboraories,pp.28-30,1988.

Article Sidebar

Main Article Content

Abstract

Article Details

References