Tagging Arabic Word Categories in Arabic Linguistic Corpus and Its Importance in Arabic Grammar Processing
Abstract
Even of all the results that have been achieved in the field of automatic processing of natural languages, especially in English language, the process of its development still attracts many linguistic and computer research efforts; In this context, Arabic tongue still needs more giving and efforts in computational linguistic research, on both theoretical or applied sides. from here, this study raises the following problem: What are the possibilities offered by corpus linguistics research to advance the automated processing of the Arabic language? And what are the practical benefits of annotating and marking up the Arabic corpora with parts of speech in building Arabic grammatical processors? The study aims, in particular, to find out some ways to tagging and annotating Arabic corpora, and show the importance of automatically identifying the parts of Arabic speech in building the grammar processor; and generally, it tries to highlight the benefits of researches on corpus linguistics in the automatic processing of Arabic tongue. The research will then describe the methods of annotating Arabic corpora and how it can be invested in automation at the grammatical level.