Automatic Morphological Generation for the Indexing of Arabic Speech Recordings

Shaalan, K., H. Talhami, and I. Kamel, "Automatic Morphological Generation for the Indexing of Arabic Speech Recordings", The International Journal of Computer Processing of Oriental Languages (IJCPOL), vol. 20, no. 1, pp. 1–14, 2007. copy at


This paper presents a novel Arabic morphological generator (AMG) for Modern Standard Arabic (MSA) which is designed and implemented using Prolog. The AMG is used to generate inflected forms of words used for the indexing of Arabic audio. These words are also the relevant terms in the Arab authority system (library information retrieval system) used in this study. The AMG generates inflected Arabic words from the root according to pre-specified morphological features that can be extended as needed. The Arabic word is represented as a feature structure which is handled through unification during the morphological generation process. The inflected forms can then be inserted automatically into a speech recognition grammar which is used to identify these words in an audio sequence or utterance.



Related External Link

ijcpol2.pdf167.5 KB