Summary: | Stemming is defined as the conflation of all variations of specific words to a single form called the root or stem. Stemming plays a vital role in natural language processing and understanding. As in other languages, there is a need for an effective stemming algorithm for Arabic words. Arabic is a language having a rich and complex morphological word structures and rules. An Arabic stemming algorithm based on morphological rules has been developed, and to enhance its effectiveness, a dictionary of root words is used to determine the right stems. The Arabic stemming algorithm developed by Al-Omari is studied and a new algorithm is proposed to enhance the performance. The improvements obtained relate to the order in which the dictionary is looked-up and the order in which the morphological rules are applied.
|