A rule and template based stemming algorithm for Arabic language

Stemming is defined as the conflation of all variations of specific words to a single form called the root or stem. Stemming plays a vital role in natural language processing and understanding. As in other languages, there is a need for an effective stemming algorithm for Arabic words. Arabic is a l...

Full description

Bibliographic Details
Published in:International Journal of Mathematical Models and Methods in Applied Sciences
Main Author: Sembok T.M.T.; Ata B.M.A.; Bakar Z.A.
Format: Article
Language:English
Published: 2011
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-79960365881&partnerID=40&md5=cb180952d587613c8e6ecdbf93bb9d6e
Description
Summary:Stemming is defined as the conflation of all variations of specific words to a single form called the root or stem. Stemming plays a vital role in natural language processing and understanding. As in other languages, there is a need for an effective stemming algorithm for Arabic words. Arabic is a language having a rich and complex morphological word structures and rules. An Arabic stemming algorithm based on morphological rules has been developed, and to enhance its effectiveness, a dictionary of root words is used to determine the right stems. The Arabic stemming algorithm developed by Al-Omari is studied and a new algorithm is proposed to enhance the performance. The improvements obtained relate to the order in which the dictionary is looked-up and the order in which the morphological rules are applied.
ISSN:19980140