1.1. NE applications
1.1.1 Information Retrieval: This is the task of retrieving data or documents according to a search input query, …show more content…
In such systems, all places and locations stored in the system database with their geographical coordinates are NEs.
1.2. Arabic language aspects and challenges
“Arabic is a language of rich morphology and complex syntax” [Al-Sughaiyer and Al-Kharashi 2004]. It is classified into three main types: Classical Arabic; which is the language of Islam that used for over 1500 years. Modern Standard Arabic; which is one of the six official languages of United Nations, and most of Arabic NLP researches are focused on Colloquial Arabic; which is the spoken Arabic language. It is irregular and differs among countries and regions.
The task of Arabic NER is challenging due to the following Arabic language features:
• Lack of capitalization: Capitalization feature is not existing in Arabic language script, unlike other languages such as English, in which capital letter used to recognize NEs. The absence of this feature makes Arabic NER more difficult by the fact that most of Arabic NEs are indistinguishable from common nouns and adjectives. For example: the Arabic location word الزرقاء (city in Jordan) can be used as an adjective (refers to something with blue color). This type of ambiguity commonly resolved by analyzing the context surrounding the NE. [Shaalan