Resources


Arabic WordNet :
Arabic WordNet Description: This improved version is an extension of the original Arabic Wordnet (http://globalwordnet.org/arabic-wordnet/awn-browser/), it was enriched by new verbs, nouns including the broken plurals that is a specific form for Arabic words.

Citation: L. Abouenour, K. Bouzoubaa and P. Rosso, "On the evaluation and improvement of Arabic WordNet coverage and usability," Language Resources and Evaluation, vol. 47, n° 13, pp. 891-917, 2013.
Arabic characters lexicon :
LMF version Description: An LMF conformant XML-based file containing all Arabic characters (letters, vowels and punctuations). Each character described with a description, different displays (isolated, at the beginning, middle and the end of a word), a codification (Unicode, others could be added later), and two transliterations (Buckwalter and wiki)

Citation: T. Loukili, K. Bouzoubaa, "Structuration et Standardisation des ressources linguistiques de l'Arabe - cas de l'alphabet, préfixes et suffixes", Journées Doctorales en Technologies de l'Information et Communication, Tangier, Morocco, 7/ 2011
XML version Description: A XML-based file containing all Arabic characters (letters, vowels and punctuations). Each character described with a description, different displays (isolated, at the beginning, middle and the end of a word), a codification (Unicode, others could be added later), and two transliterations (Buckwalter and wiki)

Citation: T. Loukili, K. Bouzoubaa, "Structuration et Standardisation des ressources linguistiques de l'Arabe - cas de l'alphabet, préfixes et suffixes", Journées Doctorales en Technologies de l'Information et Communication, Tangier, Morocco, 7/ 2011
Arabic clitics :
Enclitics Description: A XML-based file containing all Arabic enclitics
Proclitics Description: A XML-based file containing all Arabic proclitics
Arabic Stop-words lexicon :
Special nouns Description: An XML-based file containing Arabic Stop-words respecting nouns syntax; particle nouns, signal nouns, separated pronouns and connected nouns

Citation:

- Driss Namly and al. "Development of Arabic particles lexicon using the LMF framework". Colloque pour les Etudiants Chercheurs en Traitement Automatique du Langage Naturel et ses applications (CEC-TAL 2015). Sousse - Tunisie, le 23-25 Mars 2015.

- Driss Namly and al. "A Complex Arabic stop-words list design". The Second National Doctoral Symposium On Arabic Language Engineering (JDILA'2015) ENSA of Fez USMBA, 28-29 October 2015.
Special verbs Description: An XML-based file containing Arabic Stop-words respecting verbs syntax

Citation:

- Driss Namly and al. "Development of Arabic particles lexicon using the LMF framework". Colloque pour les Etudiants Chercheurs en Traitement Automatique du Langage Naturel et ses applications (CEC-TAL 2015). Sousse - Tunisie, le 23-25 Mars 2015.

- Driss Namly and al. "A Complex Arabic stop-words list design". The Second National Doctoral Symposium On Arabic Language Engineering (JDILA'2015) ENSA of Fez USMBA, 28-29 October 2015.
Particles Description: An XML-based file containing Arabic particles

Citation:

- Driss Namly and al. "Development of Arabic particles lexicon using the LMF framework". Colloque pour les Etudiants Chercheurs en Traitement Automatique du Langage Naturel et ses applications (CEC-TAL 2015). Sousse - Tunisie, le 23-25 Mars 2015.

- Driss Namly and al. "A Complex Arabic stop-words list design". The Second National Doctoral Symposium On Arabic Language Engineering (JDILA'2015) ENSA of Fez USMBA, 28-29 October 2015.
"Al wassit" Arabic dictionary :
LMF version
(Waiting for validation)
Description: An LMF conformant XML-based file containing the electronic version of al wassit dictionary. An Arabic monolingual dictionary accomplished by the Academy of the Arabic Language in Cairo
XML version
(Waiting for validation)
Description: An XML-based file containing the electronic version of al wassit dictionary. An Arabic monolingual dictionary accomplished by the Academy of the Arabic Language in Cairo
Contemporary Arabic dictionary :
LMF version
(Waiting for validation)
Description: An LMF conformant XML-based file containing the electronic version of al logha al arabia al moassira (Contemporary Arabic) dictionary. An Arabic monolingual dictionary accomplished by Ahmed Mukhtar Abdul Hamid Omar (deceased: 1424) with the help of a working group

Citation: Driss Namly, Karim Bouzoubaa. "LMF conversion of an editorial dictionary: the case of the Contemporary Arabic dictionary". Journée d’étude Ressources langagières de l’arabe pour le TAL : construction, standardisation, gestion et exploitation, 26 Novembre 2015 Institut d’Etudes et de Recherches pour l’Arabisation, Rabat
XML version
(Waiting for validation)
Description: An XML-based file containing the electronic version of al logha al arabia al moassira (Contemporary Arabic) dictionary. An Arabic monolingual dictionary accomplished by Ahmed Mukhtar Abdul Hamid Omar (deceased: 1424) with the help of a working group
CLEF-TREC Q/A Questions :
Excel version Description: List of 2264 questions + answers of CLEF and TREC, translated to Arabic

Citation: Abouenour L., Bouzoubaa K., Rosso P. "On the Evaluation and Improvement of Arabic WordNet Coverage and Usability", Languages Resources and Evaluation, Springer Netherlands 10.1007/s10579-013-9237-0 6/ 2013
Morphological evaluation corpus :
Evaluation corpus Description: An annotated corpus dedicated to the benchmark and evaluation of Arabic morphological analyzers. It consists of 100 words with all their possible analysis. The corpus contains several morphological information such as stem, pattern, root, lemma, etc.
NAFIS Gold Standard Corpus :
NAFIS Gold Standard Corpus Description: Normalized Arabic Fragments for Inestimable Stemming (NAFIS) is an Arabic stemming gold standard corpus composed by a collection of texts, selected to be representative of Arabic stemming tasks and manually annotated.

Citation: Driss Namly, Rachida Tajmout, Karim Bouzoubaa, Lahsen. Abouenour. "NAFIS: A Gold Standard Corpus for Arabic Stemmers Evaluation". International Business Information Management Association (IBIMA), November 2016 Seville, Spain

                                                                 Copyright © 2012 IBTIKARAT research group| Mohammadia School of Engineers | Mohamed V University Agdal | Rabat-Morocco | Contact Us