Wiktionnaire's Wikicode GLAWIfied: a Workable French Machine-Readable Dictionary
Nabil Hathout and
Proceedings of the 10th International Language Resources and Evaluation Conference
Nicoletta Calzolari (Conference Chair), Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis.
European Language Resources Association
[ PDF article ]
N. Hathout and F. Sajous (2016).
Wiktionnaire's Wikicode GLAWIfied: a Workable French Machine-Readable Dictionary.
Proceedings of the 10th International Language Resources and Evaluation Conference (LREC 2016), pp. 1369-1376, Portorož, Slovenia.
[ .bib ]
French, Machine-Readable Dictionary, Free Lexical Resource, Wiktionary, Wiktionnaire
GLAWI is a free, large-scale and versatile Machine-Readable Dictionary (MRD) that has been extracted from the French language edition of Wiktionary, called Wiktionnaire. In (Sajous and Hathout, 2015), we introduced GLAWI, gave the rationale behind the creation of this lexicographic resource and described the extraction process, focusing on the conversion and standardization of the heterogeneous data provided by this collaborative dictionary. In the current article, we describe the content of GLAWI and illustrate how it is structured.
We also suggest various applications, ranging from linguistic studies, NLP applications to psycholinguistic experimentation. They all can take advantage of the diversity of the lexical knowledge available in GLAWI. Besides this diversity and extensive lexical coverage, GLAWI is also remarkable because it is the only free lexical resource of contemporary French that contains definitions. This unique material opens way to the renewal of MRD-based methods, notably the automated extraction and acquisition of semantic relations.