Journal of Engineering and Applied Sciences

Year: 2018
Volume: 13
Issue: 23
Page No. 10092 - 10100

Using Semantic Similarity with Word Embeddings for Arabic Multi-Words Term Extraction

Authors : El-Khadir Lamrani, El Habib Ben Lahmer and Abdelaziz Marzak

Abstract: Identifying and extract terms from textual source is an indispensable task in information retrival and question answering systems by experiments multi-word terms represent the best candidates to represent a specific domain in Arabic. In this research, we assumed that the Multi-Word Terms (MWTs) consist of words with similar contextual representations and we propose a hybrid method of extracting multi-word terms from Arabic texts combines between linguistic and semantic approach, based on word embeddings which we use a linguistic and morphosyntactic analysis of the Arabic language to find candidate terms and we use cosine similarity between distributed representation of words for ranking candidate terms. The proposed methodology has been tested in a case studies carried out in the environnemental domains with promising results.

How to cite this article:

El-Khadir Lamrani, El Habib Ben Lahmer and Abdelaziz Marzak, 2018. Using Semantic Similarity with Word Embeddings for Arabic Multi-Words Term Extraction. Journal of Engineering and Applied Sciences, 13: 10092-10100.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved