Journal of Engineering and Applied Sciences

Year: 2018
Volume: 13
Issue: 23
Page No. 10092 - 10100

Using Semantic Similarity with Word Embeddings for Arabic Multi-Words Term Extraction

Authors : El-Khadir Lamrani, El Habib Ben Lahmer and Abdelaziz Marzak

References

Al Khatib, K. and A. Badarneh, 2010. Automatic extraction of Arabic multi-word terms. Proceedings of the 2010 International Multiconference on Computer Science and Information Technology (IMCSIT), October 18-20, 2010, IEEE, Wisla, Poland, ISBN:978-83-60810-27-9, pp: 411-418.

Belguith, L., L. Baccour and G. Mourad, 2005. [Segmentation of Arabic texts based on the contextual analysis of punctuation and certain particles]. Proceedings of the 12th Annual International Conference on Automatic Processing of Natural Languages, June 6-10, 2005, TALN, Dourdan, France, pp: 451-456 (In French).

Boulaknadel, S., B. Daille and A. Driss, 2008. [Acabit: A tool for extracting complex terms]. Proceedings of the 2008 Symposium on Act of the New Information Technologies: Opportunities for Lamazighe, November 24-25, 2008, IRCAM, Paris, France, pp: 75-82.

Boulaknadel, S., B. Daille and D. Aboutajdine, 2008. Multi-word term indexing for Arabic document retrieval. Proceedings of the IEEE International Symposium on Computers and Communications (ISCC'08), July 6-9, 2008, IEEE, Marrakech, Morocco, ISBN:978-1-4244-2702-4, pp: 869-873.

Bounhas, I. and Y. Slimani, 2009. A hybrid approach for Arabic multi-word term extraction. Proceedings of the 2009 International Conference on Natural Language Processing and Knowledge Engineering NLP-KE, September 24-27, 2009, IEEE, Dalian, China, ISBN:978-1-4244-4538-7, pp: 1-8.

Chen, J., C.H. Yeh and R. Chau, 2006. Identifying multi-word terms by text-segments. Proceedings of the 7th International Conference on Web-Age Information Management Workshops, June 17-19, 2006, IEEE, Hong Kong, China, pp: 19-19.

Church, K. and P. Hanks, 1990. Word association norms, mutual information and lexicography. Comput. Linguist., 16: 22-29.
CrossRef  |  

Daille, B., 1994. [Mixed Approach for Terminology Extraction: Lexical Statistics and Linguistic Filters Voorkant]. Paris Diderot University, Paris, France, Pages: 228 (In French).

Diab, M.T., 2007. Improved Arabic base phrase chunking with a new enriched POS tag set. Proceedings of the 2007 International Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, June 28, 2007, Association for Computational Linguistics, Stroudsburg, Pennsylvania, USA., pp: 89-96.

Frantzi, K., S. Ananiadou and H. Mima, 2000. Automatic recognition of multi-word terms: The C-value/NC-value method. Int. J. Digit. Lib., 3: 115-130.
CrossRef  |  Direct Link  |  

Hajic, J., O. Smrz, T. Buckwalter and H. Jin, 2005. Feature-based tagger of approximations of functional Arabic morphology. Proceedings of the 4th International Workshop on Treebanks and Linguistic Theories (TLT), December 10, 2005, Universitat de Barcelona, Barcelona, Spain, pp: 1-54.

Jacquemin, C., 1999. Syntagmatic and paradigmatic representations of term variation. Proceedings of the 37th Annual International Meeting on Association for Computational Linguistics on Computational Linguistics, June 20-26, 1999, Association for Computational Linguistics, Stroudsburg, Pennsylvania, USA., ISBN:1-55860-609-3, pp: 341-348.

Jacquemin, C., J.L. Klavans and E. Tzoukermann, 1997. Expansion of multi-word terms for indexing and retrieval using morphology and syntax. Proceedings of the 35th Annual Meeting and 8th International Conference on the Association for Computational Linguistics and European Chapter of the Association for Computational Linguistics, July 7-12, 1997, Association for Computational Linguistics, Stroudsburg, Pennsylvania, USA., pp: 24-31.

Korkontzelos, I., I.P. Klapaftis and S. Manandhar, 2008. Reviewing and evaluating automatic term recognition techniques. Proceedings of the 6th International Conference on Advances in Natural Language Processing, August 25-27, 2008, Springer, Gothenburg, Sweden, ISBN:978-3-540-85286-5, pp: 248-259.

Lamrani, E.K., A. Marzak and H. Ballaoui, 2014. Mixed method for extraction of domain terminology from text: Linguistic and statistical filtering. Proceedings of the IEEE 2014 3rd International Conference on Colloquium in Information Science and Technology (CIST), October 20-22, 2014, IEEE, Tetouan, Morocco, ISBN:978-1-4799-5979-2, pp: 291-295.

Mahdaouy, A.E., S.E. Ouatik and E. Gaussier, 2014. A study of association measures and their combination for Arabic MWT extraction. Proceedings of the 10th International Conference on Terminology and Artificial Intelligence, September 10, 2014, Cornell University, Ithaca, New York, USA., pp: 1-8.

Mikolov, T., I. Sutskever, K. Chen, G.S. Corrado and J. Dean, 2013. Distributed Representations of Words and Phrases and their Compositionality. In: Advances in Neural Information Processing Systems, Burges, C.J.C., L. Bottou, M. Welling, Z. Ghahramani and K.Q. Weinberger (Eds.). Curran Associates Inc., New York, USA., pp: 3111-3119.

Mikolov, T., K. Chen, G. Corrado and J. Dean, 2013. Efficient estimation of word representations in vector space. J. English Lit., 1: 1-12.
Direct Link  |  

Mikolov, T., W.T. Yih and G. Zweig, 2013. Linguistic regularities in continuous space word representations. Proc. NAACLHLT., 13: 746-751.
Direct Link  |  

SanJuan, E., J. Dowdall, F. Ibekwe-SanJuan and F. Rinaldi, 2005. A symbolic approach to automatic multiword term structuring. Comput. Speech Lang., 19: 524-542.
CrossRef  |  Direct Link  |  

Zahran, M.A., A. Magooda, A.Y. Mahgoub, H. Raafat and M. Rashwan et al., 2015. Word representations in vector space and their applications for Arabic. Proceedings of the 16th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing’15), April 14-20, 2015, Springer, Cairo, Egypt, ISBN:978-3-319-18110-3, pp: 430-443.

Zhang, W., T. Yoshida and X. Tang, 2007. Text classification using multi-word features. Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (ISIC’07), October 7-10, 2007, IEEE, Montreal, Quebec, Canada, ISBN:978-1-4244-0990-7, pp: 3519-3524.

Zhang, W., T. Yoshida and X. Tang, 2008. A study on multi-word extraction from Chinese documents. Proceedings of the International Conference on Asia-Pacific Web, April 26-28, 2008, Springer, Berlin, Germany, ISBN978-3-540-89375-2, pp: 42-53.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved