International Journal of Soft Computing

Year: 2019
Volume: 14
Issue: 2
Page No. 44 - 52

Text Document Clustering using Hashing Deep Learning Method

Authors : Nahrain A. Swidan,, Shawkat K. Guirguis and Omar G. Abood

References

Aggarwal, C.C., 2018. Machine Learning for Text. Springer, Berlin, Germany, ISBN: 978-3-319-73531-3, Pages: 293.

Al-Asadi, T.A., A.J. Obaid, R. Hidayat and A.A. Ramli, 2017. A survey on web mining techniques and applications. Int. J. Adv. Sci. Eng. Inform. Technol., 7: 1178-1184.
CrossRef  |  Direct Link  |  

Allahyari, M., S. Pouriyeh, M. Assefi, S. Safaei, E.D. Trippe, J.B. Gutierrez and K. Kochut, 2017. A brief survey of text mining: Classification, clustering and extraction techniques. Comput. Lang., Vol. 1,

Choi, B. and Z. Yao, 2005. Web Page Classification. In: Foundations and Advances in Data Mining, Chu W. and T.Y. Lin (Eds.). Springer, Berlin, Germany, ISBN: 978-3-540-25057-9, pp: 221-274.

Christanti, V.M. and D.S. Naga, 2018. Fast and accurate spelling correction using trie and damerau-levenshtein distance bigram. Telkomnika, 16: 827-833.
CrossRef  |  Direct Link  |  

Dhuliawala, S., D. Kanojia and P. Bhattacharyya, 2016. Slangnet: A wordnet like resource for English slang. Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC’16), May 23-28, 2016, Portoroz, Slovenia, pp: 4329-4332.

Dziadek, J., A. Henriksson and M. Duneld, 2017. Improving Terminology Mapping in Clinical Text with Context-Sensitive Spelling Correction. In: Informatics for Health: Connected Citizen-Led Wellness and Population Health, Randell, R., R. Cornet and C. McCowan (Eds.). IOS Press, Amsterdam, Netherlands, ISBN: 978-1-61499-752-8, pp: 241-245.

Gupta, G. and S. Malhotra, 2015. Text documents tokenization for word frequency count using rapid miner (taking resume as an example). Int. J. Comput. Appl., 975: 24-26.
Direct Link  |  

Gupta, V. and G.S. Lehal, 2009. A survey of text mining techniques and applications. J. Emerg. Technol. Web Intell., 1: 60-76.
Direct Link  |  

Herrouz, A., C. Khentout and M. Djoudi, 2013. Overview of web content mining tools. Intl. J. Adv. Res. Comput. Sci. Software Eng., 3: 375-385.

Hinton, G.E., 2012. A Practical Guide to Training Restricted Boltzmann Machines. In: Neural Networks: Tricks of the Trade, Montavon, G., G.B. Orr and K.R. Muller (Eds.). Springer, Berlin, Germany, ISBN: 978-3-642-35288-1, pp: 599-619.

Hussein, M.K. and M.H. Mousa, 2010. An effective web mining algorithm using link analysis. Int. J. Comput. Sci. Inf. Technol. (IJCSIT.), 1: 190-197.
Direct Link  |  

Jain, S., R. Rawat and B. Bhandari, 2017. A survey paper on techniques and applications of web usage mining. Proceedings of the 2017 International Conference on Emerging Trends in Computing and Communication Technologies (ICETCCT’17), November 17-18, 2017, IEEE, Dehradun, India, pp: 1-6.

Johnson, F. and S.K. Gupta, 2012. Web content mining techniques: A survey. Intl. J. Comput. Appl., Vol. 47,

Lee, H., R. Grosse, R. Ranganath and A.Y. Ng, 2009. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. Proceedings of the 26th Annual International Conference on Machine Learning, June 14-18, 2009, ACM, Montreal, Quebec, Canada, ISBN:978-1-60558-516-1, pp: 609-616.

Lopez-Sanchez, D., A.G. Arrieta and J.M. Corchado, 2017. Deep neural networks and transfer learning applied to multimedia web mining. Proceedings of the International Symposium on Distributed Computing and Artificial Intelligence, June 21-23, 2017, Springer, Berlin, Germany, pp: 124-131.

Lopez-Sanchez, D., A.G. Arrieta and J.M. Corchado, 2019. Visual content-based web page categorization with deep transfer learning and metric learning. Neurocomputing, 338: 418-431.
CrossRef  |  Direct Link  |  

Lou, Z. and C. Zhang, 2017. A data selection framework for K-means algorithm to mine high precision clusters. Proceedings of the 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD’17), July 29-31, 2017, IEEE, Guilin, China, pp: 1651-1657.

Mawardi, V.C., N. Susanto and D.S. Naga, 2018. Spelling correction for text documents in bahasa Indonesia using finite state automata and levinshtein distance method. MATEC. Web Conf., Vol. 164,

Mughal, M.J.H., 2018. Data mining: Web data mining techniques, tools and algorithms: An overview. Int. J. Adv. Comput. Sci. Appl. (IJACSA.), 9: 208-215.
Direct Link  |  

Pahwa, B., S. Taruna and N. Kasliwal, 2018. Sentiment analysis-strategy for text pre-processing. Int. J. Comput. Appl., 180: 15-18.
CrossRef  |  Direct Link  |  

Palma, M. and S. Zhou, 2017. A web scraper for forums: Navigation and text extraction methods. B.A. Thesis, KTH Royal Institute of Technology, Stockholm, Sweden.

Pandia, M., S.K. Pani, S.K. Padhi, L. Panigrahy and R. Ramakrishna, 2011. A review of trends in research on web mining. Int. J. Instrum. Control Autom. (IJICA.), 1: 37-41.
Direct Link  |  

Phyu, A.P. and K.K. Wai, 2019. Study on web content extraction techniques. Int. J. Trend Sci. Res. Dev. (IJTSRD.), 3: 2235-2238.
Direct Link  |  

Saif, H., M. Fernandez, Y. He and H. Alani, 2014. On stopwords, filtering and data sparsity for sentiment analysis of twitter. Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014) Vol. 5, May 26-31, 2014, Curran Associates, Inc., Reykjavik, Iceland, ISBN:978-1-63266-621-5, pp: 1610-1617.

Sebastiani, F., 2002. Machine learning in automated text categorization. ACM Comput. Surveys, 34: 1-47.
CrossRef  |  Direct Link  |  

Sharma, A.K. and P.C. Gupta, 2012. Study & Analysis of Web Content Mining Tools to Improve Techniques of Web Data Mining. Int. J. Adv. Res. Comput. Eng. Technol., 1: 287-293.
Direct Link  |  

Siddiqui, A.T. and S. Al Jahdali, 2013. Web mining techniques in e-commerce applications. Int. J. Comput. Applic., 69: 39-43.
Direct Link  |  

Silwattananusarn, T. and K. Tuamsuk, 2012. Data mining and its applications for knowledge management: A literature review from 2007 to 2012. J. Data Mining Knowledge Manage. Process, 2: 345-351.

Song, M.H., S.Y. Lim, D.J. Kang and S.J. Lee, 2005. Automatic classification of web pages based on the concept of domain ontology. Proceedings of the 12th Asia-Pacific Software Engineering Conference (APSEC’05), December 15-17, 2005, IEEE, Taipei, Taiwan, pp: 1-7.

Srivastava, J., R. Cooley, M. Deshpande and P.N. Tan, 2000. Web usage mining: Discovery and applications of usage patterns from web data. ACM SIGKDD Explorat., 1: 12-23.
CrossRef  |  Direct Link  |  

Srivastava, T., P. Desikan and V. Kumar, 2005. Web Mining-Concepts, Applications and Research Directions. In: Foundations and Advances in Data Mining, Chu W. and T.Y. Lin (Eds.). Springer, Berlin, Germany, ISBN: 978-3-540-25057-9, pp: 275-307.

Srividya, M., D. Anandhi and M.I. Ahmed, 2013. Web mining and its categories-a survey. Int. J. Eng. Comput. Sci. (IJECS.), 2: 1338-1345.

Tang, C., C.X. Ling, X. Zhou, N. Cercone and X. Li, 2008. Advanced Data Mining and Applications: 4th International Conference, ADMA. Springer, Berlin, Germany, ISBN: 978-3-540-88192-6, Pages: 759.

Verma, T., R. Renu and D. Gaur, 2014. Tokenization and filtering process in RapidMiner. Int. J. Applied Inf. Syst., 7: 16-18.
Direct Link  |  

Wu, Y.C., 2016. Language independent web news extraction system based on text detection framework. Inf. Sci., 342: 132-149.
Direct Link  |  

Yang, B., X. Fu, N.D. Sidiropoulos and M. Hong, 2017. Towards K-means-friendly spaces: Simultaneous deep learning and clustering. Proceedings of the 34th International Conference on Machine Learning (ICML’17) Vol. 70, August 6-11, 2017, Sydney, Australia, pp: 3861-3870.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved