Journal of Engineering and Applied Sciences

Year: 2018
Volume: 13
Issue: 9 SI
Page No. 7074 - 7080

A Technical Study on Feature Ranking Techniques and Classification Algorithms

Authors : Wareesa Sharif, Noor Azah Samsudin, Mustafa Mat Deris and Shamsul Kamal Ahmad Khalid

References

Azam, N. and J. Yao, 2012. Comparison of term frequency and document frequency based feature selection metrics in text categorization. Expert Syst. Appl., 39: 4760-4768.
Direct Link  |  

Babu, R.L. and S. Vijayan, 2016. Wrapper based feature selection in semantic medical information retrieval. J. Med. Imaging Health Inf., 6: 802-805.
Direct Link  |  

Baccianella, S., A. Esuli and F. Sebastiani, 2013. Using micro-documents for feature selection: The case of ordinal text classification. Expert Syst. Appl., 4: 4687-4696.
Direct Link  |  

Bachrach, R.G., A. Navot and N. Tishby, 2004. Margin based feature selection-theory and algorithms. Proceedings of the 21st ACM International Conference on Machine Learning, July 04-08, 2004, ACM, Banff, Alberta, ISBN:1-58113-838-5, pp: 1-43.

Chang, F., J. Guo, W. Xu and K. Yao, 2015. A feature selection method to handle imbalanced data in text classification. J. Digital Inf. Manage., 13: 169-175.

Cortes, C. and V. Vapnik, 1995. Support-vector networks. Mach. Learn., 20: 273-297.
CrossRef  |  Direct Link  |  

Feng, G., J. Guo, B.Y. Jing and T. Sun, 2015. Feature subset selection using naive Bayes for text classification. Pattern Recognit. Lett., 65: 109-115.
Direct Link  |  

Forman, G., 2003. An extensive empirical study of feature selection metrics for text classification. J. Machine Learn. Res., 3: 1289-1305.

Gnana, D.A.A., S. Appavu and E.J. Leavline, 2016. Literature review on feature selection methods for high-dimensional data. Methods, 136: 1-10.
CrossRef  |  

Javed, K., H.A. Babri and M. Saeed, 2012. Feature selection based on class-dependent densities for high-dimensional binary data. Knowl. Data Eng. IEEE. Trans., 24: 465-477.
CrossRef  |  Direct Link  |  

Javed, K., S. Maruf and H.A. Babri, 2015. A two-stage Markov blanket based feature selection algorithm for text classification. Neurocomputing, 157: 91-104.
Direct Link  |  

Joachims, T., 1998. Text categorization with support vector machines: Learning with many relevant features. Proceedings of the 10th European Conference on Machine Learning, Chemnitz, Germany, April 21-23, 1998, Springer, Berlin, Heidelberg, pp: 137-142.

Karabulut, E.M., S.A. Ozel and T. Ibrikci, 2012. A comparative study on the effect of feature selection on classification accuracy. Procedia Technol., 1: 323-327.
Direct Link  |  

Korde, V. and C.N. Mahender, 2012. Text classification and classifiers: A survey. Intl. J. Artif. Intell. Appl., 3: 85-99.
Direct Link  |  

Kulkarni, A.R., V. Tokekar and P. Kulkarni, 2012. Identifying context of text documents using Naïve Bayes classification and Apriori association rule mining. Proceedings of the 6th IEEE International Conference on Software Engineering (CONSEG), September 5-7, 2012, IEEE, Indore, India, ISBN:978-1-4673-2174-7, pp: 1-4.

Lee, C. and G.G. Lee, 2006. Information gain and divergence-based feature selection for machine learning-based text categorization. Inform. Process. Manage., 42: 155-165.
CrossRef  |  Direct Link  |  

Li, Y. and C. Chen, 2012. Research on the feature selection techniques used in text classification. Proceeding of the 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery, May 29-31, 2012, IEEE, Xi'an, China, ISBN: 978-1-4673-0024-7, pp: 725-729.

Manning, C.D., P. Raghavan and H. Schutze, 2008. An Introduction to Information Retrieval. Cambridge University Press, Cambridge, UK., ISBN-13: 9780521865715, pp: 482.

Meenakshi, R.V.S., 2013. Structured data extraction from the deep web. IOSR. J. Comput. Eng., 8: 93-96.

Mladenic, D. and M. Grobelnik, 1999. Feature selection for unbalanced class distribution and naive bayes. Proceedings of the 16th International Conference on Machine Learning, June 27-30, 1999, Morgan Kaufmann, San Francisco, CA., USA., pp: 258-267.

Nidhi and V. Gupta, 2011. Recent trends in text classification techniques. Int. J. Comput. Applic., 35: 45-51.
Direct Link  |  

Paul, A., 2014. Effect of imbalanced data on document classification algorithms. Ph.D Thesis, Auckland University of Technology, Auckland, New Zealand.

Rehman, A., K. Javed, H.A. Babri and M. Saeed, 2015. Relative discrimination criterion-a novel feature ranking method for text data. Expert Syst. Appl., 42: 3670-3681.
Direct Link  |  

Siddiqui, M.A., 2016. An empirical evaluation of text classification and feature selection methods. Artif. Intell. Res., 5: 70-81.
Direct Link  |  

Tian, C., Y. Wang, X. Lin, J. Lin and J. Hong, 2016. Research on high-dimensional data reduction. Intl. J. Database Theor. Appl., 9: 87-96.

Ting, S.L., W.H. Ip and A.H. Tsang, 2011. Is naive bayes a good classifier for document classification?. Int. J. Software Eng. Its Appl., 5: 37-46.
Direct Link  |  

Uysal, A.K. and S. Gunal, 2012. A novel probabilistic feature selection method for text classification. Knowl. Based Syst., 36: 226-235.
CrossRef  |  

Uysal, A.K., 2016. An improved global feature selection scheme for text classification. Expert Syst. Appl., 43: 82-92.
Direct Link  |  

Wang, F., Y. Zhang, H. Xiao, L. Kuang and Y. Lai, 2015. Enhancing stock price prediction with a hybrid approach based extreme learning machine. Proceedings of the IEEE International Conference on Data Mining Workshop (ICDMW), November 14-17, 2015, IEEE, Atlantic City, New Jersey, ISBN:978-1-4673-8494-0, pp: 1568-1575.

Yang, J., Y. Liu, X. Zhu, Z. Liu and X. Zhang, 2012. A new feature selection based on comprehensive measurement both in inter-category and intra-category for text categorization. Inf. Process. Manage., 48: 741-754.
Direct Link  |  

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved