Asian Journal of Information Technology

Year: 2016
Volume: 15
Issue: 16
Page No. 2887 - 2896

Hierarchical Sampling Techniques for Imbalanced Datasets

Authors : S. Lavanya and S. Palaniswami

References

Barua, S., M.M. Islam, X. Yao and K. Murase, 2014. MWMOTE-majority weighted minority oversampling technique for imbalanced data set learning. IEEE Trans. Knowledge Data Eng., 26: 405-425.
CrossRef  |  

Batista, G.E.A.P.A., R.C. Prati and M.C. Monard, 2004. A study of the behavior of several methods for balancing machine learning training data. ACM SIGKDD Explorations Newsl., 6: 20-29.
CrossRef  |  Direct Link  |  

Chen, S., H. He and E.A. Garcia, 2010. RAMOBoost: Ranked minority oversampling in boosting. IEEE Trans. Neural Networks, 21: 1624-1642.
CrossRef  |  

Cieslak, D.A. and N.V. Chawla, 2008. Start globally, optimize locally, predict globally: Improving performance on imbalanced data. Proceedings of the 2008 8th IEEE International Conference on Data Mining, December 15-19, 2008, Pisa, pp: 143-152.

Clearwater, S.H. and E.G. Stern, 1991. A rule-learning program in high energy physics event classification. Comput. Phys. Comm., 67: 159-182.
CrossRef  |  Direct Link  |  

Fawcett, T. and F. Provost, 1997. Adaptive fraud detection. Data Mining Knowledge Discovery, 3: 291-316.
CrossRef  |  Direct Link  |  

Freund, Y. and R.E. Schapire, 1996. Experiments with a new boosting algorithm. Proceedings of the 13th International Conference on Machine Learning, July 3-6, 1996, Bari, Italy, pp: 148-156.

Freund, Y. and R.E. Schapire, 1997. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci., 55: 119-139.
CrossRef  |  Direct Link  |  

Han, H., W.Y. Wang and B.H. Mao, 2005. Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning. Proceedings of the International Conference on Intelligent Computing, August 23-26, 2005, Hefei, China, pp: 878-887.

He, H. and E.A. Garcia, 2009. Learning from imbalanced data. IEEE Trans. Knowledge Data Eng., 21: 1263-1284.
CrossRef  |  

He, H., Y. Bai, E.A. Garcia and S. Li, 2008. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. Proceedings of the IEEE International Joint Conference on Neural Networks, June 1-8, 2008, Hong Kong, pp: 1322-1328.

Holte, R.C., L.E. Acker and B.W. Porter, 1989. Concept learning and the problem of small disjuncts. Proceedings of the International Joint Conference on Articial Intelligence, July 16-22, 1989, Morgan Kaufmann, pp: 813-818.

Japkowicz, N. and S. Stephen, 2002. The class imbalance problem: A systematic study. Intell. Data Anal., 6: 429-449.
Direct Link  |  

Japkowicz, N., C. Myers and M. Gluck, 1995. A novelty detection approach to classification. Proceedings of the 14th International Joint Conference on Artificial Intelligence, August 20-25, 1995, Canada, pp: 518-523.

Kubat, M. and S. Matwin, 1997. Addressing the curse of imbalanced training sets: One-sided selection. Proceedings of the 14th International Conference on Machine Learning, July 8-12, 1997, Nashville, Tennessee, USA, pp: 179-186.

Kubat, M., R.C. Holte and S. Matwin, 1998. Machine learning for the detection of oil spills in satellite radar images. Mach. Learning, 30: 195-215.
CrossRef  |  Direct Link  |  

Lewis, D.D. and J. Catlett, 1994. Heterogeneous uncertainty sampling for supervised learning. Proceedings of the 11th International Conference on Machine Learning, (ICML'94), Morgan Kaufmann, pp: 148-156.

Ling, C.X. and C. Li, 1998. Data mining for direct marketing: Problems and solutions. Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, August 27-31, 1998, New York, USA., pp: 73-79.

Liu, X.Y., J. Wu and Z.H. Zhou, 2006. Exploratory undersampling for class-imbalance learning. Proceedings of the IEEE International Conference on Data Mining, December 18-22, 2006, Hong Kong, pp: 965-969.

Liu, X.Y., J. Wu and Z.H. Zhou, 2009. Exploratory under sampling for class imbalance learning. Proceedings of the 15th International Conference on Auditory Display, May 18-21, 2009, Copenhagen, Denmark, pp: 919-926.

Mani, I. and I. Zhang, 2003. KNN approach to unbalanced data distributions: A case study involving information extraction. Proceedings of the Workshop on Learning from Imbalanced Datasets, (WLID'03), Washington, DC., USA -.

Murphy, P.M. and D.W. Aha, 1994. UCI repository of machine learning databases. Department of Information and Computer Science, University of California, Irvine, CA.

Quinlan, J.R., 1986. Induction of decision trees. Mach. Learn., 1: 81-106.
CrossRef  |  

Tomek, I., 1976. Two modifications of CNN. IEEE Trans. Syst. Man Cybern., 6: 769-772.
Direct Link  |  

Wang, S. and X. Yao, 2012. Multiclass imbalance problems: Analysis and potential solutions. IEEE Trans. Syst. Man Cybernet., 42: 1119-1130.
CrossRef  |  

Weiss, G.M., 2004. Mining with rarity: A unifying framework. ACM SIGKDD Explorations Newsl., 6: 7-19.
CrossRef  |  Direct Link  |  

Wu, J., H. Xiong, P. Wu and J. Chen, 2007. Local decomposition for rare class analysis. Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 12-15, 2007, San Jose, CA., pp: 814-823.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved