Asian Journal of Information Technology

Year: 2010
Volume: 9
Issue: 2
Page No. 54 - 61

Automatic Segmentation and Classification of Audio Broadcast Data

Authors : P. Dhanalakshmi, S. Palanivel and V. Ramalingam

References

Abu-El-Quran, A.R., R.A. Goubran and A.D.C. Chan, 2006. Security monitoring using microphone arrays and audio classification. IEEE Trans. Instrum. Measur., 55: 1025-1032.
CrossRef  |  Direct Link  |  

Ajmera, J., I. McCowan and H. Bourlard, 2003. Speech/music segmentation using entropy and dynamism features in a HMM classification framework. Speech Commun., 40: 351-363.
Direct Link  |  

Eronen, A.J., V.T. Peltonen, J.T. Tuomi, A.P. Klapuri and S. Fagerlund et al., 2006. Audio-based context recognition. Audio Speech Lang. Process., 14: 321-329.
CrossRef  |  Direct Link  |  

Esmaili, S., S. Krishnan and K. Raahemifar, 2004. Content based audio classification and retrieval using joint time-frequency analysis. IEEE Int. Conf. Acoust. Speech Signal Process., 5: 665-668.
CrossRef  |  Direct Link  |  

Guo, G. and S.Z. Li, 2003. Content-based audio classification and retrieval by supportvector machines. IEEE Trans. Neural Networks, 14: 308-315.
Direct Link  |  

Haykin, S., 2001. Neural Networks a Comprehensive Foundation. Pearson Education, Asia.

Huang, R. and J.H.L. Hansen, 2006. Advances in unsupervised audio classification and segmentation for the Broadcast news and NGSW corpora. IEEE Trans. Audio Speech Lang. Process., 14: 907-919.
Direct Link  |  

Jiang, H., J. Bai, S. Zhang and B. Xu, 2005. SVM-based audio scene classification. Proceeding of the IEEE, pp: 131-136.

Jothilakshmi, S., V. Ramalingam and S. Palanivel, 2009. Speaker diarization using auto associative neuralnetworks. Eng. Appl. Artif. Intell., 22: 667-675.
Direct Link  |  

Kiranyaz, S., A.F. Qureshi and M. Gabbouj, 2006. A generic audio classification and segmentation approach for multimedia indexing and retrieval. IEEE Trans. Speech Audio Proces., 14: 1062-1081.
CrossRef  |  Direct Link  |  

Li, D., I.K. Sethi, N. Dimitrova and T. McGee, 2001. Classification of general audio data for content-based retrieval. Pattern Recogn. Lett., 22: 533-544.
Direct Link  |  

Li, S.Z., 2000. Content-based audio classification and retrieval using the nearest feature line method. IEEE Trans. Speech Audio Process., 8: 619-625.
CrossRef  |  Direct Link  |  

Lin, C.C., S.H. T.K. Chen and Y.C. Truong, 2005. Audio classification and categorization based on wavelets and support vector machine. IEEE Trans. Speech Audio Process., 13: 644-651.
Direct Link  |  

Lu, L., H.J. Zhang and S.Z. Li, 2003. Content-based audio classification and segmentation by using support vector machines. Multimed. Syst., 8: 482-492.
CrossRef  |  Direct Link  |  

McConaghy, T., H. Leung, E. Bosse and V. Varadan, 2003. Classification of audio radar signals using radial basis function neural networks. IEEE Trans. Instrum. Measur., 52: 1771-1779.
CrossRef  |  Direct Link  |  

Mubarak, O.M., E. Ambikairajah and J. Epps, 2005. Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources. IEEE Int. Conf. Acoustics Speech Signal Process., 2: 619-622.
Direct Link  |  

Panagiotakis, C. and G. Tziritas, 2005. A speech/music discriminator based on RMS and zero-crossings. IEEE Trans. Multimed., 7: 155-156.
Direct Link  |  

Rabiner, L. and B. Juang, 2003. Fundamentals of Speech Recognition. Pearson Education, Singapore.

Rajapakse, M. and L. Wyse, 2005. Generic audio classification using a hybrid model based on GMMs and HMM. Proceedings of IEEE 11th International Multimedia Modelling Conference, Jan. 12-14, Melbourne, Australia, pp: 53-58.

Umapathy, K., S. Krishnan and R.K. Rao, 2007. Audio signal feature extraction and classification using local discriminant bases. IEEE Trans. Audio Speech Lang. Process., 15: 1236-1246.
Direct Link  |  

Umapathy, K., S. Krishnan and S. Jimaa, 2005. Multigroup classification of audio signals using time frequency parameters. IEEE Trans. Multimed., 7: 308-315.
CrossRef  |  Direct Link  |  

Xu, C., N.C. Maddage and X. Shao, 2005. Automatic music classification and summarization. IEEE Trans. Speech Audio Process., 13: 441-450.
CrossRef  |  

Yegnanarayana, B. and S. Kishore, 2002. AANN: An alternative to GMM for pattern recognition. Neural Networks, 15: 459-469.
CrossRef  |  

Yegnanarayana, B., 1999. Artificial Neural Networks. Prentice Hall of India, New Delhi.

Yegnanarayana, B., S. Gangashetty and S. Palanivel, 2002. Autoassociative Neural Network Models for Pattern Recognition Tasks in Speech and Image. In: Soft Computing Approach to Pattern Recognition and Image Processing, Ghosh, A. and S.K. Pal (Eds.). World Scientific Publishing Co. Pvt. Ltd., Singapore, pp: 283-305.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved