Asian Journal of Information Technology

Year: 2005
Volume: 4
Issue: 3
Page No. 41 - 48

A Hybrid Machine Learning Approach for Extracting Information from WWW

Authors : Zhi Cai Kun Yu, Xufa Wang and Qingsheng Cai

Abstract: This paper presents a hybrid machine learning approach to extract information from WWW. It applies structure analysis to improve the extraction accuracy, with 96.5% average precision and 96.7% average recall for static web page, and 100% precision and recall for dynamic web page. Furthermore, the working time is short (< 800 ms) and the number of learning examples is small (< 4) due to little user participation. Our results prove that this approach offers the attractive advantageous of fast, convenient and high-accuracy requirements of practical applications.

How to cite this article:

Kun Yu, Zhi Cai , Xufa Wang and Qingsheng Cai , 2005. A Hybrid Machine Learning Approach for Extracting Information from WWW . Asian Journal of Information Technology, 4: 41-48.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved