Abstract: Our semantic web application integrates images, structured and semi-structured data from various web-pages (HTML, XML) using semantic web technologies. Users can query an integrate view of these sources. The data sources integrated are heterogeneous not only in terms of data but also in terms of how the applications analyze the pages. Before achieving this vision, however, we must address several challenges. We need technologies to extract the data from different web pages (using machine learning techniques), a record linkage system for integrating data (images or structured data) from multiple web-pages to a single entity , a mediator system providing the uniform access to various web-pages, sophisticated analysis of the individual web-pages and efficient query reformulation and execution . In addition a semantic web-based system must recognize when different objects at different pages denote the same real world entity.
Reshmy.K.R , S.K.Srivatsa and Rajeevulla Mohammed , 2005. Semantic Retrieval of Heterogeneous Data. Asian Journal of Information Technology, 4: 1159-1169.