Skip to main content

Research Repository

Advanced Search

A method for automating the extraction of specialized information from the web

Lin, Ling; Liotta, Antonio; Hippisley, Andrew

Authors

Ling Lin

Antonio Liotta

Andrew Hippisley



Abstract

The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer systems. We present here a novel method for the extraction of semantically rich information from the web in a fully automated fashion. We illustrate our approach via a proof-of-concept application which scrutinizes millions of web pages looking for clues as to the trend of the Chinese stock market. We present the outcomes of a 210-day long study which indicates a strong correlation between the information retrieved by our prototype and the actual market behavior.

Citation

Lin, L., Liotta, A., & Hippisley, A. (2005, December). A method for automating the extraction of specialized information from the web. Presented at International Conference on Computational and Information Science 2005, Xi'an, China

Presentation Conference Type Conference Paper (Published)
Conference Name International Conference on Computational and Information Science 2005
Start Date Dec 15, 2005
End Date Dec 19, 2005
Publication Date 2005
Deposit Date Dec 3, 2019
Publisher Springer
Pages 489-494
Series Title Lecture Notes in Computer Science
Series Number 3801
Series ISSN 0302-9743
Book Title Computational Intelligence and Security International Conference, CIS 2005, Xi’an, China, December 15-19, 2005, Proceedings Part I
ISBN 978-3-540-30818-8
DOI https://doi.org/10.1007/11596448_72
Keywords information extraction; Web; automation; information retrieval
Public URL http://researchrepository.napier.ac.uk/Output/1995940