A method for automating the extraction of specialized information from the web

Lin, Ling; Liotta, Antonio; Hippisley, Andrew

doi:10.1007/11596448_72

A method for automating the extraction of specialized information from the web

Lin, Ling; Liotta, Antonio; Hippisley, Andrew

Authors

Ling Lin

Antonio Liotta

Andrew Hippisley

Abstract

The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer systems. We present here a novel method for the extraction of semantically rich information from the web in a fully automated fashion. We illustrate our approach via a proof-of-concept application which scrutinizes millions of web pages looking for clues as to the trend of the Chinese stock market. We present the outcomes of a 210-day long study which indicates a strong correlation between the information retrieved by our prototype and the actual market behavior.

Citation

Lin, L., Liotta, A., & Hippisley, A. (2005, December). A method for automating the extraction of specialized information from the web. Presented at International Conference on Computational and Information Science 2005, Xi'an, China

Presentation Conference Type	Conference Paper (Published)
Conference Name	International Conference on Computational and Information Science 2005
Start Date	Dec 15, 2005
End Date	Dec 19, 2005
Publication Date	2005
Deposit Date	Dec 3, 2019
Publisher	Springer
Pages	489-494
Series Title	Lecture Notes in Computer Science
Series Number	3801
Series ISSN	0302-9743
Book Title	Computational Intelligence and Security International Conference, CIS 2005, Xi’an, China, December 15-19, 2005, Proceedings Part I
ISBN	978-3-540-30818-8
DOI	https://doi.org/10.1007/11596448_72
Keywords	information extraction; Web; automation; information retrieval
Public URL	http://researchrepository.napier.ac.uk/Output/1995940

Downloadable Citations

HTML

BIB

RTF