Ling Lin
A method for automating the extraction of specialized information from the web
Lin, Ling; Liotta, Antonio; Hippisley, Andrew
Authors
Antonio Liotta
Andrew Hippisley
Abstract
The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer systems. We present here a novel method for the extraction of semantically rich information from the web in a fully automated fashion. We illustrate our approach via a proof-of-concept application which scrutinizes millions of web pages looking for clues as to the trend of the Chinese stock market. We present the outcomes of a 210-day long study which indicates a strong correlation between the information retrieved by our prototype and the actual market behavior.
Citation
Lin, L., Liotta, A., & Hippisley, A. (2005, December). A method for automating the extraction of specialized information from the web. Presented at International Conference on Computational and Information Science 2005, Xi'an, China
Presentation Conference Type | Conference Paper (Published) |
---|---|
Conference Name | International Conference on Computational and Information Science 2005 |
Start Date | Dec 15, 2005 |
End Date | Dec 19, 2005 |
Publication Date | 2005 |
Deposit Date | Dec 3, 2019 |
Publisher | Springer |
Pages | 489-494 |
Series Title | Lecture Notes in Computer Science |
Series Number | 3801 |
Series ISSN | 0302-9743 |
Book Title | Computational Intelligence and Security International Conference, CIS 2005, Xi’an, China, December 15-19, 2005, Proceedings Part I |
ISBN | 978-3-540-30818-8 |
DOI | https://doi.org/10.1007/11596448_72 |
Keywords | information extraction; Web; automation; information retrieval |
Public URL | http://researchrepository.napier.ac.uk/Output/1995940 |
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search