Dr Taoxin Peng T.Peng@napier.ac.uk
Lecturer
Dr Taoxin Peng T.Peng@napier.ac.uk
Lecturer
Lin Li
Prof Jessie Kennedy J.Kennedy@napier.ac.uk
Enhanced Associate
Abstract—There is a growing awareness that the high quality of string matching is a key to a variety of applications, such as data integration, text and web mining, information retrieval, search engine. In such applications, matching names is one of the popular tasks. There are a number of name matching techniques available. Unfortunately, there is no existing name matching technique that performs the best in all situations. Different techniques perform differently in different situations. Therefore, a problem that every researcher or a practitioner has to face is how to select an appropriate technique for a given dataset. This paper analyzes and evaluates a set of popular name matching techniques on several carefully designed different datasets. The experimental comparison confirms the statement that there is no clear best technique. Some suggestions have been presented, which can be used as guidance for researchers and practitioners to select an appropriate name matching technique in a given dataset.
Peng, T., Li, L., & Kennedy, J. (2011). An evaluation of name matching techniques. In Proceedings of 2nd Annual International Conference on Business Intelligence and Data Warehousing
Start Date | Jun 27, 2011 |
---|---|
End Date | Jun 28, 2011 |
Publication Date | 2011 |
Deposit Date | Mar 6, 2012 |
Peer Reviewed | Peer Reviewed |
Book Title | Proceedings of 2nd Annual International Conference on Business Intelligence and Data Warehousing |
ISBN | 978-981-08-9266-1 |
Keywords | String matching; data integration; data mining; information retrieval; name matching techniques; |
Public URL | http://researchrepository.napier.ac.uk/id/eprint/4994 |
Multi-Objective Evolutionary Optimisation for Prototype-Based Fuzzy Classifiers
(2022)
Journal Article
Semantic-Aware Real-Time Correlation Tracking Framework for UAV Videos
(2020)
Journal Article
A tool for generating synthetic data
(2018)
Conference Proceeding
Visualization of Online Datasets
(2017)
Journal Article
Visualization of Online Datasets
(2017)
Conference Proceeding
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Advanced Search