Dr Md Zia Ullah M.Ullah@napier.ac.uk
Lecturer
Understanding the users’ search intents through mining query subtopic is a challenging task and a prerequisite step for search diversification. This paper proposes mining query subtopic by exploiting the word embedding and short-text similarity measure. We extract candidate subtopic from multiple sources and introduce a new way of ranking based on a new novelty estimation that faithfully represents the possible search intents of the query. To estimate the subtopic relevance, we introduce new semantic features based on word embedding and bipartite graph based ranking. To estimate the novelty of a subtopic, we propose a method by combining the contextual and categorical similarities. Experimental results on NTCIR subtopic mining datasets turn out that our proposed approach outperforms the baselines, known previous methods, and the official participants of the subtopic mining tasks.
Ullah, M. Z., Chy, A. N., & Aono, M. (2016). Query Subtopic Mining Exploiting Word Embedding for Search Result Diversification. In Information Retrieval Technology: 12th Asia Information Retrieval Societies Conference, AIRS 2016, Beijing, China, November 30 – December 2, 2016, Proceedings (308-314). https://doi.org/10.1007/978-3-319-48051-0_24
Conference Name | 12th Asia Information Retrieval Societies Conference |
---|---|
Conference Location | Beijing, China |
Start Date | Nov 30, 2016 |
End Date | Dec 2, 2016 |
Online Publication Date | Oct 15, 2016 |
Publication Date | 2016 |
Deposit Date | Mar 13, 2023 |
Publisher | Springer |
Pages | 308-314 |
Series Title | Lecture Notes in Computer Science |
Series ISSN | 1611-3349 |
Book Title | Information Retrieval Technology: 12th Asia Information Retrieval Societies Conference, AIRS 2016, Beijing, China, November 30 – December 2, 2016, Proceedings |
DOI | https://doi.org/10.1007/978-3-319-48051-0_24 |
Keywords | Subtopic mining, Word embedding, Diversification, Novelty |
Defining an Optimal Configuration Set for Selective Search Strategy - A Risk-Sensitive Approach
(2021)
Conference Proceeding
Forward and backward feature selection for query performance prediction
(2020)
Conference Proceeding
Studying the variability of system setting effectiveness by data analytics and visualization
(2019)
Conference Proceeding
Information nutritional label and word embedding to estimate information check-worthiness
(2019)
Conference Proceeding
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Advanced Search