Romain Deveaud
Learning to adaptively rank document retrieval system configurations
Deveaud, Romain; Mothe, Josiane; Ullah, Md Zia; Nie, Jian-Yun
Abstract
Modern Information Retrieval (IR) systems have become more and more complex, involving a large number of parameters. For example, a system may choose from a set of possible retrieval models (BM25, language model, etc.), or various query expansion parameters, whose values greatly influence the overall retrieval effectiveness. Traditionally, these parameters are set at a system level based on training queries, and the same parameters are then used for different queries. We observe that it may not be easy to set all these parameters separately, since they can be dependent. In addition, a global setting for all queries may not best fit all individual queries with different characteristics. The parameters should be set according to these characteristics. In this article, we propose a novel approach to tackle this problem by dealing with the entire system configurations (i.e., a set of parameters representing an IR system behaviour) instead of selecting a single parameter at a time. The selection of the best configuration is cast as a problem of ranking different possible configurations given a query. We apply learning-to-rank approaches for this task. We exploit both the query features and the system configuration features in the learning-to-rank method so that the selection of configuration is query dependent. The experiments we conducted on four TREC ad hoc collections show that this approach can significantly outperform the traditional method to tune system configuration globally (i.e., grid search) and leads to higher effectiveness than the top performing systems of the TREC tracks. We also perform an ablation analysis on the impact of different features on the model learning capability and show that query expansion features are among the most important for adaptive systems.
Citation
Deveaud, R., Mothe, J., Ullah, M. Z., & Nie, J. (2019). Learning to adaptively rank document retrieval system configurations. ACM transactions on information systems, 37(1), Article 3. https://doi.org/10.1145/3231937
Journal Article Type | Article |
---|---|
Acceptance Date | Jun 1, 2018 |
Online Publication Date | Oct 30, 2018 |
Publication Date | 2019-01 |
Deposit Date | Mar 13, 2023 |
Journal | ACM Transactions on Information Systems (TOIS) |
Print ISSN | 1046-8188 |
Publisher | Association for Computing Machinery |
Peer Reviewed | Peer Reviewed |
Volume | 37 |
Issue | 1 |
Article Number | 3 |
DOI | https://doi.org/10.1145/3231937 |
You might also like
Defining an Optimal Configuration Set for Selective Search Strategy - A Risk-Sensitive Approach
(2021)
Conference Proceeding
Forward and backward feature selection for query performance prediction
(2020)
Conference Proceeding
Studying the variability of system setting effectiveness by data analytics and visualization
(2019)
Conference Proceeding
Information nutritional label and word embedding to estimate information check-worthiness
(2019)
Conference Proceeding