Josiane Mothe
Selective Query Processing: A Risk-Sensitive Selection of Search Configurations
Mothe, Josiane; Ullah, Md. Zia
Abstract
In information retrieval systems, search parameters are optimized to ensure high effectiveness based on a set of past searches and these optimized parameters are then used as the system configuration for all subsequent queries. A better approach, however, would be to adapt the parameters to fit the query at hand. Selective query expansion is one such an approach, in which the system decides automatically whether or not to expand the query, resulting in two possible system configurations. This approach was extended recently to include many other parameters, leading to many possible system configurations where the system automatically selects the best configuration on a per-query basis. One problem with this approach is the system training which requires evaluation of each training query with every possible configuration. In real-world systems, so many parameters and possible values must be evaluated that this approach is impractical, especially when the system must be updated frequently, as is the case for commercial search engines. In general, the more configurations, the greater the effectiveness when configuration selection is appropriate but also the greater the risk of decreasing effectiveness in the case of an inappropriate configuration selection. To determine the ideal configurations to use on a per-query basis in real-world systems we developed a method in which a restricted number of possible configurations is pre-selected and then used in a meta-search engine that decides the best search configuration on a per query basis. We define a risk-sensitive approach for configuration pre-selection that considers the risk-reward trade-off between the number of configurations kept, and system effectiveness. We define two alternative risk functions to apply to different goals. For final configuration selection, the decision is based on query feature similarities. We compare two alternative risk functions on two query types: ad hoc and diversity and compare these to more sophisticated machine learning-based methods. We find that a relatively small number of configurations (20) selected by our risk-sensitive model is sufficient to obtain results close to the best achievable results for each query. Effectiveness is increased by about 15% according to the P@10 and nDCG@10 evaluation metrics when compared to traditional grid search using a single configuration and by about 20% when compared to learning to rank documents. Our risk-sensitive approach works for both diversity- and ad hoc-oriented searches. Moreover, the similarity-based selection method outperforms the more sophisticated approaches. Thus, we demonstrate the feasibility of developing per-query information retrieval systems, which will guide future research in this direction.
Citation
Mothe, J., & Ullah, M. Z. (2024). Selective Query Processing: A Risk-Sensitive Selection of Search Configurations. ACM transactions on information systems, 42(1), https://doi.org/10.1145/3608474
Journal Article Type | Article |
---|---|
Acceptance Date | May 5, 2023 |
Online Publication Date | Aug 21, 2023 |
Publication Date | 2024-01 |
Deposit Date | Jun 16, 2023 |
Publicly Available Date | Aug 21, 2023 |
Print ISSN | 1046-8188 |
Electronic ISSN | 1558-2868 |
Publisher | Association for Computing Machinery (ACM) |
Peer Reviewed | Peer Reviewed |
Volume | 42 |
Issue | 1 |
DOI | https://doi.org/10.1145/3608474 |
Keywords | Information systems, Retrieval effectiveness, Learning to rank, Evaluation of retrieval results, Information retrieval query processing, Information retrieval, Adaptive information retrieval, Query driven parameterisation, Learning to rank, Search engine |
Related Public URLs | https://doi.org/10.48550/arXiv.2305.18311 |
Files
Selective Query Processing: A Risk-Sensitive Selection Of System Configurations (accepted version)
(1.9 Mb)
PDF
You might also like
Instruments and Tools to Identify Radical Textual Content
(2022)
Journal Article
Query expansion for microblog retrieval focusing on an ensemble of features
(2019)
Journal Article
Downloadable Citations
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search