Dr Kevin Sim K.Sim@napier.ac.uk
Lecturer
Dr Kevin Sim K.Sim@napier.ac.uk
Lecturer
Prof Emma Hart E.Hart@napier.ac.uk
Professor
Dr Quentin Renau Q.Renau@napier.ac.uk
Research Fellow
Coupling Large Language Models (LLMs) with Evolutionary Algorithms has recently shown significant promise as a technique to design new heuristics that outperform existing methods, particularly in the field of combinatorial optimisation. An escalating arms race is both rapidly producing new heuristics and improving the efficiency of the processes evolving them. However, driven by the desire to quickly demonstrate the superiority of new approaches, evaluation of the new heuristics produced for a specific domain is often cursory: testing on very few datasets in which instances all belong to a specific class from the domain , and on few instances per class. Taking bin-packing as an example, to the best of our knowledge we conduct the first rigorous benchmarking study of new LLM-generated heuristics, comparing them to well-known existing heuristics across a large suite of benchmark instances using three performance metrics. For each heuristic, we then evolve new instances 'won' by the heuristic and perform an instance space analysis to understand where in the feature space each heuristic performs well. We show that most of the LLM heuristics do not generalise well when evaluated across a broad range of benchmarks in contrast to existing simple heuris-tics, and suggest that any gains from generating very specialist heuristics that only work in small areas of the instance space need to be weighed carefully against the considerable cost of generating these heuristics.
Sim, K., Hart, E., & Renau, Q. (2025, April). Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing. Presented at EvoSTAR 2025, Trieste, Italy
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | EvoSTAR 2025 |
Start Date | Apr 23, 2025 |
End Date | Apr 25, 2025 |
Acceptance Date | Jan 10, 2025 |
Deposit Date | Feb 3, 2025 |
Publisher | Springer |
Peer Reviewed | Peer Reviewed |
Keywords | Large Language Models, Automated Design of Heuristics, Benchmarking, Combinatorial Optimisation |
Public URL | http://researchrepository.napier.ac.uk/Output/4105443 |
External URL | https://www.evostar.org/2025/ |
This file is under embargo due to copyright reasons.
Contact repository@napier.ac.uk to request a copy for personal use.
A hyper-heuristic ensemble method for static job-shop scheduling.
(2016)
Journal Article
A research agenda for metaheuristic standardization.
(2015)
Presentation / Conference Contribution
A Lifelong Learning Hyper-heuristic Method for Bin Packing
(2015)
Journal Article
On Constructing Ensembles for Combinatorial Optimisation
(2017)
Journal Article
Use of machine learning techniques to model wind damage to forests
(2018)
Journal Article
About Edinburgh Napier Research Repository
Administrator e-mail: repository@napier.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search