Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing

Sim, Kevin; Hart, Emma; Renau, Quentin

doi:10.1007/978-3-031-90065-5_24

Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing

Sim, Kevin; Hart, Emma; Renau, Quentin

Authors

Dr Kevin Sim K.Sim@napier.ac.uk
Lecturer

Prof Emma Hart E.Hart@napier.ac.uk
Professor

Dr Quentin Renau Q.Renau@napier.ac.uk
Research Fellow

Abstract

Coupling Large Language Models (LLMs) with Evolutionary Algorithms has recently shown significant promise as a technique to design new heuristics that outperform existing methods, particularly in the field of combinatorial optimisation. An escalating arms race is both rapidly producing new heuristics and improving the efficiency of the processes evolving them. However, driven by the desire to quickly demonstrate the superiority of new approaches, evaluation of the new heuristics produced for a specific domain is often cursory: testing on very few datasets in which instances all belong to a specific class from the domain , and on few instances per class. Taking bin-packing as an example, to the best of our knowledge we conduct the first rigorous benchmarking study of new LLM-generated heuristics, comparing them to well-known existing heuristics across a large suite of benchmark instances using three performance metrics. For each heuristic, we then evolve new instances 'won' by the heuristic and perform an instance space analysis to understand where in the feature space each heuristic performs well. We show that most of the LLM heuristics do not generalise well when evaluated across a broad range of benchmarks in contrast to existing simple heuris-tics, and suggest that any gains from generating very specialist heuristics that only work in small areas of the instance space need to be weighed carefully against the considerable cost of generating these heuristics.

Citation

Sim, K., Hart, E., & Renau, Q. (2025, April). Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing. Presented at EvoSTAR 2025, Trieste, Italy

Presentation Conference Type	Conference Paper (published)
Conference Name	EvoSTAR 2025
Start Date	Apr 23, 2025
End Date	Apr 25, 2025
Acceptance Date	Jan 10, 2025
Online Publication Date	Apr 17, 2025
Publication Date	2025
Deposit Date	Feb 3, 2025
Publicly Available Date	Apr 18, 2026
Publisher	Springer
Peer Reviewed	Peer Reviewed
Pages	386-402
Series Title	Lecture Notes in Computer Science
Series Number	15613
Series ISSN	0302-9743
Book Title	Applications of Evolutionary Computation
ISBN	978-3-031-90064-8
DOI	https://doi.org/10.1007/978-3-031-90065-5_24
Keywords	Large Language Models, Automated Design of Heuristics, Benchmarking, Combinatorial Optimisation
Public URL	http://researchrepository.napier.ac.uk/Output/4105443
External URL	https://www.evostar.org/2025/