TY - GEN
T1 - A Problem with the Current Methodology for Comparing Search Algorithms and a Proposed Solution
AU - Barley, Mike
AU - de Kriek, Natasha
AU - Franco, Santiago
AU - Garcia-Olaya, Angel
AU - Hartill, Tim
AU - Triggs, Christopher
AU - Zwart, Henry
AU - Alcazar, Vidal
AU - Riddle, Patricia
PY - 2025/5/16
Y1 - 2025/5/16
N2 - This paper explores how incompletely described tie-break policies can invalidate the experimental results reported in papers on optimal bidirectional heuristic search (BiHS). Experiments usually use a single implementation of an algorithm with its specific tie-break policy. When the tie-breaks are insufficiently described, we show that the results can be irreproducible, vary dramatically under different implementations, and lead to misleading assessments of an algorithm’s performance. To ensure reproducible and representative results, papers should either provide a description of the algorithm’s implementation, i.e., the complete tie-break policy, or alternatively, give results as a summary statistic representative of all possible tie-break implementations. We developed a software tool for this purpose.
AB - This paper explores how incompletely described tie-break policies can invalidate the experimental results reported in papers on optimal bidirectional heuristic search (BiHS). Experiments usually use a single implementation of an algorithm with its specific tie-break policy. When the tie-breaks are insufficiently described, we show that the results can be irreproducible, vary dramatically under different implementations, and lead to misleading assessments of an algorithm’s performance. To ensure reproducible and representative results, papers should either provide a description of the algorithm’s implementation, i.e., the complete tie-break policy, or alternatively, give results as a summary statistic representative of all possible tie-break implementations. We developed a software tool for this purpose.
M3 - Conference contribution
BT - International Symposium on Combinatorial Search (SOCS), 2025
ER -