Understanding generalization behavior of interval-tuned Irace configurations across sample sizes
Determine the reasons underlying the observed size-dependent performance crossover of Irace-tuned (p2, p3) parameter pairs for 3-dimensional Kronecker point sets (with p1 = 1/n), where configurations tuned on specific n-intervals (e.g., INTERVAL_5 and INTERVAL_9) generalize well up to roughly 1000–1200 points before being surpassed by the IRACE_1500 configuration; characterize how the parameter-to-discrepancy landscape changes with n and explain why certain training intervals yield superior cross-n performance.
References
Figure \ref{fig:irace_chunck_complete} illustrates how the lines are closely packed when n is small, suggesting that most configurations perform similarly for smaller sizes. Performance differences become more noticeable as the number of points increases. For example, INTERVAL_5 and INTERVAL_9 consistently outperform the majority of other interval configurations and are competitive with IRACE_1500 up to about 1000â1200 points; beyond this range, a clear improvement emerges. We do not have a full understanding of why this happens.