Grid search and random search are widely used techniques for hyperparameter tuning in machine learning, especially when gradient information is unavailable. In these methods, a finite set of candidate configurations is evaluated, and the best-performing one is selected. We propose a simple and computationally inexpensive refinement of this paradigm: instead of selecting a single best point, we form a weighted average of the evaluated configurations, where the weights are chosen using an importance sampling scheme inspired by the Laplace principle. This scheme can be implemented as a post-processing step on top of a random search, with no additional function evaluations. We also propose an iterative variant, where the sampling distributions are chosen adaptively to generate new candidate points around the previous estimate, in the spirit of Evolution Strategy (ES) methods. In a general non-convex setting, we show that, after n evaluations, the error of the proposed methods is of smaller order than n -2/(d+2) . This compares favorably to random search or grid search rates of n -1/d as soon as d > 2. We illustrate the practical benefits of this averaging strategy on several examples.
Building similarity graph...
Analyzing shared references across papers
Loading...
Radu-Alexandru Dragomir
François Portier
Victor Priser
Centre de Recherche en Économie et Statistique
Centre for Research in Engineering Surface Technology
École Nationale de la Statistique et de l'Analyse de l'Information
Building similarity graph...
Analyzing shared references across papers
Loading...
Dragomir et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69d8930e6c1944d70ce041c8 — DOI: https://doi.org/10.48550/arxiv.2604.02882