Curve Fitting Assessor is a LPA(learning, predicting, assessing) algorithm. It stops a pending trial X at step S if the prediction of final epoch's performance is worse than the best final performance in the trial history.
In this algorithm, we use 12 curves to fit the learning curve, the large set of parametric curve models are chosen from reference paper. The learning curves' shape coincides with our prior knowlwdge about the form of learning curves: They are typically increasing, saturating functions.
We combine all learning curve models into a single, more powerful model. This combined model is given by a weighted linear combination:
where the new combined parameter vector
Assuming additive a Gaussian noise and the noise parameter is initialized to its maximum likelihood estimate.
We determine the maximum probability value of the new combined parameter vector by learing the historical data. Use such value to predict the future trial performance, and stop the inadequate experiments to save computing resource.
Concretely,this algorithm goes through three stages of learning, predicting and assessing.
-
Step1: Learning. We will learning about the trial history of the current trial and determine the \xi at Bayesian angle. First of all, We fit each curve using the least squares method(implement by
fit_theta
) to save our time. After we obtained the parameters, we filter the curve and remove the outliers(implement byfilter_curve
). Finally, we use the MCMC sampling method(implement bymcmc_sampling
) to adjust the weight of each curve. Up to now, we have dertermined all the parameters in \xi. -
Step2: Predicting. Calculates the expected final result accuracy(implement by
f_comb
) at target position(ie the total number of epoch) by the \xi and the formula of the combined model. -
Step3: If the fitting result doesn't converge, the predicted value will be
None
, in this case we returnAssessResult.Good
to ask for future accuracy information and predict again. Furthermore, we will get a positive value bypredict()
function, if this value is strictly greater than the best final performance in history *THRESHOLD
(default value = 0.95), returnAssessResult.Good
, otherwise, returnAssessResult.Bad
The figure below is the result of our algorithm on MNIST trial history data, where the green point represents the data obtained by Assessor, the blue point represents the future but unknown data, and the red line is the Curve predicted by the Curve fitting assessor.
To use Curve Fitting Assessor, you should add the following spec in your experiment's YAML config file:
assessor:
builtinAssessorName: Curvefitting
classArgs:
# (required)The total number of epoch.
# We need to know the number of epoch to determine which point we need to predict.
epoch_num: 20
# (optional) choice: maximize, minimize
* The default value of optimize_mode is maximize
optimize_mode: maximize
# (optional) In order to save our computing resource, we start to predict when we have more than only after receiving start_step number of reported intermediate results.
* The default value of start_step is 6.
start_step: 6
# (optional) The threshold that we decide to early stop the worse performance curve.
# For example: if threshold = 0.95, optimize_mode = maximize, best performance in the history is 0.9, then we will stop the trial which predict value is lower than 0.95 * 0.9 = 0.855.
* The default value of threshold is 0.95.
# Kindly reminds that if you choose minimize mode, please adjust the value of threshold >= 1.0 (e.g threshold=1.1)
threshold: 0.95
# (optional) The gap interval between Assesor judgements.
# For example: if gap = 2, start_step = 6, then we will assess the result when we get 6, 8, 10, 12...intermedian result.
* The default value of gap is 1.
gap: 1
The assessor has a lot of different files, functions and classes. Here we will only give most of those files a brief introduction:
curvefunctions.py
includes all the function expression and default parameters.modelfactory.py
includes learning and predicting, the corresponding calculation part is also implemented here.curvefitting_assessor.py
is a assessor which receives the trial history and assess whether to early stop the trial.
- Further improve the accuracy of the prediction and test it on more models.