optimizationalgorithmbehavior modeling...
TRANSCRIPT
![Page 1: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/1.jpg)
Optimization Algorithm BehaviorModeling:
A Study on the Travelling SalesmanProblem
Qi QiSupervisor: Prof. Thomas Weise
University of Science andTechnology of China
2018 - 3 - 30
![Page 2: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/2.jpg)
Outline
1 Reseach Background2 Methodology
ModelingCurve Fitting MethodResults
Predict of Unseen Instances PerformancePredicton of Future Progress
2 / 18
![Page 3: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/3.jpg)
Outline
1 Reseach Background
2 Methodology
3 / 18
![Page 4: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/4.jpg)
Anytime Algorithm
1 The solution quality of anytime algorithm improves stepby step.
2 Anytime algorithm can provide approximate solution forproblems at anytime during their run.
3 Many optimization algorithms belong to anytimealgorithm like EA, local search algorithms.
3 / 18
![Page 5: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/5.jpg)
Existing Methods
reportedresult
solu
tion q
ualit
y(w
ors
e)
(better)
runtime
Method , , andA B C
4 / 18
![Page 6: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/6.jpg)
Existing Methods
point oftermination
Method , , andA B C
solu
tion q
ualit
y(w
ors
e)
(better)
runtime
4 / 18
![Page 7: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/7.jpg)
Existing Methods
4 / 18
![Page 8: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/8.jpg)
Existing Methodssolu
tion q
ualit
y(w
ors
e)
(better)
Algorithm A
Algorithm B
runtime
terminate at time :B is better than A
x
terminate at time :A is better than B
y
4 / 18
![Page 9: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/9.jpg)
What We Want to do
1 We investigate the time-quality relationship ofoptimization processes.
2 Analyze algorithms and problems based on the informationgained.
3 Provide high-level information for the algorithms based onthe whole runtime behavior of algorithms.
5 / 18
![Page 10: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/10.jpg)
Outline
1 Reseach Background
2 MethodologyModelingPredict of Unseen Instances PerformancePredicton of Future Progress
6 / 18
![Page 11: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/11.jpg)
Outline
1 Reseach Background
2 MethodologyModelingCurve Fitting MethodResults
Predict of Unseen Instances PerformancePredicton of Future Progress
6 / 18
![Page 12: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/12.jpg)
Proposed Models
0
20
40
60
1e+01 1e+03 1e+05 1e+07
FEs
Fitn
ess
Val
ue
Dynamic Performance
6 / 18
![Page 13: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/13.jpg)
Model Propose
Table: The formulas for the four S-shaped models
Model Shortcut FormulaDecay DCM A+B exp(CxD)Logistic LGM A+ B/(1 + exp(C log(x) +D))Gompertz GPM A+B exp(C ln(x+D))Exp-linear EPM A+B exp(C exp(Dx))
7 / 18
![Page 14: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/14.jpg)
LGMP Behavior
A ∈ [0, 50] B ∈ [50, 100]
C ∈ [−0.1,−0.01] D ∈ [0.3, 3]
8 / 18
![Page 15: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/15.jpg)
Table: The meaning of the model parameters of the positive-B-shapeof the Logistic model (LGMP).
Para. Model Parameters Algorithm Perfor-mance
A Vertical offset ofcurves
The best performanceof algorithms can get
B Vertical range ofcurves
Algorithms’ initial so-lution quality
C/D Steepness of curves Learning rates of algo-rithms
9 / 18
![Page 16: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/16.jpg)
Curve Fitting Method
Cost function of fitting:
minA,B,C,D
1
n
n∑
i=1
(f(xi;A,B,C,D)− yi)2
yi(1)
1 Optimization: Levenberg-Marquardt algorithm2 LM with Intelligent initialization strategy
Solve non-linear equations.Limit the range value according to datasets.Randomly generated based on Gaussian distribution.Multiple Restarts.
9 / 18
![Page 17: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/17.jpg)
Table
1 2 5 10 20 50 100 200
0.0
0.5
1.0
1.5
2.0
FEs
Obj
ectiv
e V
alue
ts10r10f/symmetric/fri26
Parameter-Based Model
1 2 5 10 20 50 100 200
0.0
0.5
1.0
1.5
2.0
FEs
Obj
ectiv
e V
alue
ts10r10f/symmetric/fri26
ANN
10 / 18
![Page 18: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/18.jpg)
Outline
1 Reseach Background
2 MethodologyModelingCurve Fitting MethodResults
Predict of Unseen Instances PerformancePredicton of Future Progress
11 / 18
![Page 19: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/19.jpg)
Existing Method Shortcomings
1 Algorithm selection is very important to solve problems:Computational ExpensiveOnly Predict Arbitrary Runtime
2 Contribution:Predict the full runtime of behaviors of new instancesSelect algorithms based on computational budgets
11 / 18
![Page 20: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/20.jpg)
Ideas
1 If we can get the predicted parameters A, B, C, D, that isthe whole runtime behavior of algorithms
Instances Features Model ParametersPredictive Models
Whole runtime behavior
12 / 18
![Page 21: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/21.jpg)
Ideas
1 If we can get the predicted parameters A, B, C, D, that isthe whole runtime behavior of algorithms
Instances Features Model ParametersPredictive Models
Whole runtime behavior
12 / 18
![Page 22: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/22.jpg)
Ideas
1 If we can get the predicted parameters A, B, C, D, that isthe whole runtime behavior of algorithms
Instances Features Model ParametersPredictive Models
Whole runtime behavior
12 / 18
![Page 23: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/23.jpg)
Process of Parameter Prediction
1 Feature Selection:Person Correlation (Top N)Spearman Correlation (Top N)MIC Correlation(Top N)Model-based Select(Top N)
2 Ranking: Frequency of features in all correlationmethod(Top N).
3 Prediction Model: Neural Networks with different hiddennodes and layers.
13 / 18
![Page 24: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/24.jpg)
Results
1 5 10 50 100 500
05
1015
Runtime
Obj
ect V
alue
ea2048+4096ehillClimber
mnsts
All Predict Instances
1 5 10 50 100 500
0.0
0.5
1.0
1.5
Runtime
Obj
ect V
alue
ma128+256lkma16+64mnsma16+64tspaco3,10lk10mnspaco3,10mnspaco3,10tspaco5,10fsmtbb
Subset of All redict Instances
14 / 18
![Page 25: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/25.jpg)
Outline
1 Reseach Background
2 MethodologyModelingCurve Fitting MethodResults
Predict of Unseen Instances PerformancePredicton of Future Progress
15 / 18
![Page 26: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/26.jpg)
WMCMethod
1 Representing algorithm behavior as function can compute,for any point in time, which solution quality the algorithmlikely obtained
2 Used for prediction future progress of algorithms in therunning optimization process.
3 (traint, testt) = (50, 100) stands for predicting thecomplete algorithm behavior (50, 100] during the processof first 50 time interval data collected.
15 / 18
![Page 27: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/27.jpg)
WMCMethod
qp =m∑
i=1
wiqi (2)
where qp is the prediction value of quality q in testt FE interval,andm is the number of models, which be 8 in our case. wi is theweights of each model, which inverse ratio with the Φi , that is:
wi =1
m− 1
m∑
i=1
(1−Φi∑m
i=1Φi
) (3)
16 / 18
![Page 28: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/28.jpg)
ResultsTable: The prediction results between ANN and WMC in hill climberalgorithms
WMC ANNInstance 10100 50100 1001000 10100 50100 1001000burma14 0.00257 0.00268 0.00268 0.00119 0.00210 0.00161ulysses16 0.00024 0.00079 0.00192 0.00119 0.00060 0.00178gr17 0.00036 0.00069 0.00082 0.00035 0.00109 0.00067gr21 0.00182 0.00427 0.00427 0.00686 0.00488 0.00437ulysses22 0.00419 0.00098 0.00154 0.00167 0.00096 0.00153gr24 0.00671 0.00146 0.00183 0.00287 0.00095 0.00125fri26 0.00710 0.00127 0.00188 0.00736 0.00100 0.00048bayg29 0.03006 0.00186 0.00193 0.01200 0.00123 0.00144bays29 0.02468 0.00147 0.00001 0.00850 0.00140 0.00005dantzig42 0.04709 0.00127 0.00287 0.03903 0.00050 0.00051swiss42 0.03742 0.00127 0.00154 0.04487 0.00057 0.00123att48 0.13983 0.00115 0.00113 0.08355 0.00052 0.00059gr48 0.10958 0.00219 0.00190 0.10147 0.00083 0.00140hk48 0.07905 0.00186 0.00588 0.11502 0.00068 0.00147eil51 0.07627 0.00085 0.00234 0.10236 0.00032 0.00139
17 / 18
![Page 29: OptimizationAlgorithmBehavior Modeling ...iao.hfuu.edu.cn/images/publications/QWB2018OABMASOTTSP...paco3,10lk10mns paco3,10mns paco3,10ts paco5,10fsm tbb SubsetofAllredictInstances](https://reader036.vdocuments.net/reader036/viewer/2022071501/6121191b244ba1514d245dde/html5/thumbnails/29.jpg)
Thanks and Questions!
18 / 18