
Review of Active Learning via Bayesian Optimization for Materials Discovery
jupyter notebook: https://nanohub.org/resources/bayesopt

Input: Molecule candidates(SMILES library)
Output/Objective: Molecules w/ desired oxidation potential

(1) SMILES string -> feature vectors(select features using PCA)
(2) [Step 1] Initialize the model
(3) Make prediction on the remaining dataset (990 molecules)
(4) [Step 2] Acquisition Function Evaluation: determine which molecules should be evalated next(molecule that has chance to have maximum oxidation potential)
(5) [Step 3, Evaluation process] Perform DFT simulation on that particular molecule
(6) After the simulation the new molecule is made -> ADD the new molecule back to the original training set(10 molecules -> 11 molecules)
Depends on the evaluation process(Step 3)
Martin Fitzner "Industrial view on Bayesian optimization A perfect match for the low/no-data regime"
Patrick Rinke: Active Materials Exploration and Characterization with Bayesian Optimization
Bayesian Optimization (Bayes Opt): Easy explanation of popular hyperparameter tuning method