Category:Machine-learned force fields: Difference between revisions
(Created page with "'''Machine-learned force fields''' from ab-initio molecular dynamics (MD) allow capturing the underlying physics from first principles and sti...") |
|||
(11 intermediate revisions by 4 users not shown) | |||
Line 1: | Line 1: | ||
'''Machine-learned force fields''' | '''Machine-learned force fields''' used in combination with ab-initio [[:Category:Molecular dynamics|molecular dynamics]] (MD) allow capturing the underlying physics from first principles and still reach long simulation times relatively cheaply. Generally, an ab-initio MD step is computationally expensive due to the [[:Category:Electronic minimization|quantum-mechanical treatment of the electrons]], e.g., within density-functional theory (DFT). In fully classical MD calculations, '''force fields''' are used to evaluate the force acting on each atom instead of DFT. These interatomic potentials are traditionally based on experimental observation and empirical inclusion of known forces, such as the van-der-Waals force, electrostatic charges, etc. Therefore, the quality of the force field depends on how well interactions in the specific system are known. | ||
VASP offers '''machine learning force fields on-the-fly''' to overcome these two issues. Namely, the high computational cost required for ab-initio MD and the empirical knowledge necessary to construct a force field the traditional way. To learn more about on-the-fly machine learning read about the [[Machine learning force field: Theory|theory of on-the-fly machine learning force fields]] or about the [[Machine learning force field calculations: Basics| setup of a basic calculation]]! Also, check out the description on [[Best practices for machine-learned force fields|best practices on how to construct, test, and retrain force fields]], as well as the [https://www.vasp.at/tutorials/latest/md/part2/ basic tutorial that provides hands-on experience for silicon] and this tutorial: {{TAG|Liquid Si - MLFF}}. | VASP offers '''machine learning force fields on-the-fly''' to overcome these two issues. Namely, the high computational cost required for ab-initio MD and the empirical knowledge necessary to construct a force field the traditional way. To learn more about on-the-fly machine learning read about the [[Machine learning force field: Theory|theory of on-the-fly machine learning force fields]] or about the [[Machine learning force field calculations: Basics| setup of a basic calculation]]! Also, check out the description on [[Best practices for machine-learned force fields|best practices on how to construct, test, and retrain force fields]], as well as the [https://www.vasp.at/tutorials/latest/md/part2/ basic tutorial that provides hands-on experience for silicon] and this tutorial: {{TAG|Liquid Si - MLFF}}. | ||
Line 5: | Line 5: | ||
== Theory == | == Theory == | ||
VASP uses a Bayesian-learning algorithm for on-the-fly machine learning. The total energy and forces are predicted based on the machine-learned force field at each time step of the MD simulation. If the Bayesian error exceeds a certain threshold an ab-initio calculation is performed, where electrons are treated quantum mechanically. Subsequently, | VASP uses a Bayesian-learning algorithm for on-the-fly machine learning. The total energy and forces are predicted based on the machine-learned force field at each time step of the MD simulation. If the Bayesian error estimate exceeds a certain threshold an ab-initio calculation is performed, where electrons are treated quantum mechanically. Subsequently, previously untrained atomic environments are identified and added to a set of so-called local reference configurations. This set serves as a comparison database for future force field predictions. With new information in the form of total energy and atomic forces obtained from the triggered ab-initio calculation the machine-learned force field is updated and the MD simulation continues. In this way the force field is iteratively improved in the course of the MD simulation. Ideally, it will reach such high prediction quality that no more ab-initio calculations are required. | ||
For details on the algorithm, check out the [[Machine learning force field: Theory|theory article about machine learning force fields]]. | For details on the algorithm, check out the [[Machine learning force field: Theory|theory article about machine learning force fields]]. | ||
Line 20: | Line 20: | ||
=== Input === | === Input === | ||
Depending on whether the calculation is training, retraining, or applying the force field (see {{TAG|ML_MODE}}), VASP may require the following input files in addition to the usual [[:Category:Input files|input files]] ({{TAG|INCAR}}, {{TAG|POSCAR}}, etc.): | |||
*{{TAG|ML_AB}} Ab-initio training data. | |||
*{{TAG|ML_FF}} Force-field parameters. | |||
=== Output === | === Output === | ||
The machine-learning–force-field method generates the following [[:Category:Output files|output files]]: | |||
*{{TAG|ML_LOGFILE}} Main output file. | |||
*{{TAG|ML_ABN}} Generated ab-initio training data. It is used as {{FILE|ML_AB}} file to restart a calculation. | |||
*{{TAG|ML_REG}} Summary of regression results. | |||
*{{TAG|ML_HIS}} Summary of the histogram data. | |||
*{{TAG|ML_FFN}} Force-field parameters. It is used as {{FILE|ML_FF}} file to restart a calculation. | |||
*{{TAG|ML_EATOM}} Local atomic energies. | |||
*{{TAG|ML_HEAT}} Local heat flux. | |||
=== Hyperparameters === | |||
Hyperparameters are user-defined parameters of the MLFF model that will not be optimized during training of the MLFF. For instance, cutoff radii ({{TAG|ML_RCUT1}}, {{TAG|ML_RCUT2}}), the normalization and weighting of ab-initio training data ({{TAG|ML_IWEIGHT}}, {{TAG|ML_WTOTEN}}, {{TAG|ML_WTIFOR}}, {{TAG|ML_WTSIF}}) | |||
, threshold and the total number of descriptors in the sparsification ({{TAG|ML_EPS_LOW}}, {{TAG|ML_RDES_SPARSDES}}). We recommend optimizing these in cross-validation by means of systematically varying the parameters and refitting the force field ({{TAG|ML_MODE}}=refit). In principle, it is necessary to do that for all hyperparameters available in VASP to obtain the best MLFF model. In practice, usually the default values are a good guess. The default parameters of VASP were optimized on a selected set of bulk materials and on a molecular data base. | |||
---- | ---- | ||
[[Category:VASP|Machine-learned force fields]][[Category:Ionic minimization|Ionic minimization]][[Category:Molecular dynamics|Molecular dynamics]][[Category:Forces|Forces]] | [[Category:VASP|Machine-learned force fields]][[Category:Ionic minimization|Ionic minimization]][[Category:Molecular dynamics|Molecular dynamics]][[Category:Forces|Forces]] |
Latest revision as of 09:31, 4 October 2023
Machine-learned force fields used in combination with ab-initio molecular dynamics (MD) allow capturing the underlying physics from first principles and still reach long simulation times relatively cheaply. Generally, an ab-initio MD step is computationally expensive due to the quantum-mechanical treatment of the electrons, e.g., within density-functional theory (DFT). In fully classical MD calculations, force fields are used to evaluate the force acting on each atom instead of DFT. These interatomic potentials are traditionally based on experimental observation and empirical inclusion of known forces, such as the van-der-Waals force, electrostatic charges, etc. Therefore, the quality of the force field depends on how well interactions in the specific system are known.
VASP offers machine learning force fields on-the-fly to overcome these two issues. Namely, the high computational cost required for ab-initio MD and the empirical knowledge necessary to construct a force field the traditional way. To learn more about on-the-fly machine learning read about the theory of on-the-fly machine learning force fields or about the setup of a basic calculation! Also, check out the description on best practices on how to construct, test, and retrain force fields, as well as the basic tutorial that provides hands-on experience for silicon and this tutorial: Liquid Si - MLFF.
Theory
VASP uses a Bayesian-learning algorithm for on-the-fly machine learning. The total energy and forces are predicted based on the machine-learned force field at each time step of the MD simulation. If the Bayesian error estimate exceeds a certain threshold an ab-initio calculation is performed, where electrons are treated quantum mechanically. Subsequently, previously untrained atomic environments are identified and added to a set of so-called local reference configurations. This set serves as a comparison database for future force field predictions. With new information in the form of total energy and atomic forces obtained from the triggered ab-initio calculation the machine-learned force field is updated and the MD simulation continues. In this way the force field is iteratively improved in the course of the MD simulation. Ideally, it will reach such high prediction quality that no more ab-initio calculations are required.
For details on the algorithm, check out the theory article about machine learning force fields.
How to
All related INCAR tags and input/output files begin with the prefix ML_. To learn about machine learning force fields, visit:
- Machine learning force field calculations: Basics.
- Best practices for machine-learned force fields include how to construct, test, and retrain force fields.
- Basic tutorial that provides hands-on experience for silicon.
- Liquid Si - MLFF.
Input
Depending on whether the calculation is training, retraining, or applying the force field (see ML_MODE), VASP may require the following input files in addition to the usual input files (INCAR, POSCAR, etc.):
Output
The machine-learning–force-field method generates the following output files:
- ML_LOGFILE Main output file.
- ML_ABN Generated ab-initio training data. It is used as ML_AB file to restart a calculation.
- ML_REG Summary of regression results.
- ML_HIS Summary of the histogram data.
- ML_FFN Force-field parameters. It is used as ML_FF file to restart a calculation.
- ML_EATOM Local atomic energies.
- ML_HEAT Local heat flux.
Hyperparameters
Hyperparameters are user-defined parameters of the MLFF model that will not be optimized during training of the MLFF. For instance, cutoff radii (ML_RCUT1, ML_RCUT2), the normalization and weighting of ab-initio training data (ML_IWEIGHT, ML_WTOTEN, ML_WTIFOR, ML_WTSIF) , threshold and the total number of descriptors in the sparsification (ML_EPS_LOW, ML_RDES_SPARSDES). We recommend optimizing these in cross-validation by means of systematically varying the parameters and refitting the force field (ML_MODE=refit). In principle, it is necessary to do that for all hyperparameters available in VASP to obtain the best MLFF model. In practice, usually the default values are a good guess. The default parameters of VASP were optimized on a selected set of bulk materials and on a molecular data base.
Pages in category "Machine-learned force fields"
The following 70 pages are in this category, out of 70 total.
M
- Machine learning force field calculations: Basics
- Machine learning force field: Theory
- ML AB
- ML ABN
- ML AFILT2
- ML CDOUB
- ML CSIG
- ML CSLOPE
- ML CTIFOR
- ML CX
- ML DESC TYPE
- ML EATOM
- ML EATOM REF
- ML EPS LOW
- ML EPS REG
- ML FF
- ML FFN
- ML HEAT
- ML HIS
- ML IAFILT2
- ML IALGO LINREG
- ML ICOUPLE
- ML ICRITERIA
- ML IERR
- ML IREG
- ML ISCALE TOTEN
- ML ISTART
- ML IWEIGHT
- ML LAFILT2
- ML LBASIS DISCARD
- ML LCOUPLE
- ML LEATOM
- ML LERR
- ML LFAST
- ML LHEAT
- ML LMAX2
- ML LMLFF
- ML LOGFILE
- ML LSPARSDES
- ML LUSE NAMES
- ML MB
- ML MB MIN
- ML MCONF
- ML MCONF NEW
- ML MHIS
- ML MODE
- ML MRB1
- ML MRB2
- ML NATOM COUPLED
- ML NHYP
- ML NMDINT
- ML NRANK SPARSDES
- ML OUTBLOCK
- ML OUTPUT MODE
- ML RCOUPLE
- ML RCUT1
- ML RCUT2
- ML RDES SPARSDES
- ML REG
- ML SCLC CTIFOR
- ML SIGV0
- ML SIGW0
- ML SION1
- ML SION2
- ML W1
- ML WTIFOR
- ML WTOTEN
- ML WTSIF