Volume 46, Issue 1 e70016
RESEARCH ARTICLE

Practical Machine Learning Strategies. I. Correcting the MMFF Molecular Mechanics Model to More Accurately Provide Conformational Energy Differences in Flexible Organic Molecules

Thomas Hehre

Thomas Hehre

Wavefunction, Inc., Irvine, California, USA

Search for more papers by this author
Philip E. Klunzinger

Philip E. Klunzinger

Wavefunction, Inc., Irvine, California, USA

Search for more papers by this author
Bernard Deppmeier

Bernard Deppmeier

Wavefunction, Inc., Irvine, California, USA

Search for more papers by this author
William Ohlinger

William Ohlinger

Wavefunction, Inc., Irvine, California, USA

Search for more papers by this author
Warren Hehre

Corresponding Author

Warren Hehre

Wavefunction, Inc., Irvine, California, USA

Correspondence:

Warren Hehre ([email protected])

Search for more papers by this author
First published: 05 January 2025
Citations: 4

ABSTRACT

A correction to the MMFF molecular mechanics model, based on a neural network trained to reproduce conformer energy differences obtained from ωB97X-V/6-311+G(2df,2p)[6-311G*]//MMFF calculations is described. It is supported for molecules containing H, C, N, O, F, S, Cl, and Br. The correction adds only slightly to the cost of MMFF, and the resulting corrected model is several orders of magnitude faster than ωB97X-V/6-311+G(2df,2p)[6-311G*]. It properly identifies the lowest energy conformer for 82% of the molecules in a test set of flexible organic molecules (3553 total conformers), compared with 38% for MMFF. While the corrected MMFF model cannot be expected to provide sufficiently accurate Boltzmann weights for use in spectra and property calculations on flexible molecules, it is able to reduce the number of “reasonable” conformers that need to be passed on to more rigorous computational models, that can.

Data Availability Statement

The data that support the findings of this study are openly available in github at https://github.com/wavefun-dev/mmff-correct.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.