Evaluating and Improving the Viability of Machine Learning to Solve Chemical Problems

Folmsbee, Dakota Lee (2022) Evaluating and Improving the Viability of Machine Learning to Solve Chemical Problems. Doctoral Dissertation, University of Pittsburgh. (Unpublished)

This is the latest version of this item.

Preview

PDF
Download (6MB) | Preview

Abstract

While improvements in computer processing have allowed for increasingly faster quantum mechanical (QM) calculations, the need for alternative techniques to accelerate computer-accelerated material design continues to grow. Screening methods have tackled this through methods that search chemical space more efficiently but often use faster, albeit less accurate methods for evaluation due to the large number of calculations conducted. Machine learning (ML) has shown promise as a potential surrogate for time-consuming quantum mechanical calculations, such as density functional and first-principles method, that would lend these screening methods a fast and accurate approach to evaluation.

This work sets out to determine the viability of ML methods through multiple tests. The ranking of thermally accessible conformations was conducted to establish ML's capacity to differentiate small energy differences compared to other established methods. The performance of ML methods was found to be equivalent to that of semi-empirical methods in both accuracy and evaluation time, demonstrating promise for future improvements of ML models. Next, ML's understanding of chemical physics was tested by analyzing the short and long-range interactions that occur with bond compressing and stretching as well as the effect of steric hindrance of dihedral angles. The work demonstrated the extent the training set has on the model as short and long-range interactions not present in the set became apparent in the testing of the models. Additionally, the inclusion of torsion sampling in the ANI-2 training exemplifies why more robust training sets are needed for more accurate ML methods.

Current work on ML indicates a strong need for additional diversity in training data. Initial work done on comparing experimental crystallographic geometry and gas-phase computed conformer torsional preferences examine the possible use of a quantum-based ETKDG, QTDG, for future conformer training set generation for expanding existing training sets. Future work on expanding data sets is crucial for ML performance as ML methods are very reliant on the scope of the training set. Incomplete training sets that do not appropriately represent chemical space diminish the applicability of ML to solve chemical problems.

Citation/Export:
Social Networking:	Share \|

Details

Item Type:

University of Pittsburgh ETD

Status:

Unpublished

Creators/Authors:

Creators	Email	Pitt Username	ORCID
Folmsbee, Dakota Lee	dlf57@pitt.edu	dlf57	0000-0002-4094-233X

ETD Committee:

Title	Member	Email Address	Pitt Username	ORCID
Committee Chair	Hutchison, Geoffrey R	ghutchis@pitt.edu	ghutchis	0000-0002-1757-1980
Committee Member	Jordan, Kenneth	jordan@pitt.edu	jordan
Committee Member	Liu, Peng	pengliu@pitt.edu	pengliu	0000-0002-8188-632X
Committee Member	Koes, David R	dkoes@pitt.edu	dkoes	0000-0002-6892-6614

Date:

6 June 2022

Date Type:

Publication

Defense Date:

11 February 2022

Approval Date:

6 June 2022

Submission Date:

14 March 2022

Access Restriction:

1 year -- Restrict access to University of Pittsburgh for a period of 1 year.

Number of Pages:

138

Institution:

University of Pittsburgh

Schools and Programs:

Dietrich School of Arts and Sciences > Chemistry

Degree:

PhD - Doctor of Philosophy

Thesis Type:

Doctoral Dissertation

Refereed:

Yes

Uncontrolled Keywords:

machine learning, quantum chemistry

Date Deposited:

06 Jun 2022 15:58

Last Modified:

06 Jun 2023 05:15

URI:

http://d-scholarship.pitt.edu/id/eprint/42626

Available Versions of this Item

Evaluating and Improving the Viability of Machine Learning to Solve Chemical Problems. (deposited UNSPECIFIED)
- Evaluating and Improving the Viability of Machine Learning to Solve Chemical Problems. (deposited 06 Jun 2022 15:58) [Currently Displayed]

Metrics

Monthly Views for the past 3 years

Plum Analytics

Actions (login required)

View Item

My Account

Search

Browse

Information

Evaluating and Improving the Viability of Machine Learning to Solve Chemical Problems

Abstract

Share

Details

Available Versions of this Item

Metrics

Monthly Views for the past 3 years

Plum Analytics

Actions (login required)

Connect with us

Send Comments or Questions

Feeds