Doctoral Exit Seminar: Machine learning models for molecular based functional organic materials
Organic semiconductors (OSC) are of interest for a wide range of flexible optoelectronics applications, including transistors, solar cells, and sensors, to name a few. Despite their promise, the design and optimization of OSC pose significant challenges due to the complexity of the structures of the molecular building blocks, varied packing configurations of these building blocks in the solid state, which impacts the optical and electronic response, and sensitivity of the solid-state packing to material processing conditions. Accurately predicting the solid-state properties of OSC traditionally requires high-level quantum mechanical methods. These methods, however, can be computationally demanding, particularly for large molecules or when there is interest in extensive material screenings. Overcoming this computational bottleneck is essential to enabling the efficient design of OSC, which would reduce the experimental trial-and-error approach used in material discovery. Moreso, the holy grail of computational study is to be able to accurately and efficiently predict the molecular packing configurations and associated properties of OSC. This dissertation aims to address some of these challenges by developing computational approaches that leverage machine learning (ML) models to accelerate the study of OSC. ML promises to facilitate faster material screening and optimization by offering an alternative to direct quantum mechanical calculations. Specifically, this dissertation describes the development of ML models for intermolecular interactions, including noncovalent interactions (NCI) and electronic couplings (EC). Conventional quantum mechanical methods used to investigate OSC are introduced, and ML approaches are reviewed. The dissertation then discusses the generation of large, high-quality datasets for NCI from symmetry-adapted perturbation theory (SAPT), and the development of ML models to efficiently predict NCI. An active learning approach for the high-throughput derivation of optimal training sets for NCI predictions is then developed, and the training set is used to train new ML models. Finally, ML models to predict EC from three-dimensional (3D) molecular dimer geometries are implemented for the rapid, on-the-fly prediction of ECs across thermally sampled conformations obtained through molecular dynamics (MD) simulations to enable rapid materials characterization during simulation. Ultimately, this dissertation presents a framework that integrates ML with quantum mechanical insights, offering a scalable solution to accelerate OSC discovery and optimization.
KEYWORDS: Organic Semiconductors (OSC), Density Functional Theory (DFT), Symmetry-Adapted Perturbation Theory (SAPT), Noncovalent Interactions (NCI), Electronic Couplings (EC), Machine Learning (ML).