Explainable graph neural networks for organic cages.



Yuan, Qi ORCID: 0000-0002-7675-2642, Szczypiński, Filip T ORCID: 0000-0003-3174-8532 and Jelfs, Kim E ORCID: 0000-0001-7683-7630
(2022) Explainable graph neural networks for organic cages. Digital discovery, 1 (2). pp. 127-138.

Access the full-text of this item by clicking on the Open Access link.

Abstract

The development of accurate and explicable machine learning models to predict the properties of topologically complex systems is a challenge in materials science. Porous organic cages, a class of polycyclic molecular materials, have potential application in molecular separations, catalysis and encapsulation. For most applications of porous organic cages, having a permanent internal cavity in the absence of solvent, a property termed "shape persistence" is critical. Here, we report the development of Graph Neural Networks (GNNs) to predict the shape persistence of organic cages. Graph neural networks are a class of neural networks where the data, in our case that of organic cages, are represented by graphs. The performance of the GNN models was measured against a previously reported computational database of organic cages formed through a range of [4 + 6] reactions with a variety of reaction chemistries. The reported GNNs have an improved prediction accuracy and transferability compared to random forest predictions. Apart from the improvement in predictive power, we explored the explicability of the GNNs by computing the integrated gradient of the GNN input. The contribution of monomers and molecular fragments to the shape persistence of the organic cages could be quantitatively evaluated with integrated gradients. With the added explicability of the GNNs, it was possible not only to accurately predict the property of organic materials, but also to interpret the predictions of the deep learning models and provide structural insights for the discovery of future materials.

Item Type: Article
Divisions: Faculty of Science and Engineering > School of Physical Sciences
Depositing User: Symplectic Admin
Date Deposited: 16 Feb 2023 17:33
Last Modified: 16 Feb 2023 17:34
DOI: 10.1039/d1dd00039j
Open Access URL: https://doi.org/10.1039/D1DD00039J
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3168463