Assessing the utility of CASP14 models for molecular replacement



Millan, Claudia, Keegan, Ronan M, Pereira, Joana, Sammito, Massimo D, Simpkin, Adam J, McCoy, Airlie J, Lupas, Andrei N, Hartmann, Marcus D, Rigden, Daniel J ORCID: 0000-0002-7565-8937 and Read, Randy J
(2021) Assessing the utility of CASP14 models for molecular replacement. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 89 (12). pp. 1752-1769.

Access the full-text of this item by clicking on the Open Access link.

Abstract

The assessment of CASP models for utility in molecular replacement is a measure of their use in a valuable real-world application. In CASP7, the metric for molecular replacement assessment involved full likelihood-based molecular replacement searches; however, this restricted the assessable targets to crystal structures with only one copy of the target in the asymmetric unit, and to those where the search found the correct pose. In CASP10, full molecular replacement searches were replaced by likelihood-based rigid-body refinement of models superimposed on the target using the LGA algorithm, with the metric being the refined log-likelihood-gain (LLG) score. This enabled multi-copy targets and very poor models to be evaluated, but a significant further issue remained: the requirement of diffraction data for assessment. We introduce here the relative-expected-LLG (reLLG), which is independent of diffraction data. This reLLG is also independent of any crystal form, and can be calculated regardless of the source of the target, be it X-ray, NMR or cryo-EM. We calibrate the reLLG against the LLG for targets in CASP14, showing that it is a robust measure of both model and group ranking. Like the LLG, the reLLG shows that accurate coordinate error estimates add substantial value to predicted models. We find that refinement by CASP groups can often convert an inadequate initial model into a successful MR search model. Consistent with findings from others, we show that the AlphaFold2 models are sufficiently good, and reliably so, to surpass other current model generation strategies for attempting molecular replacement phasing.

Item Type: Article
Uncontrolled Keywords: CASP, likelihood, model refinement, molecular replacement, structure prediction
Divisions: Faculty of Health and Life Sciences
Faculty of Health and Life Sciences > Institute of Systems, Molecular and Integrative Biology
Depositing User: Symplectic Admin
Date Deposited: 23 Dec 2021 08:34
Last Modified: 18 Jan 2023 21:18
DOI: 10.1002/prot.26214
Open Access URL: https://doi.org/10.1002/prot.26214
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3145937