To split or not to split: CASP15 targets and their processing into tertiary structure evaluation units



Kryshtafovych, Andriy and Rigden, Daniel J ORCID: 0000-0002-7565-8937
(2023) To split or not to split: CASP15 targets and their processing into tertiary structure evaluation units. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 91 (12). pp. 1558-1570.

[img] Text
629407.pdf - Author Accepted Manuscript
Available under License Creative Commons Attribution.

Download (3MB) | Preview
[img] Text
ToSplitOrNotToSplit_v3.pdf - Author Accepted Manuscript
Available under License Creative Commons Attribution.

Download (1MB) | Preview

Abstract

Processing of CASP15 targets into evaluation units (EUs) and assigning them to evolutionary-based prediction classes is presented in this study. The targets were first split into structural domains based on compactness and similarity to other proteins. Models were then evaluated against these domains and their combinations. The domains were joined into larger EUs if predictors' performance on the combined units was similar to that on individual domains. Alternatively, if most predictors performed better on the individual domains, then they were retained as EUs. As a result, 112 evaluation units were created from 77 tertiary structure prediction targets. The EUs were assigned to four prediction classes roughly corresponding to target difficulty categories in previous CASPs: TBM (template-based modeling, easy or hard), FM (free modeling), and the TBM/FM overlap category. More than a third of CASP15 EUs were attributed to the historically most challenging FM class, where homology or structural analogy to proteins of known fold cannot be detected.

Item Type: Article
Uncontrolled Keywords: CASP15, evaluation units, protein domains, protein structure, protein structure prediction
Divisions: Faculty of Health and Life Sciences
Faculty of Health and Life Sciences > Institute of Systems, Molecular and Integrative Biology
Depositing User: Symplectic Admin
Date Deposited: 21 Mar 2024 08:33
Last Modified: 21 Mar 2024 09:21
DOI: 10.1002/prot.26533
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3179496