PSI-MOUSE: Predicting Mouse Pseudouridine Sites From Sequence and Genome-Derived Features



Song, Bowen, Chen, Kunqi, Tang, Yujiao, Ma, Jialin, Meng, Jia ORCID: 0000-0003-3455-205X and Wei, Zhen
(2020) PSI-MOUSE: Predicting Mouse Pseudouridine Sites From Sequence and Genome-Derived Features. EVOLUTIONARY BIOINFORMATICS, 16. 1176934320925752-.

Access the full-text of this item by clicking on the Open Access link.

Abstract

Pseudouridine (Ψ) is the first discovered and the most prevalent posttranscriptional modification, which has been widely studied during the past decades. Pseudouridine was observed in almost all kinds of RNAs and shown to have important biological functions. Currently, the time-consuming and high-cost procedures of experimental approaches limit its uses in real-life Ψ site detection. Alternatively, by taking advantage of the explosive growth of Ψ sequencing data, the computational methods may provide a more cost-effective avenue. To date, the existing mouse Ψ site predictors were all developed based on sequence-derived features, and their performance can be further improved by adding the domain knowledge derived feature. Therefore, it is highly desirable to propose a genomic feature-based computational method to increase the accuracy and efficiency of the identification of Ψ RNA modification in the mouse transcriptome. In our study, a predictive framework PSI-MOUSE was built. Besides the conventional sequence-based features, PSI-MOUSE first introduced 38 additional genomic features derived from the mouse genome, which achieved a satisfactory improvement in the prediction performance, compared with other existing models. Moreover, PSI-MOUSE also features in automatically annotating the putative Ψ sites with diverse types of posttranscriptional regulations (RNA-binding protein [RBP]-binding regions, miRNA-RNA interactions, and splicing sites), which can serve as a useful research tool for the study of Ψ RNA modification in the mouse genome. Finally, 3282 experimentally validated mouse Ψ sites were also collected in a database with customized query functions. For the convenience of academic users, a website was built to provide a user-friendly interface for the query and analysis on the database. The website is freely accessible at www.xjtlu.edu.cn/biologicalsciences/psimouse and http://psimouse.rnamd.com. We introduced the genome-derived features to mouse for the first time, and we achieved a good performance in mouse Ψ site prediction. Compared with the existing state-of-art methods, our newly developed approach PSI-MOUSE obtained a substantial improvement in prediction accuracy, marking the reliable contributions of genomic features for the prediction of RNA modifications in a species other than human.

Item Type: Article
Uncontrolled Keywords: Pseudouridine sites, genomic feature, web-server
Divisions: Faculty of Science and Engineering > School of Electrical Engineering, Electronics and Computer Science
Depositing User: Symplectic Admin
Date Deposited: 17 Aug 2021 07:29
Last Modified: 18 Jan 2023 21:33
DOI: 10.1177/1176934320925752
Open Access URL: https://doi.org/10.1177%2F1176934320925752
Related URLs:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3133787