RetroSnake: A modular pipeline to detect human endogenous retroviruses in genome sequencing data



Kabiljo, R ORCID: 0000-0002-5183-4844, Bowles, H, Marriott, H, Jones, AR, Bouton, CR, Dobson, RJB, Quinn, JP ORCID: 0000-0003-3551-7803, Al Khleifat, A, Swanson, CM, Al-Chalabi, A
et al (show 1 more authors) (2022) RetroSnake: A modular pipeline to detect human endogenous retroviruses in genome sequencing data Iscience, 25 (11). 105289-. ISSN 2589-0042, 2589-0042

[thumbnail of RetroSnake_ A modular pipeline to detect human endogenous retroviruses in genome sequencing data.pdf] PDF
RetroSnake_ A modular pipeline to detect human endogenous retroviruses in genome sequencing data.pdf - Published version

Download (2MB) | Preview

Abstract

Human endogenous retroviruses (HERVs) integrated into the human genome as a result of ancient exogenous infections and currently comprise ∼8% of our genome. The members of the most recently acquired HERV family, HERV-Ks, still retain the potential to produce viral molecules and have been linked to a wide range of diseases including cancer and neurodegeneration. Although a range of tools for HERV detection in NGS data exist, most of them lack wet lab validation and they do not cover all steps of the analysis. Here, we describe RetroSnake, an end-to-end, modular, computationally efficient, and customizable pipeline for the discovery of HERVs in short-read NGS data. RetroSnake is based on an extensively wet-lab validated protocol, it covers all steps of the analysis from raw data to the generation of annotated results presented as an interactive html file, and it is easy to use by life scientists without substantial computational training. Availability and implementation: The Pipeline and an extensive documentation are available on GitHub.

Item Type: Article
Uncontrolled Keywords: Biocomputational method, Bioinformatics, Sequence analysis
Divisions: Faculty of Health & Life Sciences
Faculty of Health & Life Sciences > Inst. Systems, Molec & Integrative Biology > Inst. Systems, Molec & Integrative Biology
Depositing User: Symplectic Admin
Date Deposited: 11 Nov 2022 09:37
Last Modified: 24 Jan 2026 03:56
DOI: 10.1016/j.isci.2022.105289
Related Websites:
URI: https://livrepository.liverpool.ac.uk/id/eprint/3166145
Disclaimer: The University of Liverpool is not responsible for content contained on other websites from links within repository metadata. Please contact us if you notice anything that appears incorrect or inappropriate.