New technologies for high throughput genetic analysis of complex genomes



Gardiner, Laura-Jayne
New technologies for high throughput genetic analysis of complex genomes. PhD thesis, University of Liverpool.

[thumbnail of Full_thesis_edit.pdf] Text
Full_thesis_edit.pdf - Unspecified
Access to this file is embargoed until Unspecified.
After the embargo period this will be available under License Creative Commons Attribution.

Download (23MB) | Request a copy
[thumbnail of GardinerLau_July2014_2002679.pdf] Text
GardinerLau_July2014_2002679.pdf - Unspecified
Available under License Creative Commons Attribution.

Download (23MB)

Abstract

High throughput sequencing can generate hundreds of millions of reads in a single day and is revolutionizing modern genetics. This project aimed to utilize next generation genetic approaches to analyze non-model but important agronomical plant species. A key feature of these species is their complexity. Mapping and SNP calling of these sequencing datasets is fundamental to many downstream analyses that have been implemented here including; mutant identification, comparative analyses between related organisms and epigenetic studies. The first objective in this project involved developing accelerated mutant identification techniques using mapping-by-sequencing analyses that combine whole genome sequencing with genetic mapping. Such methods have largely required a complete reference sequence and are typically implemented on a mapping population with a common mutant phenotype of interest. Here mutant identification was demonstrated on the model diploid plant Arabidopsis thaliana as a proof of principle of the methodology. It was also demonstrated on a simulated hexaploid mutant that was developed using the Arabidopsis reference genome. In species such as wheat, no finished genome reference sequence is available and, due to its large genome size (17 Gb), re-sequencing at sufficient depth of coverage is not practical. Therefore a genomic target enrichment approach was validated and used here to capture the gene rich regions of hexaploid bread wheat, reducing the sequencing cost while still allowing analysis of the majority of wheat’s genic sequence. A pseudo-chromosome based reference sequence was developed from this genic sequence with a long-range order of genes based on synteny of wheat with Brachypodium distachyon. Using the capture probe set for target enrichment followed by next generation sequencing; an early flowering locus was mapped in the diploid wheat Triticum monococcum and in hexaploid bread wheat Triticum aestivum, the stripe rust resistance gene was located. A bespoke pipeline and algorithm was developed for mutant loci identification and the pseudo-chromosome reference was implemented. This novel method will allow widespread application of sliding window mapping-by-sequencing analyses to datasets that are; enriched, lacking a finished reference genome or polyploid. The second main objective of this project involved a study of methylation patterns in wheat utilizing sodium bisulfite treatment, combined with target enrichment. An enrichment system was specifically designed, developed, validated and implemented here to perform one of the first studies of methylation patterns in hexaploid bread wheat across the 3 genomes that used a genome-wide subset of genes and can thus be used to infer genome-wide methylation patterns and observations. This investigation confirmed that differential methylation exists between the A, B and D genomes of wheat and that temperature is capable of altering methylation states.

Item Type: Thesis (PhD)
Additional Information: Appendix files available on disc by request (large size) Date: 2014-07-30 (completed)
Subjects: ?? QH301 ??
?? QH426 ??
Depositing User: Symplectic Admin
Date Deposited: 27 Aug 2015 12:27
Last Modified: 17 Dec 2022 01:33
DOI: 10.17638/02002679
URI: https://livrepository.liverpool.ac.uk/id/eprint/2002679