Data Descriptor: Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

Flannick, Jason, Fuchsberger, Christian, Mahajan, Anubha, Teslovich, Tanya M, Agarwala, Vineeta, Gaulton, Kyle J, Caulkins, Lizz, Koesterer, Ryan, Ma, Clement, Moutsianas, Loukas
et al (show 291 more authors) (2017) Data Descriptor: Sequence data and association statistics from 12,940 type 2 diabetes cases and controls. SCIENTIFIC DATA, 4 (1). 170179-.

Access the full-text of this item by clicking on the Open Access link.


To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.

Item Type: Article
Uncontrolled Keywords: Humans, Diabetes Mellitus, Type 2, Genetic Variation, White People
Depositing User: Symplectic Admin
Date Deposited: 22 Dec 2017 11:39
Last Modified: 19 Jan 2023 06:46
DOI: 10.1038/sdata.2017.179
Open Access URL:
Related URLs: