AMP T2D-GENES T2D exome sequence analysis: European ancestry

AMP T2D-GENES logo

This dataset represents the European subset of the AMP T2D-GENES exome sequence analysis dataset.

The 13k exome sequence analysis dataset is a subset of the AMP T2D-GENES exome sequence analysis dataset.

Download summary statistics for the SIGMA cohorts (Hispanic ancestry) | README

Publications

Exome sequencing of 20,791 cases of type 2 diabetes and 24,440 controls.
Flannick J, Mercader JM, Fuchsberger C, Udler MS, Mahajan A, et al.
Nature. 2019 May 22. doi: 10.1038/s41586-019-1231-2

Sequence data and association statistics from 12,940 type 2 diabetes cases and controls.
Flannick J, Fuchsberger C, Mahajan A, et al.
Sci Data. 2017 Dec 19;4:170179. doi: 10.1038/sdata.2017.179

The genetic architecture of type 2 diabetes.
Fuchsberger C, Flannick J, Teslovich TM, Mahajan A, Agarwala V, Gaulton KJ, et al.
Nature 2016 Aug 4;536(7614):41-7. doi: 10.1038/nature18642

Association of a low-frequency variant in HNF1A with type 2 diabetes in a Latino population.
SIGMA Type 2 Diabetes Consortium, et al.
JAMA. 2014 Jun 11;311(22):2305-14. doi: 10.1001/jama.2014.6511

Whole-exome sequencing of 2,000 Danish individuals and the role of rare coding variants in type 2 diabetes.
Lohmueller KE, et al.
Am J Hum Genet. 2013 Dec 5;93(6):1072-86. doi: 10.1016/j.ajhg.2013.11.005

Phenotypes

type 2 diabetes

Subjects

Project

Cases

Controls

Cohort (Click to view selection criteria for cases and controls)

Ancestry

GoT2D

1,289

1,251

Multiple cohorts

Finland-United States Investigation of NIDDM Genetics (FUSION) Study

Case selection criteria

Control selection criteria

Unrelated cases selected from FUSION families and stage 2 replication
Samples met 1999 World Health Organization (WHO) criteria of fasting plasma glucose ≥ 7.0 mmol/l or postload glucose during an OGTT ≥ 11.1 mmol/l, by report of diabetes medication use, or based on medical record review
Prioritized FUSION families with ≥ 2 first-degree relatives with T2D; BMI ≥ 18.5kg/m2; case with GWAS data or earliest age at onset, if no GWAS data available
Prioritized FUSION stage 2 replication set with Metabochip data; BMI ≥ 18.5kg/m2; earliest age of onset; age of onset ≥ 35

Unrelated controls with normal glucose tolerance (NGT) based on WHO (1999) definitions: fasting plasma glucose <6.1 mM and 2 hour postload glucose during an OGTT > 7.8 mM
Frequency matched to cases by birth province; BMI ≥ 18.5kg/m2; age ≤ 80
Within each birth province, prioritized samples from stage 2 replication with highest values for age + 2*BMI

Malmö-Botnia Study

Case selection criteria

Control selection criteria

A liability score was generated (Guey LT et al. 2011) which measures risk to T2D in the context of three known risk factors (age at onset, BMI, and gender) in 27,500 individuals drawn from three prospective cohorts: the Malmö Preventive Project, the Scania Diabetes Registry, and the Botnia Study; only BMI and gender used to construct scores for Scania and Botnia studies
Eligible cases limited to individuals between 35 and 60 years of age and with a BMI between 20 and 35
To match for ethnicity, 250 Botnia cases with the most extreme liability scores were selected, while 125 cases were selected from each of the Scania and Malmö studies

Controls selected from the extreme of a liability score distribution, based upon gender, age and BMI at last follow-up visit; only BMI and gender used to construct scores for Malmö study
Eligible controls limited to individuals above 35 years of age at follow-up and with a BMI between 20 and 40
To match for ethnicity, equal numbers of controls were selected from the Botnia and Malmö studies

UK Type 2 Diabetes Genetics Consortium (UKT2D)

Case selection criteria

Control selection criteria

Cases drawn from the Wellcome Trust Case Control Consortium (WTCCC)
Female samples with age of diagnosis ≥ 66 years or BMI ≥ 32kg/m2 excluded; male samples with age of diagnosis ≥ 62 years or BMI ≥ 31kg/m2 excluded
Remaining samples were ranked by age and BMI, and the two ranks multiplied. 356 samples with the lowest values for this rank multiplier were selected for initial inclusion in the study

Unrelated samples selected as controls from the Twins UK study
A twin pair was considered for selection if there was no recorded family history of diabetes, neither twin was ever recorded as impaired glucose tolerant (defined as fasting glucose >6.1mmol/L in any reading), there were available quantitative trait and genetic (GWAs) data, and no evidence of admixture in MDS analysis of GWAs data
From set of qualifying twin pairs, the best control twin was selected from each pair with the lowest ratio of fasting glucose level to BMI across all readings, and further prioritization of the qualifying unrelated samples involved selecting samples that had the lowest fasting glucose to (BMI * age) ratios
Top two principal components were used to perform pairwise sample matching between cases and possible controls, and the best control for each case was selected

KORAgen Study Helmholtz zentrum München (KORA)

Case selection criteria	Control selection criteria
Samples drawn from KORA F3 and F4 Diabetic status validated by doctor or by medication use Cases have ≥ 1 first degree relative with type 2 diabetes (self-reported) Cases have either BMI ≤ 30 and age of onset < 65, or BMI ≤ 33 and age of onset ≤ 60	Controls selected from KORA F4 All controls are normal glucose tolerant: fasting glucose level < 6.1 mmol/l and two hour glucose level after oral glucose tolerance test < 7.8 mmol/l Controls are either > 60 years of age with BMI > 32 or over 65 years of age with BMI > 31

European

T2D-GENES

478

498

Metabolic Syndrome in Men Study (METSIM)

European

T2D-GENES

506

360

Ashkenazi

European

LuCAMP

992

985

Lundbeck Foundation Centre for Applied Medical Genomics in Personalised Disease Prediction, Prevention, and Care

European

T2D-GENES

949

943

Genetics of Diabetes and Audit Research Tayside Study (GoDARTS)

European

T2D-GENES

390

589

Framingham Heart Study (FHS)

European

Projects

Genetics of Type 2 Diabetes (GoT2D) Learn more >

The GoT2D consortium aims to understand the allelic architecture of type 2 diabetes through whole-genome sequencing, high-density SNP genotyping, and imputation. The reference panel based on this work is intended as a comprehensive inventory of low-frequency variants in Europeans, including SNPs, small insertions and deletions, and structural variants.

Lubeck Foundation Centre for Applied Medical Genomics in Personalised Disease Prediction, Prevention and Care (LuCamp) Learn more >

The LuCamp research consortium aims to discover and characterize novel variation in the human genome conferring an increased risk of visceral obesity, type 2 diabetes and hypertension and eventually premature cardiovascular disorders and death; discover and characterize novel variation in the human gut microbiome influencing metabolic and cardiovascular health; investigate how the novel molecular signatures interact mutually and with other risk markers, particularly in health behavior, influencing the risk of developing widespread metabolic disorders; and investigate how these discoveries may help improve metabolic and cardiovascular health in the at-risk population.

Type 2 Diabetes Genetic Exploration by Next-Generation Sequencing in Ethnic Samples (T2D-GENES) Learn more >

T2D-GENES (Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples) is a large collaborative effort to find genetic variants that influence risk of type 2 diabetes. With funding from NIDDK, the group is pursuing three projects: (1) deep whole-exome sequencing in 10,000 people from five ethnicities (African-American, East Asian, South Asian, European, and Hispanic); (2) deep whole-genome sequencing of 600 individuals selected from extended Mexican American pedigrees; and (3) a trans-ethnic fine-mapping "mega-meta-analysis."

Overview of analysis and results

Two single-variant association analyses were conducted for each of the 25 sample sub-groups. For both analyses, all all non-reference alleles at multiallelic sites into a single “non-reference” allele. In the first analysis, all (including related) samples were analyzed using the EMMAX test, as implemented in the EPACTS software package, using the GRM computed from the ancestry-specific ancestry variants. Covariates for sequencing technology were included in the model where appropriate, but covariates for PCs of genetic ancestry, age, sex, or BMI were not included.

In the second analysis, unrelated samples were analyzed via the Firth logistic regression test, also as implemented in EPACTS; covariates for sequencing technology and for PCs of genetic ancestry (computed from the ancestry-specific ancestry variants) were included in the model. The number of PCs included varied by subgroup; to select the PCs to be included, we T2D status was regressed on sequencing technology and the first ten PCs. Any PC that demonstrated nominal (p<0.05) association with T2D, as well as all higher-order PCs, were included in the model.

For each of the 25×2=50 single-variant analyses, QQ plots of variant association statistics were inspected, and the stringency of the variant filters was increased if the distribution of association statistics appeared poorly calibrated. A 25-group fixed-effect inverse-variance weighted meta-analysis was then conducted for each of the Firth and EMMAX tests, using METAL. EMMAX results were used for association p-values and Firth results for effect size estimates.

Dataset ID

ExSeq_52k_eu