Sequencing of high-complexity DNA pools for identification of nucleotide and structural variants in regions associated with complex traits

G Zaboli, A Ameur, W Igl, A Johansson, C Hayward, V Vitart, S Campbell, L Zgaga, O Polasek, G Schmitz, Cornelia Duijn, Ben Oostra, P Pramstaller, A Hicks, T Meitinger, I Rudan, A Wright, JF Wilson, H Campbell, U Gyllensten

Research output: Contribution to journalArticleAcademicpeer-review

6 Citations (Scopus)

Abstract

We have used targeted genomic sequencing of high-complexity DNA pools based on long-range PCR and deep DNA sequencing by the SOLiD technology. The method was used for sequencing of 286 kb from four chromosomal regions with quantitative trait loci (QTL) influencing blood plasma lipid and uric acid levels in DNA pools of 500 individuals from each of five European populations. The method shows very good precision in estimating allele frequencies as compared with individual genotyping of SNPs (r(2) = 0.95, P < 10(-16)). Validation shows that the method is able to identify novel SNPs and estimate their frequency in high-complexity DNA pools. In our five populations, 17% of all SNPs and 61% of structural variants are not available in the public databases. A large fraction of the novel variants show a limited geographic distribution, with 62% of the novel SNPs and 59% of novel structural variants being detected in only one of the populations. The large number of population-specific novel SNPs underscores the need for comprehensive sequencing of local populations in order to identify the causal variants of human traits. European Journal of Human Genetics (2012) 20, 77-83; doi:10.1038/ejhg.2011.138; published online 3 August 2011
Original languageUndefined/Unknown
Pages (from-to)77-83
Number of pages7
JournalEuropean Journal of Human Genetics
Volume20
Issue number1
DOIs
Publication statusPublished - 2012

Research programs

  • EMC MGC-02-96-01
  • EMC NIHES-01-64-02

Cite this