Generalized estimating equations for genome-wide association studies using longitudinal phenotype data

CM Sitlani, KM Rice, T Lumley, B McKnight, LA Cupples, CL Avery, Raymond Noordam, Bruno Stricker, EA Whitsel, BM Psaty

Research output: Contribution to journalArticleAcademicpeer-review

36 Citations (Scopus)


Many longitudinal cohort studies have both genome-wide measures of genetic variation and repeated measures of phenotypes and environmental exposures. Genome-wide association study analyses have typically used only cross-sectional data to evaluate quantitative phenotypes and binary traits. Incorporation of repeated measures may increase power to detect associations, but also requires specialized analysis methods. Here, we discuss one such methodgeneralized estimating equations (GEE)in the contexts of analysis of main effects of rare genetic variants and analysis of gene-environment interactions. We illustrate the potential for increased power using GEE analyses instead of cross-sectional analyses. We also address challenges that arise, such as the need for small-sample corrections when the minor allele frequency of a genetic variant and/or the prevalence of an environmental exposure is low. To illustrate methods for detection of gene-drug interactions on a genome-wide scale, using repeated measures data, we conduct single-study analyses and meta-analyses across studies in three large cohort studies participating in the Cohorts for Heart and Aging Research in Genomic Epidemiology consortiumthe Atherosclerosis Risk in Communities study, the Cardiovascular Health Study, and the Rotterdam Study. Copyright (c) 2014 John Wiley & Sons, Ltd.
Original languageUndefined/Unknown
Pages (from-to)118-130
Number of pages13
JournalStatistics in Medicine
Issue number1
Publication statusPublished - 2015

Research programs

  • EMC NIHES-01-64-02

Cite this