genetics studies of comorbidity

69
Genetics Studies of Comorbidity Heping Zhang Department of Epidemiology and Public Health Yale University School of Medicine Presented at Science at the Edge Michigan State University January 27, 2012 Heping Zhang (C 2 S 2 , Yale University) MSU 1 / 67

Upload: others

Post on 24-Mar-2022

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Genetics Studies of Comorbidity

Genetics Studies of Comorbidity

Heping Zhang

Department of Epidemiology and Public HealthYale University School of Medicine

Presented atScience at the Edge

Michigan State University

January 27, 2012

Heping Zhang (C2S2, Yale University) MSU 1 / 67

Page 2: Genetics Studies of Comorbidity

Collaborators

Dr. Xiang Chen, St. Jude HospitalDr. Kelly Cho, Harvard UniversityDr. Yuan Jiang, Yale UniversityDr. Ching-Ti Liu, Boston UniversityDr. Xueqin Wang, Sun Yat-Sen University, ChinaDr. Wensheng Zhu, Northeastern Normal University, China

Heping Zhang (C2S2, Yale University) MSU 2 / 67

Page 3: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 3 / 67

Page 4: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 4 / 67

Page 5: Genetics Studies of Comorbidity

Comorbidity

Multiple disorders or illnesses occur in the same person,simultaneously or sequentially

Source: www.depressioncell.com; www.depressiondodging.com

Heping Zhang (C2S2, Yale University) MSU 5 / 67

Page 6: Genetics Studies of Comorbidity

Comorbidity

Source: aasets.lifehack.org

Heping Zhang (C2S2, Yale University) MSU 6 / 67

Page 7: Genetics Studies of Comorbidity

Comorbidity

What Is Comorbidity?

from the director:

Comorbidity is a topic that our stakeholders––patients, family members, health care professionals, and others––frequently ask about. It is also a topic about which we have insufficient information, so it remains a research priority for NIDA. This Research Report provides information on the state of the science in this area. Although a variety of diseases commonly co-occur with drug abuse and addiction (e.g., HIV, hepatitis C, cancer, cardiovascular disease), this report focuses only on the comorbidity of drug use disorders and other mental illnesses.*

To help explain this comorbidity, we need to first recognize that drug addiction is a mental illness. It is a complex brain disease characterized by compulsive, at times uncontrollable drug craving, seeking, and use despite devastating consequences—behaviors that stem from drug-induced changes in brain structure and function. These changes occur in some of the same brain areas that are disrupted in other mental disorders, such as depression, anxiety, or schizophrenia. It is therefore not surprising that population surveys show a high rate of co-occurrence, or comorbidity, between drug addiction and other mental illnesses. While we cannot always prove a connection or causality, we do know that certain mental disorders are established risk factors for subsequent drug abuse—and vice versa.

It is often difficult to disentangle the overlapping symptoms of drug addiction and other mental illnesses, making diagnosis and treatment complex. Correct diagnosis is critical to ensuring appropriate and effective treatment. Ignorance of or failure to treat a comorbid disorder can jeopardize a patient’s chance of recovery. We hope that our enhanced understanding of the common genetic, environmental, and neural bases of these disorders—and the dissemination of this information—will lead to improved treatments for comorbidity and will diminish the social stigma that makes patients reluctant to seek the treatment they need.

Nora D. Volkow, M.D. Director National Institute on Drug Abuse

Comorbidity: Addiction and Other Mental Illnesses

Is there a relationship between childhood ADHD and later drug abuse? See page 2.

When two disorders or illnesses occur in the same person, simultaneously or sequentially, they are described as comorbid. Comorbidity also

implies interactions between the illnesses that affect the course and prognosis of both.

*Since the focus of this report is on comorbid drug use disorders and other mental illnesses, the terms “mental illness” and “mental disorders” will refer here to disorders other than substance use disorders, such as depression, schizophrenia, anxiety, and mania. The terms “dual diagnosis,” “mentally ill chemical abuser,” and “co-occurrence” are also used to refer to drug use disorders that are comorbid with other mental illnesses.

continued inside

Dr. Volkow, Director, NIDA:Comorbidity is a topic thatour stakeholders–patients,family members, health careprofessionals, andothers–frequently ask about.It is also a topic about whichwe have insufficientinformation, so it remains aresearch priority for NIDA.Source: www.nida.nih.gov

Heping Zhang (C2S2, Yale University) MSU 7 / 67

Page 8: Genetics Studies of Comorbidity

Possible Mechanisms for Comorbidity

Mental disorder⇒ drug use disorderDrug use disorder⇒ mental disorder

Common etiology⇒{

mental disorderdrug use disorder

Source: www.nida.nih.gov

Heping Zhang (C2S2, Yale University) MSU 8 / 67

Page 9: Genetics Studies of Comorbidity

Possible Mechanisms for Comorbidity

Mental disorder⇒ drug use disorderDrug use disorder⇒ mental disorder

Common etiology⇒{

mental disorderdrug use disorder

Source: www.nida.nih.gov

Heping Zhang (C2S2, Yale University) MSU 8 / 67

Page 10: Genetics Studies of Comorbidity

Possible Mechanisms for Comorbidity

Mental disorder⇒ drug use disorderDrug use disorder⇒ mental disorder

Common etiology⇒{

mental disorderdrug use disorder

Source: www.nida.nih.gov

Heping Zhang (C2S2, Yale University) MSU 8 / 67

Page 11: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 9 / 67

Page 12: Genetics Studies of Comorbidity

Genotypes and Covariates

Source: en.wikipedia.org; 2010.census.gov

Heping Zhang (C2S2, Yale University) MSU 10 / 67

Page 13: Genetics Studies of Comorbidity

Disorders, Genes and Covariates

Covariates: interact or confound genetic effectsFailure to account for covariates: bias or reduced power

Heping Zhang (C2S2, Yale University) MSU 11 / 67

Page 14: Genetics Studies of Comorbidity

Study Design

Population-based studies

Family-based studies

Heping Zhang (C2S2, Yale University) MSU 12 / 67

Page 15: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 13 / 67

Page 16: Genetics Studies of Comorbidity

Notation and Hypothesis

n study subjects, from a population-based study or family-basedstudyFor each subject:

A vector of traits T = (T(1), . . . ,T(p))′

Marker genotype MParental marker genotypes Mpa (only available in a family-basedstudy)A vector of covariates Z = (Z(1), . . . ,Z(l))′

Null hypothesis: no association between marker alleles and anylinked locus that influences traits T

Heping Zhang (C2S2, Yale University) MSU 14 / 67

Page 17: Genetics Studies of Comorbidity

Typical Response

Fagerstrom Test for Nicotine Dependence

Heping Zhang (C2S2, Yale University) MSU 15 / 67

Page 18: Genetics Studies of Comorbidity

Multivariate Distributions

∏t

nt!∏a nt,a!

∏a

pnt,at,a

exp {−12(x− µ)′Σ−1(x− µ)}√

(2π)n|Σ|

Heping Zhang (C2S2, Yale University) MSU 16 / 67

Page 19: Genetics Studies of Comorbidity

Kendall’s Tau

A nonparametric statistic measuring the rank correlation betweentwo variablesPairs of observations: {(Xi,Yi) : i = 1, . . . , n}(Xi,Yi) and (Xj,Yj):

Concordant, if Xi − Xj and Yi − Yj have the same signDisconcordant, if Xi − Xj and Yi − Yj have the different sign

Kendall’s tau:τ = 2(A− B)/{n(n− 1)}

A and B: numbers of concordant and disconcordant pairsOr

τ =

(n2

)−1∑i<j

sign{(Xi − Xj)(Yi − Yj)}

Heping Zhang (C2S2, Yale University) MSU 17 / 67

Page 20: Genetics Studies of Comorbidity

Generalized Kendall’s Tau

Fij = {f1(T(1)i − T(1)

j ), . . . , fp(T(p)i − T(p)

j )}′

fk(·): identity function for a quantitative or binary traitfk(·): sign function for an ordinal trait

Dij = Ci − Cj. C: number of any chosen allele in marker genotypeM

Genaralized Kendall’s tau (Zhang, Liu and Wang, 2010):

U =

(n2

)−1∑i<j

DijFij

Special cases:FBAT-GEE (Lange et al. 2003)Test for a single ordinal trait (Wang, Ye and Zhang, 2006)

Heping Zhang (C2S2, Yale University) MSU 18 / 67

Page 21: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 19 / 67

Page 22: Genetics Studies of Comorbidity

Hypothesis with Covariates

New null hypothesis: no association between marker alleles andany linked locus that influences traits T conditional on covariates Z

Heping Zhang (C2S2, Yale University) MSU 20 / 67

Page 23: Genetics Studies of Comorbidity

Weighted Test

A weight function w(Zi,Zj) imposes a relatively large weight whenZi is close to Zj, and a relatively small weight when Zi and Zj arefar awayWeighted U-statistic:

S =

(n2

)−1∑i<j

DijFijw(Zi,Zj)

Weighted test statistic:

χ2τ = {S− E0(S)}′Var−1

0 (S){S− E0(S)}

Heping Zhang (C2S2, Yale University) MSU 21 / 67

Page 24: Genetics Studies of Comorbidity

Weight Function–I: Distance

Write Z = (Zco′,Zca′)′, with Zco for the continuous covariates andZca for the categorical covariates

w(Zi,Zj; h, q) = Wh(‖Zcoi − Zco

j ‖)Wq{I(Zcai 6= Zca

j )}

For example,Wh(u) = exp(−u2/2h2), h > 0,

Wq(v) = (1− q)I(v = 0) + qI(v = 1), 0 ≤ q ≤ 0.5

Weighted U-statistic (called fixed-(h, q) U-statistic):

S(h, q) =

(n2

)−1∑i<j

DijFijw(Zi,Zj; h, q)

Heping Zhang (C2S2, Yale University) MSU 22 / 67

Page 25: Genetics Studies of Comorbidity

Weight Function–II: Propensity Score

Propensity score: probability of a unit being assigned to aparticular treatment given a set of covariatesCausal effect analysis: match subjects according to theirpropensity scores (Rosenbaum and Rubin, 1984)Genomic propensity score: p(z) = {p1(z), p2(z)}′,pc(z) = P(C = c|Z = z)

Genetic association analysis: match subjects according to theirgenomic propensity scoresWeight function:

w(Zi,Zj) = Wh{‖p(Zi)− p(Zj)‖},

with Wh(u) = exp(−u2/2h2), h > 0

Heping Zhang (C2S2, Yale University) MSU 23 / 67

Page 26: Genetics Studies of Comorbidity

Asymptotic Distribution: Setting

Treating the offspring genotype as randomConditioning on all phenotypes and parental genotypes (ifavailable)Eliminates the assumptions about phenotype distribution, geneticmodel and parental genotype distributionRobust and less prone to population stratificationIn addition, conditioning on covariates

Heping Zhang (C2S2, Yale University) MSU 24 / 67

Page 27: Genetics Studies of Comorbidity

Asymptotic Distribution: Null Hypothesis

When n→∞,

Var−1/20 {S(h, q)}[S(h, q)− E0{S(h, q)}] D−→ N(0, Ip)

Fixed-(h, q) test statistic:

χ2τ (h, q)

D−→ χ2p

Mean and variance:

E0{S(h, q)} =2

n− 1

n∑i=1

uiE0(Ci|Mpai ,Zi),

Var0{S(h, q)} =4

(n− 1)2

n∑i=1

n∑i=1

uiu′jCov0(Ci,Cj|Mpai ,M

paj ,Zi,Zj),

with ui = n−1∑nj=1 Fijw(Zi,Zj; h, q)

Heping Zhang (C2S2, Yale University) MSU 25 / 67

Page 28: Genetics Studies of Comorbidity

Asymptotic Distribution: Power

Under the alternative hypothesis,

χ2τ (h, q) ∼

p∑i=1

eiχ21(φi)

∆µ = µ1 − µ0 ≡ E1{S(h, q)} − E0{S(h, q)}Σ0 = Var0{S(h, q)}Σ1 = Var1{S(h, q)}e1 ≥ · · · ≥ ep ≥ 0: eigenvalues of Σ1/2

1 Σ−10 Σ

1/21

φi = ∆µ2i

∆µi: ith component of ∆µ = QΣ−1/21 ∆µ

Q: an orthonormal matrix, QΣ1/21 Σ−1

0 Σ1/21 Q′ = diag(e1, . . . , ep)

Heping Zhang (C2S2, Yale University) MSU 26 / 67

Page 29: Genetics Studies of Comorbidity

Factors Determining the Power

The conditional power P: P = P{∑p

i=1 eiχ21(φi) ≥ qχ2

p(1− α)

}Taking a family-based study as an example,

µ1 =2

n− 1

n∑i=1

uiE(Ci|Ti,Zi,Mpai )

Σ1 =4

(n− 1)2

n∑i=1

n∑j=1

uiu′jCov(Ci,Cj|Ti,Tj,Zi,Zj,Mpai ,M

paj )

By Bayes’ theorem, P(C = c|T,Z,Mpa) = P(T|C=c,Z)P(C=c|Mpa)∑c′ P(T|C=c′,Z)P(C=c′|Mpa)

Penetrance: P(T|C = c,Z)Allele frequency: P(C = c|Mpa)

Heping Zhang (C2S2, Yale University) MSU 27 / 67

Page 30: Genetics Studies of Comorbidity

Power Approximation

Using the result from Liu et al. (2009), we have

Theorem

P ≈ P{χ2l (ν) ≥ q∗},

where l, ν, and q∗ depend on µ1 and Σ1.

Heping Zhang (C2S2, Yale University) MSU 28 / 67

Page 31: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 29 / 67

Page 32: Genetics Studies of Comorbidity

WTCCC Bipolar Disorder Data

Collected by Wellcome Trust Case-Control Consortium (WTCCC,2007, Nature)

Phenotype: 1998 cases/3004 controls of bipolar disorderGenotype: genotyped by Affymetrix GeneChip 500K arraysCovariates: gender, age at recruitment

Our method: weighted test using propensity score approach(h = 1)Methods for comparison: non-weighted test and logisticregressionStrong association: p-value < 5× 10−7; moderate association:5× 10−7 < p-value < 10−5

Heping Zhang (C2S2, Yale University) MSU 30 / 67

Page 33: Genetics Studies of Comorbidity

Manhattan Plot: Comparison of Three Methods

Heping Zhang (C2S2, Yale University) MSU 31 / 67

Page 34: Genetics Studies of Comorbidity

GWAS Results

Chr. SNP Position Non-weighted Weighted Logistic Regression6 rs9378249 31435680 1.21e-8 1.39e-8 1.71e-9

16 rs420259 23541527 8.51e-9 6.59e-8 3.33e-916 rs2387823 51445620 2.90e-6 1.30e-7 1.77e-616 rs1344485 51469833 1.78e-6 1.79e-7 1.41e-616 rs11647459 51473252 2.93e-6 2.76e-7 1.89e-617 rs12938916 53221286 4.80e-7 1.11e-6 8.89e-720 rs4815603 3720527 3.00e-6 1.42e-5 4.80e-720 rs37612181 3724175 1.13e-6 3.27e-6 2.16e-7

Heping Zhang (C2S2, Yale University) MSU 32 / 67

Page 35: Genetics Studies of Comorbidity

Propensity Scores: Estimation

Chr. SNP Gender AgeCoefficient p-value Coefficient p-value

16 rs2387823 0.0021 0.970 0.0031 0.90216 rs1344485 -0.0028 0.959 -0.0049 0.85016 rs11647459 0.0004 0.994 0.0038 0.882

Heping Zhang (C2S2, Yale University) MSU 33 / 67

Page 36: Genetics Studies of Comorbidity

Propensity Scores: Histograms

rs2387823 : histogram of propensity score 1

score[, 1]

Fre

quen

cy

0.4860 0.4865 0.4870

050

010

0015

00

rs2387823 : histogram of propensity score 2

score[, 2]

Fre

quen

cy

0.191 0.192 0.193 0.194

050

010

0015

00

rs1344485 : histogram of propensity score 1

score[, 1]

Fre

quen

cy

0.475 0.476 0.477 0.478 0.479

050

010

0015

0020

00

rs1344485 : histogram of propensity score 2

score[, 2]

Fre

quen

cy

0.146 0.147 0.148 0.149 0.150 0.151

050

010

0015

00rs11647459 : histogram of propensity score 1

score[, 1]

Fre

quen

cy

0.4715 0.4720 0.4725 0.4730 0.4735 0.4740 0.4745

050

010

0015

0020

00

rs11647459 : histogram of propensity score 2

score[, 2]

Fre

quen

cy

0.1450 0.1455 0.1460 0.1465 0.1470 0.1475 0.1480 0.1485

050

010

0015

0020

00

Heping Zhang (C2S2, Yale University) MSU 34 / 67

Page 37: Genetics Studies of Comorbidity

Significant Region

29kb region: 7 strongly linked SNPsHaplotype association p-value: 2.64× 10−7 by weighted test

Heping Zhang (C2S2, Yale University) MSU 35 / 67

Page 38: Genetics Studies of Comorbidity

Interactions Between rs420259 and New SNPs

Model Variable Coefficient p-valuers420259 -0.77415 6.43e-8

(1) rs9378249 -0.60212 3.88e-8rs420259 × rs9378249 0.37602 0.332rs420259 -0.63671 1.51e-3

(2) rs2387823 -0.18834 1.73e-5rs420259 × rs2387823 -0.10661 0.561rs420259 -0.62866 1.10e-3

(3) rs1344485 -0.19898 1.14e-5rs420259 × rs1344485 -0.10653 0.575rs420259 -0.67070 4.43e-4

(4) rs11647459 -0.19508 1.53e-5rs420259 × rs11647459 -0.07443 0.693

Heping Zhang (C2S2, Yale University) MSU 36 / 67

Page 39: Genetics Studies of Comorbidity

Interactions Among New SNPs

Model Variable Coefficient p-valuers9378249 -0.49915 1.77e-3

(1) rs2387823 -0.18455 4.62e-5rs9378249 × 2387823 -0.10876 0.461rs9378249 -0.49551 1.07e-3

(2) rs1344485 -0.20266 1.58e-5rs9378249 × 1344485 -0.14683 0.344rs9378249 -0.48910 1.13e-3

(3) rs11647459 -0.19580 2.74e-5rs9378249 × rs11647459 -0.14960 0.333

Heping Zhang (C2S2, Yale University) MSU 37 / 67

Page 40: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 38 / 67

Page 41: Genetics Studies of Comorbidity

Maximum Weighted Test

Fixed-(h, q) test: how to choose optimal parameters h and q?Choose a grid of h and q values and maximize the weighted teststatistic over those choices{h1, . . . , hL1}: pre-specified grid points of h

{q1, . . . , qL2}: pre-specified grid points of q

χ2τ,max = max

1≤l1≤L1,1≤l2≤L2χ2τ (hl1 , ql2)

Approximate the optimal weighting scheme, yielding the strongestassociation measure

Heping Zhang (C2S2, Yale University) MSU 39 / 67

Page 42: Genetics Studies of Comorbidity

Resampling Approach

Population-based studies: restricted permutation in Yu et al.(2010)Family-based studies: children’s genotypes solely determined bytheir parents’ marker alleles, resample the children’s genotype byMendelian lawsCalculate M resampling test statistics χ2

τ,max,1, . . . , χ2τ,max,M using M

resampled dataResampling p-value: the proportion of the resampling teststatistics that exceed our observed test statistic, i.e.,M−1∑M

m=1 I(χ2τ,max,m ≥ χ2

τ,max)

Heping Zhang (C2S2, Yale University) MSU 40 / 67

Page 43: Genetics Studies of Comorbidity

Asymptotic Distribution: Joint Distribution

Equivalently,χ2τ,max = max

1≤l1≤L1,1≤l2≤L2‖Rl1,l2‖

2

R = Var−1/20D (S){S− E0(S)}

S = {S′(h1, q1), . . . ,S′(hL1 , qL2)}′Var0D(S) = diag[Var0{S(h1, q1)}, . . . ,Var0{S(hL1 , qL2)}]: the diagonalblocks of Var0(S)

Var−1/20 (S){S− E0(S)} D−→ N(0, IpL1L2)

R = Var−1/20D (S)Var1/2

0 (S)G, G ∼ N(0, IpL1L2)

Heping Zhang (C2S2, Yale University) MSU 41 / 67

Page 44: Genetics Studies of Comorbidity

Asymptotic Distribution: Uniform Approximation

TheoremAssume that the eigenvalues of Var0D(S) and Var0(S) are uniformlybounded from both above and below, i.e., there exist two positivenumbers c and C such that c ≤ λmin{Var0D(S)} ≤ λmax{Var0D(S)} ≤ Cand c ≤ λmin{Var0(S)} ≤ λmax{Var0(S)} ≤ C uniformly for all n, whereλmin and λmax denote the smallest and largest eigenvalues respectively.Then for any x ∈ R, as n→∞,

supx∈R

∣∣∣P(χ2τ,max ≤ x

)− P

(max

1≤l1≤L1,1≤l2≤L2‖Rl1,l2‖

2 ≤ x)∣∣∣→ 0.

Heping Zhang (C2S2, Yale University) MSU 42 / 67

Page 45: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 43 / 67

Page 46: Genetics Studies of Comorbidity

Aims

Compare the performance of:Maximum weighted testNon-weighted test

Compare the performance of:Maximum weighted testOther covariate-adjusted tests

Heping Zhang (C2S2, Yale University) MSU 44 / 67

Page 47: Genetics Studies of Comorbidity

Simulation I: Data Generation

Generate the parents’ disease and marker genotypes via thehaplotype frequenciesGiven the parental genotypes, generate the offspring genotypeusing 1cM between the two lociTwo covariates are generated: Zco ∼ N(1, 2)

Without confounder: P(Zca = 1) = 1− P(Zca = 0) = 0.7With confounder: logit{P(Zca = 1)} = 0.5Mfa + 0.5Mmo

Bivariate ordinal traits are generated according to random effectsproportional odds model:

logit{P(T(j) ≤ k)} = αj,k + βgG + βcoZco + βcaZca + Uj,

with k = 1, . . . ,Kj, j = 1, 2, and (U1,U2) ∼ N(0,Σ)

Heping Zhang (C2S2, Yale University) MSU 45 / 67

Page 48: Genetics Studies of Comorbidity

Simulation I: Parameters

Number of categories: K1 = 3 and K2 = 4

(α1,1, α1,2) = (−0.5,−0.3), (α2,1, α2,2, α2,3) = (−0.5,−0.3,−0.1)

βg = 2.0

βco = βca = 0, 0.5, 1.0, 1.5, 2.0

Σ =

(1 0.25

0.25 1

)The grid of h is {C1(C2/C1)(l1/(L1−1)) : l1 = 0, . . . ,L1 − 1}, withC1 = 0.05, C2 = 10, L1 = 8

The grid of q is {0.5l2/(L2 − 1) : l2 = 0, . . . ,L2 − 1}, with L2 = 5

Heping Zhang (C2S2, Yale University) MSU 46 / 67

Page 49: Genetics Studies of Comorbidity

Type I Error of Maximum Weighted Test

Significance LevelConfounder n α = 0.05 α = 0.01 α = 0.001

No 200 0.0466 0.0090 0.0006400 0.0512 0.0097 0.0010

Yes 200 0.0469 0.0084 0.0007400 0.0462 0.0090 0.0011

Heping Zhang (C2S2, Yale University) MSU 47 / 67

Page 50: Genetics Studies of Comorbidity

Power Comparison: Without Confounder

Covariate effectn α Method 0.0 0.5 1.0 1.5 2.0

200 0.05 Weighted 0.681 0.521 0.372 0.275 0.222Non-weighted 0.726 0.522 0.306 0.189 0.135

0.01 Weighted 0.432 0.281 0.161 0.099 0.071Non-weighted 0.491 0.283 0.128 0.064 0.041

0.001 Weighted 0.160 0.082 0.036 0.017 0.011Non-weighted 0.223 0.097 0.028 0.011 0.006

400 0.05 Weighted 0.948 0.848 0.685 0.551 0.448Non-weighted 0.960 0.838 0.565 0.348 0.233

0.01 Weighted 0.846 0.658 0.441 0.297 0.213Non-weighted 0.877 0.643 0.321 0.154 0.084

0.001 Weighted 0.563 0.337 0.164 0.091 0.054Non-weighted 0.671 0.361 0.115 0.040 0.018

Heping Zhang (C2S2, Yale University) MSU 48 / 67

Page 51: Genetics Studies of Comorbidity

Power Comparison: With Confounder

Covariate effectn α Method 0.0 0.5 1.0 1.5 2.0

200 0.05 Weighted 0.695 0.557 0.405 0.310 0.250Non-weighted 0.728 0.528 0.308 0.194 0.139

0.01 Weighted 0.452 0.305 0.181 0.120 0.087Non-weighted 0.495 0.288 0.129 0.066 0.041

0.001 Weighted 0.165 0.090 0.046 0.024 0.015Non-weighted 0.224 0.095 0.032 0.011 0.006

400 0.05 Weighted 0.951 0.867 0.718 0.593 0.493Non-weighted 0.961 0.834 0.573 0.363 0.251

0.01 Weighted 0.854 0.682 0.483 0.345 0.250Non-weighted 0.875 0.645 0.332 0.170 0.094

0.001 Weighted 0.572 0.355 0.196 0.111 0.069Non-weighted 0.665 0.364 0.129 0.044 0.020

Heping Zhang (C2S2, Yale University) MSU 49 / 67

Page 52: Genetics Studies of Comorbidity

Simulation II: Other Covariate-Adjusted Methods

FBAT-GEE (Lange et al. 2003) adjusting for covariates:Fit the regression model g(E[T(j)]) = αj + λ′jZ, with g(·) anappropriate link functionReplace the original traits T(j) with the residuals T(j) − g−1(αj + λ′jZ)in the FBAT-GEE test statistic

Ordinal trait test (Wang, Ye and Zhang, 2006):Deal with a single ordinal trait at a timeApply the Bonferroni correction for multiple trait testing

Heping Zhang (C2S2, Yale University) MSU 50 / 67

Page 53: Genetics Studies of Comorbidity

Simulation II: Data Generation

Continuous covariate: Zco ∼ N(1, 2)

Bivariate quantitative traits:

Y(j) = µ+ βgG + βcoZco + εj, j = 1, 2,

with (ε1, ε2)′ ∼ 2φ2(x;Σ)Φ(α′x), x ∈ R2 (bivariate skew normaldistribution)Bivariate ordinal traits: discretizing quantitative traits by (50%,67%) and (33%, 54%, 75%) percentiles

α = (5, 5) and Σ =

(1 0.25

0.25 1

)µ = 0, βg = βco = 0.8

Heping Zhang (C2S2, Yale University) MSU 51 / 67

Page 54: Genetics Studies of Comorbidity

Power Comparison

Significance Leveln Method α = 0.05 α = 0.01 α = 0.001

200 χ2τ,max 0.640 0.381 0.134

FBAT-GEE 0.608 0.355 0.137Wang et al.’s Test 0.448 0.236 0.081

400 χ2τ,max 0.930 0.815 0.518

FBAT-GEE 0.902 0.758 0.499Wang et al.’s Test 0.775 0.590 0.320

600 χ2τ,max 0.991 0.961 0.817

FBAT-GEE 0.982 0.938 0.787Wang et al.’s Test 0.925 0.807 0.585

Heping Zhang (C2S2, Yale University) MSU 52 / 67

Page 55: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 53 / 67

Page 56: Genetics Studies of Comorbidity

Collaborative Studies on Genetics of Alcoholism

A large scale study to map alcohol dependence susceptible genes

Heping Zhang (C2S2, Yale University) MSU 54 / 67

Page 57: Genetics Studies of Comorbidity

COGA Data

The data include 143 families with a total of 1,614 individualsMultiple Traits:

ALDX1 (the severity of the alcohol dependence): pure unaffected,never drunk, unaffected with some symptoms, and affectedMaxDrink (maximum number of drinks in a 24 hour period): 0-9,10-19, 20-29, and more than 30 drinksTimeDrink (spent so much time drinking, had little time for anythingelse): “no”, “yes and lasted less than a month”, and “yes and lastedfor one month or longer”

Genotypes: markers on chromosome 7Covariates: age at interview and gender

Heping Zhang (C2S2, Yale University) MSU 55 / 67

Page 58: Genetics Studies of Comorbidity

Results

D7S

1790

D7S

513

D7S

1802

D7S

629

D7S

673

D7S

1838

NP

Y2

D7S

817

D7S

2846

D7S

521

D7S

691

D7S

478

D7S

679

D7S

665

D7S

1830

D7S

3046

D7S

1870

D7S

1797

D7S

820

D7S

821

D7S

1796

D7S

1799

D7S

1817

D7S

2847

D7S

490

D7S

1809

D7S

1804

D7S

509

D7S

1824

D7S

794

D7S

1805

0.0

0.5

1.0

1.5

2.0

2.5

3.0

3.5

0 7.5 21.4 30.1 48.3 56.8 64.3 73.9 88 94.2 107.5 116.6 127.7 143.1 156.2 171.7 187

−lo

g(p−

valu

e)

Distance (cM)

adjustedunadjusted

Heping Zhang (C2S2, Yale University) MSU 56 / 67

Page 59: Genetics Studies of Comorbidity

Outline

1 BackgroundComorbidityDisorders, Genes and Covariates

2 Weighted Association TestGeneralized Kendall’s TauAsymptotic Distribution and PowerApplication to WTCCC Bipolar Disorder Data

3 Maximum Weighted Association TestAsymptotic DistributionSimulations

4 Analysis of ComorbidityApplication to COGA Family DataApplication to SAGE GWAS Data

5 Conclusions

Heping Zhang (C2S2, Yale University) MSU 57 / 67

Page 60: Genetics Studies of Comorbidity

Study of Addiction: Genetics and Environment (SAGE)

The data were from SAGE (Bierut et al. 2010), a case-controlstudy of mostly unrelated individuals aimed at identifying geneticassociations for addiction.We included 4,121 subjects for whom the addiction to the sixcategories of substances and genomewide SNP data (ILLUMINAHuman 1M platform) were available.We defined the outcome as to whether a subject was addicted tosubstances in at least two of the six addiction categories (nicotine,alcohol, marijuana, cocaine, opiates or others).

Heping Zhang (C2S2, Yale University) MSU 58 / 67

Page 61: Genetics Studies of Comorbidity

Peak SNPs in PKNOX2 in White Women

SNP P-value Odds Ratiors1426153 (G) 1.84E-06 1.66rs11220015(A) 1.97E-06 1.65rs11602925(G) 1.24E-06 1.67rs750338(C) 4.22E-07 1.63rs12273605(T) 3.83E-06 1.71rs10893365(C) 2.27E-07 1.72rs10893366(T) 6.87E-07 1.70rs12284594(G) 7.13E-08 1.77

Heping Zhang (C2S2, Yale University) MSU 59 / 67

Page 62: Genetics Studies of Comorbidity

Peak SNPs in PKNOX2 in White Women for Individual Substances

SNP Nicotine Alcohol Marijuana Cocaine Opiates Othersrs1426153 (G) 0.0159 5.75E-5 7E-4 3E-4 0.0113 1E-4rs11220015(A) 0.0163 6.86E-5 0.0010 3E-4 0.0037 4.18E-5rs11602925(G) 0.0136 4.24E-5 7E-4 3E-4 0.0059 5.29E-5rs750338(C) 0.0491 4.26E-5 0.0013 2E-4 0.0112 2.22E-5rs12273605(T) 0.0921 3E-4 3.53E-5 1E-4 0.0680 3.11E-5rs10893365(C) 0.0411 1.72E-5 8.58E-6 2.91E-5 0.0699 2.58E-5rs10893366(T) 0.0621 1.37E-5 8.80E-6 8.63E-5 0.0905 5.35E-5rs12284594(G) 0.0239 1.97E-6 8.54E-6 4.39E-5 0.0533 2.45E-5

Heping Zhang (C2S2, Yale University) MSU 60 / 67

Page 63: Genetics Studies of Comorbidity

Result

PKNOX2 is a novel TALE homeodomain-encoding gene, locatedat 11q24 in humansIt functions as a nuclear transcription factor indicated by itsstructure and sub-cellular localizationOne of the cis-regulated genes for alcohol addiction in mice(Mulligan et al. 2006)Confirmed by multiple studies

Heping Zhang (C2S2, Yale University) MSU 61 / 67

Page 64: Genetics Studies of Comorbidity

Conclusions: Method

Developed a nonparametric weighted test to adjust for covariatesthat accommodates multiple traitsProvided its asymptotic distribution and analytical powercalculationRefined the weighted test by proposing the idea of maximumweighting over the grid points of parametersProposed an asymptotic approach to assess its significance

Heping Zhang (C2S2, Yale University) MSU 62 / 67

Page 65: Genetics Studies of Comorbidity

Conclusions: Application

WTCCC bipolar disorder data: not only confirmed SNP rs420259on Chromosome 16 reported by the WTCCC (2007), but alsoidentified two regions (rs9378249 on chr 6; rs12938916 on chr 17)at the genome-wide significance levelThe identified haplotype block is near the RPGRIP1L gene thatwas reported to be associated with bipolar disorder (O’Donovan etal., 2008; Riley et al., 2009)COGA data: confirmed and strengthened the top signal; providedevidences for the advantage of maximum weighted test overnon-weighted testSAGE data: identified PKNOX2 for addiction, which has beenconfirmed by other studies

Heping Zhang (C2S2, Yale University) MSU 63 / 67

Page 66: Genetics Studies of Comorbidity

Other Ongoing/Future Work

Incorporating genetic prior information into a current studyGenetic association analysis for rare variantsNonparametric test for gene-environment interactionsGenetic test for multiple trait covariance structure

Heping Zhang (C2S2, Yale University) MSU 64 / 67

Page 67: Genetics Studies of Comorbidity

Acknowledgment

Supported by grant R01DA016750 from National Institute on DrugAbuseThe SAGE data were obtained from dbGaP(http://www.ncbi.nlm.nih.gov)The COGA data were provided by COGAThe views expressed here are those of the authors.

Heping Zhang (C2S2, Yale University) MSU 65 / 67

Page 68: Genetics Studies of Comorbidity

References

W. Zhu and H. Zhang (2009) Why do we test multiple traits in geneticassociation studies? (with discussion). J. Korean Statist. Soc., 38:1-10.

H. Zhang, C.-T. Liu and X. Wang (2010) An association test for multipletraits based on the generalized Kendall’s tau. J. Amer. Statist. Assoc.,105:473-481.

Y. Jiang and H. Zhang (2011) Propensity Score-Based NonparametricTest Revealing Genetic Variants Underlying Bipolar Disorder. Genet.Epidemiol., 35:125-132.

H. Zhang (2011) Statistical Analysis in Genetic Studies of MentalIllnesses, Statistical Science, in press.

W. Zhu, Y. Jiang, and H. Zhang (2011) Covariate-Adjusted AssociationTests and Power Calculations Based on the Generalized Kendall’s Tau.J. Amer. Statist. Assoc.

Heping Zhang (C2S2, Yale University) MSU 66 / 67

Page 69: Genetics Studies of Comorbidity

Heping Zhang (C2S2, Yale University) MSU 67 / 67