9. estimating survival distribution for a ph model

12
9. Estimating Survival Distribution for a PH Model

Upload: others

Post on 13-Nov-2021

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 9. Estimating Survival Distribution for a PH Model

9. Estimating Survival Distribution for a PH Model

Page 2: 9. Estimating Survival Distribution for a PH Model

Objective:

โ€ข Estimating the survival distribution for individuals with a certain combination of covariates.

PH model assumption:

โ€ข ๐œ† ๐‘ก ๐‘ง = ๐œ†0(๐‘ก)exp(๐›ฝ๐‘‡๐‘)

โ€ข Given Z=๐‘งโˆ—, it is easily to derive the relationship between S(t|z) and covariates as the following function:

๐‘† ๐‘ก ๐‘งโˆ— = ๐‘’โˆ’ 0๐‘ก๐œ† ๐‘ข ๐‘งโˆ— ๐‘‘๐‘ข = ๐‘’โˆ’ 0

๐‘ก๐œ†0 ๐‘ข ๐‘ง

โˆ— ๐‘’๐‘ฅ๐‘(๐›ฝ๐‘‡๐‘งโˆ—)๐‘‘ ๐‘ข = ๐‘’โˆ’๐‘’๐‘ฅ๐‘(๐›ฝ๐‘‡๐‘งโˆ—)๐›ฌ0(๐‘ก)

This means in order to estimate S(t|๐‘ = ๐‘งโˆ—), we only need to estimate ๐›ฝ and ฮ›0(๐‘ก), where ๐›ฝ can be estimated by MPLE.

Another Goal of the COX model

Page 3: 9. Estimating Survival Distribution for a PH Model

Estimating ๐œฆ๐ŸŽ(๐ญ)

โ€ข The same logic of deriving Nelson-Aalen estimate of the cumulative hazard function in one sample problem will be used.

โ€ข Nelson-Aalen estimate for ฮ› ๐‘ก = ฯƒ๐‘ฅ<๐‘ก๐‘‘๐‘(๐‘ฅ)

๐‘Œ(๐‘ฅ)

โ€ข In the one-sample problem, all individuals in the sample have the same hazard of failing, implying the same cause-specific hazard. However, in a proportional hazard model, the individuals in the sample do not have hazard of failing at time x but rather have a hazard which depends on their covariate values. That is, for ๐‘–๐‘กโ„Ž

individual with covariate ๐‘๐‘– = (๐‘๐‘–1 , โ€ฆ , ๐‘๐‘–๐‘ž)๐‘‡, has hazard

๐œ†๐‘– ๐‘ก = ๐œ†0(๐‘ก)exp(๐›ฝ๐‘‡๐‘๐‘–)

Page 4: 9. Estimating Survival Distribution for a PH Model

Estimating ๐œฆ๐ŸŽ(๐ญ) โ€ข ๐‘‘๐‘๐‘– ๐‘ฅ |๐น ๐‘ฅ ~Bin ๐‘Œ๐‘– x , ฯ€๐‘– x ,where ฯ€๐‘– x โ‰ˆ ๐œ†๐‘–(๐‘ฅ)ฮ”๐‘ฅ

โ†’ ๐ธ[๐‘‘๐‘๐‘– ๐‘ฅ |๐น(๐‘ฅ)] = ๐‘Œ๐‘–๐œ†๐‘–(๐‘ฅ)ฮ”๐‘ฅ

= ๐œ†0(๐‘ฅ)exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘–ฮ”๐‘ฅ

โ€ข ๐‘‘๐‘ ๐‘ฅ = ฯƒ๐‘–=1๐‘› ๐‘‘๐‘๐‘– ๐‘ฅ

โ†’ ๐ธ ๐‘‘๐‘ ๐‘ฅ ๐น ๐‘ฅ = ๐ธ[ฯƒ๐‘–=1๐‘› ๐‘‘๐‘๐‘– ๐‘ฅ |๐น(๐‘ฅ)]

= ฯƒ๐‘–=1๐‘› ๐ธ[๐‘‘๐‘๐‘– ๐‘ฅ |๐น(๐‘ฅ)]

= ฯƒ๐‘–=1๐‘› ๐œ†0(๐‘ฅ)exp(๐›ฝ

๐‘‡๐‘๐‘–) ๐‘Œ๐‘–ฮ”๐‘ฅ

= ๐œ†0(๐‘ฅ) ฮ”๐‘ฅ ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘–(๐‘ฅ)

โ€ข Therefore we estimate ๐œ†0(๐‘ฅ) ฮ”๐‘ฅ by using

๐‘‘๐‘(๐‘ฅ)

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘–(๐‘ฅ)

Page 5: 9. Estimating Survival Distribution for a PH Model

Estimating ๐œฆ๐ŸŽ(๐ญ)

โ€ข ฮ›0(๐‘ก) โ‰ˆ ฯƒ๐‘ฅ<๐‘ก ๐œ†0(๐‘ฅ) ฮ”๐‘ฅ

โ†’ ฮ›0 (๐‘ก) = ฯƒ๐‘ฅ<๐‘ก๐‘‘๐‘(๐‘ฅ)

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘–(๐‘ฅ)

โ€ข Specific, if all the ๐›ฝโ€ฒ๐‘  were equal to zero, then the

previous formula would reduce to ฯƒ๐‘ฅ<๐‘ก๐‘‘๐‘(๐‘ฅ)

๐‘Œ(๐‘ฅ), giving

us back the Nelson-Aalen estimator.

Page 6: 9. Estimating Survival Distribution for a PH Model

Property of ๐œฆ๐ŸŽ(๐ญ)โ€ข ฮ›0(๐‘ก)is approximately unbiased for ฮ›0(๐‘ก)

Proof: ๐ธ(ฯƒ๐‘ฅ<๐‘ก(๐‘‘๐‘(๐‘ฅ)

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘–(๐‘ฅ)

))

= ฯƒ๐‘ฅ<๐‘ก๐ธ[๐‘‘๐‘(๐‘ฅ)

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘–(๐‘ฅ)

]

= ฯƒ๐‘ฅ<๐‘ก๐ธ{๐ธ๐‘‘๐‘ ๐‘ฅ

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘– ๐‘ฅ

๐น ๐‘ฅ }

since ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘– ๐‘ฅ is fixed conditional on F(x)

๐ธ๐‘‘๐‘ ๐‘ฅ

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘– ๐‘ฅ

๐น ๐‘ฅ =๐ธ[๐‘‘๐‘(๐‘ฅ)|๐น(๐‘ฅ)]

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘– ๐‘ฅ

=๐ธ(ฯƒ๐‘‘๐‘๐‘– ๐‘ฅ |๐น(๐‘ฅ))

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘– ๐‘ฅ

=ฯƒ๐œ†0(๐‘ฅ)exp(๐›ฝ

๐‘‡๐‘๐‘–) ๐‘Œ๐‘–ฮ”๐‘ฅ

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘– ๐‘ฅ

= ๐œ†0 ๐‘ฅ ฮ”๐‘ฅ

๐‘†๐‘œ, ๐‘ฅ<๐‘ก

๐ธ{๐ธ๐‘‘๐‘ ๐‘ฅ

ฯƒ๐‘–=1๐‘› exp(๐›ฝ๐‘‡๐‘๐‘–) ๐‘Œ๐‘– ๐‘ฅ

๐น ๐‘ฅ } =๐‘ฅ<๐‘ก

๐œ†0 ๐‘ฅ ฮ”๐‘ฅ โ‰ˆ ฮ›0 (๐‘ก)

Page 7: 9. Estimating Survival Distribution for a PH Model

Estimate Survival Distribution

โ€ข Estimate the survival distribution forindividuals with a certain combination ofcovariates ๐‘ง0 (for a randomly sampled subject).

โ€ข ๐œ† ๐‘ก ๐‘ = ๐œ†0 ๐‘ก ๐‘’๐‘ฅ๐‘ ๐›ฝ๐‘‡๐‘

โ€ข ๐œ† ๐‘ก ๐‘ = ๐‘ง0 = ๐œ†0 ๐‘ก ๐‘’๐‘ฅ๐‘ ๐›ฝ๐‘‡๐‘ง0

Page 8: 9. Estimating Survival Distribution for a PH Model

Estimate Survival Function

โ€ข ๐‘† ๐‘ก ๐‘ = ๐‘ง0 = ๐‘’โˆ’ฮ› ๐‘ก ๐‘ = ๐‘ง0

โ€ข ฮ› ๐‘ก ๐‘ = ๐‘ง0 = 0๐‘ก๐œ†0 ๐‘ก ๐‘’๐‘ฅ๐‘ ๐›ฝ๐‘‡๐‘ง0 ๐‘‘๐‘ข

= ๐‘’๐‘ฅ๐‘ ๐›ฝ๐‘‡๐‘ง0 เถฑ0

๐‘ก

๐œ†0 ๐‘ก ๐‘‘๐‘ข = ๐‘’๐‘ฅ๐‘ ๐›ฝ๐‘‡๐‘ง0 ฮ›0 ๐‘ก

โ€ข แˆ˜๐‘† ๐‘ก ๐‘ = ๐‘ง0 = ๐‘’โˆ’๐‘’๐‘ฅ๐‘๐›ฝ๐‘‡๐‘ง0 ฮ›0 ๐‘ก

( แˆ˜๐›ฝ is the MPLE of ๐›ฝ)

Page 9: 9. Estimating Survival Distribution for a PH Model

แˆ˜๐‘† ๐‘ก ๐‘ = ๐‘0โ€ข Asymptotic Gaussian distribution

โ€ข ๐ธ แˆ˜๐‘† ๐‘ก ๐‘ = ๐‘0 โ†’๐‘Ž๐‘† ๐‘ก ๐‘ = ๐‘0

โ€ข เทž๐‘ฃ๐‘Ž๐‘Ÿ แˆ˜๐‘† ๐‘ก ๐‘ = ๐‘0 = แˆ˜๐‘†2 ๐‘ก ๐‘ = ๐‘0 ๐‘’2๐›ฝ๐‘‡๐‘ง0 ๐‘„1 ๐‘ก + ๐‘„2 ๐‘ก; ๐‘0

โ€ข ๐‘„1 ๐‘ก = ฯƒ๐‘ก๐‘–

๐‘‘๐‘–

๐‘Š2 ๐‘ก๐‘–;๐›ฝ, where ๐‘‘๐‘– is the number of deaths at time ๐‘ก๐‘– ,

๐‘Š ๐‘ก๐‘–; แˆ˜๐›ฝ = ฯƒ๐‘—๐œ–๐‘… ๐‘ก๐‘–๐‘’๐›ฝ๐‘‡๐‘ง๐‘—, ๐‘… ๐‘ก๐‘– = {๐‘—|๐‘Œ๐‘— ๐‘ก๐‘– = 1}

โ€ข ๐‘„2 ๐‘ก; ๐‘0 = ๐‘„3 ๐‘ก; ๐‘0๐‘ก ๐‘‰๐‘Ž๐‘Ÿ แˆ˜๐›ฝ ๐‘„3 ๐‘ก; ๐‘0

๐‘„3 ๐‘ก; ๐‘0 =

ฯƒ๐‘ก๐‘–โ‰ค๐‘ก๐‘Š 1 ๐‘ก๐‘–;๐›ฝ

๐‘Š ๐‘ก๐‘–;๐›ฝโˆ’ ๐‘01

๐‘‘๐‘–

๐‘Š ๐‘ก๐‘–;๐›ฝ

โ‹ฎ

ฯƒ๐‘ก๐‘–โ‰ค๐‘ก๐‘Š ๐‘ ๐‘ก๐‘–;๐›ฝ

๐‘Š ๐‘ก๐‘–;๐›ฝโˆ’ ๐‘0๐‘

๐‘‘๐‘–

๐‘Š ๐‘ก๐‘–;๐›ฝ

, ๐‘Š ๐‘˜ ๐‘ก๐‘–; แˆ˜๐›ฝ = ฯƒ๐‘—๐œ–๐‘… ๐‘ก๐‘–๐‘๐‘—๐‘˜๐‘’

๐›ฝ๐‘‡๐‘ง๐‘—

Link CL. Confidence intervals for the survival function using Cox's proportional-hazard model with covariates. Biometrics. 1984, 40(3):601-9.

Page 10: 9. Estimating Survival Distribution for a PH Model

Note:

โ€ข ๐‘„2 reflects the uncertainty in the estimation process

โ€ข ๐‘„3 is large when ๐‘ง0 is far from the average covariate in the risk set

โ€ข Confidence interval of Survival Function

แˆ˜๐‘† ๐‘ก ๐‘ = ๐‘0 ยฑ ๐‘ง๐›ผ2

เทž๐‘ฃ๐‘Ž๐‘Ÿ แˆ˜๐‘† ๐‘ก ๐‘ = ๐‘0

Page 11: 9. Estimating Survival Distribution for a PH Model

Example

โ€ข Data on 90 males with larynx cancer

โ€ข Variables โ€“

โ€ข Stage of disease (stages 1 to 4)

โ€ข Age at diagnosis of larynx cancer

โ€ข Time of death or on-study time in months

โ€ข Year of diagnosis of larynx cancer

โ€ข Death Indicator (0=alive, 1=dead)

See SAS output

Page 12: 9. Estimating Survival Distribution for a PH Model

R Codelibrary(survival);larynx <- read.table(file="data_chap9_larynx.txt", skip=11,

col.names=c("stage", "time", "age", "year", "status"));ageCat = larynx$age>60;stage.age = as.factor(larynx$stage+(larynx$age>60)*4);larynx = cbind(larynx, ageCat, cat=stage.age);larynx = larynx[order(larynx$stage, larynx$ageCat), ]larynx.ph = coxph(Surv(time,status) ~ stage+ageCat, data=larynx, ties='breslow')

stage.age2 = larynx[match(levels(larynx$cat), larynx$cat),c("stage", "ageCat")];s <- summary(survfit(larynx.ph, newdata=stage.age2));

cols=rep(c("black","red","green", "blue"),2); ltys = rep(c(1,2), each=4);plot(0,0,type="n", xlab="Survival time",ylab="Survival probabilities",xlim=c(0,8),ylim=c(0,1))for (i in 1:8) lines(s$time, s$surv[,i], lty=ltys[i], col=cols[i]);legend("bottomleft", legend=paste("Cat ",1:8),lty=ltys, col = cols, cex=0.9,title.adj=0.2);