This summarizes some very basic concepts in survival analysis from this amazing tutorial, which helps me a lot.
Chapter 3: Estimiating the Survival or Hazard Function (Parametric)
There are two ways of estimating survival/hazard functions
- parametric model for based on a particular density function
- empirical estimate of survival function (non-parametric)
If without censoring, the emperical is simply the proportion of people dying no earlier that time . It is censoring patients that complicates this problem.
Some Parametric Survival Distribution (Continuos)
The exponential distribution
This is a constant hazard model, with only one parameter.
The Weibull distribution
It is a generalization of exponential model
where os the scale parameter and is the shape parameter.
- : constant hazard
- : decreasing hazard
- : increasing hazard
The Rayleigh distribution
Another 2-parameter generalization of exponential.
Other models
Compound exponential, log-normal, log-logistic
Summary of Parametric Model
Good sides of parametric model
- easy to estimate and inference
- simple forms of , , and
- qualitative shape of hazard function
Other tips
- With adequacy of fit to a dataset, one can usually distinguish one-parameter and two-parameter models
- Without a lot of data, hard to distinguish between the fits of 2-par models