This summarizes some very basic concepts in survival analysis from this amazing tutorial, which helps me a lot.

Chapter 3: Estimiating the Survival or Hazard Function (Parametric)

There are two ways of estimating survival/hazard functions

  • parametric model for based on a particular density function
  • empirical estimate of survival function (non-parametric)

If without censoring, the emperical is simply the proportion of people dying no earlier that time . It is censoring patients that complicates this problem.

Some Parametric Survival Distribution (Continuos)

The exponential distribution

This is a constant hazard model, with only one parameter.

The Weibull distribution

It is a generalization of exponential model

where os the scale parameter and is the shape parameter.

  • : constant hazard
  • : decreasing hazard
  • : increasing hazard

The Rayleigh distribution

Another 2-parameter generalization of exponential.

Other models

Compound exponential, log-normal, log-logistic

Summary of Parametric Model

Good sides of parametric model

  • easy to estimate and inference
  • simple forms of , , and
  • qualitative shape of hazard function

Other tips

  • With adequacy of fit to a dataset, one can usually distinguish one-parameter and two-parameter models
  • Without a lot of data, hard to distinguish between the fits of 2-par models

Comments