Jump to content

Logistic regression

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 83.103.227.115 (talk) at 15:02, 13 November 2006. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Logistic regression is a statistical regression model for binary dependent variables. It can be considered as a generalized linear model that utilizes the logit as its link function, and has binomially distributed errors.

The model takes the form

where

The logarithm of the odds (probability divided by one minus the probability) of the outcome is modelled as a linear function of the explanatory variables, to . This can be written equivalently as

The interpretation of the parameter estimates is as a multiplicative effect on the odds ratio. In the case of a dichotomous explanatory variable, for instance sex, (the antilog of ) is the estimate of the odds-ratio of having the outcome for, say, males compared with females.

The parameters are usually estimated by maximum likelihood.

Extensions of the model exist to cope with multi-category dependent variables and ordinal dependent variables.

See also

References

  • Agresti, Alan: Categorical Data Analysis. New York: Wiley, 1990.
  • Amemiya, T., 1985, Advanced Econometrics, Harvard University Press.
  • Hosmer, D. W. and S. Lemeshow: Applied logistic regression. New York; Chichester, Wiley, 2000.