Kolmogorov–Smirnov test
The Kolmogorov-Smirnov test is used to determine whether two empirical distributions are different or whether an empirical distribution differs from a theoretical distribution.
The empirical cumulative distribution for N observations yi is defined as E(x) = Σ i (yi < x). The two one-sided Kolmogorov-Smirnov test statistics statistics are given by
- DN+ = MAX( E(x) - F(x) )
- DN- = MAX( F(x) - E(x) )
where F(x) is the theoretical distribution or another empirical distribution. Knuth gives a detailed description of how to analyze the significance of this pair of statistics. Many people use MAX(DN+, DN-) instead, but the distribution of this statistic is more difficult to deal with.
Note that when the underlying independent variable is cyclic as with day of the year or day of the week, then Kuiper's test is more appropriate. Numerical Recipes is again a good source of information on this.
External links
- http://www.itl.nist.gov/div898/handbook/eda/section3/eda35g.htm - A lovely explanation of the one-sided KS test
- http://www.io.com/~ritter/JAVASCRP/NORMCHIK.HTM - JavaScript code that implements both the one-sided and two-sided tests.
- As always, Numerical Recipes ISBN: 0521431085 is a prime resource for this sort of thing (see http://www.nr.com/nronline_switcher.html for a discussion).