Homoscedasticity

Plot with random data showing homoscedasticity.

In statistics, a sequence or a vector of random variables is homoscedastic /ˌhmskəˈdæstɪk/ if all random variables in the sequence or vector have the same finite variance. This is also known as homogeneity of variance. The complementary notion is called heteroscedasticity. The spellings homoskedasticity and heteroskedasticity are also frequently used.[1]

The assumption of homoscedasticity simplifies mathematical and computational treatment. Serious violations in homoscedasticity (assuming a distribution of data is homoscedastic when in reality it is heteroscedastic /ˌhɛtərskəˈdæstɪk/) may result in overestimating the goodness of fit as measured by the Pearson coefficient.

Assumptions of a regression model

As used in describing simple linear regression analysis, one assumption of the fitted model (to ensure that the least-squares estimators are each a best linear unbiased estimator of the respective population parameters, by the Gauss–Markov theorem) is that the standard deviations of the error terms are constant and do not depend on the x-value. Consequently, each probability distribution for y (response variable) has the same standard deviation regardless of the x-value (predictor). In short, this assumption is homoscedasticity. Homoscedasticity is not required for the estimates to be unbiased, consistent, and asymptotically normal.[2]

Testing

Residuals can be tested for homoscedasticity using the Breusch–Pagan test, which regresses squared residuals on the independent variables. Since the Breusch–Pagan test is sensitive to departures from normality, the Koenker–Basset or 'generalized Breusch–Pagan' test is used for general purposes. Testing for groupwise heteroscedasticity requires the Goldfeld–Quandt test.

Homoscedastic distributions

Two or more normal distributions, N(\mu_i,\Sigma_i), are homoscedastic if they share a common covariance (or correlation) matrix, \Sigma_i = \Sigma_j,\ \forall i,j. Homoscedastic distributions are especially useful to derive statistical pattern recognition and machine learning algorithms. One popular example is Fisher's linear discriminant analysis.

The concept of homoscedasticity can be applied to distributions on spheres.[3]

See also

References

  1. For the Greek etymology of the term, see McCulloch, J. Huston (1985). "On Heteros*edasticity". Econometrica 53 (2): 483. JSTOR 1911250.
  2. Achen, Christopher H.; Shively, W. Phillips (1995), Cross-Level Inference, University of Chicago Press, pp. 47–48, ISBN 9780226002194.
  3. Hamsici, Onur C.; Martinez, Aleix M. (2007) "Spherical-Homoscedastic Distributions: The Equivalency of Spherical and Normal Distributions in Classification", Journal of Machine Learning Research, 8, 1583-1623
This article is issued from Wikipedia - version of the Sunday, April 17, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.