Standard score

"Standardize" redirects here. For industrial and technical standards, see Standardization.

For Fisher z-transformation in statistics, see Fisher transformation. For Z-values in ecology, see Z-value. For z-transformation to complex numberT domain, see Z-transform. For Z-factor in high-throughput screening, see Z-factor. For Z-score financial analysis tool, see Altman Z-score.

Compares the various grading methods in a normal distribution. Includes: Standard deviations, cumulative percentages, percentile equivalents, Z-scores, T-scores, standard nine, percent in stanine

In statistics, the standard score is the signed number of standard deviations an observation or datum is above the mean. A positive standard score indicates a datum above the mean, while a negative standard score indicates a datum below the mean. It is a dimensionless quantity obtained by subtracting the population mean from an individual raw score and then dividing the difference by the population standard deviation. This conversion process is called standardizing or normalizing (however, "normalizing" can refer to many types of ratios; see normalization (statistics) for more).

Standard scores are also called z-values, z-scores, normal scores, and standardized variables; the use of "Z" is because the normal distribution is also known as the "Z distribution". They are most frequently used to compare a sample to a standard normal deviate, though they can be defined without assumptions of normality.

The z-score is only defined if one knows the population parameters; if one only has a sample set, then the analogous computation with sample mean and sample standard deviation yields the Student's t-statistic.

Calculation from raw score

The standard score of a raw score x ^[1] is

z = {x- \mu \over \sigma}

where:

μ is the mean of the population.

σ is the standard deviation of the population.

The absolute value of z represents the distance between the raw score and the population mean in units of the standard deviation. z is negative when the raw score is below the mean, positive when above.

A key point is that calculating z requires the population mean and the population standard deviation, not the sample mean or sample deviation. It requires knowing the population parameters, not the statistics of a sample drawn from the population of interest. But knowing the true standard deviation of a population is often unrealistic except in cases such as standardized testing, where the entire population is measured. In cases where it is impossible to measure every member of a population, the standard deviation may be estimated using a random sample.

It measures the sigma distance of actual data from the average.

The Z value provides an assessment of how off-target a process is operating.

Applications

Main article: Z-test

The z-score is often used in the z-test in standardized testing – the analog of the Student's t-test for a population whose parameters are known, rather than estimated. As it is very unusual to know the entire population, the t-test is much more widely used.

Also, standard score can be used in the calculation of prediction intervals. A prediction interval [L,U], consisting of a lower endpoint designated L and an upper endpoint designated U, is an interval such that a future observation X will lie in the interval with high probability $\gamma$ , i.e.

P(L<X<U) =\gamma,

For the standard score Z of X it gives:^[2]

P\left( \frac{L-\mu}{\sigma} < Z < \frac{U-\mu}{\sigma} \right) = \gamma.

By determining the quantile z such that

P\left( -z < Z < z \right) = \gamma

it follows:

L=\mu-z\sigma,\ U=\mu+z\sigma

Standardizing in mathematical statistics

Further information: Normalization (statistics)

In mathematical statistics, a random variable X is standardized by subtracting its expected value $\operatorname{E}[X]$ and dividing the difference by its standard deviation $\sigma(X) = \sqrt{\operatorname{Var}(X)}:$

Z = {X - \operatorname{E}[X] \over \sigma(X)}

If the random variable under consideration is the sample mean of a random sample $\ X_1,\dots, X_n$ of X:

\bar{X}={1 \over n} \sum_{i=1}^n X_i

then the standardized version is

Z = \frac{\bar{X}-\operatorname{E}[X]}{\sigma(X)/\sqrt{n}}.

T-score

"T-score" redirects here. It is not to be confused with t-statistic.

A T-score is a standard score Z shifted and scaled to have a mean of 50 and a standard deviation of 10.^[3]^[4]^[5]

References

↑ Kreyszig 1979, p880 eq(5)
↑ Kreyszig 1979, p880 eq(6)
↑
↑
↑

Kreyszig, E. (1979). Advanced Engineering Mathematics (Fourth ed.). Wiley. ISBN 0-471-02140-7.

External links

Statistics

Descriptive statistics

Continuous data

Location	Mean arithmetic geometric harmonic Median Mode

Dispersion	Range Standard deviation Coefficient of variation Percentile Interquartile range

Shape	Variance Skewness Kurtosis Moments L-moments

Count data

Index of dispersion

Summary tables

Dependence

Statistical graphics

Data collection

Study design	Effect size Standard error Statistical power Sample size determination

Survey methodology	Sampling stratified cluster Opinion poll Questionnaire

Controlled experiments	Design control optimal Controlled trial Randomized Random assignment Replication Blocking Factorial experiment

Uncontrolled studies	Observational study Natural experiment Quasi-experiment

Statistical inference

Statistical theory

Frequentist inference

Confidence interval Testing hypotheses Power

Unbiased estimators	Mean unbiased minimum-variance Median unbiased

Biased estimators	Maximum likelihood Method of moments Minimum distance Density estimation

Parametric tests	Likelihood-ratio Wald Score

Specific tests

Z (normal) Student's t-test F Shapiro–Wilk Kolmogorov–Smirnov

Goodness of fit	Chi-squared G Sample source (Anderson–Darling) Sample normality (Shapiro–Wilk) Skewness / kurtosis normality (Jarque–Bera) Model comparison (Likelihood-ratio) Model quality (Akaike criterion)

Signed-rank	1-sample (Wilcoxon) 2-sample (Mann–Whitney U) 1-way anova (Kruskal–Wallis)

Bayesian inference

Correlation	Pearson product–moment Partial correlation Confounding variable Coefficient of determination

Regression analysis	Errors and residuals Regression model validation Mixed effects models Simultaneous equations models Multivariate adaptive regression splines (MARS)

Linear regression	Simple linear regression Ordinary least squares General linear model Bayesian regression

Non-standard predictors	Nonlinear regression Nonparametric Semiparametric Isotonic Robust Heteroscedasticity Homoscedasticity

Generalized linear model	Exponential families Logistic (Bernoulli) / Binomial / Poisson regressions

Partition of variance	Analysis of variance (ANOVA, anova) Analysis of covariance Multivariate ANOVA Degrees of freedom

Categorical / Multivariate / Time-series / Survival analysis

Categorical

Multivariate

Time-series

General	Decomposition Trend Stationarity Seasonal adjustment Exponential smoothing Cointegration Structural break Granger causality

Specific tests	Dickey–Fuller Johansen Q-statistic (Ljung–Box) Durbin–Watson Breusch–Godfrey

Time domain	Autocorrelation (ACF) partial (PACF) Cross-correlation (XCF) ARMA model ARIMA model (Box–Jenkins) Autoregressive conditional heteroskedasticity (ARCH) Vector autoregression (VAR)

Frequency domain	Spectral density estimation Fourier analysis Wavelet

Survival

Survival function	Kaplan–Meier estimator (product limit) Proportional hazards models Accelerated failure time (AFT) model First hitting time

Hazard function	Nelson–Aalen estimator

Test	Log-rank test

Applications

Biostatistics	Bioinformatics Clinical trials / studies Epidemiology Medical statistics

Engineering statistics	Chemometrics Methods engineering Probabilistic design Process / quality control Reliability System identification

Social statistics	Actuarial science Census Crime statistics Demography Econometrics National accounts Official statistics Population statistics Psychometrics

Spatial statistics	Cartography Environmental statistics Geographic information system Geostatistics Kriging

Category
Portal
Commons
WikiProject

This article is issued from Wikipedia - version of the Thursday, April 14, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.

Standard score

Calculation from raw score

Applications

Standardizing in mathematical statistics

T-score

See also

References

Further reading

External links