Logit-normal distribution

Logit-normal
Probability density function
Cumulative distribution function
Notation	$P( \mathcal{N}(\mu,\,\sigma^2) )$
Parameters	σ² > 0 — squared scale (real), μ ∈ R — location
Support	x ∈ (0, 1)
PDF	$\frac{1}{\sigma \sqrt{2 \pi}}\, e^{-\frac{(\operatorname{logit}(x) - \mu)^2}{2\sigma^2}}\frac{1}{x (1-x)}$
CDF	$\frac12\Big[1 + \operatorname{erf}\Big( \frac{\operatorname{logit}(x)-\mu}{\sqrt{2\sigma^2}}\Big)\Big]$
Mean	no analytical solution
Median	$P(\mu)\,$
Mode	no analytical solution
Variance	no analytical solution
MGF	no analytical solution

In probability theory, a logit-normal distribution is a probability distribution of a random variable whose logit has a normal distribution. If Y is a random variable with a normal distribution, and P is the logistic function, then X = P(Y) has a logit-normal distribution; likewise, if X is logit-normally distributed, then Y = logit(X)= log (X/(1-X)) is normally distributed. It is also known as the logistic normal distribution,^[1] which often refers to a multinomial logit version (e.g.^[2]^[3]^[4]^[5]).

A variable might be modeled as logit-normal if it is a proportion, which is bounded by zero and one, and where values of zero and one never occur.

Characterization

Probability density function

The probability density function (PDF) of a logit-normal distribution, for 0 ≤ x ≤ 1, is:

f_X(x;\mu,\sigma) = \frac{1}{\sigma\sqrt{2 \pi}}\,\frac{1}{x (1-x)}\, e^{-\frac{(\operatorname{logit}(x) - \mu)^2}{2\sigma^2}}

where μ and σ are the mean and standard deviation of the variable’s logit (by definition, the variable’s logit is normally distributed).

The density obtained by changing the sign of μ is symmetrical, in that it is equal to f(1-x;-μ,σ), shifting the mode to the other side of 0.5 (the midpoint of the (0,1) interval).

Plot of the Logitnormal PDF for various combinations of μ (facets) and σ (colors)

Moments

The moments of the logit-normal distribution have no analytic solution. However, they can be estimated by numerical integration.

Mode

When the derivative of the density equals 0 then the location of the mode x satisfies the following equation:

\operatorname{logit}(x) = \sigma^2(2x-1)+\mu .

Multivariate generalization

The logistic normal distribution is a generalization of the logit–normal distribution to D-dimensional probability vectors by taking a logistic transformation of a multivariate normal distribution.^[6]^[7]^[8]

Probability density function

The probability density function is:

f_X( \mathbf{x}; \boldsymbol{\mu} , \boldsymbol{\Sigma} ) = \frac{1}{ | 2 \pi \boldsymbol{\Sigma} |^\frac{1}{2} } \, \frac{1}{ \prod\limits_{i=1}^D x_i } \, e^{- \frac{1}{2} \left\{ \log \left( \frac{ \mathbf{x}_{-D} }{ x_D } \right) - \boldsymbol{\mu} \right\}^\top \boldsymbol{\Sigma}^{-1} \left\{ \log \left( \frac{ \mathbf{x}_{-D} }{ x_D } \right) - \boldsymbol{\mu} \right\} } \quad , \quad \mathbf{x} \in \mathcal{S}^D \;\; ,

where $\mathbf{x}_{-D}$ denotes a vector of the first (D-1) components of $\mathbf{x}$ and $\mathcal{S}^D$ denotes the simplex of D-dimensional probability vectors. This follows from applying the additive logistic transformation to map a multivariate normal random variable $\mathbf{y} \sim \mathcal{N} \left( \boldsymbol{\mu} , \boldsymbol{\Sigma} \right) \; , \; \mathbf{y} \in \mathbb{R}^{D-1}$ to the simplex:

\mathbf{x} = \left[ \frac{ e^{ y_1 } }{ 1 + \sum_{i=1}^{D-1} e^{ y_i } } , \dots , \frac{ e^{ y_{D-1} } }{ 1 + \sum_{i=1}^{D-1} e^{ y_i } } , \frac{ 1 }{ 1 + \sum_{i=1}^{D-1} e^{ y_i } } \right]

Gaussian density functions and corresponding logistic normal density functions after logistic transformation.

The unique inverse mapping is given by:

\mathbf{y} = \left[ \log \left( \frac{ x_1 }{ x_D } \right) , \dots , \log \left( \frac{ x_{D-1} }{ x_D } \right) \right]

Use in statistical analysis

The logistic normal distribution is a more flexible alternative to the Dirichlet distribution in that it can capture correlations between components of probability vectors. It also has the potential to simplify statistical analyses of compositional data by allowing one to answer questions about log-ratios of the components of the data vectors. One is often interested in ratios rather than absolute component values.

The probability simplex is a bounded space, making standard techniques that are typically applied to vectors in $\mathbb{R}^n$ less meaningful. Aitchison described the problem of spurious negative correlations when applying such methods directly to simplicial vectors.^[7] However, mapping compositional data in $\mathcal{S}^D$ through the inverse of the additive logistic transformation yields real-valued data in $\mathbb{R}^{D-1}$ . Standard techniques can be applied to this representation of the data. This approach justifies use of the logistic normal distribution, which can thus be regarded as the "Gaussian of the simplex".

Relationship with the Dirichlet distribution

Logistic normal approximation to Dirichlet distribution

The Dirichlet and logistic normal distributions are never exactly equal for any choice of parameters. However, Aitchison described a method for approximating a Dirichlet with a logistic normal such that their Kullback–Leibler divergence (KL) is minimized:

K(p,q) = \int_{\mathcal{S}^D} p \left( \mathbf{x} | \boldsymbol{\alpha} \right) \log \left( \frac{ p \left( \mathbf{x} | \boldsymbol{\alpha} \right) }{ q \left( \mathbf{x} | \boldsymbol{\mu} , \boldsymbol{\Sigma} \right) } \right) \, d \mathbf{x}

This is minimized by:

\boldsymbol{\mu}^* = \mathbf{E}_p \left[ \log \left( \frac{ \mathbf{x}_{-D} }{ x_D } \right) \right] \quad , \quad \boldsymbol{\Sigma}^* = \textbf{Var}_p \left[ \log \left( \frac{ \mathbf{x}_{-D} }{ x_D } \right) \right]

Using moment properties of the Dirichlet distribution, the solution can be written in terms of the digamma $\psi$ and trigamma $\psi'$ functions:

\mu_i^* = \psi \left( \alpha_i \right) - \psi \left( \alpha_D \right) \quad , \quad i = 1 , \cdots , D-1

\Sigma_{ii}^* = \psi' \left( \alpha_i \right) + \psi' \left( \alpha_D \right) \quad , \quad i = 1 , \cdots , D-1

\Sigma_{ij}^* = \psi' \left( \alpha_D \right) \quad , \quad i \neq j

This approximation is particularly accurate for large $\boldsymbol{\alpha}$ . In fact, one can show that for $\alpha_i \rightarrow \infty , i = 1 , \cdots , D$ , we have that $p \left( \mathbf{x} | \boldsymbol{\alpha} \right) \rightarrow q \left( \mathbf{x} | \boldsymbol{\mu}^* , \boldsymbol{\Sigma}^* \right)$ .

External links

logitnorm package for R

Probability distributions

List of probability distributions

Discrete univariate with finite support	Benford Bernoulli Beta-binomial binomial categorical hypergeometric Poisson binomial Rademacher discrete uniform Zipf Zipf–Mandelbrot

Discrete univariate with infinite support	beta negative binomial Borel Conway–Maxwell–Poisson discrete phase-type Delaporte extended negative binomial Gauss–Kuzmin geometric logarithmic negative binomial parabolic fractal Poisson Skellam Yule–Simon zeta

Continuous univariate supported on a bounded interval	Arcsine ARGUS Balding–Nichols Bates Beta Beta rectangular Irwin–Hall Kumaraswamy logit-normal Noncentral beta raised cosine Reciprocal Triangular U-quadratic uniform Wigner semicircle

Continuous univariate supported on a semi-infinite interval	Benini Benktander 1st kind Benktander 2nd kind Beta prime Burr chi-squared chi Dagum Davis exponential-logarithmic Erlang exponential F folded normal Flory-Schulz Fréchet Gamma Gamma/Gompertz generalized inverse Gaussian Gompertz half-logistic half-normal Hotelling's T-squared hyper-Erlang hyperexponential hypoexponential inverse chi-squared scaled inverse chi-squared inverse Gaussian inverse gamma Kolmogorov Lévy log-Cauchy log-Laplace log-logistic log-normal matrix-exponential Maxwell–Boltzmann Maxwell–Jüttner Mittag-Leffler Nakagami noncentral chi-squared Pareto phase-type Poly-Weibull Rayleigh relativistic Breit–Wigner Rice shifted Gompertz truncated normal type-2 Gumbel Weibull Wilks's lambda

Continuous univariate supported on the whole real line	Cauchy exponential power Fisher's z generalized normal generalized hyperbolic geometric stable Gumbel Holtsmark hyperbolic secant Johnson S_U Landau Laplace asymmetric Laplace logistic noncentral t normal (Gaussian) normal-inverse Gaussian skew normal slash stable Student's t type-1 Gumbel Tracy–Widom variance-gamma Voigt

Continuous univariate with support whose type varies	generalized extreme value generalized Pareto Tukey lambda q-Gaussian q-exponential q-Weibull shifted log-logistic

Mixed continuous-discrete univariate	rectified Gaussian

Multivariate (joint)	Discrete Ewens multinomial Dirichlet-multinomial negative multinomial Continuous Dirichlet generalized Dirichlet multivariate normal multivariate stable multivariate t normal-inverse-gamma normal-gamma Matrix-valued inverse matrix gamma inverse-Wishart matrix normal matrix t matrix gamma normal-inverse-Wishart normal-Wishart Wishart

Directional	Univariate (circular) directional Circular uniform univariate von Mises wrapped normal wrapped Cauchy wrapped exponential wrapped asymmetric Laplace wrapped Lévy Bivariate (spherical) Kent Bivariate (toroidal) bivariate von Mises Multivariate von Mises–Fisher Bingham

Degenerate and singular	Degenerate Dirac delta function Singular Cantor

Families	Circular compound Poisson elliptical exponential natural exponential location-scale maximum entropy mixture Pearson Tweedie wrapped

This article is issued from Wikipedia - version of the Sunday, March 06, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.

Logit-normal distribution

Characterization

Probability density function

Moments

Mode

Multivariate generalization

Probability density function

Use in statistical analysis

Relationship with the Dirichlet distribution

See also

Further reading

External links