Smooth maximum
In mathematics, a smooth maximum of an indexed family x1, ..., xn of numbers is a differentiable approximation to the maximum function
and the concept of smooth minimum is similarly defined.
For large positive values of the parameter  , the following formulation is one smooth, differentiable approximation of the maximum function.  For negative values of the parameter that are large in absolute value, it approximates the minimum.
, the following formulation is one smooth, differentiable approximation of the maximum function.  For negative values of the parameter that are large in absolute value, it approximates the minimum.
 has the following properties:
 has the following properties:
 as as 
 is the average of its inputs is the average of its inputs
 as as 
The gradient of  is closely related to softmax and is given by
 is closely related to softmax and is given by
This makes the softmax function useful for optimization techniques that use gradient descent.
Another formulation is:
References
This article is issued from Wikipedia - version of the Monday, April 13, 2015. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.


![\nabla_{x_i}\mathcal{S}_\alpha (\{x_i\}_{i=1}^n) = \frac{e^{\alpha x_i}}{\sum_{j=1}^n e^{\alpha x_j}} [1 + \alpha(x_i - \mathcal{S}_\alpha (\{x_i\}_{i=1}^n))].](../I/m/11f218654ae84bdb485fc93b76b0f546.png)
