Nyström method

In numerical analysis, the Nyström method[1] or quadrature method seeks the numerical solution of an integral equation by replacing the integral with a representative weighted sum. The continuous problem is broken into n discrete intervals, quadrature or numerical integration determines the weights and locations of representative points for the integral. The discrete problem to be solved is now a system of linear equations with n equations and n unknowns.

From the n solved points the function value at other points is interpolated consistent with the chosen quadrature method. Depending on the original problem and the choice of quadrature the problem may be ill-conditioned.

Since the linear equations require O(n^3) operations to solve, hence high-order quadrature rules perform better because low-order quadrature rules require large n for a given accuracy. Gaussian quadrature is normally a good choice for smooth, non-singular problems.

Discretization of the integral

\int_a^b h (x) \;\mathrm d x \approx \sum_{k=1}^n w_k h (x_k)
where w_k are the weights of the quadrature rule, and points x_k are the abscissas.

Example

Applying this to the inhomogeneous Fredholm equation of the second kind

f (x) = \lambda u (x) - \int_a^b K (x, x') f (x') \;\mathrm d x',

results in

f (x) \approx \lambda u (x) - \sum_{k=1}^n w_k K (x, x_k) f (x_k).


Sampling for Nyström method

The most important step of the Nyström method is sampling, because different sampled landmark points give different approximations of the original matrix.

Uniform sampling without replacement: is the most used approach for this purpose, where every point has the same probability of being included in the sample.

Minimum Sum of Squared Similarities Sampling (MSSS)

MSSS is an non-uniform sampling approach that considers both the variance and the similarity of the data distribution in its sampling data, which approximately maximizes the determinant of the reduced similarity matrix that represents the mutual similarities between sampled data points. It is shown in[2] that this approach increase the speed of the clustering on large datasets.

References

  1. Nyström, Evert Johannes (1930). "Über die praktische Auflösung von Integralgleichungen mit Anwendungen auf Randwertaufgaben". Acta Mathematica 54 (1): 185–204. doi:10.1007/BF02547521.
  2. Bouneffouf, Djallel; Birol, Inanc (2015), "Sampling with Minimum Sum of Squared Similarities for Nystrom-Based Large Scale Spectral Clustering", Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, {IJCAI} 2015, Buenos Aires, Argentina, July 25-31, 2015 8834, AAAI Press, pp. 2313––2319, ISBN 978-1-57735-738-4
This article is issued from Wikipedia - version of the Thursday, April 28, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.