Quantile regression averaging

Quantile Regression Averaging (QRA) is a forecast combination approach to the computation of prediction intervals. It involves applying quantile regression to the point forecasts of a small number of individual forecasting models or experts. It has been introduced in 2014 by Jakub Nowotarski and Rafał Weron[1] and originally used for probabilistic forecasting of electricity prices[2][3] and loads.[4][5] Despite its simplicity it has been found to perform extremely well in practice - the top two performing teams in the price track of the Global Energy Forecasting Competition (GEFCom2014) used variants of QRA.[6][7]

Introduction

The individual point forecasts are used as independent variables and the corresponding observed target variable as the dependent variable in a standard quantile regression setting.[8] The Quantile Regression Averaging method yields an interval forecast of the target variable, but does not use the prediction intervals of the individual methods. One of the reasons for using point forecasts (and not interval forecasts) is their availability. For years, forecasters have focused on obtaining accurate point predictions. Computing probabilistic forecasts, on the other hand, is generally a much more complex task and has not been discussed in the literature nor developed by practitioners so extensively. Therefore QRA may be found particularly attractive from a practical point of view as it allows to leverage existing development of point forecasting.

Computation

Visualization of the Quantile Regression Averaging (QRA) probabilistic forecasting technique.

The quantile regression problem can be written as follows:

Q_y(q|X_t) = X_t\beta_q,

where Q_y(q | \cdot) is the conditional q-th quantile of the dependent variable (y_t), X_t=[1,\hat{y}_{1,t},...,\hat{y}_{m,t}] is a vector of point forecasts of m individual models (i.e. independent variables) and βq is a vector of parameters (for quantile q). The parameters are estimated by minimizing the loss function for a particular q-th quantile:

\min\limits_{\beta_q}
\left[ \sum\limits_{\{t:y_t \geq X_t\beta_q \}} q |y_t - X_t\beta_q | +
\sum\limits_{\{t:y_t < X_t\beta_q \}} (1-q)|y_t - X_t\beta_q |\right] = \min\limits_{\beta_q} \left[ \sum\limits_{t}(q - \mathbf{1}_{y_t < X_t\beta_q }) (y_t - X_t\beta_q ) \right]

QRA assigns weights to individual forecasting methods and combines them to yield forecasts of chosen quantiles. Although the QRA method is based on quantile regression, not least squares, it still suffers from the same problems: the exogenous variables should not be correlated strongly and the number of variables included in the model has to be relatively small in order for the method to be computationally efficient.

Factor Quantile Regression Averaging (FQRA)

Visualization of the Factor Quantile Regression Averaging (FQRA) probabilistic forecasting technique.

The main difficulty associated with applying QRA comes from the fact that only individual models that perform well and (preferably) are distinct should be used. However, there may be many well performing models or many different specifications of each model (with or without exogenous variables, with all or only selected lags, etc.) and it may not be optimal to include all of them in Quantile Regression Averaging.

In Factor Quantile Regression Averaging (FQRA),[3] instead of selecting individual models a priori, the relevant information contained in all forecasting models at hand is extracted using principal component analysis (PCA). The prediction intervals are then constructed on the basis of the common factors (f_t) obtained from the panel of point forecasts, as independent variables in a quantile regression. More precisely, in the FQRA method X_t=[1,\hat{f}_{1,t},...,\hat{f}_{k,t}] is a vector of k<m factors extracted from a panel of point forecasts of m individual models, not a vector of point forecasts of the individual models themselves. A similar principal component-type approach was proposed in the context of obtaining point forecasts from the Survey of Professional Forecasters data.[9]

Instead of considering a (large) panel of forecasts of the individual models, FQRA concentrates on a small number of common factors, which - by construction - are orthogonal to each other, and hence are contemporaneously uncorrelated. FQRA can be also interpreted as a forecast averaging approach. The factors estimated within PCA are linear combinations of individual vectors of the panel and FQRA can therefore be used to assign weights to the forecasting models directly.

QRA and LAD regression

QRA may be viewed as an extension of combining point forecasts. The well-known ordinary least squares (OLS) averaging[10] uses linear regression to estimate weights of the point forecasts of individual models. Replacing the quadratic loss function with the absolute loss function leads to quantile regression for the median, or in other words, least absolute deviation (LAD) regression.[11]

See also

Implementations

References

  1. Nowotarski, Jakub; Weron, Rafał (2015). [Open Access]. "Computing electricity spot price prediction intervals using quantile regression and forecast averaging". Computational Statistics 30 (3): 791–803. doi:10.1007/s00180-014-0523-0. ISSN 0943-4062.
  2. Weron, Rafał (2014). [Open Access]. "Electricity price forecasting: A review of the state-of-the-art with a look into the future". International Journal of Forecasting 30 (4): 1030–1081. doi:10.1016/j.ijforecast.2014.08.008.
  3. 1 2 Maciejowska, Katarzyna; Nowotarski, Jakub; Weron, Rafał (2016). "Probabilistic forecasting of electricity spot prices using Factor Quantile Regression Averaging". International Journal of Forecasting. doi:10.1016/j.ijforecast.2014.12.004.
  4. Liu, B.; Nowotarski, J.; Hong, T.; Weron, R. (2015). "Probabilistic Load Forecasting via Quantile Regression Averaging on Sister Forecasts". IEEE Transactions on Smart Grid PP (99): 1–1. doi:10.1109/TSG.2015.2437877. ISSN 1949-3053.
  5. Hong, Tao; Fan, Shu. "Probabilistic Electric Load Forecasting: A Tutorial Review". blog.drhongtao.com. Retrieved 2015-11-28.
  6. Gaillard, Pierre; Goude, Yannig; Nedellec, Raphael. "Semi-parametric models and robust aggregation for GEFCom2014 probabilistic electric load and electricity price forecasting". ResearchGate. Retrieved 2015-11-28.
  7. Maciejowska, Katarzyna; Nowotarski, Jakub (2015). "A hybrid model for GEFCom2014 probabilistic electricity price forecasting". HSC Research Report, HSC/15/06.
  8. Koenker, Roger (2005). Quantile Regresssion. John Wiley & Sons, Ltd. doi:10.1002/9780470057339.vnn091. ISBN 9780470057339.
  9. Poncela, Pilar; Rodríguez, Julio; Sánchez-Mangas, Rocío; Senra, Eva (2011). "Forecast combination through dimension reduction techniques". International Journal of Forecasting 27 (2): 224–237. doi:10.1016/j.ijforecast.2010.01.012.
  10. Granger, Clive W. J.; Ramanathan, Ramu (1984). "Improved methods of combining forecasts". Journal of Forecasting 3 (2): 197–204. doi:10.1002/for.3980030207. ISSN 1099-131X.
  11. Nowotarski, Jakub; Raviv, Eran; Trück, Stefan; Weron, Rafał (2014). "An empirical comparison of alternative schemes for combining electricity spot price forecasts". Energy Economics 46: 395–412. doi:10.1016/j.eneco.2014.07.014.
This article is issued from Wikipedia - version of the Tuesday, December 01, 2015. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.