Channel state information
In wireless communications, channel state information (CSI) refers to known channel properties of a communication link. This information describes how a signal propagates from the transmitter to the receiver and represents the combined effect of, for example, scattering, fading, and power decay with distance. The CSI makes it possible to adapt transmissions to current channel conditions, which is crucial for achieving reliable communication with high data rates in multiantenna systems.
CSI needs to be estimated at the receiver and usually quantized and fed back to the transmitter (although reverse-link estimation is possible in TDD systems). Therefore, the transmitter and receiver can have different CSI. The CSI at the transmitter and the CSI at the receiver are sometimes referred to as CSIT and CSIR, respectively.
Different kinds of channel state information
There are basically two levels of CSI, namely instantaneous CSI and statistical CSI.
Instantaneous CSI (or short-term CSI) means that the current channel conditions are known, which can be viewed as knowing the impulse response of a digital filter. This gives an opportunity to adapt the transmitted signal to the impulse response and thereby optimize the received signal for spatial multiplexing or to achieve low bit error rates.
Statistical CSI (or long-term CSI) means that a statistical characterization of the channel is known. This description can include, for example, the type of fading distribution, the average channel gain, the line-of-sight component, and the spatial correlation. As with instantaneous CSI, this information can be used for transmission optimization.
The CSI acquisition is practically limited by how fast the channel conditions are changing. In fast fading systems where channel conditions vary rapidly under the transmission of a single information symbol, only statistical CSI is reasonable. On the other hand, in slow fading systems instantaneous CSI can be estimated with reasonable accuracy and used for transmission adaptation for some time before being outdated.
In practical systems, the available CSI often lies in between these two levels; instantaneous CSI with some estimation/quantization error is combined with statistical information.
Mathematical description
In a narrowband flat-fading channel with multiple transmit and receive antennas (MIMO), the system is modeled as[1]
where  and
 and  are the receive and transmit vectors, respectively, and
 are the receive and transmit vectors, respectively, and  and
 and  are the channel matrix and the noise vector, respectively. The noise is often modeled as circular symmetric complex normal with
 are the channel matrix and the noise vector, respectively. The noise is often modeled as circular symmetric complex normal with
where the mean value is zero and the noise covariance matrix  is known.
 is known.
Instantaneous CSI
Ideally, the channel matrix  is known perfectly. Due to channel estimation errors, the channel information can be represented as[2]
 is known perfectly. Due to channel estimation errors, the channel information can be represented as[2]
where  is the channel estimate and
 is the channel estimate and  is the estimation error covariance matrix. The vectorization
 is the estimation error covariance matrix. The vectorization  was used to achieve the column stacking of
 was used to achieve the column stacking of  , as multivariate random variables are usually defined as vectors.
, as multivariate random variables are usually defined as vectors.
Statistical CSI
In this case, the statistics of  are known. In a Rayleigh fading channel, this corresponds to knowing that[3]
 are known. In a Rayleigh fading channel, this corresponds to knowing that[3]
for some known channel covariance matrix  .
.
Estimation of CSI
Since the channel conditions vary, instantaneous CSI needs to be estimated on a short-term basis. A popular approach is so-called training sequence (or pilot sequence), where a known signal is transmitted and the channel matrix  is estimated using the combined knowledge of the transmitted and received signal.
 is estimated using the combined knowledge of the transmitted and received signal.
Let the training sequence be denoted  , where the vector
, where the vector  is transmitted over the channel as
 is transmitted over the channel as
By combining the received training signals  for
 for  , the total training signalling becomes
, the total training signalling becomes
with the training matrix ![\scriptstyle \mathbf{P}=[\mathbf{p}_1,\ldots,\mathbf{p}_N]](../I/m/46a248e88c264ad68e955c354b091833.png) and the noise matrix
 and the noise matrix ![\scriptstyle \mathbf{N}=[\mathbf{n}_1,\ldots,\mathbf{n}_N]](../I/m/d24587d25c3e4d7eb81dd4fda91bda30.png) .
.
With this notation, channel estimation means that  should be recovered from the knowledge of
 should be recovered from the knowledge of  and
 and  .
.
Least-square estimation
If the channel and noise distributions are unknown, then the least-square estimator (also known as the minimum-variance unbiased estimator) is[4]
where  denotes the conjugate transpose. The estimation Mean Square Error (MSE) is proportional to
 denotes the conjugate transpose. The estimation Mean Square Error (MSE) is proportional to
where  denotes the trace. The error is minimized when
 denotes the trace. The error is minimized when  is a scaled identity matrix. This can only be achieved when
 is a scaled identity matrix. This can only be achieved when  is equal to (or larger than) the number of transmit antennas. The simplest example of an optimal training matrix is to select
 is equal to (or larger than) the number of transmit antennas. The simplest example of an optimal training matrix is to select  as a (scaled) identity matrix of the same size that the number of transmit antennas.
 as a (scaled) identity matrix of the same size that the number of transmit antennas.
MMSE estimation
If the channel and noise distributions are known, then this a priori information can be exploited to decrease the estimation error. This approach is known as Bayesian estimation and for Rayleigh fading channels it exploits that
The MMSE estimator is the Bayesian counterpart to the least-square estimator and becomes[2]
where  denotes the Kronecker product and the identity matrix
 denotes the Kronecker product and the identity matrix  has the dimension of the number of receive antennas. The estimation Mean Square Error (MSE) is
 has the dimension of the number of receive antennas. The estimation Mean Square Error (MSE) is
and is minimized by a training matrix  that in general can only be derived through numerical optimization. But there exist heuristic solutions with good performance based on waterfilling. As opposed to least-square estimation, the estimation error for spatially correlated channels can be minimized even if
 that in general can only be derived through numerical optimization. But there exist heuristic solutions with good performance based on waterfilling. As opposed to least-square estimation, the estimation error for spatially correlated channels can be minimized even if  is smaller than the number of transmit antennas.[2] Thus, MMSE estimation can both decrease the estimation error and shorten the required training sequence. It needs however additionally the knowledge of the channel correlation matrix
 is smaller than the number of transmit antennas.[2] Thus, MMSE estimation can both decrease the estimation error and shorten the required training sequence. It needs however additionally the knowledge of the channel correlation matrix  and noise correlation matrix
 and noise correlation matrix  . In absence of an accurate knowledge of these correlation matrices, robust choices need to be made to avoid MSE degradation.[5][6]
. In absence of an accurate knowledge of these correlation matrices, robust choices need to be made to avoid MSE degradation.[5][6]
References
- ↑ A. Tulino, A. Lozano, S. Verdú, Impact of antenna correlation on the capacity of multiantenna channels, IEEE Transactions on Information Theory, vol 51, pp. 2491-2509, 2005.
- 1 2 3 E. Björnson, B. Ottersten, A Framework for Training-Based Estimation in Arbitrarily Correlated Rician MIMO Channels with Rician Disturbance, IEEE Transactions on Signal Processing, vol 58, pp. 1807-1820, 2010.
- ↑ J. Kermoal, L. Schumacher, K.I. Pedersen, P. Mogensen, F. Frederiksen, A Stochastic MIMO Radio Channel Model With Experimental Validation, IEEE Journal on Selected Areas Communications, vol 20, pp. 1211-1226, 2002.
- ↑ M. Biguesh and A. Gershman, Training-based MIMO channel estimation: a study of estimator tradeoffs and optimal training signals, IEEE Transactions on Signal Processing, vol 54, pp. 884-893, 2006.
- ↑ Y. Li, L.J. Cimini, and N.R. Sollenberger, Robust channel estimation for OFDM systems with rapid dispersive fading channels, IEEE Transactions on Communications, vol 46, pp. 902-915, July 1998.
- ↑ M. D. Nisar, W. Utschick and T. Hindelang, Maximally Robust 2-D Channel Estimation for OFDM Systems, IEEE Transactions on Signal Processing, vol 58, pp. 3163-3172, June 2010.





![\mathbf{Y}=[\mathbf{y}_1,\ldots,\mathbf{y}_N] = \mathbf{H}\mathbf{P} + \mathbf{N}](../I/m/8497fa1fe0d8a5370ab12d6772e44358.png)




