CMA-FSE ERROR SURFACE EXAMPLES
P. Bert Schniter -
Cornell University Blind Equalization Research Group
The behavior of the Constant Modulus Algorithm (CMA) as used to
adapt a Fractionally Spaced Equalizer (FSE) is illustrated
below using simple examples. Some important conclusions are then
drawn from these examples: (1) in the presence of noise or channel
undermodelling, proper initialization may be necessary to sucessfully
equalize the channel, and (2) channels containing nearly reflected
roots exhibit high levels of noise gain.
Background
In digital communications systems,
the path from source to receiver (i.e. the channel)
generally introduces dispersion
and multipath, making the received pulses difficult or impossible
to detect without a high probability of error. The use of equalization
in the receiver attempts to combat these linear distortions imposed
by the channel. As a linear equalizer filtering the received signal,
usually a transversal FIR implementation
is chosen. When sampling the received signal at the system
baud rate, we refer to the equalizer as baud spaced. When
operating at a sample rate higher than baud (a typical choice being
twice the baud rate), we refer to the equalizer as fractionally
spaced. The use of FSEs is nearly ubiquitous in recent
designs due to advantages in performance over baud spaced
equalizers. It can be shown [2] that perfect
equalization is possible with FSEs longer than the channel
delay spread for channels not exhibiting reflected
roots.
In addition to dispersion and multipath, the channel can introduce
noise. In general, the equalizer must make a compromise
between compensating for linear channel distortions and attenuating noise.
In the presence of noise, the ability to perfectly equalize is lost.
In many cases, the receiver does not know the channel characteristics
(possibly because of a newly initiated connection or because of a
rapidly varying channel). To remedy this, it may be possible to
transmit a known training signal
from which the receiver can extract the current channel characteristics.
In some situations, however, the amount of channel capacity
used by this scheme
cannot be tolerated, and the receiver must instead determine the channel
characteristics blindly. CMA is the most widely
established algorithm for accomplishing blind equalization.
Motivation
Similar to LMS, CMA can be considered a stochastic
gradient algorithm. This means that the convergence of the equalizer
parameter vector under CMA can be viewed as a (stochastic) traversal of
the CMA cost surface with average movement in the direction of
steepest descent. Thus, great insights into the dynamical behavior of
CMA can be gained from an understanding of the topology of the CMA cost
surface.
What follows are simple two-dimensional examples that help to illustrate
the salient characteristics of the CMA cost surface in the case of
fractionally spaced equalization.
Restricting the scope to two real-valued equalizer taps makes the surface
possible to visualize as a 3D object and allows us to identify important
features which also exist in the general (i.e. higher dimensional) case.
Illustrations of the CMA-FSE Cost Surface
In one regard, the goal of CMA
is to transfer a set of initial equalizer parameters to
the lowest point on the CMA cost surface. As evident below, there may
be multiple locations on the cost surface attaining the minimum cost.
As such, they represent equally desirable solutions.
All of the contour plots exhibit the following properties: at the
origin there exists a maximum, surrounded by a ring of up to
four minima. Minima reflected across the origin correspond to
solutions differing only in sign, which is inconsequential when
using differential coding. Finally, as we travel further from the
origin, the surface rises steeply in all directions.
It should be noted that, though the following surfaces were created from
particular channels,
they are intended to be prototypical of their
respective channel classes. For simplicity, BPSK was chosen as the
model for the source statistics, though the thrust of the conclusions
below applies to higher-order constellations (e.g. 64-QAM) as well.
|
Perfect Equalizability with Well-Behaved Channel
In the case of perfect equalizability, all four minima correspond to
the same (optimal) CMA cost of zero. Each minima reflects a
particular combination of system delay and system polarity. The
asterisk indicates the location of a particular Wiener solution.
Since there is no noise, it appears coincident with the CMA cost
minimum of corresponding system delay and polarity. |
|
Perfect Equalizability with Nearly-Reflected Channel Roots
As channel roots become more nearly reflected, the surface becomes
more elongated.
Note the scaling on the vertical axis is 100 times that on the
horizontal axis! This characteristic will be
manifested as an extremely slow convergence of the 'f1' equalizer tap
due to the shallowness of the gradient along that dimension.
Theoretically, however, perfect equalizability is still
possible in the noiseless case. In other words, all minima correspond
to zero cost and are aligned with the respective Wiener solutions. |
|
Noisy but Well-Behaved Channel
When the well-behaved channel (i.e. the first plot above) is operated
in a 7dB SNR environment,
note the appearance of local minima distinct from the global
minima. For particular initializations, local minima are capable
of capturing the parameter
trajectory and preventing it from attaining an optimum setting.
Note that all minima correspond to costs greater than zero since perfect
equalization is not possible in the presence of noise. |
|
Noisy Channel with Nearly-Reflected Roots
When the channel with nearly reflected roots (above) is operated in a 7dB
SNR environment, we see a dramatic change in the cost surface.
Note the equivalent scaling of horizontal and vertical axes; the cost surface
has become much less elongated than in the noiseless case (see the second
picture in the series).
Also note the merging of two pairs of
minima into one pair. For this channel, the optimal solution in the
presence of noise is very different from the solution which perfectly
equalizes the channel. |
|
Undermodelled but Well-Behaved Channel, No Noise
When the delay spread of the channel is longer than the delay
spread of the FSE, we lose the ability to perfectly equalize. Thus,
none of the minima correspond to zero cost. More importantly, note the
existence of local minima with greater CMA cost than the global minima.
This implies that the equalizer initialization will determine the
optimality of the resulting CMA solution. |
Conclusions
The examples above emphasize a few facts. First, the FSE-CMA cost surface
is multimodal, by virtue of the existence of multiple minima. In the
case where an FSE is able to perfectly equalize the channel, all minima
correspond to the same optimal CMA cost. Furthermore, it can
be shown that these multiple minima occur very near to the Wiener solutions
corresponding to particular combinations of system polarity and system
delay [3]. By system delay, we mean the delay through the cascaded channel
and equalizer. By sytem polarity, we mean {+,-} in the case of real
signals or {+,-,+j,-j} in the case of complex signals.
Next, channels with nearly-reflected roots will most likely exhibit
very slow FSE parameter convergence rates in the absence of noise. Adding the
effects of noise, the FSE may not even come close to equalizing the channel.
Well behaved channels are not as sensitive to noise as those
with nearly reflected roots.
Any time noise is present, though,
we can expect local minima. This implies the potential for
convergence to non-optimal CMA solutions and places importance on
the choice of initialization. The same holds true for FSEs that
undermodel the channel, even for well-behaved channels in the absence
of noise. Initialization can be extremely important if, in these cases,
there exist bad local minima which do not correspond to open
eye solutions. By open eye, we mean that the channel has been
equalized enough to successfully transfer adaptation to a decision-
directed equalization scheme. If CMA converges to one of these
bad local minima, there is little chance of successfully demodulating the
incoming data. In such cases, the system must be restarted, possibly with
a different initialization, in the hope that CMA will converge to an
open-eye solution.
Questions for Practitioners
The examples presented in this demonstration help to motivate the
search for techniques that avoid the problems associated with CMA's
sensitivity to poor choices of initialization. In practice, it is
typical to use a center-spike initialization, whereby the center
equalizer tap is made unity and all other taps are made zero.
Though originally suggested in [1], we know of no theoretical work
that has rigorously defended this technique.
The question remains as to how well the center-spike initialization
really works. We have received reports of the center-spike method
failing on occasion, but have not received any specific examples.
Can anyone provide us with some?
References
[1] D.N. Godard, "Self-Recovering Equalization and Carrier Tracking in
Two-Dimensional Data Communication Systems," IEEE Trans. Commun.,
Vol. COM-28, No.11, Nov. 1980.
[2] J.R. Treichler, I. Fijalkow, and C.R. Johnson, Jr., "Fractionally
Spaced Equalizers - How long should they really be?," Signal
Processing, pp.65-81, May 1996.
[3] H.H. Zeng, L. Tong, and C.R. Johnson, Jr., "Relationships Between
CMA and Wiener Receivers," Submitted to IEEE Trans. on Inform.
Theory.
Further Reading
The examples and discussion above are drawn from a
technical report written by members of Cornell's
Blind Equalization Research Group (BERG). The report includes
a derivation of the equations respresenting CMA-FSE cost with arbitrary
source kurtosis (i.e. for any constellation) and in the presence
of additive white gaussian channel noise.
Matlab Code
The
BERGulator can be used to generate these surfaces and others.