Maintenance
of acceptable voice quality levels despite inevitable variations
in network performance (such as congestion or link failures)
is achieved using such techniques as compression,
silence suppression,
and QoS-enabled transport networks.
Several developments in the 1990s, most notably advances in
digital signal processor technology, high-powered network
switches, and QoS-based protocols, have combined to enable
and encourage the implementation of voice over data networks.
Low-cost, high-performance DSPs can process the compression
and echo cancellation algorithms efficiently.
Software
pre-processing of voice conversations can also
be used to further optimize voice quality. One technique,
called silence suppression, detects whenever there is a gap
in the speech and suppresses the transfer of things like pauses,
breaths, and other periods of silence. This can amount to
50-60% of the time of a call, resulting in considerable bandwidth
conservation. Since the lack of packets is interpreted as
complete silence at the output, another function is needed
at the receiving end to add "comfort noise" to the
output.
Another
software function that improves speech quality is echo
cancellation. As was noted earlier, echo becomes
a problem whenever the end-to-end delay for a call is greater
than 50 milliseconds. Sources of delay in a packet voice call
include the collection of voice samples (called accumulation
delay), encoding/decoding and packetizing time, jitter buffer
delays, and network transit delay. The ITU recommendation
G.168 defines the performance requirements that are currently
required for echo cancellers.
Engineering
a VoIP network (and the equipment used to build it) involves
trade-offs among the quality of the delivered speech, the
reliability of the system, and the delays inherent in the
system. Minimizing the end-to-end delay budget is one of the
key challenges in VoIP systems. Ensuring reliability in a
"best effort" environment is another. Equipment
producers that offer the flexibility to configure their systems
to fit the environment and thereby optimize the quality of
the voice produced will have a competitive advantage.
|