Troubleshooting Audio Quality Problems in the Field

advertisement
Troubleshooting Audio Quality
Problems in the Field
APP
NOTE
Issue 1.0
February 2006
Abstract
This document provides troubleshooting procedures for
resolving audio quality issues in the field.
avaya.com
Table of Contents
1.
Introduction
2.
System Status and Environment
3.
Problem Definition
4.
Data Gathering
5.
Troubleshooting
6.
Audio Quality Anomalies
7.
Other Information
8.
References
APPENDIX A: Audio Processing Components & Audio
Quality Terminology
APPENDIX B: Basic IP Telephony Solution Diagram
3
4
5
5
6
7
11
12
13
26
avaya.com
3 of 27
1. INTRODUCTION
Audio quality issues experienced by customers have
many and varied causes. The converged voice and data
solutions being deployed today require additional
consideration when troubleshooting these issues as
compared to the legacy voice solutions of the past.
Some problems may be caused by equipment configuration
or administration, others by network or environmental
conditions, some by user misuse, and some are simply a
customer expectation of the solution’s operation as
opposed to the solution’s inherent design intent.
This document provides a guide to troubleshooting a
customer’s audio quality issue(s) and determining what
component(s) in the call topology is causing the
problem. This is accomplished by first analyzing the
customer system and environment, and then defining the
customer’s audio quality issue using an industry
accepted definition of the anomaly and the typical
causes of that anomaly to thereby identify the area of
the customer’s solution (or network) in which to
concentrate the troubleshooting effort.
Appendix A provides a glossary of audio processing
components and audio quality terminology. Appendix B
provides a sample basic IP Telephony solution diagram
as a reference.
avaya.com
4 of 27
2. SYSTEM STATUS AND ENVIRONMENT
As part of the normal Services troubleshooting
routine, it is necessary that system components are up
to date with the correct software/firmware releases,
hardware versions, etc. Once the system components are
established as having the latest updates, it is highly
recommended that Best Practices documentation be
consulted1. Areas not meeting the initiatives of Best
Practices documentation should be corrected and the
audio problem retested before proceeding in this
document.
Environmental conditions may cause or contribute to
audio quality problems. Local noise sources such as
computer fans, PC speakers, noise from air
conditioning (HVAC), ambient speech or noisy office
environments (as in call centers); can have an affect
on audio quality being perceived by the user when
using handset, headset or speakerphone.
For optimum speakerphone performance, make sure the
phone is sufficiently clear of any shelves or clutter
near the phone that could provide an echo path during
speakerphone use. Note that environments with hard and
reflective surfaces (such as metal walls and glass
windows) can promote reverberation when speakerphones
are used and result in echo complaints (from the farend).
1
Best Practices documents are intended for Avaya Services
Installation teams and Business Partners, and can be found on
the Avaya Enterprise Portal Support page via the Anatomy of a
Successful Cut link:
https://enterpriseportal.avaya.com/ptlWeb/appmanager/ptl/internal/?_nfpb=true&_pageLabel=support.
The link provides documents such as the Echo Prevention
Checklist which can be obtained by clicking on Checklists/FAQs
and then CSD – CM.
avaya.com
5 of 27
3. PROBLEM DEFINITION
Assuming the system components are up to date (latest
version HW, SW, FW) and the solution design meets
Avaya’s IP Telephony deployment requirements2, the
audio quality complaint from the customer should be
translated into an industry accepted definition of
audio or voice quality related terminology as
appropriate. Appendix A provides a glossary of these
terms.
4. DATA GATHERING
Beyond the standard set of component model and release
information, gather additional meaningful information
such as call type (station to station or PSTN trunk
call), call traversal and solution topology (within
port network, across port network, LAN, WAN, etc.).
Information gathering on whether the user experiences
the anomaly on handset, headset, or speakerphone; or
whether any operation is being performed (transfer,
conference, etc.) when the anomaly occurs can help in
determining the cause of the anomaly.
2
Avaya solution deployment guidelines and network requirements are contained in document ID
555-245-600 or can be accessed at the following URL:
http://support.avaya.com/elmodocs2/comm_mgr/r3/pdfs/245600_3_4_1.pdf
avaya.com
6 of 27
5. TROUBLESHOOTING
With the information gathered (as described in section
4), it is important that you outline the exact call
flow or the end-to-end route the voice signal takes
from the service provider network (if applicable)
through the system and across the customer network to
the end network device. Appendix B provides a network
diagram with measurement points identified (A – D) for
troubleshooting an incoming PSTN call to an enterprise
user. The following paragraphs are meant to assist in
determining the call flow.
If the audio quality problem involving Legacy
Telephones is intermittent, the support engineer will
be required to be available onsite or remotely at the
time the anomaly occurs (this may require a pager or
other notification process be established between
customer and support engineer so that the trouble can
be addressed in real time).
If the audio quality problem involving IP Telephones
is intermittent, a network analyzer (such as Agilent)
should be set up for 24/7 operation such that the call
information is captured (and buffered) at the time the
anomaly occurs. The key to ensuring the anomaly is
captured is by stopping the trace within a short time
after the problem occurs. This will require the user
to notify the customer’s IT department when the
anomaly occurs so that trace can be stopped ensuring
pertinent trace information is not overwritten. A
pager or other notification should be sent to the
Avaya Services Engineer to alert them that the capture
has been obtained.
If it has been established that the audio quality
problem occurs on trunk calls, it is recommended that
a trunk ID button be programmed on the user’s phone so
that when the audio anomaly occurs, the specific trunk
accessed can be obtained.
avaya.com
7 of 27
If the audio quality problem is reproducible,
establish a call with the phone experiencing the audio
issue and note the call traversal (the path the call
takes through the system). Once the call flow has been
identified, all the network components involved are
now known. At this point, try to determine whether the
problem is limited to specific system components by
initiating new calls that traverse a fewer number of
system components. For instance, test if the audio
issue can be reproduced on a station to station call
between two DCP Telephones on the same port network.
If the audio anomaly occurs when IP Telephones are
used, determine whether the anomaly exists with IP to
IP direct (shuffling) or whether the anomaly only
exists when calls traverse a Media Gateway VoIP
resource3 or only over a trunk call.
6. AUDIO QUALITY ANOMALIES
Having established the path the call traverses when
the customer audio anomaly occurs, combine this
knowledge with the definitions in Appendix A to
determine what system component is causing the problem
or if further measurements are required. The following
sections describe the three major categories of audio
quality anomalies (distortion, noise, and echo) and
the typical causes.
Distorted Speech
Speech distortion(s) are typically caused by low-rate
coders, improperly matched hybrid trunk circuits, and
stutter or speech-level pumping. Audio distortion
(amplitude clipping) coming from outside the
customer’s network is typically cause by the far-end
phone.
3
Note that taking a trace to an IP Phone while an anomaly is occurring is a quick
test to identify if the problem is in the phone or not by listening to the RTP
stream and comparing that to what is heard in the phone.
avaya.com
8 of 27
Distortion from inside the customer network could be
caused by, for example, a headset user with the
microphone too close to their mouth and overdriving
the microphone circuitry.
1. Distorted Speech can be caused by a damaged
microphone in the handset, headset, or speakerphone
circuit. Swap out the equipment to determine if this
is the cause of the distortion. Use of Low-rate
coders can cause muffled speech, synthetic,
mechanical or robotic speech when network
impairments are present. Take network QoS
measurements and note the values of the typical
characteristics (packet loss, delay and jitter).
•
If the user experiences any of theses types of
speech distortions while on a call using the
4610/20/21/22 IP Phones, the user (or Services
engineer) may press the Network Audio Quality
menu item (hit the Options button and select
Network Audio Quality) to get an instant
measurement of packet loss, delay and jitter
being experienced by that phone.
•
If the problem is intermittent, working with
the customer to set up a network analyzer to
monitor the network 24/7 such that when the
user experiences the problem, the trace can be
stopped and RTP analyzed. Use this data in
conjunction with the network topology and call
flow to determine what network element is
causing the anomaly.
2. Amplitude clipping can be determined by running an
Ethereal trace and converting the RTP stream to a
.wav file and analyzing that file with a tool such
as Audacity. The clipped speech signal can be
observed graphically with that tool.
avaya.com
9 of 27
If amplitude clipping is present, identify solution
elements that add gain and verify each element is
applying the expected amount of gain.
3. Clipping can be caused by packet loss which can be
determined using a network analyzer. Also, e-can
settings 1 thru 4 on the TN464GP boards are very
aggressive and can cause clipping. Administering the
DS1 (GP) e-can setting to 8 or 9 can
reduce/eliminate the clipping4. On DS1 HP boards, ecan setting 3 is a less aggressive echo canceller
and can reduce/eliminate clipping.
4. Clipping during double-talk can be caused by the
suppressor function of an echo canceller in the
network. DS1, MedPro/Cruiser, Crossfire, and ATM all
use e-cans with suppressor functions.
5. Stutter can be traced to it’s origin by setting up a
network analyzer to analyze the call topology and
where the bottleneck is occurring.
6. Speech-level pumping can be caused by the DS1 e-can
setting 4 (on GP cards) or by the telephone’s AGC.
Adjusting the GP type DS1 e-can setting to 7 or 8 or
turning off AGC are fixes for this problem. Also,
high transmit gain levels can cause pumping.
Reducing the TX gain in the 46xx Settings file use
by the IP Telephones can also reduce/eliminate
pumping5.
4
Changing the e-can setting affects active calls. For this
reason, it’s beneficial to change he e-can setting while the
audio anomaly is occurring in order to get immediate feedback on
the change.
5
This should only be performed by Tier IV or under Tier IV
supervision.
avaya.com
10 of 27
Noise
Noise occurs in most telephones at low levels. It’s
only when the levels are perceptible that complaints
arise. The following provides noise types and ways in
which to capture information in order to analyze and
pinpoint the cause of the noise.
1. Hiss or white noise is common at low levels.
Requirements allow for up to 20dBrnC of idle-channel
noise to appear in the receiver of a handset or
headset. Typical idle-channel noise should measure
approximately 15dBrnC. This can be found using a
pair of Sage 925VSTs. Or a Sage 925VST with a Sage
Responder.
2. Static causes can be identified by using a network
analyzer to determine if there is packet loss or bit
errors.
3. Motor-boating or Hum can be caused by power-line
interference. Checking grounds and any nearby high
voltage sources can uncover the cause for this
problem.
4. Distorted Music-on-Hold is typically caused by a
low-rate coder. If so, changing the coder will fix
this problem.
avaya.com
11 of 27
Echo
Echo, either acoustic or electric, is a function of
delay in the call path6. The steps below address how to
determine the cause of the delay.
1. Speech back to the far-end. If the far-end no longer
complains of echo after muting the near-end phone,
then this confirms the near-end is the source of the
echo. If the near-end is speakerphone, verify the
environment setup. If near-end headset user, verify
proper cords used. If headset with M12 amplifier,
verify the amplifier settings and proper cords are
used. Taking a trace on an IP Phone while echo is
heard can identify if the echo is in the phone or
from the far-end (if no echo is heard in the trace,
then it’s in the phone).
2. Electrical echo (constant, intermittent, residual,
distorted) is either at the analog trunk or in the
PSTN. Terminating the trunk locally and testing for
echo will identify if the source of echo is in local
or in the PSTN. Verify DS1 e-can is being utilized
and not the MedPro e-can resources.
7. OTHER INFORMATION
It is highly recommended that you do not adjust the
Loss Plan or Terminal Parameter gain settings. This is
not a recommended option for addressing audio
impairments.
For optimum headset performance, use only Avaya
headsets best suited for the environment7.
6
For more in-depth echo discussion, see Echo and Echo Control
document - COMPAS ID 106541.
7
http://www.avayaheadsets.com
avaya.com
12 of 27
8. REFERENCES
[1]
Terminology for voice-related artifacts in
telephony, CID # 114121, September, 2005.
[2]
Echo and Echo Control, CID # 106541, Issue
0.4, June, 2005.
[3]
Avaya 4600™ Series IP Telephones - Modular
Configuration Power Options, CID # 93139, Issue 1.3,
January, 2004
avaya.com
13 of 27
APPENDIX A: Audio Processing
Components & Audio Quality
Terminology
avaya.com
14 of 27
In a typical telephony call, speech from talker to
listener often passes through the following processing
components and in the order identified in Figure 1.
speaker
reverse of below
Echo
controller
+
-
+
Echo
controller
Expander
(noise
reduction)
Speech
encode
network
Speech
decode
Packet-loss
conealment
Automatic
Gain
control
mic
Figure 1.
The talker’s voice enters at the microphone on the
left side of Figure 1, then to the microphone
expander, voice coder, network transport, voice
decoder, packet-loss concealment, echo controller,
automatic gain control and, finally, the listener’s
ear. Brief definitions of these components are given
below.
Echo controller. This is a broad term meaning an
echo canceller, echo suppressor, or a combination of
the two. Speakerphone algorithms are also included. An
echo controller prevents a talker from hearing distant
reflections (echoes) of his/her own voice, reflections
caused by acoustic or electrical reflection points
within the telephone network and end-user equipment.
Echo controllers are often only partially successful,
and this is why echo is sometimes heard even though
the call path is known to include echo controllers.
Often, people use the “echo canceller” when in fact
what is being referred to is an echo controller.
Echo canceller. Software or hardware implementations
of a digital signal processing algorithm designed to
model and subtract-out – or cancel – the reflection,
or echo, of a speech signal. Strictly speaking, an
echo
path
avaya.com
15 of 27
echo canceller does not introduce attenuation or
suppression into the speech paths to reduce the
loudness of echo. The term canceller refers to an
adaptive digital filter that models the physical echo
path and subtracts that (excited) model from the
return speech path.
Echo suppressor. Like echo canceller, above, except
the echo level is reduced or eliminated by applying
suppression or attenuation to the return speech
channel. The use of attenuation causes other audio
artifacts, including chopping or clipping of speech
utterances and/or pumping of the loudness level of a
caller’s speech.
Microphone expander (and or noise reduction). A
microphone expander is a traditional and relatively
simple method of improving the speech-signal-tobackground-noise ratio emanating from the microphone
path. An expander attenuates weak room background
noises while passing unaltered the relatively loud
speech of the talker addressing the handset (or
headset, or speakerphone).
Speech coder (encoder and decoder). The raw
speech signal, once digitized, is often digitally
encoded for transmission into the telephone network.
Encoding has one purpose, namely, to reduce the bitsper-second rate of transmission required to
communicate voice from one end to the other. Highly
compressive codecs, such as a G.729 codec, reduce
speech to a low transmission rate (8000 bits-persecond), but sacrifice voice quality in doing so.
Higher voice quality is experienced in systems using
the traditional G.711 codec (mu-law codec), since
G.711’s higher transmission rate of 64,000 bits-persecond better captures the nuances of speech.
Regardless of coding scheme, at the receiving side,
the speech decoder reconstructs (an approximation to)
the original speech for playback.
avaya.com
16 of 27
Packet-loss concealment. Often combined with
speech decoders. When the network path includes
packet-speech transmission links, like VoIP, speech
packets can be lost because of network failures. In
such cases, a concealment algorithm attempts to fillin missing speech samples. Concealment can work well
when the rate of lost speech is very low, say, less
than 2% of transmissions.
Automatic gain control. Automatic gain control
devices apply signal gain or loss automatically in an
attempt to keep the speech sound level at the
listener’s ear relatively constant. Therefore, AGC
boosts low-level speech while reducing speech levels
that are too loud. Such devices have been used for
decades in audio broadcasting and recording
applications.
The following sections outline the troubleshooting
steps in order to identify audio issues.
avaya.com
17 of 27
Speech Distortions
Distorted Speech
Speech accompanied by an unnatural buzzing or raspy
sound. A classic example of distortion occurs in the
case of a far party who is speaking too loudly or too
close to the handset or headset microphone. The far
party’s speech saturates either the mechanical or
electrical capabilities of the handset, causing
overload distortion or amplitude clipping. Other
sources of speech distortion include: defective or
improperly (impedance) matched hybrid trunk circuits,
damaged microphones, loudspeakers or earpieces (a
rarity), and low-bit-rate speech coders (VoIP and
wireless).
Muffled Speech
Speech that has an unnatural loss of high-frequency
content. Muffled speech may be caused by, for example,
poorly designed microphone assemblies in handsets (in
particular, wireless handsets) and low-bit-rate speech
coders (VoIP or wireless).
Reverberant speech (also hollowness or
speaking-in-a-tunnel effect)
Sounds like the person you are listening to is
speaking in a barrel or large empty room. This can be
the case when the talker is using a speakerphone, but
it can also be the case when there is network echo,
e.g., in a teleconference without echo control.
Also see ECHO section anomalies: hot sidetone, shortpath acoustic echo, and electrical echo.
avaya.com
18 of 27
Synthetic, Mechanical, or Robotic Speech
This can be very subtle or very severe, or very
consistent or intermittent. In the most severe case,
the pitch information has been lost making the speech
sound monotonic and robotic. Recognizing who is
speaking is often difficult. Faulty or bad acting
hardware (e.g., a DSP chip in a cell phone, base
station, or VoIP gateway) can cause this. This may be
port/resource dependent but consistent on the faulty
port or resource. If this is occurring intermittently,
sudden bursts of packet loss (or frame erasure in a
wireless network) may be the cause as the packet-loss
concealment scheme struggles to interpolate the speech
signal. A less extreme case in which you may recognize
the speaker, but yet the speech sounds somewhat
artificial or synthetic has been known to occur as
well. Sometimes the effect includes a swirling sound
accompanying or in place of certain frequencies of the
voice signal. It may be very subtle and is often
linked to the usage of a low-bit-rate speech codec,
such as G.729 in the presence of background noise, or
possibly the occurrence of packet loss or frame
erasures, as occurs in VoIP and wireless networks.
Amplitude Clipping
See definition for distortion above.
Clipping
Portions of the speech signal are not heard. This can
occur in packet-switched networks when, for example,
large numbers of successive speech packets are not
received because of excessive network congestion.
Common in wireless phones, where the RF-signal
strength fades as the user moves within the
environment.
avaya.com
19 of 27
Clipping during Double-talk
Clipping, as defined above, but heard only when both
parties of a telephone call talk at the same time.
When it occurs, this effect is almost always caused by
the excessive use of echo suppression (see definition)
at some point within the network. In this case,
clipping of speech utterances is not caused by lost
speech packets or, in the case of wireless phones, RF
fades, though those artifacts may also be present in
the same call.
Stutter
This is often used to describe an effect caused by
repetition of short bursts of noise or speech, such as
“da-da-da-da…” or “fa-fa-fa-fa…” Stutter distortion
can occur in packet-speech networks when one or more
network elements (e.g., router or switch) become a
bottleneck to the timely transmission of speech
packets.
Speech-level pumping
Pumping is often used to describe a varying speechloudness level, that is, were the speech gets louder,
softer, then louder again, etc., over the course of a
call, often over a period of just several seconds.
Automatic gain control devices can cause audible and
distracting pumping.
NOISE
Hiss or White Noise
Relatively natural-sounding noise containing energy at
all frequencies. Low-level, idle-channel hiss noise
can be perceived on nearly every telephone call when
no person is speaking.
avaya.com
20 of 27
Static
Impulsive, ticking noise, similar to the sound of an
AM radio when tuned to a very weak or nonexistent
radio station. In a packet-speech network, can be
caused by lost speech packets and/or bit errors. May
also be used to describe power-line hum (see
definition below).
Motor-boating
Repetitive noise that is separate and distinct from
the talker’s voice. Motor-boat noise differs from
static in that it is repetitive or non-random. Motorboat noise can be caused by periodic disruptions
(errors) to digital speech circuits and power-line
interference at telephone endpoints.
Hum
Sounds like humming, as in “Hmmmmm…” Hum noise often
occurs when a source of 50 Hz or 60 Hz electrical
power is located near a telephone. The power source
emits an RF (radio frequency) field that induces a
hum-like noise that is heard through the phone’s
handset/headset earpiece or speakerphone loudspeaker.
Distorted Music-on-Hold
Low-bit-rate codecs such as G.729, and G.723, were
created to efficiently encode and transport speech but
not music. Thus the usage of these and other codecs
may distort and ruin the music signal. This can be
subtle or severe depending on the music source. As a
remedy you can try to experiment with various music
sources, but it would be best to use a more
appropriate codec such as G.711, G.726, or G.722.
avaya.com
21 of 27
ECHO
There are only two physical sources of echo in
telephony: electrical echo, or network echo, and
acoustic echo. Electrical echo is caused by a
reflection of the speech signal at 2-to-4-wire hybrid
circuitry. This circuitry is present in our own analog
trunk cards, and it also exists deep in within the
PSTN (at customer premises, for example). Acoustic
echo is caused by the physical coupling (air path,
appliance-body path) between a loudspeaker and a
microphone, for example, in a speakerphone, a handset
and a headset. Whether or not a talker actually
perceives electrical or acoustic echo depends on the
loudness of his/her reflected voice signal and the
roundtrip delay that that reflection suffers. The
loudness of the reflection at the point of reflection
depends upon the electrical impedance mismatch, for
electrical echoes, and the acoustic gain of the
loudspeaker-to-microphone path, for acoustic echoes.
The roundtrip delay is a function of the path the
reflected signal traverses, which in turn is a
function of the call topology
Electrical echo, also called network echo
Reflection of a talker's speech signal at a point of
2-to-4-wire conversion caused by an impedance mismatch
at the point of analog-to-digital conversion.
Acoustic echo
Reflection of a talker's speech signal at an acoustic
endpoint caused by the acoustic coupling between the
loudspeaker and microphone.
avaya.com
22 of 27
Constant echo
When talking, the perception of echo with every
utterance. Such cases occur when there is a physical
electrical or acoustic echo path but no echo
controller in the call topology to control echo.
Additionally, constant echo may result even though an
echo controller is known to be in the call path; this
indicates a complete failure of the echo controller,
usually because the capabilities of the echo
controller are exceeded (e.g., the echo tail length
exceeds the specifications of the echo controller).
Intermittent echo
When talking, the occasional perception of echo.
Intermittent echo often caused by the intermittent
failure of an echo controller in the call path. The
echo suppressor within the echo controller may fail to
engage (to apply echo attenuation) when necessary,
with the result that short bursts of echo become
audible. In acoustic echo control applications
(speakerphone) in which people or objects close to the
speakerphone are moving, the change to the physical
echo path often results in audible intermittent
acoustic echo to listeners at the other end of the
call.
Residual echo
When talking, the perception of very low-level (quiet)
echo. The echo could be either constant or
intermittent. Residual echo can be caused by PSTN
electrical echo that is not entirely removed by the
echo controller in the call path.
Distorted or buzz-like echo
When talking the perception of a distorted echo or
buzz-like sound. This can be caused by a non-linear
echo source. An example of this is saturation
distortion at an analog trunk interface. In this case,
signals low in amplitude are reflected cleanly, but
signals high in amplitude are returned with
significant distortion making it difficult for an echo
canceller to control echo.
avaya.com
23 of 27
Such distorted echo can be perceived constantly or
intermittently, depending on the degree of distortion
and the echo canceller(s) involved.
Slapback or kickback acoustic echo
This is strictly a phenomenon of acoustic echo. With
speakerphones, slapback or kickback echo is the
intermittent echo perceived at the ends of one's
utterances. This can occur with both older-model halfduplex speakerphones and newer-model acoustic-echo
canceling speakerphones. For example, a talker
speaking into a handset utters the phrase “Please send
me the check” and perceives echo primarily at the end
of his/her sentence. This echo is described as hearing
just the sound “eck” or “k” of the word “check,” or as
a slapping sound such as that made by slapping one’s
palm against a desktop. Commonly, slapback/kickback
echo is caused by acoustically reverberant rooms.
Large offices and conference rooms can have long
reverberation times. In such rooms, the speakerphone
senses at its microphone a reverberated version of the
word “check” (our prior example) several tens or even
hundreds of milliseconds after the far talker has
finished saying the word “check.” The speakerphone
algorithm detects this reverberated speech at its
microphone, detects no speech at its receive-path
driving the loudspeaker, and decides to transition to
transmit mode. The reverberated version of "check" is
transmitted back to the far talker, where it is
perceived as echo.
Whether a slapback reflection occurs depends on the
speakerphone acoustics, speakerphone algorithm and
room acoustics. Whether slapback echo is perceived as
echo by the far talker depends on the loudness of the
echo and the roundtrip delay. Some have described
slapback echo as speakerphone false switching, where
it is meant that the speakerphone has switched too
quickly from the receive mode to the transmit mode.
This is not actually a false switch, however; the
avaya.com
24 of 27
speakerphone merely rapidly switches between states
because it cannot distinguish between microphone
energy that is due to a talker who is local to the
speakerphone room and energy due to reverberation of
the received speech. A speakerphone that enforces a
longer hold-time governing the smallest amount of time
for a switch between the receive and transmit states
will in general suffer less from slapback echo. A
common drawback, however, of a longer receive-totransmit hold-time is a higher perception of clipped
utterances during doubletalk; in other words, a
greater perception at both ends of the speakerphone
call that the speech path is half duplex.
Sidetone
In handsets and headsets, a portion of the microphone
energy is fed back to the earpiece so that the user of
the handset/headset does has a psychoacoustic
experience that simulates the case in which the user's
ear is not occluded by an object (the handset
earpiece). Without sidetone injection, the user
experiences the psychoacoustically bothersome
condition that can be demonstrated to oneself by
pressing a finger into one ear while speaking. With
one ear occluded, the sound of one’s own voice is
dominated by the path through the interior of the head
(skull, etc.) instead of around the head, an effect
that most people find objectionable.
Hot sidetone
In a handset or headset, microphone-to-earpiece
sidetone injection is not normally noticed. Some
digital phones, in particular, IP phones in which the
internal audio processing frame rate is 5 ms or
greater, inject sidetone with an appreciable delay
(e.g., 5 ms) in the microphone-to-earpiece signal
path. This delay causes the sidetone to sound
reverberant and/or louder than normal, or hot. Though
hot sidetone is a type of echo source – because some
people may use the term “echo” to describe hot
sidetone – it is generated local to the telephone, not
at some point within the telephone network.
avaya.com
25 of 27
Short-path acoustic echo, short-path electrical
echo
Acoustic or electrical echo that occurs in a very
short roundtrip call topology. This type of echo is
commonly described as a hollow sound or sound of
speaking in a barrel (see 2.1.3). In a digital-todigital phone call (think DCP-to-DCP), station-tostation, the roundtrip delay is usually very small,
less than 10 ms. Some digital speakerphones produce
significant acoustic echo, which is not canceled,
suppressed, or otherwise controlled in this simple
call topology. In these cases, and depending on the
volume setting of the far-party’s speakerphone and
near-party’s listening handset, the near party may
perceive echo and refer to this as hot sidetone.
Again, this is truly acoustic echo from the
speakerphone but is returned to the talker with such a
short roundtrip delay that it is perceived as
hollowness or reverberance rather than as classic
echo. Because of the short roundtrip delay in this
case, it can be difficult to distinguish between hot
sidetone (see definition) and short-path echo.
Short-path electrical echo refers to an electrical
echo source in which the roundtrip echo path is also
short, say less than 10 ms. Again, in such cases, it
may be very difficult to distinguish between shortpath echo and hot sidetone (see definition).
avaya.com
26 of 27
APPENDIX B: Basic IP Telephony
Solution Diagram
avaya.com
27 of 27
Figure 2.
© 2005 Avaya Inc.
All Rights Reserved. Avaya and the Avaya Logo are trademarks of Avaya Inc. and may
be registered in certain jurisdictions. All trademarks identified by the ®, SM or TM are
registered trademarks, service marks or trademarks, respectively, of Avaya Inc. All other
trademarks are the property of their respective owners.
Download