2006 IMTC Forum ITU - T Workshop

advertisement
Presents
2006 IMTC Forum
ITU-T Workshop
G.729EV: An 8-32 kbit/s scalable wideband
speech and audio coder bitstream
interoperable with G.729
Presented by Christophe Beaugeant
On behalf of ETRI, France Telecom,
Matsushita, Mindspeed, Siemens
G.729EV Standardization history
•
•
•
Standardization work conducted in SG 16, WP 3, Question 10
(Rapporteur: Claude Lamblin)
T0 (approval of Term of References) declared in November 2004
Qualification test results in July 2005
– ETRI, France Telecom, Matsushita, Mindspeed, Siemens and VoiceAge
qualified
– These companies collaborated together to provide a single candidate
•
•
•
Optimization phase 08/05-01/06
Quality assessment tests (02-03/06): G.729EV passed all the
requirements.
Current status: under AAP procedure at the ITU-T
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
High level overview
•
8-32 kbit/s scalable wideband (50-7000 Hz) extension of G.729
– Wideband: More natural voice, better intelligibility, enhance
communication comfort
– Scalability : flexibility; dynamic, simple and low cost bit rate
adaptation
– Robustness : packet loss protection and concealment integrated
•
Build upon a three-stage structure, with 12 embedded layers:
– layers 1-2 : CELP, 50-4000 Hz, 8 -12 kbit/s
– Layer 3 : TDBWE, 50-7000 Hz, 14 kbit/s
– Layers 4-12: TDAC, 50-7000 Hz, 14 to 32 kbit/s
8
G.729a+
12
EXT.
32
14
TDBWE
TDAC
TDAC
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
G.729EV bitrate flexibility
8
12
G.729a+
EXT.
Original spectrum
Encoded spectrum
2
1
Narrow band
•
•
3
4
5
6
7
Wide band
Embedded CELP stage generates layers 1 and 2 : narrowband
synthesis (50-4000 Hz) at 8 and 12 kbit/s.
Layer 1 compliant with G.729 bitstream
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
G.729EV bitrate flexibility
12
G.729a+
14
TDBWE
EXT.
Original spectrum
Encoded spectrum
2
1
Narrow band
•
•
3
4
5
6
7
Wide band
Layer 3: Wideband Extension (TDBWE)
Wideband output (50-7000 Hz) at 14 kbit/s
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
G.729EV bitrate flexibility
12
G.729a+
14
24
TDAC
TDBWE
EXT.
32
TDAC
Original spectrum
Encoded spectrum
2
1
Narrow band
•
•
•
•
3
4
5
6
7
Wide band
Layers 4 to 12: Predictive transform coding referred as TDAC
Quality improvement from 14 to 32 kbit/s.
Encoding of the weighted CELP coding error in 50-4000 Hz band
Encoding input signal in 4000-7000 Hz band.
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Encoder:
High level overview
FEC encoder
S LB (n)
H1(z)
2
Hh1(z)
+
+ -
embedded
CELP encoder
(8-12 kbit/s)
S enh (n)
Aˆ ( z )
M
WLB(z)
S WB (n)
U
X
MDCT
TDAC
encoder
(16-32 kbit/s)
MDCT
H2(z)
2
(-1)n
Hh2(z)
S HB (n)
TDBWE
encoder
(14 kbit/s)
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Decoder:
High level overview
HPF
embedded
CELP decoder
(8-12 kbit/s)
+ +
+
adaptive
postfiltering
echo
Sˆ HB
( n)
2
G1(z)
echo
dˆ LB
( n)
pre/post-echo
reduction
D
WLB(z)-1
E
M
U
TDAC
decoder
(14-32 kbit/s)
Sˆ WB (n)
+
+
MDCT-1
+
MDCT-1
Sˆ HB (n)
X
MDCT
TDBWE
decoder
(14 kbit/s)
pre/post-echo
reduction
Sˆ
echo
LB
(-1)n
( n)
2
G2(z)
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Quality of G.729EV
•
Listening tests (02-03/06): G.729EV passed all the requirements.
•
Advanced embedment of three technologies:
– CELP : Good performance for Speech
– Bandwidth Extension: reduced bit rate to switch to wideband
– Transform Coding: high wideband quality with music and non
speech signals
•
Quality equivalent or better than other speech coding standard
– Optimum "state of the art" wideband quality for all types of signals
(including music) at 32 kbps
– High wideband quality of speech maintained at 14 kbps
– Narrow band "PSTN quality" at 12 kbps (equivalent to G711)
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Subjective tests
Clean speech in Narrowband in French
5
4,5
4
3,5
3
2,5
Direct
G.729A
CuT-8 k
G.729E
CuT-12 k
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Subjective tests
Wideband clean speech in French
5
4,5
4
3,5
3
Direct
G.722.2–
8,85k
G.722 –
48k
CuT-14 k
G.722 –
56k
CuT-24 k
CuT-32 k
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Objective Measurement
PESQ SCORES FOR CUT (MOS-LQO)
4.10
4.00
3.90
3.80
All Languages
English
French
German
Japanese
Korean
3.70
3.60
3.50
3.40
3.30
3.20
14k
16k
18k
20k
22k
24k
26k
28k
30k
32k
Bitrates
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Subjective tests
Wideband music at 32 kbit/s
5
4,5
4
3,5
3
Direct
G.722 @ 56 k
CuT-32 k
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Complexity of G.729EV
•
Complexity measured using ITU-T Software Tool Library
STL2005 v2.1 .
•
Complexity figures of the G.729EV coder (encoder/decoder).
Computational complexity
Static RAM
35.79 WMOPS
5 kwords
Scratch RAM
3.7 kwords
Data ROM
8.5 kwords
Program ROM
32 kwords
Delay
48.9375 ms
Low delay mode (narrow-band)
25 ms
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Foreseen application
•
Designed for packetized wideband voice (VoIP, VoATM, IP phone,
private networks)
– Applications requiring scalable wideband on top of G.729,
– Easy integration with existing VoIP infrastructure and services, fast
deployment
•
Extension to Multimedia services (Videoconference, VOD…)
– High quality with graceful degradation from WB (face-to-face) quality to
NB (telephone) quality
•
Advantage of Scalability:
–
–
–
–
–
Simple and low cost bit rate adaptation (very limited CPU)
Design for devices (gateways) that multiplex data/audio streams
Handling heterogeneous accesses/terminals
Optimization of bitrate to network capabilities (Network congestion)
Easy modification of bitrate by intelligent router (adjustment to network
capabilities)
– Voice messaging: capacity vs quality trade-off, access adaptation
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
WB VoIP & Conversational
e-Learning service
•
•
•
Excellent voice quality to fixed users
Fittest quality and service connectivity to mobile users
Trial service will be deployed by KT in Q3/2006
Service & QoS
Manager
Conversational
e-Learning
Contents
G.729EV
32kbits/s
IP
Backbone
Wireline(xDSL, FTTH, Cable)
Fixed User
G.729EV
32kbits/s
Public Wireless LAN
WiBro (Portable Internet)
Mobile Users
G.729EV
8~32kbits/s
For fittest
quality
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Enterprise Wideband VoIP
•
•
•
Excellent voice quality among G.729EV terminals
Seamless interoperability of legacy G.729 VoIP users and facilities
Anticipated companies: Dacom, Netcodec, Essetel,Onnet,,
Enterprise – Headquarter (Seoul)
IEEE802.11 b/g/e
G.729EV WiFi Phone or
G.729EV WiFI + Mobile
G.729EV IP phone
IEEE802.11 b/g/e
AAA, Call Agent
QoS Manager
AP
IEEE
802.1
Enterprise – Branch A (LA)
AP
Flow-based
QoS IP
Network
p/q
G.729 WiFi Phone
80 2. 1
E
E
E
I
p/q
G.729EV Soft-phone
Existing VoIP Facilities
G.729-G.711 Gateways
T1/E1
PSTN
Analog phones (Seoul)
T1/E1
PSTN
Analog phones (LA)
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Intelligent service
•
•
•
Conference call with heterogeneous terminals
Adaptability to terminal capabilities
Adaptability to Up-link / Downlink voice Quality requirement
Service
Manager
AP
G.729 EV Wifi phone
G729 EV Wifi + Mobile
Fixed User
24 kbit/s
14 kbit/s
14 kbit/s
32 kbit/s
14 kbit/s
12 kbit/s
12 kbit/s
32 kbit/s
G.729 EV
IP phone
G.729-G.711 Gateways
PSTN
Analog
Phone
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Conclusion
• G729 EV : 8-32 kbit/s scalable codec for narrowband
speech (8-12 kbit/s), wideband speech (12-32 kbit/s)
and audio (32 kbit/s)
• Good quality compared to existing codecs
• Applications foreseen used the advantage of scalability
• Codecs obtained through collaboration of ETRI,
France Telecom, Matsushita, Mindspeed, Siemens and
VoiceAge.
• Quick development to fit to operators’ needs in the
field applications over IP
• Extension: Stereo coding, Super Wideband (32 kHz)
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Back-up
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Subjective tests
Interfering Talker Background noise at 15 dB in NB in German
5
4,5
4
3,5
3
Direct
G.729A
CuT-8 k
CuT-12 k
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Subjective tests
Interfering talker background noise at 15 dB in WB in English
5
4,5
4
3,5
3
G.722 – 48k
G.722 – 56k
CuT-24 k
CuT-32 k
Direct
Joint IMTC Forum and ITU-T Workshop – “H.323, SIP: is H.325 next?” – 9-11 May 2006 – San Diego, CA, USA
Download