Uploaded by Ashlesh Maurya

JVT-B077

advertisement
Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG
(ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6)
2nd Meeting: Geneva, CH, Jan. 29 - Feb. 1, 2002
Title:
Short tap filter for high resolution sequences
Status:
Input Document to JVT
Purpose:
Proposal
Author(s) or
Contact(s):
Kei-ichi Chono and Yoshihiro Miyamoto
NEC Multimedia Laboratories,
4-1-1 Miyazaki Miyamae Kawasaki,
216-8555 JAPAN
Source:
Tel:
Fax:
Email:
Document: JVT-B077
Filename: JVT-B077.doc
Generated: 2002-01-23
+81 44 856 8141
+81 44 856 2232
chono@dsp.cl.nec.co.jp
miyamoto@dsp.cl.nec.co.jp
NEC Corporation
Kawasaki, 216-8555 JAPAN
_____________________________
1. Background
In VCEG-O16 of the Pattaya meeting adaptive interpolation scheme, estimating and transmitting
6-tap filter coefficients per picture, was proposed [1] To consider non-stationary statistical
properties of video signals, this adaptive interpolation scheme seems to be beneficial [1][2].
However, there are some issues: complexity and so on. AHG was formed to study these issues
for the in-corporation of this feature into TML.
We agree to the adaptive interpolation scheme, and have compared coding quality gains of
some interpolation filters, for a given set selection of filter coefficients. Experiment results show
us two facts. First fact is that TML9 6-tap filter coding gain to Telenor 4-tap filter depends on
sequences. Second fact is that 4-tap filter implementation reduces the amount of fractional-pel
interpolation calculation to 50% at decoder. We propose that encoder/decoder may use shorter
tap filter for high-resolution sequences, in order to reduce the complexity.
2. Experiments
2.1 Interpolation
In order to produce 2:1 resolution, Telenor 4-tap filter, with impulse response (-4, 20, 20, -4)/32
[3], is used. For 4:1 resolution, Bilinear filter is used as second filter. Hence, the difference
between 4-tap filter interpolation and TML9 is first filter coefficients.
2.2 Test sequences and coding conditions
Test sequences are shown in Table 1.
File:231199546
Page: 1
Date Printed: 9/29/2021
Table 1: Test sequence
Test sequcence
Resolution Frame rate (fps)Frames
Coastguard
QCIF
10
300
Container
QCIF
10
300
Foreman
QCIF
10
400
News
QCIF
10
300
Silent
QCIF
15
300
Coastguard
CIF
10
300
Container
CIF
10
300
Foreman(Simens version)
CIF
10
300
News
CIF
10
300
Silent
CIF
15
300
Paris
CIF
15
300
Mobile
CIF
30
300
Tempete
CIF
30
260
Flower
SIF
30
150
Football
SIF
30
150
Flower
CCIR
30
150
Football
CCIR
30
150
Mobile
CCIR
30
150
The coding conditions are: Quantization parameter for I-picture(QI) set to 24, 16 pixel search
range(32 pixel is only used for Football CCIR), RestrictSearchRange set to two, UseHadamard
set to on, quarter-pel motion, one reference frame, UVLC in entropy coding, RD optimization set
to on, no B-picture.
CCIR sequences are interlace-formant, not supported by TML9 yet. The encoder treated 2 fields
as 1 frame described Fig 1.
frame 2
frame 1
frame 0
Top field
Top field
Top field
Bottom field
Bottom field
Bottom field
time
0
Fig 1
Input YUV format at CCIR encoding
2.3 Results
TML9 filter coding gain compared to Telenor 4-tap filters is the following:
QCIF results:
TML9 filter leads to average gain 0.19[dB] with avsnr4, up to 0.48[dB
File:231199546
Page: 2
Date Printed: 9/29/2021
CIF and SIF results:
TML9 filter leads to average gain 0.17[dB] with avsnr4, up to 0.60[dB]
CCIR results:
TML9 filter leads to average gain 0.24[dB] with avsnr4, up to 0.54[dB]
Avsnr4 is used to calculate objective differences for common measurement method [4]. The
avsnr4-results are shown in Table 1-3, and rate-distortion curves are attached.
3. Proposal : adaptive filter taps for reducing coding calculation
Experiment results show that TML9’s coding gain to Telenor 4-tap filter depends on sequences.
The total amount of fractional-pel calculation increases in proportion to the spatial resolution. The
current TML9 uses direct-interpolation for reducing multiplications and memory requirement.
Telenor 4-tap direct-interpolation can reduce multiplication to half of 6-tap’s [5]. We have
evaluated the decoding time improvement by the multiplication reduction. Evaluation condition is:
Pentium4 1.7GHz, 512MB memory, Microsoft Windows2000, and Intel VTune5 with 1msec
sampling rate. The multiplication reduction cuts half of interpolation calculation, and 10 percent of
total decoding calculation (See Chart 1-3).
From view of complexity reduction, we think that encoder/decoder may use shorter tap filter for
high-resolution sequences, because TML9’s coding gain depends on the sequence.
4. Conclusion
In this report, two facts are shown:
[1] TML 6-tap filter coding gain to Telenor 4-tap filter depends on sequences
[2] 4-tap direct interpolation cuts half of interpolation calculation, and 10 percent of total
decoding calculation
We propose that encoder/decoder may use shorter tap filter for high-resolution sequences, in
order to reduce the complexity. The additional study on higher resolution sequences, as HDTV,
may be valuable for confirming this interpolation filter issue.
Reference
[1] Thomas Wedi, “H.26L Interpolation Filtering Ad Hoc Report”, ITU-T SG16, doc. VCEG-O11,
Pattaya, Thailand, 4-6 Dec., 2001
[2] Thomas Wedi, “Adaptive Interpolation Filter for H.26L”, ITU-T SG16, doc. VCEG-N28, Santa
Barbara, CA, USA, Sep. 2001
[3] Thomas Wedi, “Interpolation Filter for motion compensated prediction with 1/4 and 1/8-pel
accuracy”, ITU-T SG16, doc.Q15-J-14, Osaka, Japan, May. 2000.
[4] Gary Sullivan and Gisle Bjontegaard, “Recommended Simulation Conditions for H.26L
Coding efficiency Experiment on Low-Resolution Progressive-Scan Source Material”, ITU-T
SG16, doc. VCEG-N81, Santa Barbara, CA, USA, Sep. 2001
[5] Thomas Wedi,” Results on complexity and coding performance investigations: Displacement
vector resolution and interpolation filter tap size”, ITU-T SG16, doc. VCEG-M46, Austin,
Texas, USA, April, 2001.
File:231199546
Page: 3
Date Printed: 9/29/2021
Avsnr4-results
Table1: QCIF results
TML9 average [dB]
TML9 average %
Sequence(QCIF)PSNR gain
bitsavings (avsnr4)
(avsnr4)
Coastguard
0.11
3.11
Container
0.48
11.20
Foreman
0.26
5.43
News
0.09
2.01
Silent
0.00
0.26
Average
0.19
4.40
Max
0.48
11.20
Min
0.00
0.26
Table2: CIF and SIF results
TML9 average [dB]
TML9 average %
Sequence
PSNR gain
bitsavings (avsnr4)
Coastguard(CIF (avsnr4)
0.06
1.65
)
Container (CIF)
0.27
8.04
Foreman (CIF)
0.13
2.95
News (CIF)
0.03
0.79
Silent (CIF)
0.00
0.12
Paris (CIF)
0.09
2.06
Mobile (CIF)
0.60
12.17
Tempete (CIF)
0.33
7.84
Flower (SIF)
0.16
3.15
Football (SIF)
0.02
0.44
Average
0.17
3.92
Max
0.60
12.17
Min
0.00
0.12
Table3: CCIR results
Sequence
Flower
Football
Mobile
Average
TML9 average [dB]
TML9 average %
PSNR gain
bitsavings (avsnr4)
(avsnr4)
0.13
2.49
0.06
0.55
0.54
10.15
0.24
4.40
Max
0.54
10.15
Min
0.06
0.55
File:231199546
Page: 4
Date Printed: 9/29/2021
Decoding samples
TM L9(77kbps)
P roposed(80kbps)
0
500
get_quarterpel_pixel
1000
S am ples
decode_one_m acroblock
1500
in-loop filter
itrans
2000
U V LC
else
Chart 1 Decoding samples (Sequence: Foreman with QCIF, 10fps, QI=24, QP=20, and 133 frames)
TM L9(251kbps)
P roposed(256kbps)
0
500
get_quarterpel_pixel
1000
1500
2000
decode_one_m acroblock
2500 3000
S am ples
3500
in-loop filter
4000
itrans
4500
U V LC
5000
else
Chart 2 Decoding samples (Sequence: Paris with CIF, 15fps, QI=24, QP=20, and 150 frames)
TM L9(4.1M bps)
4-tap filter(4.2M bps)
0
5000
10000
15000
20000
25000
S am ples
get_quarterpel_pixel
decode_one_m acroblock
in-loop filter
itrans
U V LC
else
Chart 3 Decoding samples (Sequence: Flower with CCIR, 30fps, QI=24, QP=20, and 150 frames)
File:231199546
Page: 5
Date Printed: 9/29/2021
Rate-distortion curves
PSNR [dB]
Coastguard QCIF 10fps
34.4
33.9
33.4
32.9
32.4
31.9
31.4
30.9
30.4
29.9
29.4
28.9
28.4
27.9
27.4
26.9
26.4
TM L
4-tap Telenor filter
0
20
40
60
80
100
120
140
160
Bitrate kbps
PSNR [dB]
C ontainer Q C IF 10fps
36.0
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
TM L
4-tap Telenor filter
0
5
10
15
20
25
30
35
B itrate kbps
PSNR [dB]
Forem an Q C IF 10fps
36.0
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
TM L
4-tap Telenor filter
0
10
20
30
40
50
60
70
80
90
100
B itrate kbps
File:231199546
Page: 6
Date Printed: 9/29/2021
PSNR [dB]
N ew s Q C IF 10fps
36.5
36.0
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
TM L
4-tap Telenor filter
0
10
20
30
40
50
60
B itrete kbps
PSNR [dB]
S ilent Q C IF 15fps
36.5
36.0
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
TM L
4-tap Telenor filter
0
10
20
30
40
50
60
70
B itrete kbps
PSNR [dB]
C oastguard C IF 10fps
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
27.5
27.0
TM L
4-tap Telenor filter
0
100
200
300
400
500
600
700
B itrate kbps
File:231199546
Page: 7
Date Printed: 9/29/2021
PSNR [dB]
C ontainer C IF 10fps
36.0
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
TM L
4-tap Telenor filter
0
20
40
60
80
100
120
140
160
B itrate kbps
PSNR [dB]
Forem an C IF 10fps
37.0
36.5
36.0
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
TM L
4-tap Telenor filter
0
50
100
150
200
250
300
B itrate kbps
PSNR [dB]
N ew s C IF 10fps
38.0
37.5
37.0
36.5
36.0
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
TM L
4-tap Telenor filter
0
20
40
60
80
100
120
140
160
B itrate kbps
File:231199546
Page: 8
Date Printed: 9/29/2021
PSNR [dB]
S ilent C IF 15fps
37.0
36.5
36.0
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
TM L
4-tap Telenor filter
0
20
40
60
80
100
120
140
160
180
200
B itrate kbps
PSNR [dB]
Paris C IF 15fps
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
27.5
TM L
4-tap Telenor filter
0
50
100
150
200
250
300
350
400
B itrate kbps
PSNR [dB]
M obile C IF 30fps
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
27.5
27.0
26.5
26.0
25.5
25.0
24.5
24.0
TM L
4-tap Telenor filter
0
500
1,000
1,500
2,000
2,500
B itrate kbps
File:231199546
Page: 9
Date Printed: 9/29/2021
PSNR [dB]
Tem pete C IF 30fps
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
27.5
27.0
26.5
26.0
25.5
25.0
TM L
4-tap Telenor filter
0
200
400
600
800
1,000
1,200
1,400
1,600
B itrate kbps
PSNR [dB]
Flow er SIF 30fps
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
27.5
27.0
26.5
26.0
25.5
25.0
24.5
24.0
23.5
23.0
TM L9
4-tap Telenor filter
0
500
1000
1500
2000
2500
B itrate kbps
PSNR [dB]
FootballS IF 30fps
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
TM L9
4-tap Telenor filter
0
500
1000
1500
B itrate kbps
File:231199546
Page: 10
Date Printed: 9/29/2021
PSNR [dB]
Flow er C C IR 30fps
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
27.5
27.0
26.5
26.0
25.5
25.0
24.5
TM L9
4-tap Telenor filter
0
1000
2000
3000
4000
5000
6000
7000
8000
B itrate kbps
PSNR [dB]
FootballC C IR 30fps
36.5
36.0
35.5
35.0
34.5
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
TM L9
4-tap Telenor filter
0
1000
2000
3000
4000
5000
B itrate kbps
PSNR [dB]
M obile C C IR 30fps
34.0
33.5
33.0
32.5
32.0
31.5
31.0
30.5
30.0
29.5
29.0
28.5
28.0
27.5
27.0
26.5
26.0
25.5
25.0
24.5
24.0
23.5
TM L9
4-tap Telenor filter
0
1000
2000
3000
4000
5000
6000
7000
8000
B itrate kbps
File:231199546
Page: 11
Date Printed: 9/29/2021
JVT Patent Disclosure Form
International Telecommunication Union
Telecommunication Standardization Sector
International Organization
for Standardization
International Electrotechnical Commission
Joint Video Coding Experts Group - Patent Disclosure Form
(Typically one per contribution and one per Standard | Recommendation)
Please send to:
JVT Rapporteur Gary Sullivan, Microsoft Corp., One Microsoft Way, Bldg. 9, Redmond WA
98052-6399, USA
Email (preferred): Gary.Sullivan@itu.int Fax: +1 425 706 7329 (+1 425 70MSFAX)
This form provides the ITU-T | ISO/IEC Joint Video Coding Experts Group (JVT) with
information about the patent status of techniques used in or proposed for incorporation in a
Recommendation | Standard. JVT requires that all technical contributions be accompanied
with this form. Anyone with knowledge of any patent affecting the use of JVT work, of their own
or of any other entity (“third parties”), is strongly encouraged to submit this form as well.
This information will be maintained in a “living list” by JVT during the progress of their work, on
a best effort basis. If a given technical proposal is not incorporated in a Recommendation |
Standard, the relevant patent information will be removed from the “living list”. The intent is
that the JVT experts should know in advance of any patent issues with particular proposals or
techniques, so that these may be addressed well before final approval.
This is not a binding legal document; it is provided to JVT for information only, on a best effort,
good faith basis. Please submit corrected or updated forms if your knowledge or situation
changes.
This form is not a substitute for the ITU ISO IEC Patent Statement and Licensing Declaration, which should be
submitted by Patent Holders to the ITU TSB Director and ISO Secretary General before final approval.
Submitting Organization or Person:
Organization name
NEC CORPORATION
Mailing address
4-1-1 Miyazaki, Miyamae-Ku, Kawasaki 216-8555, Japan
Country
Japan
Contact person
Yoshihiro Miyamoto
Telephone
+81-44-856-8141
Fax
+81-44-856-2232
Email
miyamoto@dsp.cl.nec.co.jp
Place and date of
submission
2nd JVT Meeting Jan.29-Feb.1, Geneva
Relevant Recommendation | Standard and, if applicable, Contribution:
Name (ex: “JVT”)
JVT
Title
Short tap filter for high res sequences
Contribution number
JVT-B077
(Form continues on next page)
File:231199546
Page: 12
Date Printed: 9/29/2021
Disclosure information – Submitting Organization/Person
X
2.0
(choose one box)
The submitter is not aware of having any granted, pending, or planned patents
associated with the technical content of the Recommendation | Standard or Contribution.
or,
The submitter (Patent Holder) has granted, pending, or planned patents associated with the technical
content of the Recommendation | Standard or Contribution. In which case,
2.1
The Patent Holder is prepared to grant – on the basis of reciprocity for the above
Recommendation | Standard – a free license to an unrestricted number of applicants
on a worldwide, non-discriminatory basis to manufacture, use and/or sell
implementations of the above Recommendation | Standard.
2.2
The Patent Holder is prepared to grant – on the basis of reciprocity for the above
Recommendation | Standard – a license to an unrestricted number of applicants on a
worldwide, non-discriminatory basis and on reasonable terms and conditions to
manufacture, use and/ or sell implementations of the above Recommendation |
Standard.
Such negotiations are left to the parties concerned and are performed outside the ITU |
ISO/IEC.
2.2.1 The same as box 2.2 above, but in addition the Patent Holder is prepared to grant a
“royalty-free” license to anyone on condition that all other patent holders do the same.
2.3
The Patent Holder is unwilling to grant licenses according to the provisions of either
2.1, 2.2, or 2.2.1 above. In this case, the following information must be provided as
part of this declaration:

patent registration/application number;

an indication of which portions of the Recommendation | Standard are affected.

a description of the patent claims covering the Recommendation | Standard;
In the case of any box other than 2.0 above, please provide the following:
Patent
number(s)/status
Inventor(s)/Assigne
e(s)
Relevance to JVT
Any other remarks:
(please provide attachments if more space is needed)
(form continues on next page)
File:231199546
Page: 13
Date Printed: 9/29/2021
Third party patent information – fill in based on your best knowledge of relevant patents granted,
pending, or planned by other people or by organizations other than your own.
Disclosure information – Third Party Patents (choose one box)
X
3.1
The submitter is not aware of any granted, pending, or planned patents held by third
parties associated with the technical content of the Recommendation | Standard or
Contribution.
3.2
The submitter believes third parties may have granted, pending, or planned patents
associated with the technical content of the Recommendation | Standard or Contribution.
For box 3.2, please provide as much information as is known (provide attachments if more space
needed) - JVT will attempt to contact third parties to obtain more information:
3rd party name(s)
Mailing address
Country
Contact person
Telephone
Fax
Email
Patent
number/status
Inventor/Assignee
Relevance to JVT
Any other comments or remarks:
File:231199546
Page: 14
Date Printed: 9/29/2021
Download