Slides - The University of Texas at Austin

advertisement
NO-REFERENCE SYNTHETIC
IMAGE QUALITY ASSESSMENT
USING SCENE STATISTICS
Debarati Kundu and Brian L. Evans
Embedded Signal Processing Laboratory (ESPL)
Department of Electrical and Computer Engineering
The University of Texas at Austin
49th Asilomar Conference on Signals, Systems and Computers
November 11, 2015
2
Outline
• Introduction
• Subjective Quality Evaluation of Synthetic Scenes
• No-reference Objective Quality Evaluation of Synthetic Scenes
• Classification of Distortion Type in Synthetic Scenes
• Conclusion
3
Motivation
• Natural images from optical cameras
~350M photos/day uploaded to Facebook
• Computer graphics in video games
Massively multiplayer online (23.4M subscribers)
Cloud-based (e.g. Nvidia Grid servers)
Livestreaming (e.g. YouTube Gaming)
• Animated movies and synthetic images
• Visual quality assessment
Give immediate feedback to content creators
Maximize visual quality during compression
Reduce rendering complexity
Source: Statista.com
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
4
Image Quality Assessment (IQA) Algorithms
“Full-Reference”
(FR) IQA require
an “original”high
quality image for
comparison.
Reference
Test
“No-Reference”
(NR) IQA
assume that no
“original image”
is available for
comparison.
Supplemental
Information
Full Reference
“Reduced- IQA
Reference” (NR)
IQA require
supplementary
information about
the original image
Introduction |
ESPL Database|
Reduced
Reference IQA
Subjective
Opinion Scores
No-Reference
IQA
Correlation
Performance
[Dr. Alan C. Bovik,
EE381V Digital Video,
Spring 2015]
ESPL Objective Image Quality| Distortion Classification| Conclusions
5
ESPL Synthetic Image Database
• 25 reference images + 500 distorted images
• 5 distortion categories
• Annotated with differential mean opinion scores (DMOS)
• 52 observers: 12 among 64 subjects removed as outliers
• Single Stimulus Continuous Quality Evaluation with hidden reference
• Website: http://signal.ece.utexas.edu/~bevans/synthetic/
Reference (DMOS = 0)
Introduction |
ESPL Database|
Distorted (DMOS = 74.68)
ESPL Objective Image Quality| Distortion Classification| Conclusions
6
ESPL Database: Distortions
Interpolation
(DMOS = 63.23)
Additive Noise
(DMOS = 60.33)
Introduction |
ESPL Database|
Original
(DMOS = 0)
Blur
(DMOS = 50.89)
JPEG Compression
(DMOS = 74.68)
Fast Fading
(DMOS = 60.26)
ESPL Objective Image Quality| Distortion Classification| Conclusions
7
No-Reference IQA Algorithms
• Many use Natural Scene Statistics (NSS)
Distribution of pixel values, transform coefficients or MSCN pixels
Natural pristine images have same distribution regardless of content
Distortion causes deviation from statistical distribution
• Mean Subtracted Contrast Normalized (MSCN) pixels [Ruderman1993]
Models normalization in primary visual cortex
MSCN uses K x L neighborhood around pixel I(i, j) and Gaussian weights wk,l
I(i, j) - m (i, j)
Iˆ(i, j) =
s (i, j) +1
m (i, j) =
K
L
å åw
k.l
I(i + k, j + l)
Mean
k=-K l=-L
s (i, j) =
K
L
å å w [ I(i + k, j + l) - m (i, j)]
k.l
2
Standard
deviation
k=-K l=-L
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
8
Mean Subtracted Contrast Normalization
Original Image
MSCN Image
Introduction |
ESPL Database|
Standard Deviation Image
ESPL Objective Image Quality| Distortion Classification| Conclusions
9
Distorted Image Statistics in Different Domains
• Different distortions affect scene statistics characteristically
• Can be used for distortion classification and blind quality prediction
MSCN Coefficients
Steerable Pyramid Wavelet
Coefficients
Curvelet Coefficients
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
10
Learning-Based NR-IQA on ESPL Database
Model based feature
extraction
Support Vector
Machine Regressor
Input Image
Correlation
Algorithms
Features
BRISQUE
MSCN coefficients
BLIINDS-II
DCT coefficients
DIIVINE
Steerable Pyramid
GM-LOG
Gradient Magnitude +
Laplacian of Gaussian
CORNIA
Dictionary learning
Introduction |
Predicted
Quality Score
ESPL Database|
Pretrained model
Human
Scores
Distorted images split into training & test
sets in the ratio of 4:1
Split randomized over multiple trials
Median of correlation values over 1000
trials considered here.
ESPL Objective Image Quality| Distortion Classification| Conclusions
11
Learning based
methods
Correlation of NR-IQA and Human Scores
Spearman’s Rank Ordered Correlation Coefficient for different distortions
Algorithms
Interpolatio
n
Blur
Additive
Noise
JPEG
Blocking
Fast
Fading
Overall
DESIQUE
0.701
0.872
0.917
0.932
0.819
0.888
GM-LOG
0.798
0.800
0.881
0.902
0.796
0.881
CORNIA
0.800
0.870
0.858
0.884
0.777
0.847
BLIINDS-II
0.806
0.838
0.879
0.885
0.726
0.837
NIQE
0.364
0.354
0.835
0.385
0.392
0.470
LPCM
0.415
0.834
0.623
0.211
0.108
-
CPBDM
0.676
0.757
0.746
0.765
0.347
-
FNVE
0.320
0.463
0.863
0.517
0.461
-
JPEG-NR
0.540
0.593
0.748
0.928
0.464
-
Introduction |
Noise
evaluation
ESPL
Database|
metrics
Blockiness
evaluation
ESPL Objective Image Quality|
metric Distortion Classification|
Blur
evaluation
metrics
Conclusions
12
SROCC
Training Robustness in Learning-Based Methods
• For each box, red central line is median of SROCC over 1000 trials
• Bottom/top of each box represents 25th/75th percentiles
• Whiskers span most extreme non-outlier data points
• Outliers plotted individually
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
13
Classification of Distortions in ESPL Database
Algorithms
Interpolation
Blur
Additive
Noise
JPEG
Blocking
Fast
Fading
Overall
GM-LOG
100.0
96.0
100.0
96.5
96.6
97.8
C-DIIVINE
94.5
96.2
100.0
94.5
93.8
95.8
BRISQUE
94.4
96.6
100.0
91.8
89.8
94.4
BLIINDS-II
91.6
87.7
100.0
81.3
74.6
85.9
CurveletQA
88.4
85.8
100.0
81.3
74.6
85.9
Singular value decomposition of NR-IQA features in 2-D
GM-LOG features
Introduction |
ESPL Database|
BLIINDS-II features
ESPL Objective Image Quality| Distortion Classification| Conclusions
14
Conclusions
• Distortions change scene statistics of synthetic scenes similar to the
case for natural scenes
• Learning based generic models perform better than models meant for
individual distortions
• Multiple domain feature combining models perform best
Future Work
• Create an image database with shading and color artifacts
• Design NR-IQA to correlate better with interpolation artifacts
• Use NSS models to evaluate photorealism of rendered scenes
ESPL Database: http://signal.ece.utexas.edu/~bevans/synthetic/
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
15
Questions?
16
Algorithm Acronyms
• BLIINDS-II: Blind Image Quality Assessment DCT domain (BLIINDS-II)
•
•
•
•
•
•
•
•
[Saad2012]
CORNIA: Codebook Rep for No-Ref Image Assessment (CORNIA) [Ye2012]
DESIQUE: DErivative Statistics-based Quality Evalulator [Zhang2013]
DIIVINE: Distortion Identification-based Image Verity and INtegrity
Evaluation [Moorthy2011]
GM-LOG: Gradient Mag-Laplacian of Gaussian Index [Xue2014]
NIQE: Natural Image Quality Evaluator [Mittal2013]
LPCM: Local Phase Coherence Measurement [Hassen2013]
CPBDM: Cumulative Probability of Blur Detection Metric [Narvekar2011]
FNVE: Fast Noise Variance Estimation [Immerkar96]
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
References
17
• [Bremond2010] J. Petit, R. Bremond, and J.-P. Tarel, “Saliency maps of high dynamic range images," in Proceedings of the
•
•
•
•
•
•
•
•
•
•
6th Symposium on Applied Perception in Graphics and Visualization , ser. APGV '09. New York, NY, USA: ACM, 2009.
[Ferwerda2007] G. Ramanarayanan, J. Ferwerda, B. Walter, and K. Bala, “Visual equivalence: Towards a new standard for
image fidelity,” in SIGGRAPH ACM, 2007
[Kundu 2014] D. Kundu and B. L. Evans, ``Spatial Domain Synthetic Scene Statistics'', Pro c. Asilomar Conf. on Signals,
Systems, and Computers, Nov. 2-5, 2014, Pacifi c Grove, CA USA.
[Larson1997] G. W. Larson, H. Rushmeier, and C. Piatko. 1997. A Visibility Matching Tone Reproduction Operator for High
Dynamic Range Scenes. IEEE Transactions on Visualization and Computer Graphics 3, 4 (October 1997), 291-306.
[Larson2010] E. C. Larson and D. M. Chandler, “Most apparent distortion: fullreference image quality assessment and the
role of strategy,” J Electronic Imaging, vol. 1 9, no. 1, p. 011006, 2010.
[Ma2015] Kede Ma; Yeganeh, H.; Kai Zeng; Zhou Wang, "High Dynamic Range Image Compression by Optimizing Tone
Mapped Image Quality Index," in Image Processing, IEEE Transactions on , vol.24, no.10, pp.3086-3097, Oct. 2015
[McGuire2012] M. McGuire, P. Hennessy, M. Bukowski, and B. Osman. 2012. A reconstruction filter for plausible motion
blur. In Proceedings of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games (I3D '12), ACM, New
York, NY, USA, 135-142.
[Moorthy2011] A. K. Moorthy and A. C. Bovik, “Blind image quality assessment: From natural scene statistics to perceptual
quality,” IEEE Transactions on Image Processing, vol. 20, no. 12, pp. 3350– 3364, Dec 2011.
[Nafchi2014] H. Ziaei Nafchi, A. Shahkolaei, R. Farrahi Moghaddam, and M. Cheriet, “Fsitm: A feature similarity index for
tone-mapped images," Signal Processing Letters, IEEE , vol. 22, no. 8, pp. 1026-1029, Aug2015.
[Narwaria2013] M. Narwaria, M. Perreira Da Silva, P. Le Callet, and R. Ppion, “Tone mapping-based high-dynamic-range
image compression: study of optimization criterion and perceptual quality," Optical Engineering , vol. 52, no. 10, pp. pp. 102
008-1 102 008-15, Oct 2013.
[Ruderman1993] D. L. Ruderman and W. Bialek, “Statistics of natural images: Scaling in the woods,” in Neural
Information Processing Systems Conference and Workshops, 1 993, pp. 551–558.
18
References (cont’d)
• [Saad2012] M. A. Saad, A. C. Bovik, and C. Charrier, “Blind image quality assessment: A natural scene statistics
•
•
•
•
•
•
•
•
•
approach in the DCT domain.” IEEE Transactions on Image Processing, vol. 21, no. 8, pp. 3339–3352, 2012
[Wang2003] Z.Wang, E. P. Simoncelli, and A. C. Bovik, “Multiscale structural similarity for image quality
assessment,” in Proc. Asilomar Conference on Signals, Systems and Computers, vol. 2, Nov 2003, pp. 1398–1402
Vol.2.
[Winkler2012] Winkler, S., "Analysis of Public Image and Video Databases for Quality Asses sment," in Selected
Topics in Signal Processing, IEEE Journal of , vol.6, no .6, pp.616-625, Oct. 2012
[Xue2014] W. Xue, X. Mou, L. Zhang, A. C. Bovik, and X. Feng, “Blind image quality assessment using joint
statistics of gradient magnitude and laplacian features,” IEEE Transactions on Image Processing, vol. 23, no. 11,
pp. 4850–4862, Nov 2014
[Ye2012] Peng Ye, Jayant Kumar, Le Kang, and David Doermann, “Unsupervised Feature Learning Framework for
No-reference Image Quality Assessment,” in Proc. Intl. Conf. on Computer Vision and Pattern Recognition, June
2012, pp. 1098–1105
[Yeganeh2013] H. Yeganeh and Z. Wang, “Objective quality assessment of tone-mapped images," Image
Processing, IEEE Transactions on , vol. 22, no. 2, pp. 657-667, Feb 2013.
[Zhang2011] L. Zhang, L. Zhang, X. Mou, and D. Zhang, “FSIM: A feature similarity index for image quality
assessment,” IEEE Transactions on Image Processing, vol. 20, no. 8, pp. 2378–2386, Aug 2011
[Zhang2012] L. Zhang and H. Li, “SR-SIM: A fast and high performance IQA index based on spectral residual,” in
Proc. IEEE International Conference on Image Processing, Sept 2012, pp. 1473–1476
[Zhang2013] Y. Zhang and D. M. Chandler, “No-reference image quality assessment based on log-derivative
statistics of natural scenes,”Journal of Electronic Imaging, vol. 22, no. 4, 2013
[Zhang2014] L. Zhang, Y. Shen, and H. Li, “VSI: A visual saliency-induced index for perceptual image quality
assessment,” IEEE Transactions on Image Processing, vol. 23, no. 10, pp. 4270–4281, Oct 2014
19
Backup
20
ESPL Database: Distortion Parameters
• Interpolation
• Images downsampled by factors ranging from 3 to 6
• Upsampled back using nearest neighbor interpolation
• Blur
• RGB color channels filtered with circularly symmetric 2D Gaussian kernel
• Standard deviation ranging from 1.25 to 3.5 pixels
• Same kernel employed for each color channels
• Additive Noise:
• Same noise variance used for all color channels
• Noise standard deviation ranged from 0.071 to 0.316 pixels
Back
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
21
ESPL Database: Distortion Parameters (cont’d)
• JPEG compression
• MATLAB imwrite function was used
• Bits per-pixel (bpp) ranged from 0.0445 to 0.1843.
• Higher bpp images were not considered to better simulate playing a cloud
video game under restricted bandwidth conditions.
• Simulated Fast Fading Channel
• Original images compressed into JPEG2000 bitstreams
• Wireless error resilience features enabled and 64 x 64 tiles
• Transmitted over a simulated Rayleigh-fading channel
• Signal-to-noise ratio (SNR) was varied at the receiver from 14 to 17 dB
Back
• SNRs greater than 17 dB did not introduce perceptible distortions due to
the error resilience feature of the JPEG2000 codec.
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
22
ESPL Database: Methodology
• Single Stimulus Continuous Quality Evaluation (SSCQE) method
• Each subject evaluated each image
• Three sessions, of one hour each, separated by at least 24 hours.
• Each session divided into two sub-sessions of 25 minutes
• Separated by a break of five minutes.
• 64 subjects
• Age range : 18 – 30 years
• Mostly without prior experience in participation of subjective tests
• Verbal confirmation of 20/20 (corrected) vision was obtained
• Viewed roughly 175 test images during each session
• Randomly ordered using a random number generator
• Testing sessions were preceded by training session of 10 images
Introduction |
ESPL Database|
Back
ESPL Objective Image Quality| Distortion Classification| Conclusions
23
ESPL Database : Methodology (cont’d)
• User interface programmed on MATLAB using Psychology Toolbox
• NVIDIA Quadro NVS 285
• Dell 24 inches U2412M display
• Normal office illumination
• Each image displayed for 12 seconds
• Viewing distance: 2-2.25 times display height
• Scores between 0-100 was entered
Back
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
24
ESPL Database: Processing of raw scores
• Raw scores were first converted to raw quality difference scores:
dij = riref ( j ) - rij
• rij: Score assigned to j-th image by the i-th subject
• riref(j): Score assigned by same subject to corresponding reference
• Difference scores normalized for each subject and averaged
Histogram of DMOS scores
Introduction |
ESPL Database|
Scatter plot of DMOS scores
Back
ESPL Objective Image Quality| Distortion Classification| Conclusions
25
ESPL Database: Processing of raw scores
• Raw scores were first converted to raw quality difference scores:
dijk = riref ( j)k - rijk
• rijk: Score assigned to j-th image by the i-th subject in k-th session
• riref(j)k: Score assigned by same subject to corresponding reference
• DMOS score is zero for reference images
• DMOS scores converted to Z-scores per session
N
1 ik
mik =
å dijk
N ik j=1
zij' =
N
ik
1
s ik =
(dijk - mik )2
å
N ik -1 j=1
zijk =
dijk - mik
100(zij + 3)
6
1 M '
DMOS j = å zij
M i=1
s ik
Nik: Number of videos seen by i-th subject in k-th session
Back
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
26
ESPL Database: Outlier rejection
• Done as per ITU-R BT 500.11 recommendation
• Compute kurtosis of scores per subject to check Gaussianity
• If kurtosis falls between the values of 2 and 4 (Gaussian)
• Subject rejected if more than 5% of his scores falls outside ±2σ from mean.
• For non Gaussian distributions
• Subject rejected if more than 5% of his scores falls outside ±4.47σ from mean.
• 12 out of 64 subjects rejected
• Testing degree of consensus among subjects:
• Subjects divided into two groups randomly
• DMOS scores for all the image calculated individually from each group
• Pearson’s linear correlation coefficient was 0.9813 between the groups
• Shows a high level of consensus among the subjects
Back
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
27
ESPL Database: Calculating correlations
• Let Qj be the quality predicted by the IQA algorithm for the j-th image.
• Four parameter monotonic logistic function fit IQA predictions to quality
scores:
b1 - b2
Q'j = b2 +
1+ exp[-
Q j - b3
]
| b4 |
N
• Spearman’s Rank-order correlation coefficient: SRCC = 1-
6å di2
i=1
2
N(N -1)
• where di is the difference between the i-th image’s ranks is subjective and
objective evaluations.
• Kendall’s correlation coefficient: KCC = N c - N d
0.5N(N -1)
• Nc and Nd are the number of concordant (of consistent rank order) and
discordant (of inconsistent rank order) pairs in the data set respectively.
Back
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
28
ESPL IQA: Modeling of MSCN Gradients
Distribution of the gradient magnitude of MSCN coefficients
•
•
Modeled using Rayleigh, Weibull and Nakagami distributions
Mean square error, J-Divergence, and Pearson’s Chi-squared values for distributions fitted to
histograms of MSCN coefficients, averaged over 221 pristine synthetic images [Kundu 2014]
MSE
J
Chi Square
Rayleigh
0.00891
4.730
0.769
Weibill
0.0251
5.00432
0.663
Nakagami
0.00916
5.304
0.892
Back
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
29
ESPL IQA: Comparison with Natural Scenes
Histogram of scale parameter
(JS divergence = 1.5655)
Kurtosis
Histogram of shape parameter
(JS divergence = 1.0503)
Skewness
Skewness-kurtosis scatter plot
of MSCN coefficients of synthetic images[Kundu2014] (221 images) and natural images
Back
[Martin2001] (500 images from Berkeley Segmentation Dataset)
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
30
Generalized Gaussian Density
• The GGD
pg (r)=
b
( )
2sG b
-1
(
exp r / s
)
b
r ÎÂ, s,b>0
includes the special cases
b = 1 (laplacian density)
b = 2 (gaussian density)
b = ¥ (uniform density)
• Many authors have observed the GGD behavior of BP image
signals.
[Ref: Dr. Alan C. Bovik,
EE381V Digital Video,
Spring 2015]
• Wavelet coefficients
• DCT coefficients
• Usually reported that b » 1 but varies (0.8 < b < 1.4).
Back
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
31
Symmetric Alpha Stable
• A random variable X is called stable if its characteristic function can be
expressed as:
• sgn(t): sign of t
•
Back
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
32
Steerable Filter
• Create directional derivative of gaussian in arbitary direction
by a coordinate rotation by q:


  x 2  y2 
1

h θ (x) =
 cos θ  x   sin θ  y  exp 
4 
2


2πσ
2σ


2
 j   cos θ  U   sin θ  V   exp  2  πσ  U 2  V 2 




• In particular:
h θ (x) =  cos θ  h 0 (x)   sin θ  h π (x) 
 cos θ  H 0 (u)  sin θ  H π (u) = Hθ (u)
2
2
• A derivative-of-gaussian filter of any orientation (derivative
direction ) is exactly a linear combination of two orthogonal
derivative-of-gaussian filters
[Ref: Dr. Alan C. Bovik, EE381V Digital Video, Spring 2015]
Introduction |
ESPL Database|
Back
ESPL Objective Image Quality| Distortion Classification| Conclusions
33
ESPL IQA: Steerable Pyramid comparison
(a) Orientation = 0 degree
(b) Orientation = 60 degrees
(c) Orientation = 0 degree
(d) Orientation = 60 degrees
Histogram of scale (a)(b) and shape parameters(c)(d) of the steerable pyramid decomposition
of synthetic images (221 images) and natural images
Back
[Martin2001] (500 images from Berkeley Segmentation Dataset)
Introduction |
ESPL Database|
ESPL Objective Image Quality| Distortion Classification| Conclusions
34
ESPL IQA: Performance-Time Complexity
Tradeoff
• Some NR-IQA metrics comparable with the best FR-IQA
algorithms.
Introduction |
ESPL Database|
Back
ESPL Objective Image Quality| Distortion Classification| Conclusions
Download