FACULTY OF SCIENCE AND ENGINEERING Linköping Studies in Science and Technology Dissertation No. 2272 Linköping Studies in Science and Technology, Dissertation No. 2272, 2023 Department of Science and Technology Linköping University SE-581 83 Linköping, Sweden www.liu.se Tanaboon Tongbuasirilai Data-Driven Approaches for Sparse Reflectance Modeling and Acquisition Tanaboon Tongbuasirilai Data-Driven Approaches for Sparse Reflectance Modeling and Acquisition 2023 Linköping studies in science and technology. Dissertation, No. 2272 Data-driven approaches for sparse reflectance modeling and acquisition Tanaboon Tongbuasirilai Division of Media and Information Technology Department of Science and Technology Linköping University, SE-601 74, Norrköping, Sweden February 2023 Data-driven approaches for sparse reflectance modeling and acquisition Copyright © 2023 Tanaboon Tongbuasirilai (unless otherwise noted) Division of Media and Information Technology Department of Science and Technology Linköping University, Campus Norrköping SE-601 74 Norrköping, Sweden ISBN ISBN ISSN DOI : : : : 978-91-7929-557-8 (Print) 978-91-7929-558-5 (PDF) 0345-7524 10.3384/9789179295585 Printed in Sweden by LiU-Tryck, Linköping, 2023 This work is licensed under a Creative Commons AttributionNonCommercial 4.0 International License. https://creativecommons.org/licenses/by-nc/4.0/ Description of the cover image The front cover of this thesis illustrates BRDFs in the form of multiple rendered spheres along BRDF curves (RGB signals) where the glossy BRDF curves are placed above the diffuse BRDF curves. The front illustration portrays the results of the thesis contents. The blue wave signals in the back cover represent sparse signals which serve as the BRDF encoding process (thesis’s contributions) to reconstruct the BRDF signals in the front cover. The orange abstract waves in the background indicate that the BRDF is a type of signals around us. The materials used to render all the spheres are measured isotropic BRDFs from Mitsubishi Electric Research Laboratories (MERL). Abstract Photo-realistic rendering and predictive image synthesis are becoming increasingly important and utilized in many application areas ranging from production of visual effects and product visualization to digital design and the generation of synthetic data for visual machine learning applications. Many essential components of the realistic image synthesis pipelines have been developed tremendously over the last decades. One key component is accurate measurement, modeling, and simulation of how a surface material scatters light. The scattering of light at a point on a surface (reflectance and color) is described by the Bidirectional Reflectance Distribution Function (BRDF); which is the main research topic of this thesis. The BRDF describes how radiance, light, incident at a point on a surface is scattered towards any view-point from which the surface is observed. Accurate acquisition and representation of material properties play a fundamental role in photo-realistic image synthesis, and form a highly interesting research topic with many applications. The thesis has explored and studied appearance modeling, sparse representation and sparse acquisition of BRDFs. The topics of this thesis cover two main areas. Within the first area, BRDF modeling, we propose several new BRDF models for accurate representation of material scattering behaviour using simple but efficient methods. The research challenges in BRDF modeling include tensor decomposition methods and sparse approximations based on measured BRDF data. The second part of the contributions focuses on sparse BRDF sampling and novel highly efficient BRDF acquisition. The sparse BRDF sampling is to tackle tedious and timeconsuming processes for acquiring BRDFs. This challenging problem is addressed using sparse modeling and compressed sensing techniques and enables a BRDF to be measured and accurately reconstructed using only a small number of samples. Additionally, the thesis provides example applications based on the research, as well as a techniques for BRDF editing and interpolation. Publicly available BRDF databases are a vital part of the data-driven methods proposed in this thesis. The measured BRDF data used has revealed insights to facilitate further development of the proposed methods. The results, algorithms, and techniques presented in this thesis demonstrate that there is a close connection between BRDF modeling and BRDF acquisition; efficient and accurate BRDF modeling is a by-product of sparse BRDF sampling. v Populärvetenskaplig Sammanfattning Fotorealistisk rendering och prediktiv bildsyntes har blivit allt viktigare och används i flera olika tillämpningsområden, allt ifrån produktion av visuella effekter och produktvisualisering till digital design och generering av syntetiska data för tillämpningar inom visuell maskininlärning. Utvecklingen har tagit en ordentlig fart under de senaste decennierna för många av de väsentliga komponenterna i det fotorealistiska bildsyntes-området. En nyckelkomponent inom området är att noggrant kunna mäta, modellera och simulera ljusets spridning från ytan hos ett material. Ljusspridningen från en punkt på en yta med reflektion och färg, beskrivs av en funktion (eng. BRDF); vilket är det huvudsakliga forskningsområdet i den här avhandlingen. BRDF beskriver hur strålning, ljus i detta fall, träffar en punkt på en yta och sprids mot varje observerad synvinkel. En noggrann uppmätning och representation av materialegenskaper är en fundamental del i fotorealistisk bildsyntes och omger ett väldigt intressant forskningsområde med många tillämpningar. Den här avhandlingen har utforskat och studerat modellering för materialytors utseende, glesa representationer och glesa mätningar av ljusspridningsfunktioner. Avhandlingen täcker två huvudområden. Inom det första området, BRDF-modellering, så presenterar vi ett flertal nya BRDF-modeller för noggrann representation av ljusspridningens beteende från materialets yta genom att använda simpla men effektiva metoder. Forskningsutmaningarna inom BRDF-modellering inkluderar både metoder för tensoruppdelning och glesa approximationer baserat på uppmätt BRDF-data. Den andra delen fokuserar på gles BRDF-sampling och en ny och effektiv mätningsmetod för att mäta BRDF. Syftet med den glesa BRDF-samplingen är för att förenkla och snabba upp de enormt tidskrävande processerna som krävs för att mäta BRDF. Detta utmanande problem löses genom att använda glesa modeller och tekniker från compressed sensing som möjliggör att BRDF kan be uppmätt och noggrant rekonstruerad genom att endast använda ett fåtal uppmätta sampel. Slutligen så visar avhandlingen ett flertal exempel på tillämpningsområden från forskningen, så väl som tekniker för BRDF-editering och interpolation. De BRDF-databaser som är öppna och tillgängliga för allmänheten är en vital del av de datadrivna metoderna som presenteras I denna avhandling. De uppmätta BRDF-data som använts har öppnat nya insikter för vidare utveckling av de framtagna metoderna. Resultaten, algoritmerna och teknikerna presenterade i den här avhandlingen visar på att det finns en nära koppling mellan BRDF modellering och BRDF-mätning; effektiv och noggrann BRDF-modellering är en biprodukt av gles BRDF-sampling. vii Acknowledgments Life is a journey and my thesis is analogous to a record of one moment of my life. It contains loads of wonderful memories during my PhD studies. I set sail and carried on this expedition with the supports of many amazing and great people. I could not have done it without their guidance, advice and encouragement. This part of this thesis is dedicated to the expression of my sincere gratitude towards my supervisor, co-supervisors, colleagues, friends and family. First of all, I would like to thank my supervisor, Jonas Unger, for his support, advice, guidance and patience throughout my PhD expedition. Your guidance assisted in steering this journey in the right direction. You have had positive attitudes and confidence to encourage me. All of the manuscripts could not be perfect without your touching. It has been a privilege to work with such a great and talented researcher like you. I would like to thank my co-supervisor, Ehsan Miandji. You introduced me to the sparseland and this inspired me and my future direction. Your enthusiasm motivated and influenced my research. I am grateful for all of our discussions and the joyful moments we had outside of work. Your food preference makes me so happy about Thai food and chilis. Additionally, I appreciate the research facilities and environment established by Anders Ynnerman. Your efforts and contributions to the division of Media and Information Technology (MIT) are invaluable. Sailing to the sea of academic and research work, I have met fantastic people. They are amazing and prominent crews on this journey. I would like to thank my friends and colleagues in the Computer Graphics and Image Processing (CGIP) group. Apostolia Tsirikoglou is the first crew whom I met before the beginning of this expedition. Thank you for the enjoyment you brought to our group. Thank you to Saghi Hajisharif for all suggestions you gave me. Whenever I need information, you are much better than Wikipedia. When I want to discuss deep-learning topics, Gabriel Eilertsen is the expert whom I can rely on. Your unintentional humor makes our chats more fun. Thank you to Gabriel Baravdish for the small talks we had in the morning at the office. The chilis on your food made me excited. Thank you to Behnaz Kavoosi for the collaborations, discussions and small chats we had in our office. Thank you to Fereshteh Abedini for all favors I have asked for. Thank you to chinese PhD students, Wen Cao and Yifan Ding, for insightful information about China. Thank you to Igor Fedorov for setting up servers in the lab. I would like to extend thanks to the old crews, Joel Kronander, Per Larsson and Andrew Gardner, who strengthened this group with their inventions, ideas and discussions. Thank to people from Linköping, Joel Hedlund, Karin Stacke, Milda Poceviciute, Shreyas Shivakumara and Betül Eren Keskin for bringing up the meetings lively. ix x During the transition step of the group, I would like to thank Daniel Jönsson for taking this responsibility. Moreover, all the paper and administrative works could not be done smoothly without the help from Eva Skärblom and Carina Enemård. During my journey in Sweden, I was fortunate to meet a lovely family, who were incredibly kind and helpful to me. I would like to express my deep gratitude to Pom Jonsson and her sons, Martin and Robin, for all their assistance and support. They were like a Swedish version of my own family. I cannot thank you enough for all Thai friends whom I had a great time with, including Kittichate Visuttijai, Auttapol Chaisomboonpun, and Pattharaporn Thongnim. Furthermore, my deepest gratitude is granted to my family in Thailand. You are supportive of whatever I needed during my studies. I could not have done it without them. Last but not least, the Buddha’s teachings have been a source of great comfort and guidance for me, especially during difficult and stressful times. His wisdom has helped me to prepare my mind and to find peace in the face of uncertainty. I am grateful for the Buddha’s teachings and for the ways in which they have helped me to grow and to better understand the world around me. I hope that the following quote, which has been a source of inspiration for me, will also be helpful and uplifting for others. "A disciplined mind brings happiness." The Gautama Buddha Tanaboon Tongbuasirilai Norrköping, February 2023 List of Publications This thesis includes research that is based the publications listed below: • A. Tsirikoglou, J. Kronander, P. Larsson, T. Tongbuasirilai, A. Gardner, and J. Unger, “Differential appearance editing for measured brdfs,” in ACM SIGGRAPH 2016 Talks, ser. SIGGRAPH ’16. New York, NY, USA: Association for Computing Machinery, 2016 • T. Tongbuasirilai, J. Unger, and M. Kurt, “Efficient brdf sampling using projected deviation vector parameterization,” in 2017 IEEE International Conference on Computer Vision Workshop (ICCVW). Los Alamitos, CA, USA: IEEE Computer Society, oct 2017, pp. 153–158 • T. Tongbuasirilai, J. Unger, J. Kronander, and M. Kurt, “Compact and intuitive data-driven brdf models,” The Visual Computer, vol. 36, no. 4, pp. 855–872, 2020 • T. Tongbuasirilai, J. Unger, C. Guillemot, and E. Miandji, “A sparse nonparametric brdf model,” ACM Trans. Graph., apr 2022 • E. Miandji, T. Tongbuasirilai, S. Hajisharif, B. Kavoosighafi, and J. Unger, “Frostbrdf : A fast and robust optimal sampling technique for brdf acquisition,” To be sumitted to ACM Trans. Graph., 2023 Other publication by the author that is not included is a journal paper. The topic is about person identification using Electroencephalography (EEG) signals. • T. Wilaiprasitporn, A. Ditthapron, K. Matchaparn, T. Tongbuasirilai, N. Banluesombatkul, and E. Chuangsuwanich, “Affective eeg-based person identification using the deep learning approach,” IEEE Transactions on Cognitive and Developmental Systems, vol. 12, no. 3, pp. 486–496, 2020 xi Contributions This thesis provides two major sets of contributions to Bidirectional Reflectance Distribution Function (BRDF). The first set of contributions focuses on datadriven BRDF modeling, representations and its applications (BRDF editing and interpolation) as presented in Paper A, Paper C and Paper D. The second set of contributions develops algorithms for sparse BRDF measurements as presented in Paper B and Paper E. In what follows, the publications included in this thesis are listed with a short description below. Paper A : Differential appearance editing for measured brdfs A. Tsirikoglou, J. Kronander, P. Larsson, T. Tongbuasirilai, A. Gardner, and J. Unger, “Differential appearance editing for measured brdfs,” in ACM SIGGRAPH 2016 Talks, ser. SIGGRAPH ’16. New York, NY, USA: Association for Computing Machinery, 2016 This paper presents a BRDF editing method as well as a BRDF interpolation technique on measured BRDF data. Our proposed method utilizes a parametric model to facilitate users in the editing and interpolation as the first step. The edited BRDF will be further computed as the differential appearance, a residual vector. The final step modifies the measured BRDF using the residual vector. Our method can preserve some effects which cannot be captured by the parametric model since the final edits are performed on the measured BRDF domain yet users is able to achieve desirable effects with few parameters. The author of this thesis was responsible for providing tools and part of implementations for the project. Paper B : Efficient BRDF Sampling Using Projected Deviation Vector Parameterization T. Tongbuasirilai, J. Unger, and M. Kurt, “Efficient brdf sampling using projected deviation vector parameterization,” in 2017 IEEE International Conference on Computer Vision Workshop (ICCVW). Los Alamitos, CA, USA: IEEE Computer Society, oct 2017, pp. 153–158 This work was inspired by a parametric BRDF model. Parameterizing BRDFs under Projected Deviation Vector (PDV) domain, isotropic BRDFs, specifically glossy materials, can be represented by only two 1D functions. The two 1D basis functions can be measured using a simple gonioreflectometer with two degree-of-freedom configuration as opposed to typical four degree-of-freedom configuration. This results in an efficient sampling method where a densely-measured isotropic BRDF can be recovered using two measured bases. The evaluation was performed on xiii xiv simulated measurements to demonstrate small visual errors introduced by our factor measurement. The project had an international collaboration with a researcher in the related field. This work was presented in a workshop of ICCV 2017. The author of this thesis was responsible for the project initiation, implementation of the algorithm, and evaluations of the method. The author also contributed to the majority of the written manuscript and the oral presentation. Paper C : Compact and intuitive data-driven BRDF models T. Tongbuasirilai, J. Unger, J. Kronander, and M. Kurt, “Compact and intuitive data-driven brdf models,” The Visual Computer, vol. 36, no. 4, pp. 855–872, 2020 This paper is a continuation of Paper B and was published in a journal. Two iterative-factorized BRDF models were proposed with importance sampling capability. The BRDF models utilize the characteristics found in the PDV parameterization and the CP Decomposition method (CPD). The importance sampling strategy is newly derived for the proposed models. Besides, analyses of parameterizations are provided to demonstrate some insights of relationship between BRDF values and several parameterizations. Our models have shown to outperform previous factorized BRDF models in both quantitative and qualitative evaluations. The project had a continuation of the international collaboration as in Paper B. The author of this thesis contributed to the ideas, design and implementations of the project as well as the written manuscript. Paper D : A Sparse Non-parametric BRDF Model T. Tongbuasirilai, J. Unger, C. Guillemot, and E. Miandji, “A sparse non-parametric brdf model,” ACM Trans. Graph., apr 2022 This paper presents a non-parametric BRDF model with a unique interpolation technique. Our proposed model represents isotropic BRDFs as multiple dictionaries, basis functions. A BRDF is then reconstructed using a linear combination of sparse coefficients and dictionaries. To obtain the dictionaries, a modified Aggregate Multi Dictionary Ensemble (AMDE) algorithm is developed. The measured BRDF datasets were used to train and evaluate our proposed model. Additionally, a model selection method is presented to pick best BRDF representation among resulting candidates without a visual comparison. Moreover, BRDF interpolations on the dictionary domain can be achieved through our derived technique. The project had an international collaboration with a researcher from INRIA. A part of this work was presented in CLIM 2021 workshop. The final version was published in ACM Transactions on Graphics 2022 and nominated to present in Siggraph 2023. The author of this thesis was responsible for the project initiation, design and implementation of the algorithm as well as the written manuscript. xv Paper E : FROST-BRDF : A Fast and Robust Optimal Sampling Technique for BRDF Acquisition E. Miandji, T. Tongbuasirilai, S. Hajisharif, B. Kavoosighafi, and J. Unger, “Frostbrdf : A fast and robust optimal sampling technique for brdf acquisition,” To be sumitted to ACM Trans. Graph., 2023 In this paper, a sparse BRDF acquisition framework was proposed. Sparse representation and compressed sensing are foundations of the problem reformulation such that our method produces an optimal set of sampling regions for a sparse BRDF acquisition. The technique is simple but efficient as the problem is reformulated under a system of linear equations. As a result, our method is faster and more robust as well as outperformed compared to current state-of-the-art which uses a heuristic search on a probabilistic method. The evaluation was performed through 10-fold cross-validations on publicly available BRDF databases. The author of this thesis was responsible for the implementation of the method, the experiments as well as the written manuscript. The manuscript will be submitted to CVPR 2023. Contents Abstract v Populärvetenskaplig Sammanfattning vii Acknowledgments ix List of Publications xi Contributions xiii 1 Introduction 1.1 Towards the Virtual World 1.2 Object Representation 1.2.1 Geometric Shape Modeling 1.2.2 Surface Reflectance Modeling 1.3 Surface Reflectance Measurement 1.4 Objectives and Contributions 1.5 Thesis Outline 1 2 4 5 6 8 9 11 2 Fundamentals of Rendering and Material Apperance 2.1 Fundamentals of Light Transport 2.1.1 Light Transport Problem 2.1.1.1 Radiometry 2.1.1.2 Path Integral Equation 2.1.2 Light Transport Algorithm 2.1.2.1 Monte Carlo Integration 2.1.2.2 Importance sampling 2.1.2.3 Transforming between PDFs 2.1.2.4 The cosine-weighted hemisphere sampling 2.2 Surface Reflection 2.2.1 The Reflectance Functions 2.2.1.1 Physically Plausible Properties 2.2.1.2 Hierarchical chart of reflectance functions 2.3 Parameterization 2.3.1 Standard parameterization 2.3.2 Half-Diff parameterization 2.3.3 Half-Out parameterization 2.3.4 Projected Deviation Vector (PDV) parameterization 2.3.5 Other parameterizations 13 14 14 15 17 20 20 21 21 22 22 23 23 26 27 28 28 29 29 30 xvii Contents 2.4 Data Transformation 2.4.1 Log-based Transformation 2.4.2 Gamma Transformation Material Databases and Acquisition Systems 2.5.1 MERL 2.5.2 RGL-EPFL 2.5.3 Other databases 2.5.4 The catadioptric measurement 30 31 33 34 34 35 36 36 3 Tensor Decompositions and Sparse Representation 3.1 Tensor Decompositions 3.1.1 Tensor Operations 3.1.2 Tensor Decomposition Methods 3.1.2.1 CP Decomposition (CPD) 3.1.2.2 Higher-Order SVD (HOSVD) 3.2 Sparse Signal Representation 3.2.1 Sparse Representation 3.2.2 Dictionary Learning 37 38 38 40 40 41 42 42 44 4 Surface Reflectance Modeling and its applications 4.1 Outline and Contributions 4.2 BRDF Models 4.2.1 The Empirical BRDF Model 4.2.2 The Analytical BRDF Model 4.2.3 The Data-Driven BRDF Model 4.3 Our Data-driven BRDF Models 4.3.1 The Factorized BRDF Model 4.3.1.1 Motivation 4.3.1.2 Our BRDF model 4.3.1.3 Importance sampling on PDV 4.3.2 The Sparse Non-parametric BRDF Model 4.3.2.1 Motivation 4.3.2.2 An Aggregated Multi Dictionary Ensemble for BRDFs 4.3.2.3 BRDF Encoding and Decoding 4.3.2.4 BRDF model Selection 4.4 BRDF Applications 4.4.1 Differential apperance editing 4.4.2 BRDF interpolation in sparseland 4.5 Summary and Future Work 47 47 48 48 51 53 55 56 56 58 59 61 61 5 Sparse Sampling for BRDF Acquisition 73 2.5 xviii 62 65 66 68 68 69 71 Contents 5.1 Outline and Contributions 5.2 The sparse BRDF measurement 5.2.1 The BRDF factor measurement using PDV 5.2.1.1 Motivation 5.2.1.2 The separable BRDF models 5.2.1.3 Results 5.2.2 A Fast and Robust Optimal Sampling Technique : FROST 5.2.2.1 Motivation 5.2.2.2 The Dictionary 5.2.2.3 Compressed sensing 5.2.2.4 FROST 5.2.2.5 BRDF Reconstruction 5.2.2.6 Results 5.3 Summary and Future Work 73 74 74 74 74 76 78 78 80 80 82 84 85 88 6 Summary and Outlook 91 Bibliography 95 Publications 121 Paper A 123 Paper B 129 Paper C 139 Paper D 161 Paper E 183 xix xx Chapter 1 Introduction Computer graphics is currently being employed in a wide range of important domains. Photo-realistic and predictive rendering is a core element and research area of central importance to many industries as it enables applications including digital design, visual effects in movies, computer games, product visualization, computer-aided tools for education, virtual and augmented reality, and even sensor simulation and generation of input to machine learning and artificial intelligence systems, to name a few. Figure 1.1 illustrates the layout and steps of a photorealistic rendering pipeline. As can be seen, material appearance is at the heart of the modeling, representation, and simulation of virtual objects in photo-realistic image synthesis. The appearance of different materials is diverse and originates from different phenomena governed by the physics of light. An object’s appearance is defined by the underlying scattering properties of the material such as color, reflection, and transparency. To enhance the accuracy and realism in computer-generated renderings, it is becoming increasingly common to measure and model real-world appearance properties, i.e., creating accurate digital copies/twins from physical material samples. A key challenge, however, of the reflectance measurement process is that it is very time-consuming. Depending on the measurement device and data resolution, the measurement process can take hours or even days for obtaining the reflectance data of a single point on a single material. Moreover, once we have the measured data, the next challenge is that realistic rendering and simulation require advanced reflectance models capable of representing the acquired appearance characteristics accurately while supporting efficient computations. The goal of this 2 Chapter 1 • Introduction thesis is to contribute to solving these challenges. In this introductory chapter, we give an overview of techniques for simulating virtual objects and environments using photo-realistic image synthesis. We also explain briefly the different components of the representations used to produce accurate visual results at high fidelity, and finally, we summarize the chapter by presenting the thesis’s objectives and its outline. Integrator Scene Object representation Light Material Camera parameter Light parameter Figure 1.1 — An overview of a photo-realistic rendering pipeline. The main components, highlighted in orange, are a rendering algorithm, and the scene description. The photo-realistic renderer is the rendering algorithm consisting of Camera and Integrator modules. Each scene contains object representations, camera parameters and light parameters. A rendered image at top-right is then reproduced using the renderer. 1.1 Towards the Virtual World There are countless benefits of using computer-generated images. It can help companies save production costs before delivering products to their customers. For example, automotive manufacturers can develop virtual prototypes for seamless testing of cars. Movies and advertisements can be produced creatively and uniquely without the need for the real-world locations. Furniture companies can design their own catalogs virtually to save costs and efforts in setting up real photo-sets and environments. Moreover, the advancement of artificial intelligence and autonomous systems require an abundance of training data. Synthetic data generation [7] is, thus, a key to develop the next generation autonomous systems. These examples 1.1 • Towards the Virtual World 3 indicate the importance of using tools for generating realistic images and illustrates some of the motivations as to why we see such growth in tools based on visualization and image synthesis across industries. Generating photo-realistic images is a complex and very computationally demanding problem. Broadly, it requires accurate and detailed representations of all visual aspects of the objects and light sources making up the virtual scene, as well as complex rendering algorithms accurately simulating the physics of light and matter interactions. Indeed, both components have been researched for decades to enhance the achievable realism and to accelerate the computations. To simulate the visual aspects of a virtual world realistically, we seek to solve the light transport problem. Most photo-realistic rendering systems are based on the ray tracing algorithm [8]. In ray tracing, we cast sampled rays from a virtual camera into the scene, tracking light paths and their interactions with the virtual objects. Ray tracing approaches have been developed since 1980s to solve more and more complex light transport problems, and more and more physically plausible effects in virtual environments. (a) dragon (b) ball (c) classroom Figure 1.2 — Three rendered images are generated using PBRT [9]. Each scene was rendered with 4096 samples per pixel. As illustrated in Figure 1.1, the rendering engine, or pipeline, contains different modules working in concert to solve the light transport problem and to generate the final image. The scene module contains descriptions of the virtual camera, the objects, lights, and surface materials in the scene and facilitates the simulation of scattering, i.e., the interaction between light and materials. The rendering module generates and gathers ray samples, which are traced from the virtual camera into the scene. When a ray hits an object the light-object interaction is simulated based on properties such as object surface normals, material properties, and lighting information. The rays are then processed in the integrator module which is responsible for the core radiance calculations. The integrator collects information from the lights (e.g., color and intensity) and from the material properties (e.g., color and reflection) according to the scene descriptions, and utilizes a physicallybased solver to compute the radiance information associated to each ray. Finally, 4 Chapter 1 • Introduction the renderer generates an image in which each pixel is computed by filtering the radiance contributions from each sampled ray into the final output. Figures 1.2(a), (b) and (c) are examples of rendered images generated using a photo-realistic rendering algorithm. The images from left to right have distinct environments in which different phenomena are demonstrated. The dragon shown in Figure 1.2(a) was rendered using an infinite light source placed infinitely far away from the object. Its appearance is painted by a matte material producing a dull look. The shiny ball, placed on a checkerboard pattern floor, in Figure 1.2(b), appears to be placed into a real scene as it was illuminated by radiance captured from a real-world location. Lastly, a Japanese classroom in Figure 1.2(c) portrays shadow effects, from both direct and indirect lighting, obtained automatically from the photo-realistic rendering algorithm. We used PBRT [9] as the renderer to reproduce images from publicly available PBRT scenes provided by Benedikt Bitterli [10]. PBRT is a photo-realistic rendering engine developed using a physically-based approach, and path tracing. Path tracing is a fundamental concept employed by physically-based methods for solving the rendering equation proposed by Kajiya [11]. It can be thought of as an enhanced version of Whitted ray tracing [8]. More details about the light transport problem and the computational solutions will be further discussed in Chapter 2. The scene description is another crucial component where artists can take part in designing the look and feel of rendered images. Each rendered scene, in Figure 1.2, has a unique setting for light, objects and camera parameters. Scene illumination can be controlled by light parameters. Camera parameters characterize image parameters such as view location, image size and resolution. Each virtual object requires a geometric description, to represent the object’s shape, and reflection properties, to represent the object’s color. This thesis primarily focuses on surface reflection modeling for physically-based rendering. In the next section, we will give a brief description of virtual object representation. 1.2 Object Representation The human visual system (HVS ) characterizes real-world objects by surface profiles, reflectance properties, and environmental light. When a light ray hits an object, some photons are absorbed by the material surface and some are scattered depending on the material properties. Once the scattered light reaches human eyes, the perceived light is translated into chemical and electrical signals supporting the object recognition tasks in the brain. The intrinsic properties of objects are defined by geometric shape and surface reflectance. The process of creating digital 3D contents can be divided into two tasks, i.e., geometric shape modeling and surface reflectance modeling. Modeling of 3D objects assembles primitive surfaces, so called 1.2 • Object Representation 5 polygons, such as rectangles and triangles, to carve an object into a complex shape. The latter process is to model surface reflectance in which it describes the portion of light scattering from a surface. 1.2.1 Geometric Shape Modeling 3D geometric shape modeling, in short 3D modeling, is a process in which artists build up 3D objects from primitive structures. There exist a variety of ComputerAided Design (CAD) software systems and computer graphics tools supporting these 3D modeling tasks. A 3D model consists of vertices and edges. A vertex is usually defined by Cartesian coordinates, (x, y, z). Two vertices are interconnected by an edge. Vertices and edges can be shared among neighboring primitives. Typically, triangles are used as primary primitives for 3D modeling. One can also use implicit functions to represent 3D objects such as spheres, cones and cylinders. (a) an implicit ball (b) a bunny Figure 1.3 — Two examples of 3D models defined by (a) an implicit function and (b) compound triangles of a bunny shape. Figure 1.3(a) shows an example of an implicit mathematical function representing a sphere, rendered in an indoor environmental light. The implicit sphere requires a radius as an input parameter to construct the virtual sphere in a scene. We can also construct a complex implicit shape using methods such as arithmetic, blobby and functional methods [12]. One of the advantages for constructing a complex shape from multiple implicit surfaces is that it guarantees smoothness and continuity of the object surface [13]. Even though the implicit surfaces in some cases are computationally expensive, they are still attractive for applications such as medical data visualization, deformations and collision detection [14, 15]. 6 Chapter 1 • Introduction Instead of representing a 3D object with implicit surfaces, a polygonal mesh can be used. A polygonal mesh can be formed by connecting tiny triangles making surfaces smooth. There are many 3D modeling approaches for creating triangular meshes. One can, for example, create a complex shape from a ball by subdividing it into small triangles and sculpt it. This task is called digital sculpting. Another 3D modeling technique is using image-based techniques. For instance, Figure 1.3(b) is the Standford Bunny containing about 725, 500 triangles. The bunny’s surface was reconstructed from range images using zippered polygon meshing method [16]. The scanner was a Cyberware Model3030 MS producing range points for single polygonal mesh recovery. There is also a well-known technique, so called photogrammetry, which recovers 3D surface points of a scene from multiple images. The method recovers not only 3D points of objects in the scene, but also surface reflectance and/or textures [17, 18]. This was primarily adopted in applications such as 3D cultural heritage preservation and architectural scene recovery, and later used in 3D forensics recreation [19]. For an overview and more details on algorithms and representations of 3D modeling, please refer to the following surveys and review reports [20, 21, 22]. 1.2.2 Surface Reflectance Modeling The characteristics of real-world materials enable various visual phenomena. In practice, understanding optical effects gives us insight about material surface modeling. During rendering, optical effects are simulated using light transport models such as geometric optics, wave optics and quantum optics [23, 24, 25]. The most simple type of light transport, explaining a wide range of optical phenomena such as reflection and refraction, can be modeled by geometric optics. When light reaches a material surface, optical effects, created by scattered light from a material, are described by the surface reflectance. Scattering of light of different wavelengths determines the color of an object as perceived by the human visual system. In general, materials can be classified by light scattering behaviors. For instance, diffuse materials scatter light almost equally in all directions. The ideal case is called Lambertian reflection where light scatters equally in all directions. Some examples of this material class are a matte paint and an acrylic felt. Another class of material scattering behavior is shiny materials exhibiting light in a lobe, so called a specular lobe, or perfect reflection generating a mirror-like effect. Mirrors and some metals, such as gold, are examples of shiny objects. Regarding various types of materials exhibiting a wide range of phenomenon, we need efficient ways to model surface reflectance covering a large spectrum of scattering effects. There is a large number of surface reflectance models representing a range of material classes; from simple diffuse materials to complex layered materials. In fact, the models can be broadly classified into three major classes which are 1.2 • Object Representation 7 empirical models, analytical models and data-driven models. In early stages of development in computer graphics, empirical models were proposed to represent surface reflectance based on empirical observations. The Lambertian reflection is an essential component in many empirical reflectance models as it accounts for approximating diffuse reflection. Phong reflection model [26] is a classical example of an empirical reflectance model which combines ambient color, a diffuse reflection and a specular reflection. The model computes a specular term using the relation between the surface normal, the perfect reflection vector and the viewing vector. The improved version of the Phong model is Blinn-Phong model [27] which replaces the perfect reflection vector with the half vector, computed from the normalized sum of light-view directions. Even though most empirical models are computationally inexpensive, one major drawback is that they often lack physical interpretations. Most empirical models lack physical accuracy, thus some parameters are not intuitive and difficult to configure. Owing to the limitations of the empirical models, the adoption of physically plausible theories has been, thus, employed in the analytical modeling. A classical example of analytical models is Cook-Torrance model [28], which accounts for rough surfaces describing a broad range of materials such as metals and plastics. Microfacet theory is the foundation of this class, which includes several terms describing different microscopic effects, i.e., directional reflected distribution term, shadowing term, and the Fresnel effect. Considering the physically-based formulation in analytical models, most of them support fundamental properties of the surface reflectance, i.e., reciprocity and energy conservation. Analytical models are often limited when it comes to describing measured real-world materials, and are often more computationally expensive as they tend to be more complex. Using measured data of real-world materials has been proven to be a great choice which enhances realism in 3D digital content. For decades, reflectance acquisition systems have been developed to obtain real-world reflectance data. The acquired measured data is normally high-dimensional, hence, for dense sampling, it is a time-consuming task which requires high memory footprints. CuRet [29], Cornell reflectance data [30], MERL [31], UTIA [32, 33], Bonn [34] and RGL-EPFL [35] databases are publicly provided measured reflectance data sets. These databases offer an extensive range of measured materials in different formats and resolutions. Indeed, we can fine-tune parameters of an analytical model, or an empirical model, using the measured data and optimization algorithms. Since the introduction of measured reflectance data, the parameter fitting approach has, then, been a typical form of evaluation method for reflectance modeling research. Therefore, the use of data-driven models to represent the measured reflectance data becomes important. In this thesis, we focus primarily on data-driven modeling. Moreover, we will discuss more details about surface reflectance models later in Chapter 2 8 Chapter 1 • Introduction and Chapter 4. As mentioned, data acquisition has become an important element for increasing realism in photo-realistic rendering. Key factors considered when developing a reflectance acquisition system are measurement speed and data resolution. If it takes hours or days to measure a single material, it is not practical for generating databases with a large collection of materials. Moreover, if the measured data is not dense enough, its representation will be less accurate for recreating realistic 3D digital content. One of the contributions in this thesis is to bridge the gap between the speed and data density. The developed algorithms can speed up the measurement process and reconstruct high resolution reflectance data faithfully. In the next section, the advancement of surface reflectance acquisition systems will be briefly discussed. 1.3 Surface Reflectance Measurement The digital transformation trend has sparked increasing interest in material appearance acquisition. To accurately simulate the real world digitally, specialized acquisition systems are needed to capture the material appearance for rendering pipelines. Three main components are found in all acquisition systems, i.e., a light source, a sensor and a material sample. The configuration of these components varies depending on measurement formats. As shown below in Figure 1, the overall acquisition process involves data capture, processing, and storage, culminating in its use for rendering. Figure 1.4 — A pipeline illustrating the processes from material reflectance measurement to rendering. In the past decades, a number of acquisition systems have been proposed [36, 37]. The design choices of the acquisition systems have to consider factors such as type of measured materials, sample shapes, speed of measurement, setup cost, ease 1.4 • Objectives and Contributions 9 of use, etc. We can classify the acquisition systems broadly into four types, i.e., gonioreflectometers, image-based acquisition systems, catadioptric systems, and texture-based acquisition systems. Each of these acquisition systems has different pros and cons. The gonioreflectometer, for example, measures a single angle at a time making it slow for capturing the data but it is reliable and accurate. The image-based acquisition and catadioptric systems perform faster measurements but the data is often less reliable. Furthermore, measuring a texture - a type of material with varying properties across its surface (heterogeneous) - presents a particularly difficult problem. Accurately capturing this spatial information requires a sophisticated light setup. This added complexity only serves to underscore the importance of carefully designing and implementing material appearance acquisition systems. Regarding the mentioned factors, there is no ultimate solution for the acquisition system, but it is rather dependent on the application for which it is intended to be used. In Chapter 2, publicly available material databases and their setups will be explained in details. 1.4 Objectives and Contributions The research objectives of this thesis are to leverage measured reflectance data by developing data-driven reflectance models and develop minimal sampling strategies enabling accurate reconstruction of measured reflectance data. In particular, we focus on isotropic materials whose surfaces are homogeneous. Isotropic materials are invariant to rotation around surface normal. As a result, the isotropic reflectance data can be defined by three parameters. In addition, we also address the following problems. • Applications for editing and interpolating surface reflectance between materials from either measured data or data-driven reflectance models. • Design of sparse measurement schemes for single-layered isotropic materials. • A reflectance model capable of a variance reduction procedure reducing the noise of final rendered images. • A model selection method providing insight of the model quality without visual evaluations. Contributions of the research areas, addressed in this thesis, led by the author are presented below. 10 Chapter 1 • Introduction Data-Driven Reflectance Modeling This thesis presents new data-driven reflectance models which utilize publicly available databases, especially the MERL database. The first model is based on empirical observations that single-layered isotropic materials can be factorized into three factors (Paper B, C ). This observation was found in Projected-DeviationVector parameterization (PDV ), defining how the reflectance data is parameterized. Another proposed model employs a machine learning technique, called dictionary learning (Paper D). This approach learns multiple spaces in which reflectance data can be accurately described, and selects the best candidate model from the learned spaces. Our technical contributions to this area are as follows: • Non-parametric models capable of editing and interpolating between materials. (Paper A, C, D) • A study of different parameterizations, on hemisphere and projected plane, including Half-Diff parameterization [38], Half-Out parameterization [39, 40] and PDV parameterization. (Paper C ) • A variance reduction technique for the factorized model based on the PDV parameterization. (Paper C ) • A model selection, without the need for visual comparisons, based on a transformation of approximated reflectance error. (Paper D) • Mathematical formulations for material interpolations on multiple dictionary spaces. (Paper D) Sparse Sampling for Optimal Reflectance Reconstruction Sampling strategies for optimal reconstruction of reflectance data are presented in this thesis. We propose sparse sampling methods based on our factorized model and the compressed sensing technique. The sampling based on the factored model was derived from an empirical observation that the reflectance data can be approximated as the linear combinations between two individual components (Paper B). The characteristics of the PDV parameterization facilitates the design of factor measurement. The radially circular BRDFs in the PDV enable to model 2D BRDFs, hence, resulting in acquiring BRDFs by moving mechanical arms, a light source and a sensor, only on a plane. As a result, our factor measurement can reconstruct single-layered materials accurately, especially shiny and glossy materials. Additionally, the measured factors can be used for an importance sampling technique for variance reduction in rendering. Another contribution is the new formulation of finding a sensing matrix for material capture using compressed sensing. We formulate a new sampling framework in the form of coefficient recovery 1.5 • Thesis Outline 11 using Multiple Measurement Vector (MMV ) algorithms (Paper E). The formulation was derived from a dictionary learning perspective. While the state-of-the-art [41] employs a heuristic-based optimization seeking for sample locations, our sparse sampling formulation can be solved with deterministic methods that support theoretical guarantees. This results in faster optimization, robust optimal sampling locations, and better signal recovery. 1.5 Thesis Outline This thesis is divided into two major parts. The first part introduces preliminary concepts, background and modeling of surface reflectance data and efficient acquisition approaches. The second part is the collection of publications providing details of the research conducted by the author. The selected publications are consolidated into contents of this thesis. The first part contains five chapters. Chapter 2 gives overviews of the fundamental theories for solving the rendering equation as well as more details about surface reflectance data and acquisition setups. This chapter is suitable for audiences who are not familiar with rendering theories and the surface reflectance. Additionally, background and necessary elements of the surface reflectance and publicly available material databases are provided in the chapter. Chapter 3 is another preliminary part for tensor decomposition and sparse representation. The fundamental concepts and theories of tensor decomposition, used in Paper B and Paper C, and sparse representation, used in Paper D and Paper E, are provided. Then, we describe the contributions developed during the work on this thesis. When the surface reflectance data is available, from observations and/or acquisition, modeling comes into play as the representation method, e.g., for estimating unknown data points or encoding dense data. Chapter 4 then provides the background of the reflectance modeling and our data-driven reflectance models (Paper C and Paper D). We discuss the possibilities of factorized forms of the surface reflectance (Paper B and Paper C ) which also leads to an efficient sparse sampling for specific types of materials such as metals or glossy materials. Moreover, a dictionary-learning method (Paper D) is presented. Additionally, some useful tools (Paper A, C, D) are presented for working with the surface reflectance data. Chapter 5 introduces powerful tools for surface reflectance acquisition to efficiently recover a BRDF from sparsely sampled measurements (Paper B, C, E). We introduce a factor measurement based on PDV parameterization, and a novel method for the sparse measurements of reflectance solved by an MMV algorithm. Finally, we conclude this thesis in Chapter 6. Chapter 2 Fundamentals of Rendering and Material Apperance Throughout this thesis, we review problems and challenges in the area of the surface reflectance modeling and acquisition for photo-realistic image synthesis. Similar problems have long been studied as theoretical concepts, practical algorithms, and computational problems in mathematics, physics, and computer science. In this chapter, we give an overview of fundamental concepts in physics of light, radiometric quantities, and the light transport equation (LTE). These areas of knowledge pave the way for understanding the problems related to material capture and modeling in more details. The intersection between physics of light and computer graphics algorithms has led to the invention of a rendering technology, called Global illumination, which aims to simulate detailed scattering phenomena. Light can be emitted, scattered, transmitted and absorbed at surfaces. Based on wave optics, light can also bend through different mediums through light refraction. We distinguish between different global illumination algorithms, e.g., Path tracing and Radiosity, by the mathematical frameworks used in their formulations. In this thesis, we focus on path tracing since it is a general rendering framework and currently the primary technology used for rendering using Graphics Processing Units (GPUs). An advantage of path tracing is that it simulates geometric optics in a way such that it enables the simulation of scattering effects adequate for a wide range of illumination configurations. Thus, wave optics models and quantum mechanics, contributing to more advanced natural light phenomena, are not included. 14 Chapter 2 • Fundamentals of Rendering and Material Apperance Surface reflectance is a key component in realistic image synthesis. Modeling and representation of light-material interactions are important tools to generate enticing virtual scenes. Indeed, each material scatters light differently. Categorizing materials facilitates the complication of material appearance. Material types also enable an effective way of handling the variety of appearance complexity. Therefore, understanding material classes, data representation and acquisition systems is an elementary step to construct the bigger picture of the addressed problems in this thesis. This chapter will be divided into two main parts. The first part presents the fundamentals of light simulation through the rendering equation. The rendering equation can currently be described by formulating the light transport problem with radiometric quantities. The fundamentals of light transport are the foundation of the rendering engine used in the entire thesis. The second part provides the basic concepts of surface reflectance named Bidirectional Reflectance Distribution Function (BRDF ). The physically-based rendering employs the BRDF as a key element for rendering realistic materials in virtual scenes. We also give a brief overview of material databases and acquisition systems to connect the theories to real-world use cases. 2.1 2.1.1 Fundamentals of Light Transport Light Transport Problem Light is essential for our perception to see and understand the world around us. When light reaches the eyes, humans see a scene and observe light phenomena. Light allows us to perceive the colors, shapes, and details of the objects and scenes in our environment. To create a realistic virtual world, we need to simulate the way that light behaves and interacts with objects. That means light transport problems must be solved for radiance accumulation in a scene. This process, known as light transport simulation, helps us to generate images that look natural. Indeed, one challenge of the light transport model is to efficiently solve indirect illumination. This phenomena is inherent to any multiple-scattered light energy in a scene. By solving this problem, we can create more convincing virtual environments. Thus, robust light transport algorithms could help simulate indirect lighting correctly. One necessary condition of light transport algorithms is to find the equilibrium point of light energy deposit or the steady-state of light at a given time. Here, we simulate light transport problem using geometric optics. This means that light travels along straight paths and scatters according to interactions with material surfaces in the scene. For convenience, participating media, which account for volume scattering, e.g., fog, are ignored. However, the light transport problem can be simply extended to include effects of participating media by adding volume 2.1 • Fundamentals of Light Transport 15 scattering and light absorption. The rendering equation is a light transport model and can be simplified as the following equation Lo (p, ω) = Le (p, ω) + Lr (p, ω), (2.1) where Lo (p, ω) is the radiance leaving point p in direction ω, Le (p, ω) is the emitted radiance leaving point p in direction ω, and Lr (p, ω) is the reflected radiance leaving point p in direction ω. The simplified rendering equation above is at the heart of light transport algorithms. To further clarify the above radiance terms in Equation (2.1), the basic terminology must be defined to measure light quantity. We introduce radiometry as a tool for quantifying optical radiation [42]. 2.1.1.1 Radiometry A photon, emitted from illumination sources, carries energy depending on its wavelength. To measure the amount of photon energy, radiant energy Qλ with a wavelength λ is defined by Qλ = hc , λ (2.2) where h ≈ 6.63×10−34 joules·seconds is Planck’s constant, and c = 299, 792, 458 m/s is the speed of light. As the following quantities are wavelength dependent, we will omit λ to facilitate the notations below. In rendering, we assume light under the steady state. Radiant flux Φ defines the differential amount of energy per differential time and its units are joules per second (J/s) or watts (W ). The radiant flux is expressed as Φ= dQ . dt (2.3) The quantities measuring density of energy over an area are defined by irradiance E, flux arriving at a surface, and radiant exitance M , flux leaving a surface. Their units are W · m−2 . Given an area A at a point p we can define irradiance and radiant exitance in a similar way as follows E(p) = dΦ(p) . dA (2.4) Note that, although the difference between E and M is the directional power, they lack a unit expressing its direction. The directional distribution of light can be quantified by the solid angle representing the angular size and direction of a ray. Indeed, the solid angle measures the size subtended by an observed object from a given point. Solid angles are measured in steradians (sr) indicated by a directional 16 Chapter 2 • Fundamentals of Rendering and Material Apperance variable ω. A hemisphere subtends a solid angle of 2π steradians. Now, we can define the directional quantities which are intensity I and radiance L. Intensity is defined as I= dΦ , dω (2.5) which has units W · sr−1 and describes the directional power of light centered at the unit sphere. Here, the final and most important radiometric quantity, radiance, is defined as L(p, ω) = dEω (p) , dω (2.6) where Eω (p) denotes irradiance perpendicular to the direction ω at surface point p. Radiance is defined by the flux density over directional area as follows L= d2 Φ . dω dA cos(θ) (2.7) Radiance has units W · m−2 · sr−1 . The cosine term compensates for the projected area perpendicular to direction ω. The radiance term here is the most used radiometric quantity in deriving a light transport framework. Indeed, radiance is constant and invariant along a ray in vacuum. This property is important enabling the emerging of ray tracing algorithms which assume invariance of light energy along a ray. The definition of radiance, then, lets us focus on rays, or light paths, carrying light energy throughout a given scene. Figure 2.1 — A geometric illustration of radiometric quantities employed to describe measured radiance. A visual explanation of the radiometric quantities introduced above is illustrated in Figure 2.1. When light is observed at point p with respect to direction ω, radiance measures the energy Φ over infinitesimal area perpendicular to direction ω which is 2.1 • Fundamentals of Light Transport 17 dA cos(θ). For convenience, we provide a summary table of the above radiometric quantities in Table 2.1. Radiometric quantity Definition Unit Radiant energy, Q h c λ Joule (J) Radiant flux, Φ dQ dt J · s−1 or Watt (W ) Irradiance, E dΦ dA W · m−2 Intensity, I dΦ dω W · sr−1 2 d Φ dω dA cos(θ) Radiance, L W · m−2 · sr−1 Table 2.1 — Summary of radiometric quantities and units of measurement. 2.1.1.2 Path Integral Equation Equation (2.1) can be expanded as Lo (p, ωo ) = Le (p, ω) + Z ρ(p, ωo , ωi ) Li (p, ωi ) | cos(θi ) {z } | {z Ω| ref lectance irradiance | dωi . (2.8) } Intuitively, the reflected radiance Lr can be thought of as an integral of a surface reflectance function (BRDF) denoted by ρ(p, ωo , ωi ), and receiving flux at point p with respect to all possible incoming angles Ω. The BRDF describes how much light is reflected from the incident direction ωi into the outgoing direction ωo . We leave the mathematical expressions and properties of BRDF to Section 2.2.1. Here, we continue on deriving the solution of the integral in Equation (2.8) which is typically impossible to solve analytically. The reflected radiance term accounts for simulating both direct and indirect illuminations. The direct illumination can be found explicitly when irradiance is contributed directly from light sources in the scene. However, to simulate indirect lighting, irradiance contributed to the integral will be found through the light reflected from objects in the scene, which may have undergone multiple scattering events. Such complexity requires reformulation of Equation (2.8) into the path integral formulation so that we express the problem on a path space to avoid the curse of recursiveness. Using the path space makes it more convenient in exploiting any random path sampling technique as a rendering algorithm. Let us reformulate Equation (2.8) into the three-point form of area integral. First, we define the radiance and the BRDF from a point p1 to p0 as follows 18 Chapter 2 • Fundamentals of Rendering and Material Apperance L(p1 → p0 ) = L(p1 , ω), (2.9) ρ(p2 → p1 → p0 ) = ρ(p1 , ωo , ωi ), (2.10) where ωo is the direction from point p1 → p0 and ωi is the direction from point p1 → p2 . We introduce a geometric term G(x) describing visibility V (x) and surface orientation between two points as follows G(p0 → p1 ) = V (p0 → p1 ) | cosθ0 | | cosθ1 | ∥p0 − p1 ∥2 . (2.11) By substituting the above terms into Equation (2.8), we obtain the area integral as follows L(p1 → p0 ) = Le (p1 → p0 ) + Z A ρ(p2 → p1 → p0 )L(p0 → p1 )G(p0 → p1 )dA(p2 ), (2.12) where A is all surfaces of the scene. Figure 2.2 — An illustration of a four-vertex path starting from point p0 where a sensor, or a camera casts a ray. A sub path contains two vertices, the geometric term G and the BRDF ρ. Here, the path ends at p3 where a light source is placed and emits energy. Now, we repeatedly substitute the integral on the right hand side to obtain a sum over path integrals. The integrals of the light path will appear continuously so that the length of the path keeps increasing. An example of four-vertex path is defined below L(p1 → p0 ) =Le (p1 → p0 )+ Z Le (p1 → p0 )ρ(p2 → p1 → p0 )G(p0 → p1 )dA(p2 )+ ZA Z A A Le (p3 → p2 )ρ(p3 → p2 → p1 )G(p3 → p2 ) × ρ(p3 → p2 → p1 )G(p2 → p1 )dA(p3 )dA(p2 ). (2.13) 2.1 • Fundamentals of Light Transport 19 An illustration of the path integral in Equation (2.13) is shown in Figure 2.2. The points p0 and p1 are predetermined depending on the camera position, however, the remaining points are randomly chosen over all points in the scene. In fact, a light path is generated by limitless subpaths. Thus, the integral form is expanded infinitely. Fortunately, the path integral equation looks similar to a Fredholm integral equation of the second kind [43]. Such equation can be solved using the Neumann series expansion. We can then rewrite the path integral equation above into an infinite sum by reorganizing the BRDF and geometric terms. L(p1 → p0 ) = ∞ X PL (p¯n ), (2.14) Le (pn → pn−1 )T (p¯n )dA(p2 ) . . . dA(pn ), (2.15) n=1 where PL (p¯n ) = T (p¯n ) = Z ··· A n−1 Y Z A ρ(pi+1 → pi → pi−1 )G(pi+1 → pi ). (2.16) i=1 Equation (2.14) expresses the path integral as a sum of radiance contributions over path p¯n bouncing around in the scene. We also define an operator, T , for handling visibility and scattering along a path. Finally, PL defines integrals over the chosen path. These mathematical expressions used in deriving Equation (2.14) construct a venue for solving the sum of the path by using a Monte Carlo estimator. The chosen path can be randomly sampled to compute radiance estimates contributing to point p0 . We can write Equation (2.14) in a more concise way as follows L(p1 → p0 ) = PL1 (p¯1 ) + PL2 (p¯2 ) + ∞ X PL (p¯i ), (2.17) i=3 where PL1 (p¯1 ) is the contribution from a surface emission at point p1 , PL2 (p¯2 ) is the direct lighting contribution from a light source. One interpretation of Equation (2.17) is that we determine each term by the path length. For example, PL1 (p¯1 ) is a path between the camera and a surface point in the scene. PL2 (p¯2 ) is a path of length 2 where only one bounce occurs. This term is usually estimated by sampling all light sources in the scene. PL3 (p¯3 ) is a path of length 3 where two bounces have contributed and so on. The length of the path can be predefined with the maximum number of bounces, otherwise we can estimate by using Russian roulette to terminate the low contribution of the subpath. Solving Equation (2.17) requires the estimation of the sum of the integrals from the last term, where multiple bounces are inherent to indirect illuminations. The estimator used in evaluating such integral is the Monte Carlo technique. Note 20 Chapter 2 • Fundamentals of Rendering and Material Apperance that, in practice, we sample the path based on a probability function and we evaluate radiance contribution from light source to the camera with the Monte Carlo estimator. 2.1.2 Light Transport Algorithm In this section, we give details about a light transport algorithm to solve Equation (2.8), or the generalized path integral equation in Equation (2.17). Kajiya [11] proposed a path tracing algorithm which employs a random path sampling technique to estimate the contributed radiance using a Monte Carlo estimator. The path tracing suggests only one reflected ray to be generated as a subpath, similar to an example path illustrated in Figure 2.2, instead of computing multiple reflected rays at a surface point. The result is computed as the average over the samples created over the area of each pixel. However, a problem of the Monte Carlo path tracing is the variance of estimates which appears as noise in the rendered image. When a sufficient number of rays are traced in the scene, the noise will be reduced. In a complex environment, even when the rays are traced densely, e.g., 10, 000 samples per pixel, the noise may still be visible. One solution to cope with variance reduction problem is, thus, reviewed at the end of this section. 2.1.2.1 Monte Carlo Integration Monte Carlo integration is a point-sampling technique for solving integral equations by estimating an integrand g(x) from random samples. Suppose that we want to evaluate a 1D integral as follows G= Z b g(x)dx, (2.18) a where G is the exact solution of the integral in the domain [a, b]. We can estimate G using a sequence of random variables Xi ∼ p(x) by GM = M 1 X g(Xi ) , M i=1 p(Xi ) (2.19) where each Xi is drawn from the probability distribution p(x) and M is the number of samples. Due to the fact that the expected value of an arbitrary function g(x) is defined as Ep [g(x)] = Z g(x)p(x)dx, (2.20) D where D is the domain of interest. The expected value of GM is the desired integral of g(x) as shown in Equation (2.18). This confirms the convergence of Monte Carlo algorithms. One major drawback as mentioned previously is the standard 2.1 • Fundamentals of Light Transport 21 deviation, or variance, of the Monte Carlo estimator, that is inversely proportional to square root of the number of samples, i.e., σ ∝ √1M . This means that to reduce the error by half, the number of samples must be quadrupled. To remedy this, we will introduce a variance reduction technique called importance sampling in the next subsection. 2.1.2.2 Importance sampling The idea of using importance sampling is rather straightforward. Suppose that p(x) = cg(x) where c is a constant. The normalization results in 1 c= R . (2.21) g(x)dx This indicates that if we could sample from the p(x), the estimate would be Z g(Xi ) g(Xi ) = = g(x)d(x). p(Xi ) cg(Xi ) (2.22) The expressions above demonstrate that if samples Xi are drawn proportional to the integrand g(x), the variance is zero when the integrand is known, because the estimate in Equation (2.22) is the same constant. However, in rendering, the integrand is unknown but the probability distribution can be constructed to approximate the true distribution hence the variance can be decreased. In fact, one simple way to choose the probability distribution is to construct it from the BRDF. The reflected radiance in the rendering equation consists of BRDF and irradiance which identify the shape of the integrand. For data-driven BRDFs, Lawrance et al. [39] showed that the probability density function (PDF ) for importance sampling can be constructed by an inversion method using such BRDFs. In Paper C, we derive an importance sampling strategy based on our data-driven BRDF model. We adopted the inversion method, see [9], when two independently uniform samples are drawn to determine the samples based on an arbitrary PDF. 2.1.2.3 Transforming between PDFs When working with different parameterizations, the corresponding PDF must be transformed accordingly. Up until now, the PDF has primarily been defined in spherical coordinates. However, in many cases, reparameterization reveals many interesting patterns underlying BRDFs and their distributions. For example, our BRDF models (Paper B, C, D) have been reparemeterized differently from the standard spherical coordinates which defines incoming and outgoing directions (ωi , ωo ). To draw sample according to the corresponding PDFs, it requires a Jacobian transformation. 22 Chapter 2 • Fundamentals of Rendering and Material Apperance Let Y = T (X) be a transformation between two distributions, Y and X, where T is a transformation function. The Jacobian is defined as ∂T 1 ∂x1 ∂T2 ∂x1 | JT | = det . .. ∂Tn ∂x1 ∂T1 ∂x2 ∂T2 ∂x2 .. . ∂Tn ∂x2 ... ... .. . ... ∂T1 ∂xn ∂T2 ∂xn , .. . ∂Tn ∂xn (2.23) The Jacobian is the absolute value of the determinant of the Jacobian matrix of T . One important property of the transformation function T is bijection which means that the mapping between distributions is 1-1 (injective) and onto (surjective). 2.1.2.4 The cosine-weighted hemisphere sampling A common solution when a desired PDF is difficult to draw a sample from is to perform uniform sampling on hemispherical directions. Malley method [9] uniformly chooses directions on a disc where polar coordinates are defined by (r, ϕ). The √ method randomly chooses ( ξ1 , 2πξ2 ) = (r, ϕ) and projects the point on the unit hemisphere. The resulting directions are cosine-weighted and the corresponding pdf is p(θ, ϕ) = | JT | p(r, ϕ) = cos(θ) . π (2.24) In summary, the Monte Carlo estimation for solving the light transport problem requires many tools involving statistical sampling strategies. In typical situations, the cosine-weighted sampling ensures that the solution is obtained but with a slow rate of convergence. To speed up the convergence, the importance sampling strategically chooses paths according to the integrand consisting of both BRDF and irradiance terms. In case of the specially designed sampling strategy, the inversion method and the Jacobian term are employed to make sure that the solution is correctly estimated. 2.2 Surface Reflection To quantify the surface reflection, reflectance is used to express the amount of light incident at a surface point to the amount reflected out from the point. This property is important because it allows us to visually identify and categorize materials based on how they reflect light. In realistic image synthesis, reflectance is essential for solving the light transport equation as mentioned in Section 2.1. In general, hemispherical reflectance is defined as follows 2.2 • Surface Reflection 23 ρΩ = Φr , Φi (2.25) where ρΩ is the reflectance over hemisphere, Φr and Φi are reflected and incident radiant flux, respectively. Clearly, the reflected energy must not exceed the incident energy at the surface, hence, ρΩ ∈ [0, 1]. The reflection is an event occurring when light hits a surface but there are other events such as transmission and subsurface scattering which occur depending on material properties. Light transmission can be found in transparent objects such as glasses and lenses. The transmission occurs when light passes through materials at one point and exits at the other side of the material. This let us see objects sitting behind transparent materials. Another type of light behavior is subsurface scattering, which occurs when light penetrates a material and scatters inside before exiting at a different point. This is often seen in materials like human skin or jade, where the light creates a soft and glowing effect. These combinations of light transport mechanisms can create more complex phenomena in the scene. Each of light events has a specific function describing the directional and spatial quantities leading to more accurate and detailed representations of complex surfaces. 2.2.1 The Reflectance Functions Defining the reflection quantity on a hemisphere as shown in Equation (2.25) ignores the directional light energy. In rendering, we are interested in directional quantities in order to solve the rendering equation. The Bidirectional Reflectance Distribution Function (BRDF) [44] expresses the directionally scattering quantity for materials. The BRDF is expressed by ρ(ωo , ωi ) = dLo (ωo ) dLo (ωo ) = . dE(ωi ) Li (ωi )cos(θi )dωi (2.26) BRDF is typically a 4D function parameterized by (θo , ϕo , θi , ϕi ) = (ωo , ωi ). Each material has a unique BRDF describing the visual appearance of an object. For example, a metal scatters light strongly in some directions where we see highlights or near mirror reflections while a matte plastic reflects light almost equally in all directions so its appearance is invariant to viewing angles. 2.2.1.1 Physically Plausible Properties There are two important BRDF properties to consider in physically plausible rendering: 24 Chapter 2 • Fundamentals of Rendering and Material Apperance 1. Reciprocity which states that given any pair of directions (ωo , ωi ), the BRDF is reversible and invariant to exchanging the directions as expressed by ρ(ωo , ωi ) = ρ(ωi , ωo ). 2. Energy conservation states that the reflected energy should not exceed the total received energy as denoted by Z Ω ρ(ωo , ωi )cos(θi )dωi ≤ 1.0. The reciprocity property refers to Helmholtz reciprocity [45] and it is useful in many situations. For example, when doing measurements or representing BRDFs, only half of the hemisphere can be measured or stored because the remaining part can be filled in by the reciprocity property. Another example is in the design choice of BRDF parameterizations for importance sampling. The path tracing provides a camera direction for the radiance estimation. The reciprocity enables us to select the light directions as an auxiliary parameter. We will provide more details in Chapter 4, where we utilize this property in the design of our BRDF parameterization. (a) BRDF (b) BTDF (c) BSSRDF Figure 2.3 — Illustration of different light scattering phenomena. (a) BRDF scatters light above the surface. (b) BTDF refracts light passing through the surface. (c) BSSRDF scatters light inside and light leaves at other points of the surface. BRDFs can be classified into two subcategories which are isotropic and anisotropic BRDFs. Isotropic BRDFs are invariant to azimuthal rotation of materials. This means that given an incoming light direction and a viewing direction, rotating the material around its normal will result in no change for the reflected radiance. On the other hand, anisotropic BRDFs give different reflected radiance when the rotation occurs. Anisotropic materials can be found in wood, fabric and polished metals which often have fibers or threads that cause directionally-dependent reflectance,i.e., anisotropy. Nevertheless, most of the BRDFs are isotropic, e.g., glass, plastics and some metals. More importantly, isotropic BRDFs can be parameterized by only 3D 2.2 • Surface Reflection 25 parameters as ρ(θo , ϕo , θi , ϕi ) = ρ(θo , θi , | ϕo − ϕi |). This is a property we utilize in this thesis. The BRDF is used to describe radiance scattering above surfaces. For materials which have light transmission and subsurface scattering, there are specific functions, serving such material properties, called Bidirectional Transmission Distribution Function (BTDF ) and Bidirectional Scattering Surface Reflectance Distribution Function (BSSRDF ), respectively. Illustrations of each function are shown in Figure 2.3. Although the BTDF and the BSSRDF are not in the scope of this thesis, they are worth mentioning as these three functions can cover a wide range of material properties. The BTDF does not obey the reciprocity due to Snell’s law, which states that the index of refraction η between two mediums is proportional to the sine of the angles as expressed below sin(θ1 ) η2 = . sin(θ2 ) η1 (2.27) Hence, the relationship between incoming radiance and transmitted radiance is dependent on the index of refraction as follows Lo (ωo ) = eT r Li (ωi ) ηo2 , ηi2 (2.28) where eT r is the transmitted fraction of light, ηo and ηi are indices of refraction of outgoing and incoming directions, respectively. The BTDF is a 4D function similar to the BRDF but the transmitted directions are on the opposite side of the surface. In some context, the BRDF and the BTDF are blended and represented in a general term as Bidirectional Scattering Distribution Function (BSDF ). This general function allows us to consider both reflection and transmission in a single mathematical framework. When using the BSDF, the domain of integration is expanded to include the entire unit sphere, rather than just the hemisphere around the surface normal. The BSSRDF is a more generalized form of material properties where spatial and directional domains are considered. The function is parameterized by incident location and direction of light and exitant location and direction of scattered light. The BSSRDF is a highly-complex 8D function, ρSS (xi , yi , θi , ϕi , xo , yo , θo , ϕo ), which poses a lot of challenging problems in research. We often find the BSSRDF in translucent materials. For modeling and acquisition of BSSRDFs, refer [25, 46, 47, 48, 49, 50, 51]. 26 Chapter 2 • Fundamentals of Rendering and Material Apperance Figure 2.4 — A taxonomy of reflectance functions defined by complexity of parameters, the bar on the right. 2.2.1.2 Hierarchical chart of reflectance functions In summary, we can classify the reflectance functions using the number of parameters as depicted in Figure 2.4. The right bar in the figure tells the number of parameters used in each reflectance function, ranging from 3D to 8D. Apparently, the simplest reflectance function is the isotropic BRDF, which is parameterized by three parameters, and the most complex reflectance function is the BSSRDF which has two additional spatial locations. The BSDF is a general form of the 4D reflectance function which describes both reflection and transmission. Indeed, there are two important functions which are BTF and SVBRDF where their complexity is inherent from the BRDF. These functions represent spatial reflection properties where each point is represented by a BRDF. The difference between the BTF and the SVBRDF is that SVBRDF surfaces do not have occlusions between each point while the macro structure of BTF [52] surfaces is non-flat, resulting in non-local scattering effects such as shadowing and interreflection on each surface point. For more details about the BTF, please refer to the following review reports [52, 53]. 2.3 • Parameterization 27 2.3 Parameterization Parameterization is a mathematical tool used in expressing data with a coordinate system. When data is aligned on different coordinate systems, its representation and modeling can also be changed. The benefit of defining data on different parameterizations is that the data can be interpreted more easily and the patterns in the data can be revealed. For example, representing a 2D circle is preferred on the polar coordinate system while drawing a straight line is easier on the Cartesian coordinate system. This also applies to the surface reflectance data. In this section, we describe parameterizations used to represent BRDFs. Each parameterization has unique properties and structure, which can be used for different purposes. So one of the main goals for BRDF parameterization is to efficiently model and represent BRDFs, as well as improving the process of BRDF acquisition [35]. (a) Standard Parameterization (b) Half-Diff Parameterization (c) Half-Out Parameterization (d) Projected Deviation Vector (PDV) Figure 2.5 — BRDF parameterizations illustrated on directional vectors on hemisphere. Normal vectors denoted by N are perpendicular to the plane. Each vector represents by spherical coordinates using θ and ϕ denoting elevation and azimuthal angles, respectively. 28 Chapter 2 • Fundamentals of Rendering and Material Apperance 2.3.1 Standard parameterization Typically, a BRDF is defined by two vectors, the light and the view vectors, on a unit hemisphere. These vectors are commonly defined in spherical coordinates (r, θ, ϕ) where r = 1, θ ∈ [0, 90) and ϕ ∈ [0, 360). A pair of light and viewing directions is denoted by (ωi , ωo ) which is (θi , ϕi , θo , ϕo ). The elevation angle θ is measured from the normal vector N of the hemisphere. The azimuthal angle is measured on the surface plane. We use the standard parameterization in a number of formulations since it is important in deriving the rendering equation, and intuitive for the design of BRDF measurement devices. However, a number of disadvantages of using this parameterization are, for examples, inefficiency of BRDF modeling and BRDF representation. Kautz and McCool [54] proposed a BRDF decomposition indicating that the standard parameterization is not a suitable form for a separable BRDF model. Figure 2.5(a), depicts directions on a hemisphere using the standard parameterization. 2.3.2 Half-Diff parameterization In 1998, Rusinkiewicz proposed the Half-Diff parameterization [38]. It is constructed by changing (ωi , ωo ) to become a half vector ωh and a difference vector ωd . As the +ωo name suggests, the half vector, denoted by (θh , ϕh ), is formulated by ωh = ∥ωωi +ω . i o∥ Once the half vector is derived, the surface plane is rotated such that ωh is the normal of the new coordinate frame. The difference vector is thus obtained by rotating either ωi or ωo with the same rotation as the half vector. As shown in Figure 2.5(b), the half vector is perpendicular to the plane defined by the difference vector. Note that θh is measured relative to the normal vector N while θd is measured relative to the half vector. Parameterizing the directions in this way assists in explaining the specular lobe with fewer number of parameters. Moreover, many properties have been observed, e.g., the separable BRDFs [3, 38, 54, 55] both in isotropic and anisotropic BRDFs, reciprocity and retroreflective lobes. Another practical example is the MERL [31] BRDF database which is widely utilized by the computer graphics community for data-driven modeling. The BRDF database provided is stored in a modified Half-Diff parameterization which encodes the specular lobe densely. We will cover this database in Section 2.5.1. For isotropic BRDFs, this parameterization uses three parameters (θh , θd , ϕd ). 2.3 • Parameterization 29 2.3.3 Half-Out parameterization The half vector is widely used in describing specular lobes as shown in many BRDF models [27, 28, 56, 57]. Moroever, the relationship between the half vector and the outgoing direction, which indicate a similar contour on the hemisphere, was presented by Ngan et al. [58]. Reparameterizing with the half vector as a principal direction has given improved results for BRDF modeling. The other auxiliary direction is the viewing direction ωo as illustrated in Figure 2.5 (c). Lawrence et al. [39] proposed the use of Half-Out parameterization for a separable BRDF. One important feature for this parameterization is that the noise reduction technique, called importance sampling, can be defined more efficiently as compared to the Half-Diff parameterization. The Jacobian term and the sampling strategy are easily derived in connection to the viewing direction. It facilitates the integration to Monte Carlo ray tracing algorithms. Separable BRDF models [39, 40] utilizing this parameterization have shown improvements in both compact representation and fast rendering. Isotropic BRDFs using this parameterization require three parameters (θo , θh , ϕh ). 2.3.4 Projected Deviation Vector (PDV) parameterization Parameterization is not limited to be defined on a hemisphere. Reprojection hemispherical directions on a unit circle plane is an alternative method. The PDV parameterization begins with the perfect reflection direction ωR from a viewing direction ωo as a principal vector parameter. The auxiliary parameters describing ωi is defined by a projected deviation vector Dp and its projected azimuthal angle, denoted by ϕdp , as depicted in Figure 2.5. Originally, it was defined by Neumann et al. [59] as a mathematical tool. They reformulated the directions with orthogonal projections and there were multiple scenarios for deriving BRDF models. The derived BRDF models covered a variety of material types such as plastics and metals. The importance sampling strategy of the BRDF models was also proposed. It was later studied that the measured BRDF isocontours have a similar shape as the deviation vector Dp . As a result, ABC BRDF models [60] were empirically observed and developed, inspired by the Rayleigh-Rice theory. Our contribution to this parameterization is to introduce a separable capability for the PDV parameterization found in isotropic BRDFs (Paper B, C ). Our proposed BRDF models offer separable BRDF representations and sparse BRDF measurement, which are effective for glossy materials. We also derived a Jacobian term corresponding directly to the PDV parameterization and our data-driven BRDF model. Note that the ABC smooth surface model makes use of the polar coordinates to enable the importance sampling strategy. Our sparse acquisition 30 Chapter 2 • Fundamentals of Rendering and Material Apperance can be done on a gonioreflectometer with two degrees-of-freedom. Moreover, we also demonstrate an application of BRDF editing with our separable BRDF model. In Chapter 4, we will explain in detail our BRDF models and sparse measurements. We define this parameterization for isotropic BRDFs by (θR , dp , ϕdp ). 2.3.5 Other parameterizations Redefining BRDF parameters is a way to achieve more efficient BRDF models enabling separability and lower BRDF dimensionality. McCool et al. [61] proposed a BRDF decomposition technique on texture-mapping coordinates for a real-time rendering application. Edwards et al. [62] created a mathematical framework for the half vector. The framework enforces energy conservation to ensure physically plausible results. Stark et al. [63] utilized the barycentric coordinates to fit BRDF models on triangular coordinates. They also explain the geometric interpretations of BRDF reflections (diffuse, specular, grazing and retro-reflection) over Barycentric parameterizations. A hybrid parameterization, between the PDV and the Barycentric parameterization, was proposed by Barla et al. [64] showing an improved distortion appearing on the orthographic projection. Similar to the Barycentric parameterization, Mallett and Yuksel [65] presented a mathematical framework for defining 2D parameterization supporting both uniform and nonuniform quantization. An example of the nonuniform quantization is Bezier parameterization derived from their framework. Using frequency domain as BRDF parameterizations is a classical approach. Spherical harmonics are simple spherical BRDF representations on the frequency domain as proposed by a number of works [66, 67, 68, 69]. Fourier approaches [70, 71] can be used to represent BRDFs. Note that a tabular form of coefficients is required for storing the encoded BRDFs in the frequency domain. The examples of parameterizations discussed in this section reveal insights of BRDF behaviors in many aspects such as reflection characteristics on the half vector, separability, and irregular quantizations. However, one common key idea found in many parameterizations is to efficiently represent the specular lobes, as these regions have strong contributions in rendering. Moreover, when looking at importance sampling, the specular lobes dominate the corresponding PDFs matched to the BRDFs. This means that the samples tend to fall on the specular regions. Thus, the choice of parameterization is an essential part in both modeling and rendering. 2.4 Data Transformation The utilization of measured BRDFs for modeling has increased due to the development of BRDF acquisition systems. The acquired data has shown that the BRDF data has a high dynamic range meaning that there is a large range between highest and lowest values. This creates a difficulty when using BRDF models to obtain 2.4 • Data Transformation 31 parameters by fitting parameters to measured BRDFs. Thus, transformation of the linear BRDFs are desirable for BRDF modeling. Indeed, the choice of the transformations is unclear as it has been discussed previously [4, 58, 60]. In this section, we will briefly explain transformation functions used when working with BRDFs. 2.4.1 Log-based Transformation The human visual system perceives light intensity propotional to a logarithmic function. The nonlinearity of the HVS was characterized by Weber’s law and Fechner’s scaling [72]. The large dynamic range which can be observed by the HVS is about 14 log10 units [73]. One simple operator in dealing with the high dynamic range data is to use logarithm functions as they have been adopted in the High Dynamic Range Imaging (HDRI) resaerch. Reference Sparse measurement Absolute error Figure 2.6 — The rendered image (middle) shows gold-metallic-paint3 from MERL which was reconstructed using our sparse BRDF model from Paper B. The reference is placed on the left and the normalized absolute error image is on the right with the range [0, 1]. Due to the high dynamic range of BRDFs, a number of works has observed the advantages of using the logarithmic functions, e.g., specular lobe interpolations [55], factorization [61] and BRDF fitting metrics [2, 3, 4, 40, 58, 60, 74]. Since the logarithmic function behaves poorly when it is near 0, addition of an ϵ < 0.001 or ϵ = 1.0 is preferable to avoid near 0 values. The following functions are two common logarithmic functions used for this purpose: ρt1 = log(ρ + ϵ), (2.29) ρt2 = log(ρ · cosM ap + ϵ), (2.30) 32 Chapter 2 • Fundamentals of Rendering and Material Apperance where ρ is a BRDF value, ρt1 and ρt2 are transformed BRDF values, ϵ is a constant to avoid near 0 value, and cosM ap is a cosine mapping function. The cosM ap function can be either cos(θi ) or max(cos(θi ) · cos(θo ), ϵ). (a) The logarithmic fitting (b) The linear fitting red-metallic-paint acrylic-felt-green Figure 2.7 — BRDF fitting with different error metrics. The logarithmic fitting metric (a) and the linear fitting metric (b) were performed on the measured alumbronze material from MERL database with the Cook-Torrance BRDF model. The y-axis indicates BRDF values and the x-axis is scattering angles. The incident angles are θi = 0◦ and 67◦ . The plots are from Löw et al. [60]. Reference ρt1 Error (ρt1 ) ρt2 Error (ρt2 ) Figure 2.8 — The rendered images in second and forth columns are reconstructed BRDFs from ρt1 and ρt2 , respectively. The reference is placed on the left and the corresponding normalized absolute error images are on the right of each reconstruction. The error scale is [0, 1] shown on the right most column. In Paper B, C, we employed Equation (2.29) as our mapping function for BRDF factorization and acquisition, as it performs well with glossy materials or highlyspecular reflections. In Figures 2.6, we present an example of our sparse BRDF measurement from Paper B. The data transformation plays a key role in aligning 2.4 • Data Transformation 33 the measured BRDFs so that the aligned data facilitates factorization and brings up the sparse measurement configuration with only two degrees-of-freedom. Löw et al. [60] demonstrated that the fitting metric in Equation (2.30) can improve the fitting curve of wide-angle scattering compared to a cosine-weighted function, using cos(θi ). Figure 2.7 (a) and (b) show the results of using a logarithmic error metric and a linear error metric. As can be seen, the linear fitting focuses on both specular lobes but it neglects the wide-angle scattering. Indeed, the shape of BRDF curves, performing both high specular lobe and wider-angle scattering, appears in glossy materials. Thus, the BRDF fitting procedure with logarithmic error metrics serves favorably on glossy materials. Another example of the logarithmic transformations is presented in Paper D. We observed that the two transformations, (Equation (2.29) and Equation (2.30)), have produced efficient reconstructions depending on material classes. We see that Equation (2.29) performs superior on high-specular reflection and some glossy materials while Equation (2.30) performs superior on diffuse materials. Figure 2.8 shows two examples of reconstructed BRDFs. The diffuse material on the top row is acrylic-felt-green, and the glossy material on the bottom row is red-metallic-paint. Obviously, this reveals an observation from Ngan et al. [58], showing that different material classes require different fitting error metrics. The cosM ap is, thus, one important factor for the diffuse BRDFs. 2.4.2 Gamma Transformation The gamma transformations are often found in imaging techniques [75, 76], e.g., CRT display, image quantization, camera response functions, and HDR tone mapping. One purpose of such functions is to apply nonlinearity to linear data. This non-linear behavior has a connection found in the HVS, which captures a certain range of high dynamic range data. The function can be named differently such as gamma correction, gamma encoding, and gamma compression. The general form of the gamma transformation is the power-law function as follows ρt3 = cg ρ1/γ , (2.31) where cg is either a constant or a weight function, and γ ∈ R. Bieron and Peers [77] proposed the gamma transformation for the BRDF fitting problem, demonstrating some improvements over the log-based transformation. Using the power function, controlled by γ variations, has an advantage when there exists no single optimal solution. The multi-class BRDF’s characteristics thus enable the strong argument in utilizing adaptive γ values for different BRDF types. In the study, the range of γ values is fixed at [1, 3]. Low γ values, i.e., close to γ = 1, are almost similar to the linear mapping, while higher γ values suppress the 34 Chapter 2 • Fundamentals of Rendering and Material Apperance specular lobe components. This implies that highly specular materials could get better fits from low γ values. On the other hand, materials exhibiting glossy and more diffuse characteristics are able to perform well with higher γ values. Our contribution to the gamma transformation is introduced in Paper D. When having multiple BRDF parameter sets as mentioned in previous works [58, 77], one requires rendering of images for selecting the best solution. We thus proposed a selection method based on the gamma function. Our method operates on the transformed BRDF domain and can perform suitable selections without rendering images. Developing a perceptual error metric for appearance modeling is still an open problem. In fact, there is still no clear solution for the BRDF fitting procedure as shown in previous works [77, 78]. The recent perceptual metric study [79] indicates that the log-based transformations have a small dominance over other BRDF metrics. Indeed, there are other factors which have impact on the perception of appearance such as object’s shapes and environment [80]. These studies have revealed that a BRDF perceptual metric is a complex issue which has not been solved yet. 2.5 Material Databases and Acquisition Systems Up to this point, we have provided the theoretical concepts of the material appearance from the physics level to mathematical tools in modeling. In this section, we give overviews of the publicly available BRDF databases used in BRDF research. We begin with MERL [31] and RGL-EPFL [35] databases. Both BRDF databases were used in our work for developing and evaluating the efficiency of our data-driven BRDF models. The latter part of this section provides details of some additional databases. 2.5.1 MERL Matusik et al. [31] proposed an image-based acquisition inspired by the setup presented by Marschner et al. [81, 82]. The proposed image-based acquisition method requires a camera, a light source and a spherical material sample as illustrated below, Figure 2.9(a). Both the camera and the sample are stationary while the light moves during the capturing process. It takes about 3 hours to captures 330 HDR images. The acquired BRDFs were then stored in the Half-Diff parameterization, (θh , θd , ϕd ), with non-linear encoding of the specular lobes. A measured BRDF in the MERL database has 90 × 90 × 180 = 1, 458, 000 samples and its storage cost is approximately 33 MB per material. 2.5 • Material Databases and Acquisition Systems 35 The MERL database is the first database of densely sampled BRDFs containing 100 isotropic materials. It contains diverse material classes from diffuse to highly specular BRDFs. The material samples used in the capturing process are shown below, Figure 2.9(b). The availability of this database and the BRDF fitting procedure have accelerated the development of BRDF models and standardized the evaluation process using real measurements. (a) The image-based acquisition setup (b) Material samples Figure 2.9 — The BRDF acquisition setup (a) proposed by Marschner et al. [81] was adopted to acquire 100 isotropic BRDFs [31], shown in (b). 2.5.2 RGL-EPFL Dupuy and Jakob [35] found a relationship between BRDF parameterization and importance sampling. Assuming that materials follow the microfacet theory, the microfacet normal distribution (NDF) can be extracted by the retro-reflective measurements. The extracted NDF is then used to adaptively sample BRDFs according to retro-reflection behaviors. The device used in the capturing process is a gonio-photometer with a special lighting setup for retro-reflective measurements. The measurement time takes about 2-3 hours and the memory footprint is roughly 600 KB for a single material. The RGL-EPFL database provides, at the time of writing, 62 materials which are 51 isotropic and 11 anisotropic materials. There exists, in this database, some layered materials which exhibit multi-lobe scattering. 36 Chapter 2 • Fundamentals of Rendering and Material Apperance 2.5.3 Other databases There are some publicly available databases, however they are limited in terms of the number of BRDFs or their corresponding resolution. Cornell [30] and CuRET [29] are the earliest BRDF databases provided for computer graphics community. CuRET contains 61 materials with 205 sample points. The reflectance data from Cornell [30] were measured using a custom-built gonioreflectometer. The database contains 2 categories, i.e., paint material and human skin, however, they are stored in different formats. Another BRDF database which has many anisotropic BRDFs is from UTIA. They provided 150 coarsely sampled BRDFs [83] and several dense anisotropic measurements [33, 84, 85]. A learning-based BRDF sampling proposed by Nielsen et al. [41] provided 8 reconstructed BRDFs using 20 samples. A learning-based method for sparse BRDF acquisition is proposed in Paper E, where we show that our deterministic method, entitled FROST, can produce more robust and accurate results when compared to the state-of-the-art method. 2.5.4 The catadioptric measurement The above BRDF databases were acquired using either image-based acquisition or a gonioreflectometer. There is, however, another type of acquisition setup which utilizes a catadioptric measurement. The examples are the acquisition devices proposed by Ward [56], Ghosh et al. [86], and Mukaigawa et al. [87]. They exploited the parabolic mirrors to reflect light so that less moving parts are needed. This capturing process was designed to speed up the measurement process at the cost of accuracy and angular resolution. Chapter 3 Tensor Decompositions and Sparse Representation The foundation of this thesis is built upon tensor decompositions and the sparse representation. Since we are addressing the BRDF modeling and the sparse BRDF sampling problems, the two mathematical frameworks, i.e., tensor operation and sparse representation, are utilized to reformulate and solve the problems. In this chapter, we present an overview of the tensor and its decomposition methods used in Paper B and Paper C. The tensor decomposition assumes that the structural data in a high-dimensional tensor, for instance, a 3D matrix, can be decomposed/factorized into a more compact representation, e.g., three vectorized forms. The factorization permits the high-dimensional data to be encoded into lower-dimensional data. Moreover, some fundamental concepts of the tensor decompositions are then incorporated in the latter part to develop richer representation models. We provide some preliminaries of the sparse representation where it defines a mathematical structure to tackle sparse signal estimation problems. The sparse representation allows us to search for a basis function by learning it from a collection of signals. The learning process is merely an optimization problem of the underdetermined system of equations. This sparse representation framework is used to solve the problems in Paper D and Paper E. This chapter is organized as follows. In Section 3.1, we provide an overview of tensor operations which introduces how high-dimensional data is stored and processed. 38 Chapter 3 • Tensor Decompositions and Sparse Representation We then present tensor decomposition methods which will appear in Chapter 4 and Chapter 5. Lastly, the sparse representation and dictionary learning will be introduced in Section 3.2. 3.1 Tensor Decompositions As we are moving forward to our main contributions of this thesis, here we provide preliminaries of tensor and its operations. In this section, some relevant tensor operations and methods are introduced. Only partial and necessary tensor fundamentals are reviewed and will further guide to methods used in later chapters. For more comprehensive reviews of tensor linear algebra, see the literature [88, 89, 90]. 3.1.1 Tensor Operations The term tensor is defined differently across disciplines [91, 92], but for simplicity, we define it as a multi-dimensional array. To determine the dimensionality of a tensor, there are several formal naming conventions, e.g., mode, way, order, which denotes the number of dimensions. Let X ∈ Rm1 ×m2 ×···×mn denote an n-mode tensor whose entries are real values. A tensor entry is addressed by a set of indices. An entry of a 3-mode tensor, for instance, is xijk where i = {1, · · · , m1 }, j = {1, · · · , m2 } and k = {1, · · · , m3 }. Each tensor contains fibers representing values of a fixed mode. For example, matrix rows are mode-1 fibers and matrix columns are mode-2 fibers. For a 3D matrix, or a 3-mode tensor, x:jk , xi:k , xij: denotes column, row and tube, respectively. Here, we also introduce tensor operations that are frequently used in the literature. The Frobenius norm of an n-mode tensor X is expressed as follows. v um m mn 1 X 2 uX X ∥X ∥ = u ··· x2i1 ,i2 ,··· ,in , t i1 =1 i2 =1 (3.1) in =1 where xi1 ,i2 ,··· ,in is an entry of the tensor X . The inner product of the same-sized tensors X and Y is the sum of the products of all entries as follows ⟨X , Y⟩ = m1 X m2 X i1 =1 i2 =1 ··· mn X xi1 ,i2 ,··· ,in · yi1 ,i2 ,··· ,in , in =1 where yi1 ,i2 ,··· ,in is an entry of the tensor Y. (3.2) 3.1 • Tensor Decompositions 39 An n-mode tensor is rank-one if it can be expressed as the outer product of n vectors X = a(1) ◦ a(2) ◦ · · · ◦ a(n) , where a(i) is ith vector corresponding to the ith mode of the tensor. This means that each entry of the tensor can be calculated by the product of the corresponding vector elements. For example, an element (1) (2) (3) xijk of a 3-mode rank-one tensor is xijk = ai aj ak . This tensor property has a great benefit in dimensionality reduction as the tensor can be encoded as the outer product of a set of vectors. A tensor decomposition algorithm in Section 3.1.2 will employ this property for tensor factorization. The N -mode product of a tensor X with a matrix U ∈ RJ×mN is defined as follows (X ×N U)i1 ,··· ,iN −1 ,j,iN +1 ,··· ,in = m N X xi1 ,··· ,in uj,iN , (3.3) iN =1 where (X ×N U) is of size m1 × · · · × mN −1 × J × mN +1 × · · · × mn . This tensor operation is important for a tensor decomposition which produces a core tensor and multiple basis matrices. The process of unfolding or reordering the elements of an n-mode tensor into a matrix is known as matricization, unfolding or flattening. The unfolding of a tensor X along mode k is denoted X(k) . This operation maps the tensor into a matrix of size (mk , m1 × · · · × mk−1 × mk+1 · · · × mn ). An element xi1 ,··· ,in is mapped into a matrix X(k) (ik , j) where j = 1+ n X (i − 1) p p=1,p̸=k p−1 Y mq . (3.4) q=1,q̸=n The Kronecker product is a matrix product between A ∈ RP ×Q and B ∈ RM ×N which is defined as follows a1,1 B a1,2 B . . . a1,q B a2,1 B A⊗B = . . . a2,2 B . . . a2,q B , .. .. .. . . . (3.5) ap,1 B ap,2 B . . . ap,q B where the resulting matrix A ⊗ B is of size P Q × M N. Note that this product is an extension of the element-wise multiplication of the two matrices. The Khatri-Rao product is the column-wise Kronecker product. The Khatri-Rao product between A ∈ RP ×Q and B ∈ RM ×Q is denoted by A ⊙ B. The resulting matrix is of size (P M × Q) and expressed as follows 40 Chapter 3 • Tensor Decompositions and Sparse Representation a1,1 b1 a1,2 b2 ... a1,q bq a2,1 b1 A⊙B = .. . a2,2 b2 .. . ... .. . a2,q bq .. . . (3.6) apm,1 b1 apm,2 b2 . . . apm,q bq The rank of a 3-mode tensor X , denoted by rank(X ), is the smallest possible number of rank-one tensors in which an entry of the tensor xijk can be expressed as xijk = R X air bjr ckr , (3.7) r=1 where R is the tensor rank, ai ∈ Rm1 , bi ∈ Rm2 , and ci ∈ Rm3 . Equation (3.7) is a generalized form of the rank-one tensor defined on a 3-mode tensor where R = 1. This can also be extended to an n-mode tensor. The tensor operations and properties introduced here will be used in later sections where we explain tensor decomposition methods. 3.1.2 Tensor Decomposition Methods 3.1.2.1 CP Decomposition (CPD) The history of the CP decomposition dates back to 1927 and has been discovered by several authors [93, 94, 95, 96, 97, 98, 99, 100]. There are several naming variants, e.g., Canonical Polyadic, PARAFAC, CANDECOMP, since it was discovered by various studies. Later in 2000, Henk A. L. Kiers [100] named it CP(CANDECOMP/PARAFAC) decomposition, CPD for short. The CPD seeks for solving a minimization problem such that an n-mode tensor X is estimated from sum of rank-one tensors. The generalized form of the tensor optimization is expressed as follows arg min ∥X − X̂ ∥ s.t. X̂ = X̂ R X (2) (n) λr a(1) r ◦ ar ◦ · · · ◦ ar , (3.8) r=1 where λr is its corresponding weight of the rth rank-one component. To illustrate the summation expression of the minimization problem above, Figure 3.1 shows an example of a three-mode tensor approximated by the CPD method with R components. In this way, a three-mode tensor is modeled as a weighted sum of rank-one tensors. Each of the elements of X̂ can be accessed and computed by Equation (3.7). 3.1 • Tensor Decompositions 41 Figure 3.1 — The CP decomposition of a three-mode tensor X with R components. One useful property of CPD is the uniqueness [101, 102], as its conditions are much weaker and relaxed than, for instance, Tucker Decomposition. The discussions on the CPD uniqueness can be found in literature [103, 104, 105, 106]. Here, we also provide an algorithm to solve Equation (3.8) using an Alternating Least Squares (ALS) approach, see Algorithm 1. 1 Input: X , R Result: return λ, A(1) , · · · , A(n) Initialize A(1) , · · · , A(n) T 2 3 4 5 6 7 8 9 10 T Compute Gram matrices G(1) ← A(1) A(1) , · · · , G(n) ← A(n) A(n) repeat for i = 1 to n do S ← G(1) ∗ · · · ∗ G(i−1) ∗ G(i+1) ∗ · · · ∗ G(N ) A(i) ← X(i) (A(N ) ⊙ · · · ⊙ A(i−1) ⊙ A(i+1) ⊙ · · · ⊙ A(1) )S† Normalize columns of A(i) to store column norms in λ Update the Gram matrices G(i) end until criteria met; Algorithm 1: An ALS algorithm for computing CPD of an n-mode tensor X with R components where S† is the pseudo inverse of S and λ is its corresponding weight. Note that solving Algorithm 1 Line 6, by default, employs SVD but it can be replaced by other matrix decompositions, e.g., Cholesky decomposition or QR decomposition [107]. In Section 4.3.1, CPD is utilized for a BRDF model in which aligning BRDFs on the PDV parameterization accommodates the use of CPD. 3.1.2.2 Higher-Order SVD (HOSVD) Higher-Order Singular Value Decomposition (HOSVD) is another class of tensor decompositions, where matrix SVD is utilized. It was originally introduced by 42 Chapter 3 • Tensor Decompositions and Sparse Representation Tucker [108, 109] and has appeared in several other pubcliations under different names [110, 111, 112, 113]. The method decomposes a tensor into a core tensor G and matrices along each mode. The formulation is expressed as follows X = G ×1 U(1) ×2 U(2) ×3 · · · ×n U(n) , (3.9) where {U(i) }ni=1 are orthogonal matrices. Note that the size of the core tensor G is equal to the second mode (column) of each {U(i) }ni=1 . The above expression was defined in Equation (3.3) but here the orthogonal (typically orthonormal) matrices are considered. Truncated HOSVD, like truncated SVD of a matrix, can also be computed; however, the resulting factorization is not necessarily optimal [88]. In Section 4.3.2, we will introduce a variant of HOSVD where we construct an ensemble of HOSVD to represent multiple BRDF classes. Unlike HOSVD, the orthogonal matrices U(i) are learned from a given set of BRDFs. Our proposed method also adopts the concept of sparse representation, presented in a later section, to encode a BRDF in the form of a linear combination of orthogonal matrices U(i) and sparse coefficients G. 3.2 3.2.1 Sparse Signal Representation Sparse Representation The concept of sparse representation has emerged in the field of signal processing and adopted by a variety of fields, e.g., image processing, audio, and medical imaging. It was adopted to visual data by a number of publications [114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124]. Sparse representation, also known as sparse coding or sparse approximation, formulates the signal recovery problems using a linear combination of basis functions and coefficients. Generally, an observed 1D signal, for instance, x ∈ Rm can be approximately decomposed into two basic elements, i.e., a set of bases D ∈ Rm×n and coefficients s ∈ Rn . This is analogous to solving a linear system of equations x = Ds. The matrix D is referred to as an overcomplete dictionary, m < n, whose columns are called atoms. The goal of sparse representation is to find the sparsest coefficient s satisfying the underdetermined system of linear equations, where there are infinitely many solutions under the ℓ2 norm. This allows any high-dimensional signal to be encoded with a small number of coefficients. Considering the sparse signal recovery, the signal x is τ sparse if it can be estimated with the following formulation. arg min ∥x − Ds∥22 s.t. ∥s∥0 ≤ τ, s (3.10) where ∥s∥0 counts the non-zero elements of the coefficient vector s. The sparsity τ 3.2 • Sparse Signal Representation 43 is strictly small, τ ≪ n, compared to the size of atoms, since we wish the signal x to be compressible. The above constraint can be rewritten in another form using an estimated noise or error parameter ϵ as follows arg min ∥s∥0 s.t. ∥x − Ds∥22 ≤ ϵ. s (3.11) Consequently, solving the equations above gives us a set of sparse coefficients. This implies that the observed signal x can be reconstructed with fewer non-zero coefficients. Given the observed signal x and the dictionary D, however, the above problems are non-convex and NP-hard due to the ℓ0 norm. An exhaustive search of the possible solutions is required to solve a combinatorial nτ least square problem. To address this problem, the condition can be relaxed by replacing the ℓ0 norm with an ℓ1 norm. This leads to solving a convex problem with Linear Programming (LP) algorithms. These theoretical works [125, 126, 127, 128, 129, 130] were the pioneer works which conceptually proved the LP solutions of the above equations. The geometric interpretations of ℓp norm can be found in [131], demonstrating the relationship between sparsity and ℓp norm minimization. Hence, one goal of sparse representation is to seek algorithms to solve the combinatorial and non-convex optimization with relaxed conditions. There exist two major categories of sparse recovery algorithms which rely on the norm used for the regularizer. The first group of algorithms solves Equation (3.11) with Greedy methods. The popular methods in this category are, for example, matching pursuit (MP) [132] and orthogonal matching pursuit (OMP) [133, 134]. The greedy algorithms iteratively search for the best atom (support) from the dictionary for each iteration based on heuristic conditions, e.g., approximated residuals. There are several MP and OMP variants proposed to improve performance, e.g., Regularized OMP [135, 136] and Compressive Sampling OMP [137]. Here, we refer to [138, 139, 140, 141, 142, 143, 144, 145] for more extensions. Another category of sparse approximation solvers exploits convex relaxation [146] with ℓ1 -norm regularization. The convergence guarantee of ℓ1 minimization makes this group of algorithms strong but they suffer from the high-computational complexity compared to the greedy algorithms. The examples of this group are, for instance, basis pursuit (BP) [147], Gradient Projection Sparse Reconstruction (GPSR) [148], interior-point based methods [149, 150], alternating direction method (ADM) [151], proximity-based algorithms [152, 153, 154, 155] and SpaRSA [156]. One can classify the algorithms into more groups, however, here we only mention two major categories. We direct the reader to literature [131, 157, 158] for a comprehensive review of sparse recovery algorithms. Obtaining a sparse signal by solving Equation (3.11) poses a number of challenges. Firstly, one challenge is the dictionary design for sparse representation. Typically, a sparse signal recovery algorithm is applied when a dictionary D is given. However, 44 Chapter 3 • Tensor Decompositions and Sparse Representation in most cases, the dictionary is not given and the choice of the dictionary is critical to the reconstruction quality. One classical choice for the dictionary is the pre-specified dictionary based on the transform methods, e.g., the fast Fourier transform (FFT) [159], Discrete Cosine Transform (DCT) [160], wavelet transforms [161, 162, 163], curvelets transform [164], bandelets [165], and contourlet transform [166]. Another promising way to address this is to consider it as a machine learning task where a dictionary is learned from a training set. In the next section, we will provide an overview of dictionary learning. One of the most important applications of sparse representation is efficient sampling and reconstruction of signals. This opens up the research direction on compressed sensing in which one would like to sample a compressible signal. In Chapter 5, we will introduce the concept of compressed sensing utilized in our work. We also refer the reader to a reference book [167] on compressed sensing. 3.2.2 Dictionary Learning A motivation of dictionary learning is to find a representative set of bases D by learning it from collected training data. Several types of dictionary learning methods have been proposed, e.g., online dictionary learning [168, 169], discriminative dictionary learning [170, 171] and joint dictionary learning [172, 173]. One can also categorize the methods into supervised learning and unsupervised learning [131, 174]. Generally speaking, dictionary learning utilizes a method for solving an equation similar to Equation (3.11) with the ability to update or search for a representative dictionary D with respect to the sparsity, see Equation (3.12) below. The common approach for learning the dictionary D is to iteratively update atoms until certain conditions are met. One of the most commonly used dictionary learning methods is K-SVD [175]. The algorithm can be seen as a generalized Kmeans clustering algorithm classified as an unsupervised learning method. K-SVD seeks to solve the following objective function arg min ∥X − DS∥22 s.t. ∥Si ∥0 ≤ τ, ∀i ∈ 1, . . . , N , (3.12) D,S where X ∈ Rm×p is a training data matrix whose columns are p known signals, D ∈ Rm×k is the learned dictionary, and S ∈ Rk×p is the sparse coefficient matrix. The user-defined parameter τ is the sparsity parameter. When k > m, the learned dictionary is called overcomplete. The joint optimization problem, Equation (3.12), can be solved in an iterative manner by freezing either parameter D or S at a time and updating the other. K-SVD employs a sparse recovery algorithm, OMP, when the dictionary D is fixed. When S is fixed, it becomes a regression problem which can be solved using SVD 3.2 • Sparse Signal Representation 45 decomposition. This iterative two-stage approach is commonly used in dictionary learning algorithms. Notice that the number of atoms k influences K-SVD in the way that if k is large, it leads to a more representative dictionary but at a high computational cost. The unsupervised dictionary learning has been widely adopted in applications of visual data reconstruction, e.g., image denoising [114, 176, 177] and image compression [178]. For applications with labeled training data, supervised dictionary learning algorithms [170, 171, 179, 180, 181, 182] can address the problem. The dictionary learning algorithms, in the early stage, did not account for data coherence along different dimensions. The multi-dimensional visual data sets, e.g., light field data, BRDF and BTF, have shown to have correlations along different dimensions. Moreover, vectorizing multi-dimensional data into a 1D signal does not capture the underlying coherent structure and it may significantly increase the training time. To address this problem, the tensor-based dictionary learning algorithms [183, 184, 185, 186, 187] are considered as an alternative approach to this type of training data. Additional algorithms in this category are CPD-based dictionary learning [188, 189, 190, 191] and Tucker-based dictionary learning [192, 193, 194, 195] where the algorithms utilize similar formulations of Equation (3.8) or Equation (3.9) based on multi-dimensional representations with sparsity constraints. Therefore, it is reasonable to utilize the tensor-based dictionary learning method for handling BRDF modeling. We will show the performance of our proposed BRDF models in Chapter 4 where we employ a tensor decomposition method and a dictionary learning method to represent high-dimensional reflectance data. Chapter 4 Surface Reflectance Modeling and its applications As discussed in previous chapters, the surface reflectance is a key for the pipeline of generating realistic images. Solving the light simulation accounts for the degree of realism obtained from the rendering algorithm and the light-material interaction. For each material, light interaction is distinctly determined by a special class of reflectance functions such as the BRDF, the BTDF and the BSSRDF. Indeed, one research problem that this thesis addresses is to efficiently encode such high dimensional data obtained from the measurements. We focus on the homogeneous surface as the spatial location parameters are omitted. More importantly, we consider the use of data-driven approaches to tackle the problem. The emerging of the MERL database [31] gave rise not only to evaluation of BRDF models, but also to leveraging of the measured data for data-driven BRDF models. 4.1 Outline and Contributions In this chapter, we introduce the background knowledge for BRDF modeling, see Section 4.2. The modeling approach has been classified into three major categories. We review each one individually starting from empirical BRDFs to data-driven BRDFs. In Section 4.3, we present our data-driven BRDF models as the contributions of the thesis (Paper C and Paper D). Our non-parametric BRDF models have no assumptions about the underlying physical model, and the number of model parameters are not fixed. Parametric models, on the other hand, have a 48 Chapter 4 • Surface Reflectance Modeling and its applications strong assumption about the model structure, and the number of parameters are predetermined [196]. Our factorized BRDF model, Paper C, is a non-parametric model whose number of coefficients corresponds to data resolution. The unique characteristics of the PDV parameterization is adopted to align BRDFs to achieve a low-rank factorization. In Paper D, a learning-based method is employed with some additional dictionary properties. The resulting BRDF model is a non-parametric model whose number of coefficients is flexible according to the required accuracy and storage. Lastly, in Section 4.4, we present some useful BRDF applications derived from Paper C and Paper D, as well as a BRDF editing method presented in Paper A. 4.2 4.2.1 BRDF Models The Empirical BRDF Model In the early stage of 3D computer graphics, empirical models were used to approximately imitate specular highlights on rough surfaces. This group of BRDF models are empirically developed from the observations with the mathematical frameworks supported. The earliest models of this kind are Phong [26] and BlinnPhong [27] BRDF models which employ the cosine angle between directions. The Phong model demonstrates that the specular highlight can be modeled by a pair of the viewing direction, V , and the perfect reflection direction, R, while the Blinn-Phong model utilizes a pair of the normal vector, N , and the half vector, H. The energy-conservation constraint was later introduced by Lewis [197], where the model limits the diffuse and specular coefficients to 1.0. The following equations are the Phong BRDF model, the Blinn-Phong BRDF model and the Lewis BRDF model, respectively. ρs (ωi , ωo ) = ks (R · V )n , (4.1) ρs (ωi , ωo ) = ks (N · H)n , (4.2) ρ(ωi , ωo ) = kd + ks ρs (ωi , ωo ), (4.3) where n is the exponent controlling the specular highlight, kd and ks are diffuse and specular coefficients, respectively, with the constraint that kd + ks = 1.0, and (ωi , ωo ) is a pair of incoming and outgoing directions. These models are widely used in simple shading models due to inexpensive computation. Later, another modified Phong BRDF model [198] was derived so that it follows the physically plausible rules and can be used in a path tracing renderer with importance sampling. The equation is expressed as follows 4.2 • BRDF Models 49 1 n+2 (R · V )n . + ks π 2π | {z } {z } | ρ(ωi , ωo ) = kd dif f use (4.4) specular Equation (4.4) has both the diffuse and specular components where the diffuse component is described by the Lambertian model and the specular component is described by the Phong model. The term n+2 2π was derived to meet the energy conservation. It is common that many BRDF models separate the diffuse and specular components because the Lambertian reflectance can efficiently approximate the diffuse component and it is nearly a constant. One of the well-known BRDF models was proposed by Gregory J. Ward [56]. It was designed to represent both isotropic and anisotropic BRDFs by using Gaussian distributions. Moreover, the proposed models also support the Monte Carlo sampling. Several variants [199, 200] were proposed to correct the normalization term for more physically plausible constraints. The Ward BRDF models for isotropic and anisotropic materials are, respectively, as follows exp −tan2 1 1 ρ(ωi , ωo ) = kd + ks q · π 4πα2 cos(θi )cos(θo ) 2 1 1 ρ(ωi , ωo ) = kd + ks q · π cos(θi )cos(θo ) θh 2 α (4.5) , 2 2 exp −tan (θh ) cos θh 2 αx 4παx αy + sin θh 2 αy !! , (4.6) where α is the Gaussian parameter for the specular highlight in the isotropic BRDF model, αx and αy parameters control the specular highlight for the anisotropic BRDF model, and θi , θo and θh are elevation angles of incoming, outgoing and halfway directions, respectively. All the α parameters can be thought of as the standard deviation of the surface slope. The α parameter in Equation (4.5) handles the size of the circular highlight seen in isotropic materials which is invariant to the rotation while the αx and αy in Equation (4.6) govern the highlight of anisotropy seen in anisotropic materials. Please note that due to the grazing angle problem in the anisotropic Ward BRDF model, a similar anisotropic BRDF model was proposed by Neumann et al. [201] where q the cosine term cos(θi )cos(θo ) was replaced by max(cos(θi ), cos(θo )). Neumann et al. [201] also proposed modifications of physically plausible BRDF models for the Phong model and the Blinn-Phong model to represent metallic materials. The empirical approach has opened a research direction for mathematical BRDF frameworks. A BRDF framework, which was mathematically derived for general BRDF models with importance sampling support, was proposed by Neumann et al. [59]. The resulting models are physically plausible for both isotropic and 50 Chapter 4 • Surface Reflectance Modeling and its applications anisotropic BRDFs. The key idea of the models is their basic BRDF which is defined as the distance vector on a projected vector domain. Another BRDF framework was derived by using the half vector, proposed by Edwards et al. [62]. This framework shares a similar idea from Neumann et al. [59] in the way that the projected vector on the unit disk was used to derive the shape of surface reflectance with a pair of the half vector and the normal vector. The framework puts the energy conservation as a constraint to ensure that the resulting empirical BRDF model is physically plausible. At near-grazing angles, light is scattered from the media largely due to the Fresnel effect which had not been considered until Ashikhmin and Shirley [57] proposed their BRDF model. This effect can be modeled using the electro-magnetic wave of light reflection and transmission. Schlick [202] introduced an approximation to the Fresnel term, Fr (cos θh ), as follow Fr (cos θh ) = FR0 + (1 − FR0 )(1 − (cos θh ))5 , FR0 = η1 − η2 η1 + η2 (4.7) !2 (4.8) , where Fr is the Fresnel reflection, FR0 is the surface reflectance at the normal incidence, and η1 and η2 are indices of refraction (IOR) of the two medium. The approximation in Equation (4.7) was later used in the Ashikhmin and Shirley’s model to derive the physically plausible normalization term as shown in Equation (4.10) below. The complete form of the model appears as ρs = ks (N · H)n F (cos θh ), max(cos(θi ), cos(θo )) r (4.9) !5 !5 cos(θ 28Rd ) cos(θ ) i o 1 − 1 − , ρd = (1 − Rs ) 1 − 1 − 23π ρ = ρs + ρd , 2 2 (4.10) (4.11) where ks is an arbitrary specular coefficient, and Rd and Rs are constants for diffuse and specular reflections, respectively. Its variant [203] was proposed to support the Monte Carlo sampling and hardware-friendly implementation. So far, we have reviewed a number of well-known empirical BRDF models. Note that the empirical BRDF models are derived mathematically to meet the effects of light scattering instead of using physical theories to support the assumptions. A benefit of using the mathematical derivation is the freedom of parameters and constants in the models. In the next subsection, we will go through the BRDF models whose assumptions have a more physically-accurate meaning. 4.2 • BRDF Models 51 4.2.2 The Analytical BRDF Model When material surfaces are modeled by using theoretical physics to predict physical surface phenomena, we categorize the BRDF model as an analytical model or a physically-based model. The most favoured physically-based model widely used in computer graphics is the microfacet-based BRDF model. The microfacet theory suggests that the rough surfaces be modeled as a collection of microfacets defining the light distributions at any given point. The Torrance-Sparrow model [204] and Cook-Torrance model [28], as shown in Equation (4.12) appear to be the early physically-based BRDF models derived by using the microfacet theory, defined as ρs (ωi , ωo ) = D(ωh )G(ωi , ωo )Fr (ωo ) , 4 cos(ωi )cos(ωo ) (4.12) !2 tan(α) 1 , 4 exp − m m cos α ! 2 (N · H)(N · V ) 2 (N · H)(N · L) G(ωi , ωo ) = min 1, , , (V · H) (V · H) D(ωh ) = 2 (4.13) (4.14) where D is the light distribution function, G is the geometrical shadowing and masking term, and Fr is the Fresnel term. The Cook-Torrance BRDF model proposed the use of Beckmann distribution function as shown in Equation (4.13), where m is the Beckmann parameter and α is the roughness parameter. The geometrical term, G, denotes the approximation of micro structure of surfaces which blocks and attenuates light when it enters the microscopic scale of the surface. The G term of the Cook-Torrance model is presented in Equation (4.14). The Fresnel term, Fr , accounts for the Fresnel effect appearing near the grazing angle. The microfacet-based BRDF model is rather well-known in computer graphics and rendering for decades. Most of the variants tweak and adjust the distribution term D and the geometrical term G. For example, the microfacet-based model and the GGX distribution, in Equation (4.15), proposed by Walter et al. [205] was adopted to model refraction accounting for the BSDF model (BRDF + BTDF). The SGDbased BRDF model [206] proposed more accurate distribution using Shifted-Gamma Distribution (SGD), in Equation (4.16), for single-layered materials. D(ωh ) = χggx (N · H)α2 π cos4 (θh )(α2 + tan2 (θh ))2 χsgd (θh ) D(ωh ) = P22 (tan2 (θh )), π cos4 (θh ) , (4.15) (4.16) where χggx (N · H) is equal to 1 if N · H > 0 and 0 otherwise, χsgd (ωh ) is equal to 1 if ωh < π/2 and 0 otherwise, P22 (x) is a positive function of the variable x ∈ [0, ∞). 52 Chapter 4 • Surface Reflectance Modeling and its applications Note that the GGX distribution, Equation (4.15), looks similar to the SGD distribution, Equation (4.16), as the P22 (x) is a generalization function for the distribution, which can be the GGX distribution, the Beckmann distribution or the Gaussian distribution. The SGD-based BRDF model [206] proposed the generalization function as shown below p−1 P22 (x) = α · Γ(1 − p, α) 2 exp − α α+x α2 + x p (4.17) , where Γ is the incomplete gamma function, and α and p are the distribution slope and the exponent parameters, respectively. Holzschuch et al. [207] proposed another variant of the microfacet-based BRDF model by extending the diffraction term, using an approximation of the CookTorrance diffraction. Some variants [208, 209] have proposed the use of different normalization terms, the denominator term, to steer the near grazing angle lobe. The Oren-Nayar BRDF model [210] improves the diffuse surfaces by assuming the V-shaped cavity as micro structures. The resulting approximation of the model is represented by the following equation ρ(ωi , ωo ) = where A = 1 − 2 kd (A + B max(0, cos(ϕi − ϕo ))sin(α)tan(β)), π m , 2 2(m +0.33) B= 2 0.45m , 2 m +0.09 (4.18) α = max(θi , θo ), β = min(θi , θo ), and m is the standard deviation of the microfacet orientation angle in radians. Inspired by Rayleigh-Rice theory, the ABC BRDF models [60] were derived based on the observations that the measured BRDF isocontours, projected on the unit disk, are radially symmetrical. The ABC BRDF models are presented in the following equations. kd + S( Dp )Fr (ωo ), π √ S( 1 − N · H)G(ωi , ωo )Fr (ωo ) k , ρ(ωi , ωo ) = d + π cos(θi )cos(θo ) ρ(ωi , ωo ) = where S(f ) is the ABC model with arbitrary parameter f and S(f ) = (4.19) (4.20) A 2 C. (1+Bf ) The parameters A, B, C are obtained from the fitting process. The projected distance vector, Dp , is computed from the projected incoming and outgoing directions as shown in Figure 2.5. The proposed smooth surface BRDF model in Equation (4.19) represents glossy materials and metals quite efficiently as these materials exhibits the symmetrical 4.2 • BRDF Models 53 scattering on the projected unit disk. The microfacet-based ABC model in Equation (4.20), on the other hand, works well on dielectric or plastic materials where rough surfaces are dominant. The single-lobe scattering models sometimes cannot accurately represent layered BRDFs which exhibit multiple scattering from multi-layered materials such as car paint, pearlescence and iridescence. There are several BRDF models proposed for layered materials. The Beard-Maxwell reflection model [211] proposed the physically-based reflection model based on the microfacet theory. The reflection model has an additional term representing the volumetric scattering under the material surface as follows ρ = ρf s + ρvol . |{z} surf ace (4.21) |{z} volume Its variant was presented by Westlund and Meyer [212], whose BRDF model is based on the microfacet model. Weidlich et al. [213] proposed a layered BRDF model similar to the Beard-Maxwell model but with an absorption term using the Bouguer-Lambert-Beer law. A hybrid layered BRDF model [214] utilized the multi-lobe scattering, the microfacet-based BRDF model, and an image-based fitting technique to represent flakes on metallic car paints. Clearly, most of the models presented here are microfacet-based BRDF models and widely used in physically-based rendering and shading. As measured BRDFs became more available, the evaluation of BRDF models has been shifted toward the use of measured BRDFs. In the next subsection, we then review the data-driven BRDF models as they leverage the measured data and mathematical tools for BRDF modeling. 4.2.3 The Data-Driven BRDF Model The development of reflectance acquisition methods, presented in Section 2.5, has brought up the attention to reflectance modeling using measured data. The key idea of a data-driven model is to define the model as a linear combination of basis functions using a set of coefficients. This representation does not normally rely on any physical model, but rather computing the coefficients through data with the accommodation of the chosen basis functions. In fact, the BRDF models presented in Section 4.2.1 and Section 4.2.2 have a fixed number of parameters. In contrast, the number of parameters of a data-driven BRDF model are not fixed but predetermined by the model accuracy or the memory footprint as needed. Spherical harmonics are frequency-space basis functions used widely in computer graphics and vision. The basis function represents angular information on the unit 54 Chapter 4 • Surface Reflectance Modeling and its applications sphere. The spherical harmonic has been used in BRDF modeling [66, 67, 215] and BRDF shading [68, 216, 217]. Some example applications of the spherical harmonic, other than the BRDF modeling, are face recognition [218, 219], an spherical harmonic framework for illumination [220], environment map representation [221] and real-time rendering [222, 223]. A comprehensive review of the spherical harmonic in illumination modeling and computer vision can be found in [224]. Another frequency-space basis function is the Fourier transform [70, 71, 225]. These frequency-space BRDF models require storing the coefficients, depending on the expansion orders, in a tabular form. The rational function constructed by the ratio of two polynomials can be used to approximate BRDFs [226]. The rational BRDF model requires to predetermine the number of coefficients for each polynomial. In this thesis, one of our contributions belongs to the category of separable factorization functions. As we focus on isotropic BRDFs, the measured data is stored in a 3D matrix. Given that the provided data is stored in a 3D regular grid, the factorized BRDF model can be defined as follows ρ(x, y, z) = L X F(1,i) (x)F(2,i) (y)F(3,i) (z), (4.22) i=1 where F(1,i) , F(2,i) , F(3,i) are 1D factors of the ith term and L is the number of factorized terms. The factorized BRDF model has an assumption that any BRDF can be approximated by rank-1 factorization whose factors are 1D non-parametric functions. This holds true when the measured data is aligned on the parameterization which makes the BRDF separable as presented by Kautz et al. [54] and McCool [61]. Moreover, one can further reduce the data dimensionality into 2D for BRDF representations. Lawrence et al. [39] realigned a measured BRDF into a 2D matrix and applied the Non-negative Matrix Factorization (NMF) to obtain two 1D factors. It was later observed by Romeiro et al. [227], who utilized the bivariate representation on the Half-Diff parameterization by taking the average over ϕd dimension, that MERL BRDFs [31] can be approximated with the bivariate representation. They then proposed a BRDF inference framework from a known illumination and a sphere image. In case of the 2D data, we can drop out the third factor F(3,i) shown in Equation (4.22) to perform the NMF method. Proposed by Bilgili et al. [40], the Tucker decompsition [109] was employed to model BRDFs. Their BRDF model does recursive factorization of the BRDF illumination channel on the Half-Out parameterization and do BRDF fitting to obtain RGB color reflection. However, their BRDF model suffers from the color shifting which we believe is inherent to the BRDF fitting. Machine learning is another branch of techniques which has been used in BRDF 4.3 • Our Data-driven BRDF Models 55 modeling. The probabilistic methods such as Gaussian process [228, 229] and directional statistics [230] have been adopted to learning BRDF model parameters. Another machine learning approach which we adopt for our data-driven BRDF model is the dictionary-learning approach. The BRDF can be thought of as a signal observed by light-material interactions. Principal Component Analysis (PCA) is an example of an unsupervised representation learning. It was first utilized on measured data by Matusik et al. [31] when they published the MERL database. Nielsen et al. [41] employ PCA for their BRDF sampling method. However, their sampling method is sensitive to the training data when low number of samples is used leading to artifacts often found in reconstructing diffuse materials. They suggest that their BRDF sampling could improve by performing PCA on different classes of materials. Later Sun et al. [74] proposed the diffuse-specular separation with the PCA method for diffuse and specular dictionaries. The basis function of PCA is the learned dictionary consisting of the orthonormal basis matrix and its atom variance as X = U ΣV T , (4.23) where X ∈ RM ×N is the BRDF data, U ∈ RM ×K is the orthonormal basis matrix, Σ ∈ RK×K is the scaling factor and V T ∈ RK×N is the coefficient matrix whose columns are orthonormal. The above equation factorizes the data using Singular Value Decomposition (SVD) method. We can define the dictionary as Q = U Σ. The size of the dictionary Q depends on the explained variance parameters (number of columns of U ). The explained variance of the PCA harnesses the quality of the learned dictionary and thus the quality of the reconstructed BRDFs. Here, the major problem of using PCA is the large memory footprint of the dictionary as it grows according to the number of training data in X. A data-driven BRDF model proposed by Bagher et al. [231] is considered a hybrid method which assumes the microfacet-based BRDF as the underlying model. They proposed a special weighting function to preprocess the measured data. The weighted BRDFs and the microfacet-based BRDF are then optimized to obtain coefficients of the distribution term D, the geometrical term G and the Fresnel term Fr . The number of coefficients are determined by the data resolution used during the optimization process. 4.3 Our Data-driven BRDF Models Here, we present the factorized BRDF model and the dictionary-based BRDF model as our contributions. The factorized BRDF model utilizes the fact that isotropic 56 Chapter 4 • Surface Reflectance Modeling and its applications BRDFs are separable when the data is aligned on a suitable parameterization. We propose the use of the Projected Deviation Vector parameterization (PDV) with regards to observations from previous works [59, 60]. As a result, our factorized BRDF models can be used in BRDF modeling, BRDF editing and efficient BRDF acquisition. Secondly, the dictionary-based BRDF model was inspired by Miandji et al. [232], that presents the compression of light-field data using Aggregated Multi Dictionary Ensemble (AMDE). With sparsity constraints imposed in the dictionary learning, our dictionary-based BRDF model is a sparse non-parametric BRDF model which has much smaller dictionary size than PCA. Additionally, we show that the encoded BRDFs can be interpolated using our dictionaries and our model selection method works well on the BRDF domain accounting for variability of BRDF data sets. 4.3.1 The Factorized BRDF Model 4.3.1.1 Motivation The BRDF parameterization has played a significant role in BRDF modeling as shown in a number of works [39, 40, 58, 59, 60]. They showed that a suitable parameterization facilitates the aligned measured data in BRDF modeling. The observations made by Löw et al. [60] indicate that the measured isotropic BRDFs [31] have the iso-contour on the unit disk and the hemisphere as shown below, Figure 4.1(a) and (d). The iso-contour curves have symmetrical circular shapes preserved along the incident angles on the unit disk. We then further investigate the visualizations of BRDF parameterizations as shown below in Figure 4.1. Figure 4.1(a), (b) and (c) illustrate the projections of coordinates of each parameterization on the unit disk by varying the last two parameters as explained in Section 2.3. The Half-Diff and Half-Out parameterizations do not preserve the coordinate shapes around the grazing angle on the unit disk. Moreover, the coordinates on the hemisphere are different from iso-contour curves found in the measured isotropic BRDFs. To convert the measured BRDFs from the standard parameterization, we provide the conversion algorithms presented below in Algorithm 2 and Algorithm 3. One issue occurs in dp sampling as the BRDF is highly non-linear and that the BRDF specular lobe is important. We present the quantization based on the inversion method in Paper B and Paper C. This method normalizes the sum of the non-linear energy across all BRDFs in the MERL database and the result is non-linear samples of the dp parameter. The following equation is the normalized BRDF energy along the dp parameter. PM P P Ej = m i k ρm (i, j, k) , M (4.24) 4.3 • Our Data-driven BRDF Models 57 (a) PDV at θR = 30◦ (b) Half-Diff at θh = 30◦ (c) Half-Out at θo = 30◦ (d) PDV at θR = 70◦ (e) Half-Diff at θh = 70◦ (f) Half-Out at θo = 70◦ Figure 4.1 — Illustrations of coordinates of each parameterization on the unit disk and on the hemisphere. Figures (a), (b) and (c) are the coordinates of the PDV, the Half-Diff and the Half-Out parameterizations respectively on the unit disk while figures (d), (e) and (f) are the aligned coordinates of the PDV, the Half-Diff and the Half-Out parameterizations respectively on the hemisphere. Each curve of the PDV (a),(d) is dp = 0.044 unit apart. Each curve of the Half-Diff (b),(e) is θd = 2◦ apart. Each curve of the Half-Out (c),(f) is θh = 2◦ apart. 1 2 3 4 5 6 7 8 Input: (θi , ϕi , θo , ϕo ) Result: return the PDV parameters(θR , dp , ϕdp ) θR = θo ϕi = ϕi − ϕo ϕo = 0.0 Rp = (sin(θo )cos(ϕo + π), sin(θo )sin(ϕo + π)) Lp = (sin(θi )cos(ϕi ), sin(θi )sin(ϕi )) Dp = Lp − Rp dp = len(Dp ) ϕdp = atan2(Dp .y, Dp .x) Algorithm 2: Standard-to-PDV parameterization conversion where Ej is an element of Enormalized = {Ej |j = 1, 2, ..., NE }, NE is the number of linear dp resolution, ρm is the BRDF value of mth material, and M is the number of BRDFs in the database. The plot of normalized BRDF energy of the MERL database is shown in Figure 4.2(a). 58 Chapter 4 • Surface Reflectance Modeling and its applications 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Input: (θR , dp , ϕdp ) Result: return the Standard parameters(θi , ϕi , θo , ϕo ) Rp = (−sin(θR ), 0.0) Lp = (dp cos(ϕdp ) + Rp .x, dp sin(ϕdp ) + Rp .y) if len(Rp ) > 1.0 or len(Lp ) > 1.0 then return null else ϕi = atan2(Lp .y, Lp .x) L .x p θi = abs(asin( cos(ϕ )) i) π if θi > 2 then return null else θo = θR ϕo = 0.0 end end Algorithm 3: PDV-to-Standard parameterization conversion (a) The normalized energy (b) dp step size Figure 4.2 — The normalized energy curve (a) is computed on 2000 linear dp samples. The dp step size (b) from the inversion method demonstrates the non-linear curve. The resulting non-linear dp resolution is 90 samples with a non-linear step size, Figure 4.2(b). Note that our dp sampling can be applied individually on each BRDF to increase the model accuracy, however, here we take the average of all BRDFs for the dp step size for general purpose modeling. 4.3.1.2 Our BRDF model Our 3D separable BRDF model utilizes the 3D PDV representations of the measured isotropic BRDFs. Our BRDF model iteratively encodes the BRDFs into three 1D 4.3 • Our Data-driven BRDF Models 59 factored functions using the rank-1 CPD method, see Section 3.1.2. We define our BRDF model as follows ρt (θR , dp , ϕdp ) = L X F1,i (θR )F2,i (dp )F3,i (ϕdp ), (4.25) i=1 ρt = log(ρ + 1.0), (4.26) where L is the number of iterative factorized terms, F1,i , F2,i and F3,i are rank-1 factors of θR , dp and ϕdp , respectively. Here, the high dynamic range values of the BRDF is handled by a logarithmic transformation, Equation (4.26). One condition for our 1D factors is that the first term, L = 1, is non-negative to preserve the physical meaning of the BRDF but the remaining terms L > 1 does not have such constraint. Paper C demonstrates the increase of our model accuracy as L increases. 4.3.1.3 Importance sampling on PDV We propose a variance reduction technique as discussed in Section 2.1.2.2 to integrate our model in a Monte Carlo ray tracing. The inversion method which requires a cumulative distribution function (CDF) of the BRDF is employed for the importance sampling setup. The non-negative factors at L = 1 are used to construct the cumulative distribution function (CDF) as the normalized reconstructed BRDF of the first term can be viewed as the probability density function (PDF). Therefore, once a BRDF is represented by our BRDF model, the PDF is computed from the normalized values of L = 1 reconstruction. This can be done on-the-fly once the BRDF model is loaded up to the renderer. p(dpk |θR ) = ρ(θR , dpk , ϕdp )dpk PNdp , (4.27) j=1 ρ(θr , dpj , ϕdp )dpj ∆dpj where Ndp is the dp resolution, ∆dpj is the dp step size, and ρ(θR , dp , ϕdp )dp is the L = 1 reconstructed BRDF. To choose the directions, the CDF is constructed as follows Nd P (dp |θR ) = Xp p(dpk |θR )∆dpk . (4.28) i=1 During the rendering, we choose sample directions based on a given probability. The inversion method randomly draws two uniform random samples (ξ1 , ξ2 ). We can then generate the ϕdp and dp parameters given (ξ1 , ξ2 ) and θR as follows 60 Chapter 4 • Surface Reflectance Modeling and its applications ϕdp = 2πξ1 , dp = P −1 (4.29) (ξ2 |θR ). (4.30) As our BRDF model is defined on PDV space, the Jacobian of the mapping is needed to transform the sampling space. The probability distribution of sampling an incoming light direction ωi given an outgoing direction ωo is expressed as p(ωi |ωo ) = p(dp |θR )pϕ (ϕdp )|JT |. (4.31) Equation (4.31) can be computed using a uniform PDF of pϕ (ϕdp ) and a PDF from Equation (4.27). We derive the Jacobian transformation JT to avoid numerical instabilities as follows √ 2cos(θi ) JT ≈ q . (4.32) 2 − cos(2θi ) − cos(2θo ) + 4cos(ϕi − ϕo )sin(θi )sin(θo ) (a) Lawrence et al. [39] PSNR = 33.40dB (b) Bilgili et al. [40] PSNR = 35.27dB (c) Our BRDF model [3] PSNR = 37.37dB Figure 4.3 — BRDF model evaluations on the Princeton scene rendered using PBRT [9]. The scene contains 3 BRDFs which are blue-metallic-paint, nickel and yellow-matte-plastic from MERL. The inset represents the absolute error image of each BRDF model and PSNR of each rendered image is reported under the image. Figure 4.3 shows the evaluations of our BRDF model with other factorized BRDF models. Our factorized model in Figure 4.3(c) uses L = 10, resulting in around 87 KB per material. Additionally, our importance sampling indicates our model efficacy is comparable to other methods. We present the evaluation of importance sampling below, Figure 4.4. The visual error metric used here are Peak Signalto-Noise Ratio (PSNR) for rendered images and Mean Squared Error (MSE) for importance sampling efficiency. For more evaluation results, please refer to Paper C. 4.3 • Our Data-driven BRDF Models 61 (b) Lawrence et al. sampling (MSE = 4.0662) (c) Bilgili et al. sampling (MSE = 3.6134) (d) Our model sampling (MSE = 3.9474) Figure 4.4 — The Princeton scene rendered for visual comparisons of importance sampling strategy. All images were rendered at 256 samples per pixel. The log MSE values are computed using the rendered images and its reference. 4.3.2 The Sparse Non-parametric BRDF Model 4.3.2.1 Motivation Regarding the discussion in Section 4.2.3, we show that data-driven BRDF models have different complexity for the basis functions. Some models have a small storage requirement for the basis functions such as spherical harmonics and wavelets [221, 233], while the others have greater cost of storage for the basis such as PCA or decomposition-based BRDFs. The trade-off between the model complexity and the storage cost opens an opportunity to create a non-parametric BRDF model to solve this challenge. Moreover, the increasing measured materials allow us to leverage machine learning for developing a dictionary-based BRDF model. Indeed, a BRDF is multi-dimensional, typically 3D or 4D. Using one-dimensional PCA or SVD does not account for multi-way arrays and neglects the correlations between dimensions. Moreover the resulting dictionary size is large which limits the applicability. Therefore, our proposed dictionary-based BRDF model is based on multi-dimensional dictionaries to handle such high-dimensional reflectance data. The proposed BRDF model learns a BRDF manifold in the form of multiple collections of dictionaries using a dictionary ensemble. A set of dictionaries are basis functions which can be interpreted as a BRDF manifold representing a set of 62 Chapter 4 • Surface Reflectance Modeling and its applications BRDFs. To obtain multiple ensembles, we impose a different sparsity constraint on each ensemble resulting in covering a wide range of material classes. Additionally, we observed that each class of BRDFs corresponds to a different sparsity. We thus employ the sparse representation, see Section 3.2, to put a sparsity constraint in the learning process. Our learned dictionaries and the sparsity constraints allow us to have flexible number of coefficients depending on the desired reconstruction quality. More importantly, the encoded BRDFs can be recovered using only the target dictionary set leading to fast reconstruction. Lastly, the BRDF parameterization is still an open research problem. The BRDF model should not be limited by a parameterization. To summarize, our proposed dictionary BRDF model thus have the following properties. • Small dictionary size : Our proposed dictionary size depends on the data resolution unlike PCA whose size grows according to the number of training BRDFs. • Multi-dimensional data : Since the reflectance functions have a range of model complexity depending on the dimensionality, we can extend our dictionary-based model to represent high-dimensional data such as BTF, SVBRDF and BSSRDF. • Representativeness of dictionary : Once the trained dictionaries are obtained, any new material which is not included in the training set can be encoded without retraining the dictionaries. • Sparsity : The number of coefficients, i.e., sparsity, provides a trade-off between storage cost and quality. • Fast reconstruction : Due to the orthogonality of dictionaries, the fast local reconstruction is achievable. • Parameterization independence : The learning process is not limited by the parameterization used for storing BRDFs. The model can be freely extended to any suitable parameterization. 4.3.2.2 An Aggregated Multi Dictionary Ensemble for BRDFs The dictionary must handle any arbitrary BRDF dimensionality as N -mode tensors. An appropriate choice, thus, points to the method proposed by Miandji et al. [232]. They proposed the Aggregated Multi Dictionary Ensemble (AMDE) method which trains an ensemble of multi-dimensional dictionaries. We thus propose a variant of the AMDE that addresses the challenges associated with data-driven BRDF modeling. Indeed, the BRDF has many unique properties such as being highly non-linear and having a high dynamic range, hence it is not straightforward to 4.3 • Our Data-driven BRDF Models 63 use the AMDE as proposed in [232]. First, the BRDF must be transformed prior to the learning process. However, the choice of the transformations is a challenge as studied in the literature [3, 41, 60, 74, 77, 79]. Here, we select the log-based transformations since they were observed to perform well. Each BRDF is then transformed as follows ρt1 (ωh , ωd ) = log(ρ(ωh , ωd ) + 1), (4.33) ρt2 (ωh , ωd ) = log(ρ(ωh , ωd ) · cosMap(ωh , ωd ) + 1), (4.34) where cosMap(ωh , ωd ) is a weighting function mapping from the input directions (ωh , ωd ) to (ωi , ωo ) with the weight max(cos(θi ) · cos(θo ), ϵ), where ϵ = 0.001. cosine-weighted- (ρt1 ) log (ρt2 ) 44.72dB 48.90dB 45.16dB 41.41dB Reference glossy-red-paper cardboard log-plus Figure 4.5 — A diffuse material, cardboard (first row), and a glossy material, glossy-red-paper (second row), were modeled using our BRDF model with both transformations. The reference images are on the right column. The absolute error images appear on the right of each reconstructed result. Each error image is multiplied by 10.0 and normalized for visual comparison. The number presented of each rendered image is Signal-to-Noise Ratio (SNR). We refer to (ρt1 ) transformation in Equation (4.33) as log-plus and (ρt2 ) transformation in Equation (4.34) as cosine-weighted-log. Figure 4.5 demonstrates the visual results and the error introduced by each transformation. The log-plus transformation performs better on glossy materials, glossy-red-paper, while the cosine-weighted-log transformation performs better on diffuse materials, cardboard. More results regarding data transformations are presented in Paper B and Paper C. 64 Chapter 4 • Surface Reflectance Modeling and its applications The parameterization used here is the Half-Diff parameterization, i.e., the MERL parmeterization [31] with resolution 90×90×180. Given a RGB BRDF, we separate the BRDF into three BRDFs, i.e., one for each color channel. The transformations of each BRDF using ρt1 and ρt2 give two data points to our training set. Hence, the number of training data points is 6 times as large as the size of the training BRDFs. For multi-spectral BRDFs, one can get more BRDF data points depending on the number of provided spectral information, however the available measured spectral BRDFs are limited. This can be further investigated in the future work. N l We define the training data more concretely as follows. Let {X (i) }i=1 be a set (i) m1 ×m2 ×m3 of Nl BRDFs, where X ∈ R . As an example, for MERL BRDFs, we have m1 = 90, m2 = 90 and m3 = 180. Our training condition is that the BRDF resolution must be the same for all data points. Then, each BRDF is represented by the following decomposition X (i) = S (i) ×1 U(1,k) ×2 U(2,k) ×3 U(3,k) , where n U(1,k) ∈ Rm1×m1 , U(2,k) ∈ Rm2×m2 , U(3,k) ∈ Rm3×m3 o (4.35) is an orthonormal dictionary, the coefficient tensor S (i) ∈ Rm1 ×m2 ×m3 , and k ∈ {1, . . . , K} is the number of dictionaries in the ensemble. The notation ×n denotes the n-mode product between a tensor and a matrix [88]. Each trained ensemble consists of K multi-dimensional dictionaries denoted by oK n U(1,k) , U(2,k) , U(3,k) . The parameter K is a user-defined parameter and we k=1 suggest that K = 8 is suitable for commonly used measured BRDF databases [31, 35]. Another user-defined parameter is τl which harnesses the sparsity by the constraint S (i) ≤ τl in the training. The following optimization problem is defined for 0 training a BRDF dictionary ensemble U (j,k) min ,S (i,k) subject to Nl X K X Mi,k X (i) − S (i,k) ×1 U(1,k) ×2 U(2,k) ×3 U(3,k) ,Mi,k i=1 k=1 U(j,k) S T (i,k) K X 0 U(j,k) = I, ∀k = 1, . . . , K, ∀j = 1, . . . , 3, ≤ τl , Mi,k = 1, ∀i = 1, . . . , Nl , 2 F (4.36a) (4.36b) (4.36c) (4.36d) k=1 where the matrix M ∈ RNl ×K is a binary membership matrix representing BRDF clusters, where each BRDF in the training set is represented by one multidimensional dictionary. The constraints from Equations (4.36b) - (4.36d) are imposed during the training. 4.3 • Our Data-driven BRDF Models 65 Equation (4.36b) ensures the orthonormality of each dictionary, Equation (4.36c) enforces the sparsity of the coefficients, and Equation (4.36d) clusters each BRDF by one dictionary only. This membership clustering improves the representation error in the way that BRDFs, having high similarity, should belong to only one class and be represented by the same dictionary. Our observations in Paper D have shown that the pre-clustering method proposed in Miandji et al [232] cannot be applied to the BRDFs for two reasons. First, the number of available measured data is not sufficient for training after preclustering. Second, the size of each BRDF, 1458000 elements, is impractical for the pre-clustering method. Moreover, the BRDFs exhibit a variable degree of sparsity. For example, a small number of coefficients is adequate for diffuse materials while a highly glossy BRDF requires a higher number of coefficients. This corresponds to our observation in Paper D. A small value of τl results in an efficient model for low-frequency materials, or diffuse materials. On the other hand, when τl is larger, the learned dictionaries will capture high-frequency materials while diffuse-like materials will be degraded due to over-fitting. This observation has also been witnessed in a number of BRDF models [39, 41, 67, 68, 70, 74, 234]. We then present the training method using multiple sparsity levels. Each ensemble is trained with one sparsity value that fits a certain frequency of a material class, e.g., a group of plastics. The results are combined to form one bigger ensemble. The learned aggregated ensemble, as a result, covers a wider spectrum of BRDF signals. Although the number of ensembles has an impact to the size of our aggregated ensemble, we can still achieve low memory capacity with multiple ensembles. The size of our dictionary is determined by the data resolution (m1 , m2 , m3 ), the number of dictionary sets K, and the number of coefficients, τl . In Paper D, we trained on 4 ensembles with 8 dictionaries. The storage cost of 32 dictionaries required for our aggregated ensemble is (4 ∗ 8(90 ∗ 90 + 90 ∗ 90 + 180 ∗ 180) ∗ 4)/10242 = 5.93MB, assuming 4 bytes for each element. Our dictionary size is much smaller than using PCA and our model performs much better on both visual quality and BRDF reconstruction error. For more evaluation details, refer to Paper D. 4.3.2.3 BRDF Encoding and Decoding Once our dictionary ensemble is obtained from the learning process, each new BRDF can be encoded and decoded. To encode an unobserved BRDF, we seek for the sparse set of coefficients such that Mean Squared Error (MSE) of the reconstructed BRDFs is the lowest among the dictionary candidates. Each candidate is obtained using Equation (4.37) which projects a transformed BRDF onto each dictionary. Since we have two transformed BRDFs, the candidates are twice as many as the number of dictionaries. The projected coefficients are then nullified according to a user-defined testing sparsity Ŝ (i,k) ≤ τt . The projection step is formulated as 0 66 Chapter 4 • Surface Reflectance Modeling and its applications Ŝ (i,k) = Y (i) ×1 U(1,k) T ×2 U(2,k) T ×3 U(3,k) T , (4.37) where Ŝ (i,k) is the coefficients of data point i projected onto dictionary k, Y (i) is a BRDF that was not included in the training set. The decoder is simply the reconstruction from the given sparse coefficients as follows Ŷ (i) = Ŝ (i,k) ×1 U(1,k) ×2 U(2,k) ×3 U(3,k) , where Ŷ (i) (4.38) is the reconstructed BRDF. The obtained candidates Ŝ (i,k) , ∀k ∈ {1, ..., K} from Equation (4.37) are used to reconstruct the unobserved BRDFs and we select the dictionary corresponding to the least reconstruction error defined as Y (i) − Ŝ (i,k) ×1 U(1,k) ×2 U(2,k) ×3 U(3,k) 2 F . (4.39) After we select the best dictionary candidate, here we only store the non-zero locations and thecorresponding values of the selected sparse coefficients as tuples 1 2 3 lt , lt , lt , Ŝli1 ,l2 ,l3 , where lt1 , lt2 , lt3 are indices of non-zero elements, and Ŝli1 ,l2 ,l3 are t t t t t t the corresponding values. This candidate selection is done separately for each transformation. As a result, we have two sets of sparse coefficients after this encoding step. Up to this point, we have two BRDF models each associating with a BRDF transformation. This still leaves us the problem of choosing the best BRDF model. This is a challenging open research problem. 4.3.2.4 BRDF model Selection Given two BRDF models, the most reliable method to evaluate them is to render and compare visual results. However, rendering is rather a time-consuming and computationally expensive task. Using BRDF reconstruction MSE, the resulting model selection is inefficient and almost completely incorrect regarding to our observations in Paper D. We observe that this problem is similar to the BRDF fitting which seeks to find a qualitative BRDF error metric. The perceptual study of BRDF-space metrics by Lavoué et al. [79] demonstrates the logarithm function shows high correlation to the perceptual rendering performance. However, using this logarithm function may push the model selection biased toward our learned dictionaries and coefficients as our candidates are trained with log-based transformations. An alternative choice is a Gamma mapping function Γ(ρ, γ) = ρ1/γ inspired by Bieron and Peers [77]. We propose our BRDF model selection using the Gamma-mapped-MSE. The default γ parameter is 2.0 but this number can be fine-tuned per BRDF. 4.3 • Our Data-driven BRDF Models 67 Our BRDF model Bagher et al. [231] 52.70dB 34.06dB Reference Figure 4.6 — Rendering results of red-metallic-paint form the MERL test set. Our BRDF model was selected from the candidate pool by the Gamma-mapped-MSE. The absolute error images are multiplied by 10.0 and normalized between two BRDF models for visual comparison. The number presented for each rendered image is Signal-to-Noise Ratio (SNR). Figure 4.7 — Plots of average rendering SNR of test materials from MERL, EPFL and DTU data sets when reconstructed with increasing number of coefficients. We performed evaluations against non-parametric BRDFs. Indeed, our number of coefficients τt is variable depending on the degree of accuracy. For a fair comparison, the experiments were performed when the coefficients of all evaluated methods are the same size. We used τt = 262 as our testing sparsity. In Figure 4.6, we show an example of visual quality of our BRDF model and a result from Bagher et al. [231]. We additionally experimented with the effect of varied number of coefficients τt on MERL [31], EPFL [35] and DTU [41] data sets. The plots in Figure 4.7 clearly demonstrate the increasing performance when the number of coefficients is increased. Paper D reports additional performance analysis of our BRDF model along with other non-parametric BRDFs. 68 Chapter 4 • Surface Reflectance Modeling and its applications 4.4 4.4.1 BRDF Applications Differential apperance editing Predictive rendering [235] is becoming increasingly popular in many applications such as product visualization and architectural design. This requires highly-efficient material representations and flexibility for material editing by artists to be creative on designing appearance. However, working with measured BRDFs is cumbersome since it is difficult to tweak and edit the material freely. BRDF models are, thus, often employed for BRDF editing [236, 237, 238, 239], since a small number of parameters is involved in the editing step. Moreover, editing on BRDF parameter space is more feasible to control the BRDF shape according to the interpretable parameters such as roughness and specular lobe parameters. In addition, when analytical BRDF models are physically plausible, the edited results also carry the physically-accurate properties. Measured BRDF Edited BRDF Fitted BRDF parameters Residual BRDF Edited BRDF parameters Figure 4.8 — The differential editing pipeline for BRDF editing and interpolation. In Paper A, we present a BRDF editing method using differential edits on measured BRDFs. Analytical BRDF models are main components for our method since most of them were derived under physically plausible assumptions. Our method begins with a BRDF fitting process to obtain parameters from a chosen analytical BRDF model. The user then edits the BRDF shape on the parameter space to produce the desired material. After this editing process, both fitted parameters and edited parameters are used to approximate the BRDF basis and coefficients. In this case, we represent them as BRDF vectors in the Half-Diff parameterization with MERL encoding. We then obtain the residual vector as follows f N e N {cri }N i=1 = |{ci }i=1 − {ci }i=1 |, (4.40) where N is the number of non-zero elements of the coefficient space, {cri } is the residual coefficients, {cfi } is the coefficients from the BRDF fitting, and {cei } is the coefficients from parameter editing. We then update the measured data coefficients using the following expression 4.4 • BRDF Applications 69 r cei = max(0, cm i + ci ), (4.41) cm i where is the ith measured BRDF coefficient, and BRDF coefficient. cei is the edited measured Note that the updated BRDF is computed based on the measured BRDF. This is to preserve the intrinsic reflection properties of the measured data while the user can tweak the desired BRDF shapes such as specular lobe and diffuse reflection with differential edits cri . The pipeline of our proposed differential editing is presented in Figure 4.8. BRDF interpolation is applicable with our proposed method. Given two measured BRDFs, we edit the BRDFs on the parameter space. Our interpolation technique gradually adjust each measured BRDF individually by differential edits. This step thus produces two difference vectors, i.e., two residual BRDFs. Each difference vector generates an edited measured BRDF. The result of this step is two edited measured BRDFs. The final interpolation result is the weighted average between the edited materials where the weights are proportional to the distance in the parameter space. 4.4.2 BRDF interpolation in sparseland In Paper D, we demonstrate the BRDF interpolation in the coefficient space of our learned dictionaries. Here, we show the BRDF interpolation between two BRDFs under two scenarios. The first scenario is to interpolate two BRDFs under the same dictionary and the other scenario is to interpolate across different dictionaries, i.e., assuming that each BRDF is modeled using a distinct dictionary. Let B1 , B2 , B3 be three BRDFs that are defined under dictionaries U(1,1) , U(2,1) , U(3,1) , U(1,1) , U(2,1) , U(3,1) , and U(1,2) , U(2,2) , U(3,2) , respectively. The linear interpolation, denoted by I, between B1 and B2 is then expressed as I = (1 − α)B1 + αB2 (4.42a) (1,1) (2,1) + αS2 ×1 U ×2 U = (1 − α)S1 ×1 U ×2 U (1,1) (1,1) = ((1 − α)S1 + αS2 ) ×1 U (3,1) ×3 U (2,1) ×3 U(3,1) (2,1) ×2 U (4.42b) (3,1) ×3 U , (4.42c) where S1 and S2 are the sparse coefficients of B1 and B2 , respectively, and α is the interpolation weight. Note that Equation (4.42c) holds when the interpolation is under the same dictionary. This can be applied to bilinear and bicubic interpolations. 70 Chapter 4 • Surface Reflectance Modeling and its applications To interpolate two BRDFs across two distinct dictionaries, we need an intermediate dictionary which represents both BRDFs. A solution to the intermediate dictionary is to choose one dictionary as the reference basis and transform the other dictionary and its coefficients. Let R(i,k)→(i,j) be an orthonormal transformation matrix that transforms the orthonormal matrix U(i,k) to U(i,j) , then we have U(i,j) = U(i,k) R(i,k)→(i,j) . This expression holds due to the following proposition. Proposition 4.1. Let A and B be two distinct orthonormal matrices. Then the solution of min ∥B − AR∥2F subject to RT R = I, (4.43) R T is R = A B. The proof of Proposition 4.1 can be found in [240]. We then obtain the linear interpolation between B1 and B3 , denoted I, by the following expression I = (1 − α)B1 + αB3 = (1 − α)S1 ×1 U (4.44a) (1,1) (1,2) + αS3 ×1 U = (1 − α)S1 ×1 U + αS3 ×1 U (2,1) ×2 U ×2 U (2,2) ×2 U (1,1) (1,1) (2,1) (2,1) + α S 3 ×1 R (4.44b) (3,1) ×3 U (1,1)→(1,2) R(2,1)→(2,2) ×3 U(3,1) R(3,1)→(3,2) (1,1) = (1 − α)S1 ×1 U ×3 U ×3 U(3,2) ×2 U R (3,1) (2,1) ×2 U (1,1)→(1,2) ×3 U ×2 R(2,1)→(2,2) ×3 R(3,1)→(3,2) {z | (1,1) = (1 − α)S1 + αS̃3 ×1 U } S̃3 ×1 U(1,1) ×2 U(2,1) ×3 U(3,1) , (4.44c) (3,1) (2,1) ×2 U (4.44d) (3,1) ×3 U , (4.44e) where (4.44d) follows from (4.44c) due to a fundamental property of the N -mode product [88]. Figure 4.9 — An example of BRDF interpolation on coefficient space of the dictionaries. The interpolation coefficient α from left to right is in [0.0, 1.0] incremented by 0.1. The left most BRDF is cc-amber-citrine-rgb from EPFL and the right most BRDF is silver-metallic-paint2 from MERL 4.5 • Summary and Future Work 71 We observe that S̃3 is the transformation of S3 with the transformation matrices that transform U(i,1) to U(i,2) , for i = 1, 2, 3. The result from our BRDF interpolation is not necessary sparse, however, we can achieve the desired sparsity by re-projecting the interpolated BRDF onto the reference dictionary. Figure 4.9 shows an example of our BRDF interpolation from a layered material to a glossy material. Our dictionary ensemble BRDF model is a method of learning the manifold structure of BRDFs. Each of learned dictionaries represent a chart (a local coordinate frame) of the BRDF manifold. In our work, the BRDF manifold contains 32 local coordinate frames represented by the orthogonal dictionaries. Given two BRDFs that lie within the same local coordinate frame, we can generate a new BRDF using Equation (4.42a). Doing a BRDF interpolation across two local coordinate frames can be done using Equation (4.44e). 4.5 Summary and Future Work This chapter presented BRDF topics in several aspects. We started from an overview of BRDF models followed by our proposed data-driven BRDF models. By utilizing special characteristics of the PDV parameterization, an isotropic BRDF can be decomposed into 1D factors. This results in a simple factorized BRDF model representing glossy BRDFs accurately. We showed that our factorized BRDF model performs better than other factorized models on both model accuracy and importance sampling efficiency. The other proposed BRDF model employs a dictionary-learning method, leveraging the use of BRDF databases for training. By expanding into multiple ensembles with different sparsity constraints, our sparse dictionary BRDF model can efficiently represent BRDFs with a small number of coefficients. Due to the flexibility of our sparse dictionary BRDFs, we can achieve higher accuracy with minimal storage cost. In the last section, we presented BRDF editing and interpolation techniques under different models. Differential BRDF editing was presented to work with measured BRDFs. Owing to the physically plausible BRDF model and the BRDF fitting, the edited measured BRDF still preserves reflection properties from the measurements, while the user can intuitively edit the desired BRDF shape based on the model parameters. We also demonstrated the BRDF interpolation on sparse coefficient space with the learned dictionaries. Even if two BRDFs lie on two distinct dictionaries, it is possible to linearly interpolate the sparse coefficients of BRDFs because of the orthonormality constraint. In the future, we would like to extend the sparse dictionary BRDF to more complex reflectance data especially BTF and SVBRDF. We believe that the resulting models could benefit from the dictionary properties as well as the coefficient-space 72 Chapter 4 • Surface Reflectance Modeling and its applications interpolation in many aspects, e.g. arbitrary dimensions, fast reconstruction, data augmentation. Moreover, perceptual studies of the BRDF metric is a key to BRDF fitting and BRDF model selection. To close this gap, BRDF transformations have to be explored and investigated further. More importantly, BRDF parameterization for more complex materials, e.g. layered materials and BSSRDF, is an interesting area of study for efficient representations. Chapter 5 Sparse Sampling for BRDF Acquisition The demand of 3D digital content creation has increased over decades as rendering technology has evolved, facilitating the use of computer graphics in many applications. Material appearance, primarily measured data, is of importance for generating realistic images in predictive rendering. However, acquiring BRDF data has long been a challenging research problem as most capturing techniques are very time-consuming. This is due to the high dimensionality of the BRDF. For example, the MERL meaterial acquisition [31] takes approximately 3 hours to capture reflectance data of a sample material wrapped on a sphere. The time required prohibits the practicality of the acquisition system. Moreover, the spherical material sample facilitates the sampling rate per captured image. The downside is the unreliability of near-grazing measurements, and some materials may be difficult to be wrapped on the spherical objects. Each BRDF acquisition system, discussed in Section 2.5, has different benefits and drawbacks. In this thesis, we address the BRDF data acquisition problem by leveraging the available BRDF databases. Our goal is to achieve sparse BRDF measurement in order to speed up the acquisition process with minimized loss of accuracy. 5.1 Outline and Contributions This chapter presents our contributions towards sparse BRDF acquisition systems (Paper B and Paper E). We introduce two sparse BRDF sampling procedures 74 Chapter 5 • Sparse Sampling for BRDF Acquisition under different models in Section 5.2. The section splits into two parts. The first part, Section 5.2.1, presents a BRDF factor measurement inspired by observations from Löw et al. [60]. Under a suitable BRDF parameterization, the BRDF factor measurement can efficiently reconstruct isotropic BRDFs, specifically glossy materials, with a low sampling rate. This requires only two simple mechanical arms with two degrees-of-freedom. The second part presents the Fast and Robust Optimal Sampling Technique (FROST) for BRDF acquisition. The problem is formulated using the compressed sensing framework. To find the sparse BRDF sampling locations, FROST represents BRDFs in the form of a dictionary and a set of sparse coefficients, and solves for the optimal sampling pattern under the Multiple Measurement Vector (MMV) signal model. FROST is a deterministic method that is fast and more robust compared to previous methods that rely on heuristics or random sampling. 5.2 The sparse BRDF measurement 5.2.1 The BRDF factor measurement using PDV 5.2.1.1 Motivation Observations from Löw et al. [60] have shown the radial symmetry of isotropic BRDFs along the projected vector on the unit disk. This inspired us to further investigate the PDV parameterization. The study has confirmed that isotropic BRDFs behave similarly to the observations. This implies that isotropic BRDFs can be well-approximated with rank-1 factorization in the PDV parameterization. Therefore, once the BRDF is aligned in the PDV, we can utilize the CP decomposition to obtain our factorized BRDF models on 2D BRDF representations, see Section 3.1.2. Moreover, our factorized BRDF model is not recursive as in Paper C. As opposed to the ABC BRDF model [60], we propose a simple and efficient data-driven BRDF model that is suitable for a BRDF factor acquisition. The acquisition is sparse due to the fact that only two factors are measured and the reconstruction error from the factor measurement is minimized. 5.2.1.2 The separable BRDF models Our 2D separable BRDF model has an assumption that isotropic BRDFs can be approximated by two factors which are denoted by (θR , dp ). The radial symmetry has an implication that isotropic BRDFs are invariant to the ϕdp parameter of the PDV parameterization. Due to the high dynamic range property of a BRDF, as discussed in Section 2.4 on page 30, the assumption holds under the logarithmic transformation of BRDFs. The isotropic BRDFs are then transformed using the following equation 5.2 • The sparse BRDF measurement 75 ρt = log(ρ + 1.0). (5.1) In Paper B, we introduce the concept of the BRDF factor measurement using the 2D BRDF model. Our 2D separable BRDF model for the sparse BRDF measurement is expressed as follows ρt (θR , dp , ϕdp ) = F1 (θR )F2 (dp ), (5.2) where F1 (θR ) is a basis function associated with θR , and F2 (dp ) is a basis function associated with dp . This measurement acquires only two factors of the 2D data matrix. Each factor is obtained by a configuration, as shown in Figures 5.1. This requires only two degreesof-freedom for the measurement setup on a relatively simple gonioreflectometer. The first factor is measured by moving two mechanical arms away from the normal and the other factor is measured by moving a sensor away from a light source. (a) F1 (θR ) measurement (c) F2 (dp ) measurement Figure 5.1 — Two configurations of the factor measurement. Figure (a) demonstrates the perfect reflection measurement for F1 while Figure (b) measures F2 when the light source is fixed and the sensor moves away from the light source along the dashed line. ©2017 IEEE. The obtained factor BRDF values will appear in the 2D BRDF matrix as illustrated below, Figure 5.2(a), the blue row and column. These values are depicted as two blue curves shown in Figure 5.2(b). The F1 curve measures BRDF values associated to θR , i.e., the light scattering at perfect reflection angles while the F2 curve measures BRDF values associated to dp , i.e., the shape of scattering lobe (specular lobe). To estimate the missing BRDF values zi,j , we derive a simple relation. The components we have from our factor measurement are formulated in the following expressions BRDF (sr-1) 76 Chapter 5 • Sparse Sampling for BRDF Acquisition (a) The factor measurement on the 2D BRDF matrix (b) The BRDF measurement curves Figure 5.2 — Illustrations of the proposed factor measurement. Figure (a) shows the factors obtained on the 2D BRDF matrix corresponding to the measured curves shown on figure (b). The predicted BRDFs zi,j are on the dashed curve. The measured BRDF values are on the blue curves. zi,1 = F1 (xi )F2 (y1 ), (5.3) zm,1 = F1 (xi )F2 (y1 ), (5.4) zm,j = F1 (xm )F2 (yj ), (5.5) zi,j = F1 (xi )F2 (yj ) = zm,j zi,1 zm,j zi,1 = , F2 (y1 ) F1 (xm ) zm,1 (5.6) where zi,j is the value at ith row and jth column, m is the measurement index corresponding to the fixed light direction, xi is the parameter at ith row, and yj is the parameter at jth column. We have provided additional experiments on the impact of the fixed light location for F2 factor measurement in Paper B. Once all missing values of the 2D BRDF matrix are estimated, the 2D matrix can be used to approximate the 3D BRDF with respect to the above assumption. 5.2.1.3 Results To evaluate our factor measurement, we simulated the measurement system using materials from the MERL database. As seen in Figure 5.3, our method performs better compared to other parameterizations both on the absolute error images and rendering SNR values. For more evaluation results, refer to Paper B The proposed BRDF factor measurement utilizes the properties of the PDV parameterization. It facilitates the isotropic BRDF acquisition using sparse measurements (Paper B), and it can be used as a factorization model (Paper C ). In Paper C, 5.2 • The sparse BRDF measurement 77 Our factor measurment using PDV 2D BRDF model using Half-Diff 2D BRDF model using Half-Out (a) SNR = 31.84dB (b) SNR = 22.06dB (c) SNR = 25.59dB Figure 5.3 — We show the reconstructions of red-metallic-paint from the MERL. Figures (a), (b) and (c) are rendered images of each reconstruction method with the error images appeared on the right corner. The rendering SNR of each method is placed under each image. (a) (b) Figure 5.4 — An example of the BRDF factor measurement of chrome. The left plot (a) measures the Fresnel effect and the right plot (b) indicates the specular lobe width from the perfect reflection. we also show that our factorized BRDFs using L = 1 could be useful in terms of BRDF editing. Since each factor is independently measured, each BRDF has two measured factors. Each factor can be replaced by a factor from other BRDFs generating a new BRDF. Moreover, the user can also edit each factor intuitively. F1 describes the Fresnel effect and the specular reflection magnitude, while F2 controls the width of the specular lobe. This interpretation is clearly seen in Figure 5.2(b) and Figure 5.4. The curves in the figures correspond to the BRDF visualization as observed by Löw et al. [60]. 78 Chapter 5 • Sparse Sampling for BRDF Acquisition Here, we show the BRDF factor measurement of the chrome material from the MERL database in Figure 5.4. The F1 values are relatively high and steep neargrazing angles, while F2 values control the scattering from the perfect reflection vector. Our factorized BRDF models, hence, have great benefits in many aspects, i.e. modeling, measurement and editing, under a simple and intuitive BRDF parameterization. 5.2.2 A Fast and Robust Optimal Sampling Technique : FROST 5.2.2.1 Motivation The sparse BRDF measurement can be casted into several problems, e.g. adaptive sampling, inverse rendering and efficient sampling. These problems are addressed in many different aspects while the possible outcomes are accurate reflectance data from real-world objects. Adaptive Sampling [18, 35, 241, 242] requires an initial estimate of the signal to determine the initial material properties. This initial estimation is later used to identify sampling region adaptively. Inverse rendering [243] is a wide research topic and complex under distinct conditions, e.g., lighting, geometry and reflectance [227, 230, 244, 245, 246]. The problem aims to retrieve scene parameters observed from the inputs, e.g., an image with known geometry of an object and unknown lighting, and reuses the outcomes in a new fashion such as scene relighting, 3D scene reconstruction and BRDF recovery. Here, we address FROST as an efficient sampling method [41, 241, 247, 248]. Such methods learn a basis function, leveraging the available BRDF databases, and reconstruct unobserved BRDFs from a sampling pattern. The number of samples is relatively small, around 20 - 100 samples, compared to typically more than a million of samples from dense BRDF measurements. The efficient sampling method proposed by Nielsen et al. [41] utilizes a learningbased method, i.e. PCA, resulting in the principal component matrix Q as a basis function. Recall from Equation (4.23), the principal component used in [41] is a scaling of the coefficient matrix Q = V Σ. The Q matrix is used to obtain an optimal set of sampling directions and reconstruct any unobserved BRDF. However, a major drawback of this method lies in the heuristic optimization scheme with random initializations. Their optimization tries to minimize the condition number of Q̂, i.e., sampled rows of Q. Minimizing the condition number reduces the sensitivity to noise [249] of the matrix leading to finding a well-conditioned Q̂. This does not necessarily correlate with the reconstruction error. Furthermore, the algorithm randomly selects rows to construct Q̂. This randomness does not guarantee consistency in producing sample locations, leading to a distinct result for each run. 5.2 • The sparse BRDF measurement 79 Figure 5.5 — The FROST pipeline for BRDF sampling and reconstruction. The dictionary is trained for FROST and reused for BRDF reconstruction. FROST finds optimal sampling directions. The sampled BRDF values are used in the reconstruction. We propose a framework addressing this problem, called FROST. FROST reformulates the optimization process under the compressed sensing framework which has a user-defined parameter indicating the required number of samples. Indeed, FROST solves the sensing matrix design problem by finding an optimal sampling pattern based on the input training data. One important property of FROST is the shared optimal samples across the BRDFs. This is how we integrate the Multiple Measurement Vector (MMV) in our sensing model. Due to the compressed sensing capability, FROST optimizes the sample locations deterministically with low computational complexity. Therefore, FROST is designed to be a fast and robust method which resolves the randomness of previous methods. Moreover, to achieve higher efficient sampling, the training BRDFs may be split into multiple categories depending on the corresponding material classes, e.g., diffuse and glossy BRDFs. However, we show here that even without the prior training classification, our method can achieve accurate BRDF reconstructions. To the best of the author’s knowledge, FROST is the first efficient BRDF sampling method with theoretical guarantees. The BRDF sampling pipeline of FROST begins with a BRDF dictionary trained on measured BRDFs. Given the trained dictionary, FROST finds an optimal set of locations by a user-defined parameter for the number of samples. This optimization process is fast and produces a consistent set of optimal locations for BRDF sampling. The obtained locations are used to sample BRDF values of an unobserved BRDF from a BRDF acquisition device. Later, the trained dictionary is reused together with the sampled BRDF values to reconstruct the full BRDF. The illustration of the pipeline is shown in Figure 5.5. Next, we will go through each step of the pipeline in the subsequent parts of this 80 Chapter 5 • Sparse Sampling for BRDF Acquisition subsection. The BRDF dictionary is a part of this process in which it computes the sparse representation of the BRDF signal. The trained dictionary is independent of BRDF parameterization and dimensionality. However, the representation should be sufficiently accurate for the reconstruction. The chosen dictionary here is PCA as it is simple and efficient for training and reconstructing. We then explain the formulation of FROST under the compressed sensing and the MMV model. Lastly, the BRDF reconstruction from sparse measurements is presented. 5.2.2.2 The Dictionary The design choice of the trained dictionary used in FROST is not limited by a single dictionary-learning method. Since the sensing matrix is designed with the capability of invertible dictionary, hence the only requirement is that the dictionary should be invertible. We select PCA as it is a simple model for the dictionary learning. Regarding the high dynamic range property of measured BRDFs, we perform the log-relative mapping introduced by Nielsen et al. [41] before the PCA decomposition. The mapping is expressed as follows ! ρ(ωh , ωd ) · cosMap(ωh , ωd ) + ϵ ρt (ωh , ωd ) = log , ρref (ωh , ωd ) · cosMap(ωh , ωd ) + ϵ (5.7) where cosMap is a weighting function similar to Equation (4.34), ϵ = 0.001, and ρref is the median of the training BRDFs. Recall Equation (4.23), and let X ∈ Rn×t be a BRDF matrix containing a set of t vectorized measured BRDFs for training. Each BRDF has n = 90 × 90 × 180 values on the Half-Diff parameterization. After X is constructed, it is preprocessed by the log-relative mapping followed by mean subtraction to obtain X − µ = T ∈ Rn×t , where µ ∈ Rn×1 is the mean of the transformed X. We then perform a truncated Singular Value Decomposition (SVD) as T = U ΣV T , where U ∈ Rn×p and V ∈ Rt×p are orthonormal basis matrices, and Σ ∈ Rp×p is the diagonal matrix containing the scaling values, where p is the number of singular values. Now let us define the dictionary D = U Σ and its corresponding coefficients as S = V T . We can express the training BRDF decomposition using PCA as follows T = DS + δ, (5.8) where δ is the approximation error of the truncated SVD and it is related to the number of principal components p. 5.2.2.3 Compressed sensing Compressed sensing is a branch of signal processing field which aims for finding solutions of sparse sampling and reconstruction. The required sampling rate of compressed sensing for the exact recovery is far fewer than the Nyquist-Shannon 5.2 • The sparse BRDF measurement 81 sampling theorem [250, 251]. It was originally discovered by Emmanuel Candés [252, 253]. Later, David Donoho [254] published the theoretical framework and used the term Compressed Sensing for the first time. Let us define mathematical definitions necessary for the compressed sensing framework. Let x be a 1D signal of length n which we want to reconstruct from its measurements y. Let Φ ∈ Rm×n be a measurement or sensing matrix mapping from Rn to Rm , where m is the number of samples and m < n. Applying the sensing matrix to the signal, we obtain the signal measurements y = Φx + γ, where γ is the measurement noise. The sensing matrix Φ can be defined as a point sampling operator, i.e., one that selects a predefined subset of elements from x. We use a point sampling operator in designing FROST. We would like to integrate the concept of sparse representations to formulate the underdetertermined system of linear equations. Let D ∈ Rn×k be an overcomplete dictionary, where each column defines a basis vector, then the signal can be expressed as x = Ds under the sparsity constraint ∥s∥0 < τ . Hence the compressed sensing seeks for solving the following optimization, ŝ = arg min ∥s∥0 s.t. ∥y − ΦDs∥22 ≤ ϵc , s (5.9) where ϵc is related to the noise γ. Once the solution ŝ is obtained, the signal can be reconstructed as x̂ = Dŝ. The above equation states that we want to find the sparse coefficients of an individual signal with the known measurements y and the known ΦD. In case we have p signals, i.e., {x(1) , x(2) , ..., x(p) }, each signal is solved individually. This model is called the Single Measurement Vector (SMV) where each signal is recovered independently and without any assumption on the support of s, i.e., the location of non-zero elements in s. In the sparse BRDF measurement, we would like to design an optimal sampling operator so that we can perform the measurement efficiently and effectively. Particularly, having a shared sampling operator for a set of BRDFs, under the compressed sensing model, can facilitate the acquisition process in terms of designing a device and accelerating the measurement. Similar to what was described in Section 5.2.1, the design of the acquisition device can be cost-efficient (two degrees-of-freedom) and time-effective (fast measurements) with the shared sampling locations. To address this problem, we assume that the supports of all signals are the same. This model is known as the Multiple Measurement Vector (MMV) or the Simultaneous Sparse Approximation [255, 256]. Equation (5.9) can be extended naturally to obtain the MMV model. This can be done by replacing the 1D signal and its coefficients with matrices. Let X ∈ Rn×p be a matrix where each column is a vectorized signal and S ∈ Rk×p be a coefficient 82 Chapter 5 • Sparse Sampling for BRDF Acquisition matrix where each column contains a sparse coefficient vector. Thus, the MMV formulates the sparse representation as X = DS. The sensing matrix can be applied explicitly to obtain the measurements of all observed signals Y = ΦX. Here, the compressed sensing under the MMV solves the following optimization problem, Ŝ = arg min ∥S∥r0 s.t. ∥Y − ΦDS∥22 ≤ ϵc , (5.10) S where ∥S∥r0 counts the number of non-zero rows of S. Note that having row sparsity for S implies that all signals in X have the same support set. The optimization problem is solved across all signals simultaneously. There are several algorithms [257, 258, 259, 260, 261] to solve the optimization in Equation (5.10). In our experiments, we employ Simultaneous Orthogonal Matching Pursuit(SOMP) introduced by Tropp et al. [255]. It is a greedy algorithm which is a generalization of Orthogonal Matching Pursuit (OMP [133, 134]). 5.2.2.4 FROST Typically, the BRDF measurement is a point sampling procedure where pairs of directions (a light source and a sensor) are identified and used in the acquisition process. FROST is a framework addressing the efficient BRDF sampling problem for BRDF acquisition. The method determines the optimal sampling directions such that a small number of samples is sufficient for accurate BRDF reconstruction. Here, we introduce terms and expressions by connecting the BRDF acquisition and compressed sensing to formulate FROST. Let b be a vectorized BRDF signal of length n = 90 × 90 × 180 = 1458000 and y = Φb be BRDF samples. Sensing matrix design in the context of compressed sensing is formulated as follows arg min ∥s∥0 s.t. ∥y − ΦDs∥22 ≤ ϵc , and Φ ∈ Ψ, (5.11) Φ where Ψ is a set of all possible sensing matrices. Note that compared to Equation (5.9), here we are solving for Φ rather than S. We address the problem of efficient BRDF sampling by seeking the optimal sensing matrix Φ ∈ Rm×1458000 corresponding to the dictionary D ∈ R1458000×k from the set of possible solutions Ψ as expressed in Equation (5.11). One solution to this problem is heuristic gradient descent algorithms used in [41, 241, 248, 262]. The algorithms draw random samples to initialize the search for a possible solution depending on the metric used in the optimization scheme. Particularly, the method proposed by Nielsen et al. [41] minimizes the condition number of the randomly selected dictionary rows. There are a number of disadvantages for Nielsen et al.’s method. First, the method suffers from the randomness which produces an inconsistent sampling pattern over each run, i.e., it converges to a different local 5.2 • The sparse BRDF measurement 83 minima. Second, the condition number of ΦD does not necessary correlate to the reconstruction error. Third, the heuristic search is a time-consuming and computationally-expensive task due to the randomness and the gradient descent method. Moreover, the search is parameterization dependent which requires extra implementation if another BRDF parameterization is used. FROST overcomes these issues, while providing theoretical guarantees. FROST reformulate the sensing matrix design problem as sparse representation under the MMV model. FROST is deterministic and computationally-inexpensive due to the algorithms, e.g., SOMP, used to solve the MMV model. Moreover, solving for the optimal point sampling pattern by FROST is parameterization independent since the problem is defined on the dictionary space. As mentioned previously, D must be invertible. We start by inverting Equation (5.8) to obtain S = D−1 T − D−1 δ. (5.12) In this context, the terms S and D−1 are known, while T is the unknown variable. Note that rows of T are the locations of measured BRDFs. The term ΦD denotes choosing rows of D as discussed previously in the context of compressed sensing. As a result, for D−1 , finding the support set corresponds to finding the row samples of D. This is the key observation utilized in designing FROST. The objective of this formulation is to find the sparse solution of T with the known D , leading to column selection of D−1 . By substituting S and D−1 in Equation (3.11), see Section 3.2.1, we then can formulate the sensing matrix design problem as follows. Φ = arg min ∥T ∥r0 s.t. ∥S − D−1 T ∥22 ≤ ϵc . (5.13) supp(T ) Notice that the term containing δ is negligible if the reconstruction is not sensitive to the noise, otherwise it is related to the error ϵc in Equation (5.13). Thus, we omit the term from the minimization. We employ SOMP to solve the minimization problem, Equation (5.13). Algorithm 4 below presents SOMP’s pseudo-code. Indeed, FROST requires only two user-defined parameters, where p denotes the number of principal components in case of PCA and m denotes the number of required samples. These two parameters affect FROST’s error, which directly links to the approximation term δ. Figure 5.6 below illustrates the quality of the sensing matrix showing the inverse MSE of reconstruction error when m and p are increased. The higher the inverse MSE is, the more accurate the reconstruction is. We set p = m for simplicity. In fact, Figure 5.6 verifies the expected behavior of p and m as an increased p introduces less approximation error and an increased m provides more information for signal reconstruction. 84 Chapter 5 • Sparse Sampling for BRDF Acquisition 1 2 3 4 5 Input: BRDF coefficient matrix S ∈ Rp×t , BRDF dictionary inverse D−1 ∈ Rp×n , and user-defined number of samples m ≥ 1 Result: Support set supp(T) supp(T )0 ← ∅ ; // Initialize the support set. (0) R ←S ; // Initialize the residual. l←0; while l < m do j ← argmax (D−1 )Ti R(l−1) ; // Atom selection 1 i 6 7 8 9 supp(T)l ← supp(T)l−1 ∪ {j} ; R(l) ← S − (D−1 )(D−1 supp(T) supp(T) l+1 † l+1 )S; // Support set update // Residual update l ← l+1 ; end 140 195 185 175 165 155 145 135 125 115 105 95 85 75 65 55 45 35 25 15 5 120 100 80 60 40 20 5 15 25 35 45 55 65 75 85 95 10 5 11 5 12 5 13 5 14 5 15 5 16 5 17 5 18 5 19 5 number of samples Algorithm 4: Simultaneous Orthogonal Matching Pursuit (SOMP) for solving Equation (5.13) principal components Figure 5.6 — Inverse MSE of reconstructed BRDFs according to the number of principal component p and the number of samples m. The obtained sensing matrix Φ can be used to strategically place sampling locations for acquiring an unseen BRDF. More importantly, FROST reuses Φ in the BRDF reconstruction step as the final procedure of the pipeline (Figure 5.5) to recover an unseen BRDF signal. 5.2.2.5 BRDF Reconstruction So far, we have described the training details and the optimal sensing matrix solution with the FROST framework. This section provides the details of the 5.2 • The sparse BRDF measurement 85 BRDF signal recovery in the pipeline. The BRDF reconstruction requires two steps. First, we acquire an unobserved BRDF according to m paris of directions identified by Φ. The sampled BRDF is expressed as b̂ = Φb. Second, we apply Φ on to the trained dictionary to get D̂ = ΦD. Since we set m = p, the BRDF reconstruction is then defined as a least square problem as follows arg min ∥b̂ − ΦDc∥22 , c (5.14) where c is the BRDF coefficients to be obtained, i.e., the corresponding coefficients for principal components. The least square problem can be solved by using ridge regression as suggested in [41, 263]. This results in the coefficient vector c, which is used to reconstruct the BRDF b̃ = Dc. However, in case we have m < p, the minimization problem becomes solving an underdetermined system. The solution can be found using the ℓ0 and ℓ1 regularizers as previously discussed in Chapter 3 and Section 5.2.2.3. 5.2.2.6 Results We evaluated FROST in comparison to the method proposed by Nielsen et al. [41] as it is the state-of-the-art of for BRDF point sampling. Due to the limited number of measured BRDFs, we have combined MERL [31] and EPFL [35] databases to enlarge the training size. Moreover, we conducted a 10-fold cross-validation by randomly splitting training and testing materials. For solving the MMV minimization problem, Equation (5.13), we employed SOMP implemented by Jean-François Determe [264] in MATLAB. Since the BRDF sampling locations of each run produced by Nielsen et al.’s method are inconsistent, we ran their method for 5 trials. The summary of the overall reconstruction quality with respect to the number of samples is reported below in Figure 5.7(a), starting from m = 5 to 60 with an incremental step of 5. Each line represents the average reconstruction errors of the 10-fold cross-validation where the yellow line indicates the errors from Nielsen et al.’s method. With an equal size for the training set, our method significantly outperforms Nielsen et a.l [41], especially when the number of samples is increased. The results indicate that when the training size is reduced from t = 131 materials (blue) to t = 80 materials (red), our result is comparable to Nielsen et al.’s method, when it is trained on t = 131 materials. We thus expect the higher reconstruction accuracy when a larger training set is used compared to Nielsen et a.l [41]. Hence, this plot motivates the use of FROST over Nielsen et al.’s method even if the training set is smaller. Figure 5.7(b) demonstrates the robustness of the proposed method in more details. The high variations of each box from Nielsen et al’s method (green) clearly present the wide spectrum of its reconstruction quality due to randomness. While Nielsen et al.’s method produces vastly different results for 86 Chapter 5 • Sparse Sampling for BRDF Acquisition 25 200 FROST-BRDF Nielsen et al. FROST-BRDF with t=131 FROST-BRDF with t=80 Nielsen et al. with t=131 150 Inverse MSE Inverse MSE 20 15 10 50 5 0 100 0 10 20 30 40 Number of samples 50 60 (a) The average reconstruction errors of each method with respect to the number of samples. 0 rs Ou al Tri 1 al Tri 2 al Tri 3 al Tri 4 al Tri 5 ge era Av (b) The reconstruction errors of each method on an individual test set when m = 40 samples. Figure 5.7 — Plots of experimental results between Nielsen et al. [41] and FROST. The left plot (a) illustrates the reconstruction accuracy (inverse MSE) when the number of samples is increased. The errors are average of all 10-fold cross-validation tests. The right plot shows the statistical quantities of the reconstruction accuracy. Each box represents 25 percentile (bottom of the box), 75 percentile (head of the box), maximum value (top of the dashed line), minimum value (bottom of the dashed line) and its mean (star symbol inside the box). each run leading to unstable reconstruction quality, FROST produces more reliable results. m FROST Nielsen et al. 2015 5 0.14 76.01 10 0.47 248.67 20 1.86 772.74 30 4.21 1600.46 40 7.49 2772.74 50 12.05 4357.08 60 17.67 6366.88 Table 5.1 — Average optimization time measured in seconds. Furthermore, FROST solves the minimization problem in seconds which is about 360 times faster than Nielsen et al.’s method at m = 60 samples. This is clearly seen in Table 5.1 where we report the optimization time in seconds. FROST demonstrates significant advantages in accelerating the sensing matrix pattern search compared to the random heuristic search proposed by Nielsen et al.’s method. Moreover, the heuristic search requires traversing inside the parameterization, therefore it needs extra implementation if another parameterization is used. When a small number of samples is used in the sampling step, sometimes ring artifacts appear on the rendered images of reconstructed BRDFs. This was also reported in [41] where they suggested to either separate diffuse and glossy materials 5.2 • The sparse BRDF measurement 87 Reference Nielsen et al. [41] FROST-BRDF Figure 5.8 — Rendered images of the reconstructed vch-silk-blue-rgb with m = 10 samples. The red boxes demonstrate the same regions, magnified on the right corner, where the ring artifacts noticeably appear on Nielsen et al.’s method. Each absolute error image is placed on the right column of each method. in training or increase the number of samples to attenuate these artifacts. However, the results obtained from FROST show much less severity in terms of ringing artifacts. We believe that one root cause of such artifacts is related to local minima which is often found by the random heuristic search. We provided a clear example of this type of artifacts in Figure 5.8. The figure presents the rendered images of vch-silk-blue-rgb when the BRDF is acquired with 10 samples. The ring artifacts in the red regions appear on Nielsen et al.’s method. Figure 5.9 provides three examples of reconstructed BRDFs. A diffuse material, green-latex, a glossy material, gold-metallic-paint2, and a layered material, cmwhite-rgb, were reconstructed with m = 20 samples. Most of the errors appear on highlights and grazing angle regions. This is due to the fact that such regions are difficult to measure and sensitive to noise both from sensor and the measurement methods. However, FROST still achieves higher reconstruction quality than the current state-of-the-art. Particularly for the layered material, which is rather challenging to represent and acquire, FROST performs quite effectively. Note that each error image is normalized and multiplied by 10 for visual comparisons. 88 Chapter 5 • Sparse Sampling for BRDF Acquisition gold-metallic-paint2 cm-white-rgb Nielsen green-latex 34.45dB 25.75dB 51.62dB 41.76dB 32.16dB Reference FROST-BRDF 44.98dB Figure 5.9 — Three examples of reconstruced BRDFs, green-latex, goldmetallic-paint2 and cm-white-rgb, from each method with m = 20 samples. The first row shows reconstructed BRDFs from Nielsen et al.’s method [41] while the second row shows reconstructed BRDFs from FROST. The last row shows the reference fro each material. The absolute error image is presented on the right column of each material where the error scale is on the right most column. We also report rendering SNR values under each of reconstructed BRDFs. 5.3 Summary and Future Work This chapter presented sparse BRDF measurement methods using two proposed models. In Section 5.2.1, we utilized the circular symmetry found in the PDV characteristics for the BRDF factor measurement. The PDV parameterization thus enables the 2D BRDF separable model to faciliate the measurements with two simple acquisition configurations. As a result, the acquisition device which permits only two degrees-of-freedom can be used to acquire and reconstruct a BRDF quite faithfully, specifically for glossy BRDFs. In Section 5.2.2, we proposed the sparse BRDF measurement which adopts compressed sensing as a sensing matrix design framework, called FROST. FROST reformulates the efficient BRDF point sampling measurement into a sensing matrix design problem. To this end, SOMP was employed to find the optimal sampling pattern. This leads to a fast and robust method compared to the current state-of-the-art while we also achieve higher BRDF reconstruction quality. In the future, as the digital trend keeps growing, the sparse reflectance measurement 5.3 • Summary and Future Work 89 will become more important. More compact BRDF representations and intuitive BRDF parameterizations could be keys and a future venue for BRDF modeling and acquisition systems, as shown in Section 4.3.1 and Section 5.2.1. The adaptive PDV parameterization admitting other radial symmetries than circular shape is also an interesting venue which might be flexible and suitable for more complex reflectance data. Moreover, we would like to extend and further study FROST for more complex reflectance data and diverse types of dictionaries. We believe that FROST initiates a new research direction for not only BRDF acquisition but also any high-dimensional visual data. Furthermore, we would like to further study FROST for capturing in the wild. Chapter 6 Summary and Outlook This thesis has presented several contributions in two major research areas. i.e., appearance modeling and BRDF sampling. The content of this thesis was organized into three major parts. The first part (Chapter 1) introduced the readers to the fundamentals of computer graphics, the research questions which are addressed by this thesis, and the overview of contributions. The second part (Chapter 2 and Chapter 3) provided preliminaries of BRDFs and relevant methods. The components of the rendering pipeline, e.g., light transport algorithms, surface reflection and data acquisition, as well as data-driven methods, e.g., tensor decompositions and sparse representation, were given to the readers to develop familiarity with the research content. The third part (Chapter 4 and Chapter 5) has presented the proposed models for answering the research questions we set out to address. The data-driven models are the primary key to overcome the issues on both BRDF representation and acquisition. This final chapter will provide a snapshot of thesis’s contributions where the contents get consolidated and some future research directions are discussed. For BRDF modeling and representation, we identified that non-parametric BRDF models form a great basis for modeling of captured real-world BRDFs. The first set of BRDF models was achieved by an iterative CPD method. Our factorized BRDF models can efficiently and accurately represent isotropic BRDFs with support of importance sampling. The motivation behind our factorized BRDF models was based on observations made when analyzing the PDV parameterization. The second set of BRDF models was motivated by the recent results of multi-dimensional dictionary learning. We obtained promising results using sparse representation and 92 Chapter 6 • Summary and Outlook dictionary learning. By learning the dictionaries from material databases, we were able to encode BRDFs with a flexible number of coefficients. Moreover, BRDF interpolation and editing techniques were proposed. The differential editing technique facilitated measured BRDF editing regarding to the BRDF parametric fitting. We also derived the BRDF interpolation technique on our learned dictionaries. The practicality of BRDF acquisition has long been a problem for acquiring BRDFs. We addressed this problem with two sparse acquisition models. The first acquisition setup adopted the factorized BRDF model and the observations made under PDV parameterization. As a result, we were able to efficiently recover isotropic BRDFs, especially glossy materials, with only two factor measurements. The second BRDF acquisition model was formulated using compressed sensing as the underlying framework. We proposed a BRDF acquisition framework (FROST) which reformulated the problem into finding the optimal sampling pattern. FROST employed the dictionary learning method and SOMP algorithm enabling a fast and robust optimal sampling pattern. Regarding the proposed models, we find the connections between BRDF modeling and acquisition. The essence of sparse BRDF sampling and reconstruction has addressed both BRDF representation and acquisition as a single research goal. In most cases, BRDF models are not physically interpretable to the real measurements. In contrast to sparse BRDF acquisition, the addressable methods tend to relate sparse measurements with measurement models, e.g., signal recovery. The proposed sparse BRDF acquisition models have shown accurate BRDF reconstruction results, however, our BRDF models still perform much better in terms of data compression or representation. We believe that the sparse BRDF sampling is a future trend in the development of BRDF research. Furthermore, the number of measured BRDFs is a key for data-driven approaches. We would like to apply FROST to create a large collection of materials for future research. We also would like to improve and extend FROST to other reflectance data such as SVBRDF and BTF as our future contribution. High-dimensional visual data can not only be seen in surface reflectance data but also other domains, e.g., medical images, light field images and spectral images. Data compression, representation and reconstruction are essential for developing the pipelines related to other domain applications. Here, we believe that tensor decompositions, sparse representation, and compressed sensing are important tools with strong mathematical foundations. These frameworks are classical machine learning approaches as compared to deep learning, modern machine learning. The integrations between classical frameworks and modern high-computational learning approaches are complements to each other and their cross-fertilization is an interesting venue for future work. Bibliography [1] A. Tsirikoglou, J. Kronander, P. Larsson, T. Tongbuasirilai, A. Gardner, and J. Unger, “Differential appearance editing for measured brdfs,” in ACM SIGGRAPH 2016 Talks, ser. SIGGRAPH ’16. New York, NY, USA: Association for Computing Machinery, 2016. [2] T. Tongbuasirilai, J. Unger, and M. Kurt, “Efficient brdf sampling using projected deviation vector parameterization,” in 2017 IEEE International Conference on Computer Vision Workshop (ICCVW). Los Alamitos, CA, USA: IEEE Computer Society, oct 2017, pp. 153–158. [page 31] [3] T. Tongbuasirilai, J. Unger, J. Kronander, and M. Kurt, “Compact and intuitive data-driven brdf models,” The Visual Computer, vol. 36, no. 4, pp. 855–872, 2020. [pages 28, 31, 60, and 63] [4] T. Tongbuasirilai, J. Unger, C. Guillemot, and E. Miandji, “A sparse nonparametric brdf model,” ACM Trans. Graph., apr 2022. [page 31] [5] E. Miandji, T. Tongbuasirilai, S. Hajisharif, B. Kavoosighafi, and J. Unger, “Frost-brdf : A fast and robust optimal sampling technique for brdf acquisition,” To be sumitted to ACM Trans. Graph., 2023. [6] T. Wilaiprasitporn, A. Ditthapron, K. Matchaparn, T. Tongbuasirilai, N. Banluesombatkul, and E. Chuangsuwanich, “Affective eeg-based person identification using the deep learning approach,” IEEE Transactions on Cognitive and Developmental Systems, vol. 12, no. 3, pp. 486–496, 2020. [7] A. Tsirikoglou, G. Eilertsen, and J. Unger, “A survey of image synthesis methods for visual machine learning,” Computer Graphics Forum, vol. 39, no. 6, pp. 426–451, 2020. [page 2] [8] T. Whitted, “An improved illumination model for shaded display,” Commun. ACM, vol. 23, no. 6, pp. 343–349, jun 1980. [pages 3 and 4] [9] M. Pharr, W. Jakob, and G. Humphreys, Physically Based Rendering: From Theory to Implementation, 3rd ed. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2016. [pages 3, 4, 21, 22, and 60] 96 Bibliography [10] B. Bitterli, “Rendering resources,” 2016, https://benedikt-bitterli.me/ resources/. [page 4] [11] J. T. Kajiya, “The rendering equation,” vol. 20, no. 4, pp. 143–150, aug 1986. [pages 4 and 20] [12] J. Menon, B. Wyvill, C. Bajaj, J. Bloomenthal, B. Guo, J. Hart, G. Wyvill, and C. Bajaj, “Implicit surfaces for geometric modeling and computer graphics,” ACM SIGGRAPH-96 Course Notes, pp. 4–9, 1996. [page 5] [13] A. Opalach and S. Maddock, “An overview of implicit surfaces,” 05 1995. [page 5] [14] J. Flórez, M. Sbert, M. Sainz, and J. Vehí, “Improving the interval ray tracing of implicit surfaces,” vol. 4035, 01 2006, pp. 655–664. [page 5] [15] B. R. de Araújo, D. S. Lopes, P. Jepp, J. A. Jorge, and B. Wyvill, “A survey on implicit surface polygonization,” vol. 47, no. 4, may 2015. [page 5] [16] G. Turk and M. Levoy, “Zippered polygon meshes from range images,” in Proceedings of the 21st Annual Conference on Computer Graphics and Interactive Techniques, ser. SIGGRAPH ’94. New York, NY, USA: Association for Computing Machinery, 1994, pp. 311–318. [page 6] [17] P. E. Debevec, C. J. Taylor, and J. Malik, “Modeling and rendering architecture from photographs: A hybrid geometry- and image-based approach,” in Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques, ser. SIGGRAPH ’96. New York, NY, USA: Association for Computing Machinery, 1996, pp. 11–20. [page 6] [18] H. P. A. Lensch, J. Kautz, M. Goesele, W. Heidrich, and H.-P. Seidel, “Imagebased reconstruction of spatial appearance and geometric detail,” ACM Trans. Graph., vol. 22, no. 2, pp. 234–257, apr 2003. [pages 6 and 78] [19] J. I. Ebert, “Chapter 3 - photogrammetry, photointerpretation, and digital imaging and mapping in environmental forensics,” in Introduction to Environmental Forensics (Third Edition), third edition ed., B. L. Murphy and R. D. Morrison, Eds. San Diego: Academic Press, 2015, pp. 39–64, https:// www.sciencedirect.com/science/article/pii/B9780124046962000035. [page 6] [20] T. Lescoat, M. Ovsjanikov, P. Memari, J.-M. Thiery, and T. Boubekeur, “A survey on data-driven dictionary-based methods for 3d modeling,” Computer Graphics Forum, vol. 37, no. 2, pp. 577–601, 2018. [page 6] [21] L. Xu, R. Wang, J. Zhang, Z. Yang, J. Deng, F. Chen, and L. Liu, “Survey on sparsity in geometric modeling and processing,” Graph. Models, vol. 82, no. C, p. 160–180, nov 2015. [page 6] Bibliography 97 [22] K. Xu, V. G. Kim, Q. Huang, N. Mitra, and E. Kalogerakis, “Data-driven shape analysis and processing,” in SIGGRAPH ASIA 2016 Courses, ser. SA ’16. New York, NY, USA: Association for Computing Machinery, 2016. [page 6] [23] E. Veach, “Robust monte carlo methods for light transport simulation,” Ph.D. dissertation, Stanford, CA, USA, 1998, aAI9837162. [page 6] [24] B. E. A. Saleh and M. C. Teich, Fundamentals of photonics; 1st ed. New York, NY: Wiley, 1991. [page 6] [25] H. W. Jensen, Realistic Image Synthesis Using Photon Mapping. USA: A. K. Peters, Ltd., 2001. [pages 6 and 25] [26] B. T. Phong, “Illumination for computer generated pictures,” Commun. ACM, vol. 18, no. 6, pp. 311–317, jun 1975. [pages 7 and 48] [27] J. F. Blinn, “Models of light reflection for computer synthesized pictures,” in Proceedings of the 4th Annual Conference on Computer Graphics and Interactive Techniques, ser. SIGGRAPH ’77. New York, NY, USA: Association for Computing Machinery, 1977, pp. 192–198. [pages 7, 29, and 48] [28] R. L. Cook and K. E. Torrance, “A reflectance model for computer graphics,” ACM Trans. Graph., vol. 1, no. 1, pp. 7–24, jan 1982. [pages 7, 29, and 51] [29] “Curet: Columbia-utrecht reflectance and texture database,” http://www1. cs.columbia.edu/CAVE/software/curet/. [pages 7 and 36] [30] “Cornell reflectance data,” http://www.graphics.cornell.edu/online/ measurements/reflectance/index.html/. [pages 7 and 36] [31] W. Matusik, H. Pfister, M. Brand, and L. McMillan, “A data-driven reflectance model,” ACM Transactions on Graphics, vol. 22, no. 3, pp. 759–769, jul 2003. [pages 7, 28, 34, 35, 47, 54, 55, 56, 64, 67, 73, and 85] [32] V. R. Haindl M., Filip J., “Digital material appearance: the curse of terabytes,” ERCIM News, no. 90, pp. 49–50, 2012. [page 7] [33] J. Filip, M. Kolafová, M. Havlíček, R. Vávra, M. Haindl, and R. H., “Evaluating Physical and Rendered Material Appearance,” The Visual Computer (Computer Graphics International 2018), 2018. [pages 7 and 36] [34] M. Weinmann, J. Gall, and R. Klein, “Material classification based on training data synthesized using a btf database,” in Computer Vision - ECCV 2014, D. Fleet, T. Pajdla, B. Schiele, and T. Tuytelaars, Eds. Cham: Springer International Publishing, 2014, pp. 156–171. [page 7] 98 Bibliography [35] J. Dupuy and W. Jakob, “An adaptive parameterization for efficient material acquisition and rendering,” Transactions on Graphics (Proceedings of SIGGRAPH Asia), vol. 37, no. 6, pp. 274:1–274:18, nov 2018. [pages 7, 27, 34, 35, 64, 67, 78, and 85] [36] D. Guarnera, G. C. Guarnera, A. Ghosh, I. Hall, and M. Glencross, “Capturing and representing brdfs for virtual reality,” in SIGGRAPH ASIA 2016 Courses, ser. SA ’16. New York, NY, USA: Association for Computing Machinery, 2016. [page 8] [37] G. C. Guarnera, A. Ghosh, I. Hall, M. Glencross, and D. Guarnera, “Material capture and representation with applications in virtual reality,” in ACM SIGGRAPH 2017 Courses, ser. SIGGRAPH ’17. New York, NY, USA: Association for Computing Machinery, 2017. [page 8] [38] S. Rusinkiewicz, “A new change of variables for efficient brdf representation.” in Rendering Techniques, ser. Eurographics, G. Drettakis and N. L. Max, Eds. Springer, 1998, pp. 11–22. [pages 10 and 28] [39] J. Lawrence, S. Rusinkiewicz, and R. Ramamoorthi, “Efficient brdf importance sampling using a factored representation,” in ACM SIGGRAPH 2004 Papers, ser. SIGGRAPH ’04. New York, NY, USA: Association for Computing Machinery, 2004, pp. 496–505. [pages 10, 21, 29, 54, 56, 60, and 65] [40] A. Bilgili, A. Öztürk, and M. Kurt, “A general brdf representation based on tensor decomposition,” Computer Graphics Forum, vol. 30, no. 8, pp. 2427–2439, 2011. [pages 10, 29, 31, 54, 56, and 60] [41] J. B. Nielsen, H. W. Jensen, and R. Ramamoorthi, “On optimal, minimal brdf sampling for reflectance acquisition,” ACM Transactions on Graphics (TOG), vol. 34, no. 6, pp. 186:1–186:11, November 2015. [pages 11, 36, 55, 63, 65, 67, 78, 80, 82, 85, 86, 87, and 88] [42] G. E. Stillman, “21 - optoelectronics,” in Reference Data for Engineers (Ninth Edition), ninth edition ed., W. M. Middleton and M. E. Van Valkenburg, Eds. Woburn: Newnes, 2002, pp. 21–1–21–31. [page 15] [43] G. B. Arfken and H. J. Weber, Mathematical methods for physicists; 4th ed. San Diego, CA: Academic Press, 1995. [page 19] [44] F. Nicodemus, J. Richmond, and J. Hsia, Geometrical Considerations and Nomenclature for Reflectance, ser. (National Bureau of Standards monograph). U.S. Government Printing Office, 1977. [page 23] Bibliography 99 [45] B. Hapke, Theory of Reflectance and Emittance Spectroscopy. Cambridge University Press, 2012. [page 24] [46] C. Donner, J. Lawrence, R. Ramamoorthi, T. Hachisuka, H. W. Jensen, and S. Nayar, “An empirical bssrdf model,” ACM Trans. Graph., vol. 28, no. 3, jul 2009. [page 25] [47] A. King, C. Kulla, A. Conty, and M. Fajardo, “Bssrdf importance sampling,” in ACM SIGGRAPH 2013 Talks, ser. SIGGRAPH ’13. New York, NY, USA: Association for Computing Machinery, 2013. [page 25] [48] P. H. Christensen, “An approximate reflectance profile for efficient subsurface scattering,” in ACM SIGGRAPH 2015 Talks, ser. SIGGRAPH ’15. New York, NY, USA: Association for Computing Machinery, 2015. [page 25] [49] L.-Q. Yan, W. Sun, H. W. Jensen, and R. Ramamoorthi, “A bssrdf model for efficient rendering of fur with global illumination,” ACM Trans. Graph., vol. 36, no. 6, nov 2017. [page 25] [50] C. Castillo, J. López-Moreno, and C. Aliaga, “Recent advances in fabric appearance reproduction,” Computers and Graphics, vol. 84, pp. 103–121, 2019. [page 25] [51] Y. Gitlina, G. C. Guarnera, D. S. Dhillon, J. Hansen, A. Lattas, D. Pai, and A. Ghosh, “Practical measurement and reconstruction of spectral skin reflectance,” Computer Graphics Forum, vol. 39, no. 4, pp. 75–89, 2020. [page 25] [52] M. Haindl and J. Filip, Bidirectional Texture Functions. London: Springer London, 2013, pp. 147–209. [page 26] [53] J. Filip and M. Haindl, “Bidirectional texture function modeling: A state of the art survey,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 11, pp. 1921–1940, 2009. [page 26] [54] J. Kautz and M. D. McCool, “Interactive rendering with arbitrary brdfs using separable approximations,” in Proceedings of the 10th Eurographics Conference on Rendering, ser. EGWR’99. Goslar, DEU: Eurographics Association, 1999, pp. 247–260. [pages 28 and 54] [55] R. Roland and K. Reinhard, “A compact and editable representation for measured BRDFs,” University of Bonn, Techreport CG-2010-1, 2010. [pages 28 and 31] [56] G. J. Ward, “Measuring and modeling anisotropic reflection,” in Proceedings of the 19th Annual Conference on Computer Graphics and Interactive 100 Bibliography Techniques, ser. SIGGRAPH ’92. New York, NY, USA: Association for Computing Machinery, 1992, pp. 265–272. [pages 29, 36, and 49] [57] M. Ashikhmin and P. Shirley, “An anisotropic phong brdf model,” Journal of Graphics Tools, vol. 5, no. 2, pp. 25–32, 2000. [pages 29 and 50] [58] A. Ngan, F. Durand, and W. Matusik, “Experimental Analysis of BRDF Models,” in Eurographics Symposium on Rendering (2005), K. Bala and P. Dutre, Eds. The Eurographics Association, 2005. [pages 29, 31, 33, 34, and 56] [59] L. Neumann, A. Neumann, and L. Szirmay-Kalos, “Reflectance Models with Fast Importance Sampling,” Computer Graphics Forum, 1999. [pages 29, 49, 50, and 56] [60] J. Löw, J. Kronander, A. Ynnerman, and J. Unger, “Brdf models for accurate and efficient rendering of glossy surfaces,” ACM Trans. Graph., vol. 31, no. 1, feb 2012. [pages 29, 31, 32, 33, 52, 56, 63, 74, and 77] [61] M. D. McCool, J. Ang, and A. Ahmad, “Homomorphic factorization of brdfs for high-performance rendering,” in Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, ser. SIGGRAPH ’01. New York, NY, USA: Association for Computing Machinery, 2001, pp. 171–178. [pages 30, 31, and 54] [62] D. Edwards, S. Boulos, J. Johnson, P. Shirley, M. Ashikhmin, M. Stark, and C. Wyman, “The halfway vector disk for brdf modeling,” ACM Trans. Graph., vol. 25, no. 1, pp. 1–18, jan 2006. [pages 30 and 50] [63] M. Stark, J. Arvo, and B. Smits, “Barycentric parameterizations for isotropic brdfs,” IEEE Transactions on Visualization and Computer Graphics, vol. 11, no. 2, pp. 126–138, 2005. [page 30] [64] P. Barla, L. Belcour, and R. Pacanowski, “In praise of an alternative brdf parametrization,” ser. MAM ’15. Goslar, DEU: Eurographics Association, 2015, pp. 9–13. [page 30] [65] I. Mallett and C. Yuksel, “Parameterization of tabulated brdfs,” in Proceedings of the 33rd Computer Graphics International, ser. CGI ’16. New York, NY, USA: Association for Computing Machinery, 2016, pp. 45–48. [page 30] [66] B. Cabral, N. Max, and R. Springmeyer, “Bidirectional reflection functions from surface bump maps,” ser. SIGGRAPH ’87. New York, NY, USA: Association for Computing Machinery, 1987, pp. 273–281. [pages 30 and 54] Bibliography 101 [67] S. H. Westin, J. R. Arvo, and K. E. Torrance, “Predicting reflectance functions from complex surfaces,” SIGGRAPH Comput. Graph., vol. 26, no. 2, pp. 255–264, jul 1992. [pages 30, 54, and 65] [68] J. Kautz, P.-P. Sloan, and J. Snyder, “Fast, arbitrary brdf shading for low-frequency lighting using spherical harmonics,” in Proceedings of the 13th Eurographics Workshop on Rendering, ser. EGRW ’02. Goslar, DEU: Eurographics Association, 2002, pp. 291–296. [pages 30, 54, and 65] [69] P. Gautron, J. Krivanek, S. Pattanaik, and K. Bouatouch, “A Novel Hemispherical Basis for Accurate and Efficient Rendering,” in Eurographics Workshop on Rendering, A. Keller and H. W. Jensen, Eds. The Eurographics Association, 2004. [page 30] [70] H. Xu and Y. Sun, “Compact representation of spectral brdfs using fourier transform and spherical harmonic expansion,” Computer Graphics Forum, vol. 25, no. 4, pp. 759–775, 2006. [pages 30, 54, and 65] [71] W. Jakob, E. d’Eon, O. Jakob, and S. Marschner, “A comprehensive framework for rendering layered materials,” ACM Trans. Graph., vol. 33, no. 4, jul 2014. [pages 30 and 54] [72] G. Fechner, D. Howes, and E. Boring, Elements of Psychophysics [Elemente Der Psychophysik], ser. Henry Holt editions in psychology. Holt, 1966, no. v. 1. [page 31] [73] J. A. Ferwerda, S. N. Pattanaik, P. Shirley, and D. P. Greenberg, “A model of visual adaptation for realistic image synthesis,” in Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques, ser. SIGGRAPH ’96. New York, NY, USA: Association for Computing Machinery, 1996, pp. 249–258. [page 31] [74] T. Sun, H. W. Jensen, and R. Ramamoorthi, “Connecting measured brdfs to analytic brdfs by data-driven diffuse-specular separation,” ACM Transactions on Graphics (TOG), vol. 37, no. 6, p. 273, 2018. [pages 31, 55, 63, and 65] [75] E. Reinhard, G. Ward, S. Pattanaik, and P. Debevec, High Dynamic Range Imaging: Acquisition, Display, and Image-Based Lighting, ser. The Morgan Kaufmann Series in Computer Graphics. Elsevier Science, 2005. [page 33] [76] G. Eilertsen, “The high dynamic range imaging pipeline : Tone-mapping, distribution, and single-exposure reconstruction,” Ph.D. dissertation, Linköping UniversityLinköping University, Media and Information Technology, Faculty of Science and Engineering, 2018. [page 33] 102 Bibliography [77] J. Bieron and P. Peers, “An adaptive brdf fitting metric,” Computer Graphics Forum, vol. 39, no. 4, pp. 59–74, 2020. [pages 33, 34, 63, and 66] [78] A. Forés, J. Ferwerda, and J. Gu, “Toward a perceptually based metric for brdf modeling,” Final Program and Proceedings - IS and T/SID Color Imaging Conference, vol. CIC’12, pp. 142–148, 01 2012. [page 34] [79] G. Lavoué, N. Bonneel, J.-P. Farrugia, and C. Soler, “Perceptual quality of BRDF approximations: dataset and metrics,” Computer Graphics Forum, vol. 40, no. 2, pp. 327–338, May 2021. [pages 34, 63, and 66] [80] P. Vangorp, J. Laurijssen, and P. Dutré, “The influence of shape on the perception of material reflectance,” in ACM SIGGRAPH 2007 Papers, ser. SIGGRAPH ’07. New York, NY, USA: Association for Computing Machinery, 2007, pp. 77–es. [page 34] [81] S. R. Marschner, S. H. Westin, E. P. F. Lafortune, K. E. Torrance, and D. P. Greenberg, “Image-based brdf measurement including human skin,” in Rendering Techniques’ 99, D. Lischinski and G. W. Larson, Eds. Vienna: Springer Vienna, 1999, pp. 131–144. [pages 34 and 35] [82] S. R. Marschner, S. H. Westin, E. P. F. Lafortune, and K. E. Torrance, “Image-based bidirectional reflectance distribution function measurement,” Appl. Opt., vol. 39, no. 16, pp. 2592–2600, Jun 2000. [page 34] [83] J. Filip and R. Vávra, “Template-based sampling of anisotropic BRDFs,” Computer Graphics Forum, vol. 33, no. 7, pp. 91–99, October 2014. [page 36] [84] J. Filip, R. Vavra, M. Haindl, P. Zid, M. Krupicka, and V. Havran, “Brdf slices: Accurate adaptive anisotropic appearance acquisition,” in Proceedings of the 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2013, June 2013, pp. 4321–4326. [page 36] [85] R. Vávra and J. Filip, “Adaptive slices for acquisition of anisotropic BRDF,” Computational Visual Media, no. 4, pp. 55–69, 2018. [page 36] [86] A. Ghosh, W. Heidrich, S. Achutha, and M. O’Toole, “A basis illumination approach to brdf measurement,” International journal of computer vision, vol. 90, no. 2, pp. 183–197, 2010. [page 36] [87] Y. Mukaigawa, K. Sumino, and Y. Yagi, “Multiplexed illumination for measuring brdf using an ellipsoidal mirror and a projector,” in Computer Vision - ACCV 2007, Y. Yagi, S. B. Kang, I. S. Kweon, and H. Zha, Eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 2007, pp. 246–257. [page 36] Bibliography 103 [88] T. G. Kolda and B. W. Bader, “Tensor decompositions and applications,” SIAM Review, vol. 51, no. 3, pp. 455–500, 2009. [pages 38, 42, 64, and 70] [89] M. Itskov, Tensor Algebra and Tensor Analysis for Engineers: With Applications to Continuum Mechanics, ser. Mathematical Engineering. Springer Berlin Heidelberg, 2012. [page 38] [90] T. Needham, Visual Differential Geometry and Forms: A Mathematical Drama in Five Acts. Princeton University Press, 2021. [page 38] [91] J. N. Reddy, Principles of Continuum Mechanics: A Study of Conservation Principles with Applications. Cambridge University Press, 2010. [page 38] [92] V. de Silva and L.-H. Lim, “Tensor rank and the ill-posedness of the best low-rank approximation problem,” SIAM Journal on Matrix Analysis and Applications, vol. 30, no. 3, pp. 1084–1127, 2008. [page 38] [93] F. L. Hitchcock, “The expression of a tensor or a polyadic as a sum of products,” Journal of Mathematics and Physics, vol. 6, no. 1-4, pp. 164–189, 1927. [page 40] [94] F. L. Hitchcock, “Multiple invariants and generalized rank of a p-way matrix or tensor,” Journal of Mathematics and Physics, vol. 7, no. 1-4, pp. 39–79, 1928. [page 40] [95] R. B. Cattell, ““parallel proportional profiles” and other principles for determining the choice of factors by rotation,” Psychometrika, pp. 267–283, 1944. [page 40] [96] R. B. Cattell, “The three basic factor-analytic research designs—their interrelations and derivatives.” Psychological Bulletin, pp. 499–520, 1952. [page 40] [97] R. Harshman, Foundations of the Parafac Procedure: Models and Conditions for an "explanatory" Multimodal Factor Analysis, ser. UCLA working papers in phonetics. University of California at Los Angeles, 1970. [page 40] [98] J. D. Carroll, “Analysis of individual differences in multidimensional scaling via an n-way generalization of “eckart-young” decomposition,” Psychometrika, pp. 283–319, 1970. [page 40] [99] J. Mocks, “Topographic components model for event-related potentials and some biophysical considerations,” IEEE Transactions on Biomedical Engineering, vol. 35, no. 6, pp. 482–484, 1988. [page 40] [100] H. A. L. Kiers, “Towards a standardized notation and terminology in multiway analysis,” Journal of Chemometrics, vol. 14, no. 3, pp. 105–122, 2000. [page 40] 104 Bibliography [101] J. B. Kruskal, “Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics,” Linear Algebra and its Applications, vol. 18, no. 2, pp. 95–138, 1977. [page 41] [102] J. B. Kruskal, “Rank, decomposition, and uniqueness for 3-way and n -way arrays,” 1989. [page 41] [103] N. Sidiropoulos and R. Bro, “On the uniqueness of multilinear decomposition of n-way arrays,” Journal of Chemometrics - J CHEMOMETR, vol. 14, pp. 229–239, 05 2000. [page 41] [104] J. M. F. ten Berge, “The typical rank of tall three-way arrays,” Psychometrika, vol. 65, pp. 525–532, 2000. [page 41] [105] X. Liu and N. Sidiropoulos, “Cramer-rao lower bounds for low-rank decomposition of multidimensional arrays,” IEEE Transactions on Signal Processing, vol. 49, no. 9, pp. 2074–2086, 2001. [page 41] [106] L. De Lathauwer, “A link between the canonical decomposition in multilinear algebra and simultaneous matrix diagonalization,” SIAM Journal on Matrix Analysis and Applications, vol. 28, no. 3, pp. 642–666, 2006. [page 41] [107] R. Minster, I. Viviano, X. Liu, and G. Ballard, “Cp decomposition for tensors via alternating least squares with qr decomposition,” 2021. [page 41] [108] L. R. Tucker, “Implications of factor analysis of three-way matrices for measurement of change,” in Problems in measuring change., C. W. Harris, Ed. University of Wisconsin Press, 1963, pp. 122–137. [page 42] [109] L. Tucker, “Some mathematical notes on three-mode factor analysis,” 1966. [pages 42 and 54] [110] P. M. Kroonenberg, “Principal component analysis of three-mode data by means of alternating least squares algorithms,” Psychometrika, vol. 45, no. 4, pp. 69–97, 1980. [page 42] [111] T. Wansbeek, H. Neudecker, and A. Kapteyn, “An approach to n-mode components analysis,” Psychometrika, vol. 51, pp. 269–275, 1986. [page 42] [112] M. A. O. Vasilescu and D. Terzopoulos, “Multilinear analysis of image ensembles: Tensorfaces,” in Computer Vision — ECCV 2002, A. Heyden, G. Sparr, M. Nielsen, and P. Johansen, Eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 2002, pp. 447–460. [page 42] [113] L. D. Lathauwer, B. D. Moor, and J. Vandewalle, “A multilinear singular value decomposition,” SIAM J. Matrix Anal. Appl., vol. 21, no. 4, pp. 1253–1278, 2000. [page 42] Bibliography 105 [114] M. Elad and M. Aharon, “Image denoising via sparse and redundant representations over learned dictionaries,” IEEE Transactions on Image Processing, vol. 15, no. 12, pp. 3736–3745, 2006. [pages 42 and 45] [115] M. Aharon, M. Elad, and A. Bruckstein, “K-svd: An algorithm for designing overcomplete dictionaries for sparse representation,” IEEE Transactions on Signal Processing, vol. 54, no. 11, pp. 4311–4322, 2006. [page 42] [116] R. Raina, A. Battle, H. Lee, B. Packer, and A. Y. Ng, “Self-taught learning: Transfer learning from unlabeled data,” in Proceedings of the 24th International Conference on Machine Learning, ser. ICML ’07. New York, NY, USA: Association for Computing Machinery, 2007, p. 759–766. [page 42] [117] M. Aharon and M. Elad, “Sparse and redundant modeling of image content using an image-signature-dictionary,” SIAM Journal on Imaging Sciences, vol. 1, no. 3, pp. 228–247, 2008. [page 42] [118] J. Mairal, M. Elad, and G. Sapiro, “Sparse representation for color image restoration,” IEEE Transactions on Image Processing, vol. 17, no. 1, pp. 53–69, 2008. [page 42] [119] J. Bagnell and D. Bradley, “Differentiable sparse coding,” in Advances in Neural Information Processing Systems, D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, Eds., vol. 21. Curran Associates, Inc., 2008. [page 42] [120] J. Yang, K. Yu, Y. Gong, and T. Huang, “Linear spatial pyramid matching using sparse coding for image classification,” in 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009, pp. 1794–1801. [page 42] [121] G. Peyré, “Sparse modeling of textures,” SIAM Journal on Imaging Sciences, vol. 34, pp. 17–31, 2009. [page 42] [122] G. Peyré, J. Fadili, and J.-L. Starck, “Learning the morphological diversity,” SIAM Journal on Imaging Sciences, vol. 3, no. 3, pp. 646–669, 2010. [page 42] [123] J. Mairal, F. Bach, J. Ponce, and G. Sapiro, “Online Learning for Matrix Factorization and Sparse Coding,” Journal of Machine Learning Research, vol. 11, no. 1, pp. 19–60, Jan 2010, revised version. [page 42] [124] E. Elhamifar and R. Vidal, “Robust classification using structured sparse representation,” in CVPR 2011, 2011, pp. 1873–1879. [page 42] [125] D. Donoho and X. Huo, “Uncertainty principles and ideal atomic decomposition,” IEEE Transactions on Information Theory, vol. 47, no. 7, pp. 2845–2862, 2001. [page 43] 106 Bibliography [126] M. Elad and A. Bruckstein, “A generalized uncertainty principle and sparse representation in pairs of bases,” IEEE Transactions on Information Theory, vol. 48, no. 9, pp. 2558–2567, 2002. [page 43] [127] D. L. Donoho and M. Elad, “Optimally sparse representation in general (non-orthogonal) dictionaries via ℓ1 minimization,” in PROC. NATL ACAD. SCI. USA 100 2197–202, 2002. [page 43] [128] R. Gribonval and M. Nielsen, “Sparse representations in unions of bases,” IEEE Transactions on Information Theory, vol. 49, no. 12, pp. 3320–3325, 2003. [page 43] [129] D. L. Donoho and J. Tanner, “Sparse nonnegative solution of underdetermined linear equations by linear programming,” Proceedings of the National Academy of Sciences, vol. 102, no. 27, pp. 9446–9451, 2005. [page 43] [130] E. Candes and T. Tao, “Decoding by linear programming,” IEEE Transactions on Information Theory, vol. 51, no. 12, pp. 4203–4215, 2005. [page 43] [131] Z. Zhang, Y. Xu, J. Yang, X. Li, and D. Zhang, “A survey of sparse representation: Algorithms and applications,” IEEE Access, vol. 3, pp. 490–530, 2015. [pages 43 and 44] [132] S. Mallat and Z. Zhang, “Matching pursuits with time-frequency dictionaries,” IEEE Transactions on Signal Processing, vol. 41, no. 12, pp. 3397–3415, 1993. [page 43] [133] Y. Pati, R. Rezaiifar, and P. Krishnaprasad, “Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition,” in Proceedings of 27th Asilomar Conference on Signals, Systems and Computers, 1993, pp. 40–44 vol.1. [pages 43 and 82] [134] J. Tropp, “Greed is good: algorithmic results for sparse approximation,” IEEE Transactions on Information Theory, vol. 50, no. 10, pp. 2231–2242, 2004. [pages 43 and 82] [135] D. Needell and R. Vershynin, “Uniform uncertainty principle and signal recovery via regularized orthogonal matching pursuit,” Foundations of Computational Mathematics, vol. 9, pp. 317–334, 2009. [page 43] [136] D. Needell and R. Vershynin, “Signal recovery from incomplete and inaccurate measurements via regularized orthogonal matching pursuit,” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 2, pp. 310–316, 2010. [page 43] Bibliography 107 [137] D. Needell and J. A. Tropp, “Cosamp: Iterative signal recovery from incomplete and inaccurate samples,” Commun. ACM, vol. 53, no. 12, p. 93–100, dec 2010. [page 43] [138] D. Needell, J. Tropp, and R. Vershynin, “Greedy signal recovery review,” in 2008 42nd Asilomar Conference on Signals, Systems and Computers, 2008, pp. 1048–1050. [page 43] [139] T. Zhang, “Adaptive forward-backward greedy algorithm for learning sparse representations,” IEEE Transactions on Information Theory, vol. 57, no. 7, pp. 4689–4708, 2011. [page 43] [140] H. Wu and S. Wang, “Adaptive sparsity matching pursuit algorithm for sparse reconstruction,” IEEE Signal Processing Letters, vol. 19, no. 8, pp. 471–474, 2012. [page 43] [141] Y. Chi and R. Calderbank, “Knowledge-enhanced matching pursuit,” in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013, pp. 6576–6580. [page 43] [142] S. K. Sahoo and A. Makur, “Signal recovery from random measurements via extended orthogonal matching pursuit,” IEEE Transactions on Signal Processing, vol. 63, no. 10, pp. 2572–2581, 2015. [page 43] [143] M. Yaghoobi, D. Wu, and M. E. Davies, “Fast non-negative orthogonal matching pursuit,” IEEE Signal Processing Letters, vol. 22, no. 9, pp. 1229– 1233, 2015. [page 43] [144] A. Adler, “Covariance-assisted matching pursuit,” IEEE Signal Processing Letters, vol. 23, no. 1, pp. 149–153, 2016. [page 43] [145] N. Lee, “Map support detection for greedy sparse signal recovery algorithms in compressive sensing,” IEEE Transactions on Signal Processing, vol. 64, no. 19, pp. 4987–4999, 2016. [page 43] [146] J. Tropp, “Just relax: convex programming methods for identifying sparse signals in noise,” IEEE Transactions on Information Theory, vol. 52, no. 3, pp. 1030–1051, 2006. [page 43] [147] S. S. Chen, D. L. Donoho, and M. A. Saunders, “Atomic decomposition by basis pursuit,” SIAM J. Sci. Comput., vol. 20, no. 1, p. 33–61, jan 1998. [page 43] [148] M. A. T. Figueiredo, R. D. Nowak, and S. J. Wright, “Gradient projection for sparse reconstruction: Application to compressed sensing and other inverse problems,” IEEE Journal of Selected Topics in Signal Processing, vol. 1, no. 4, pp. 586–597, 2007. [page 43] 108 Bibliography [149] L. F. Portugal, M. G. C. Resende, G. Veiga, and J. J. Júdice, “A truncated primal-infeasible dual-feasible network interior point method,” Networks, vol. 35, no. 2, pp. 91–108, 2000. [page 43] [150] S.-J. Kim, K. Koh, M. Lustig, S. Boyd, and D. Gorinevsky, “An interior-point method for large-scale ℓ1 -regularized least squares,” IEEE Journal of Selected Topics in Signal Processing, vol. 1, no. 4, pp. 606–617, 2007. [page 43] [151] J. Yang and Y. Zhang, “Alternating direction algorithms for ℓ1 -problems in compressive sensing,” SIAM Journal on Scientific Computing, vol. 33, no. 1, pp. 250–278, 2011. [page 43] [152] M. Figueiredo and R. Nowak, “A bound optimization approach to waveletbased image deconvolution,” in IEEE International Conference on Image Processing 2005, vol. 2, 2005, pp. II–782. [page 43] [153] A. Beck and M. Teboulle, “A fast iterative shrinkage-thresholding algorithm for linear inverse problems,” SIAM Journal on Imaging Sciences, vol. 2, no. 1, pp. 183–202, 2009. [page 43] [154] A. Y. Yang, S. S. Sastry, A. Ganesh, and Y. Ma, “Fast ℓ1 -minimization algorithms and an application in robust face recognition: A review,” in 2010 IEEE International Conference on Image Processing, 2010, pp. 1849–1852. [page 43] [155] P. L. Combettes and J.-C. Pesquet, “Proximal Splitting Methods in Signal Processing,” in Fixed-Point Algorithms for Inverse Problems in Science and Engineering, Bauschke, H. Burachik, R. Combettes, P. Elser, V. Luke, D. Wolkowicz, and H. (Eds.), Eds. Springer, 2011, pp. 185–212. [page 43] [156] S. J. Wright, R. D. Nowak, and M. A. T. Figueiredo, “Sparse reconstruction by separable approximation,” in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, pp. 3373–3376. [page 43] [157] J. Adler, B. Rao, and K. Kreutz-Delgado, “Comparison of basis selection methods,” in Conference Record of The Thirtieth Asilomar Conference on Signals, Systems and Computers, vol. 1, 1996, pp. 252–257. [page 43] [158] A. M. Bruckstein, D. L. Donoho, and M. Elad, “From sparse solutions of systems of equations to sparse modeling of signals and images,” SIAM Review, vol. 51, no. 1, pp. 34–81, 2009. [page 43] [159] J. W. Cooley and J. W. Tukey, “An algorithm for the machine calculation of complex fourier series,” Mathematics of computation, vol. 19, no. 90, pp. 297–301, 1965. [page 44] Bibliography 109 [160] N. Ahmed, T. Natarajan, and K. Rao, “Discrete cosine transform,” IEEE Transactions on Computers, vol. C-23, no. 1, pp. 90–93, 1974. [page 44] [161] E. Simoncelli, W. Freeman, E. Adelson, and D. Heeger, “Shiftable multiscale transforms,” IEEE Transactions on Information Theory, vol. 38, no. 2, pp. 587–607, 1992. [page 44] [162] E. Simoncelli and E. Adelson, “Noise removal via bayesian wavelet coring,” in Proceedings of 3rd IEEE International Conference on Image Processing, vol. 1, 1996, pp. 379–382 vol.1. [page 44] [163] W. He, Y. Zi, B. Chen, F. Wu, and Z. He, “Automatic fault feature extraction of mechanical anomaly on induction motor bearing using ensemble superwavelet transform,” Mechanical Systems and Signal Processing, vol. 54-55, pp. 457–480, 2015. [page 44] [164] J.-L. Starck, E. Candes, and D. Donoho, “The curvelet transform for image denoising,” IEEE Transactions on Image Processing, vol. 11, no. 6, pp. 670–684, 2002. [page 44] [165] E. Le Pennec and S. Mallat, “Sparse geometric image representations with bandelets,” IEEE Transactions on Image Processing, vol. 14, no. 4, pp. 423–438, 2005. [page 44] [166] M. Do and M. Vetterli, “The contourlet transform: An efficient directional multiresolution image representation,” IEEE transactions on image processing : a publication of the IEEE Signal Processing Society, vol. 14, pp. 2091–106, 01 2006. [page 44] [167] S. Foucart and H. Rauhut, A Mathematical Introduction to Compressive Sensing., ser. Applied and Numerical Harmonic Analysis. Birkhäuser, 2013. [page 44] [168] B. Zhao, L. Fei-Fei, and E. P. Xing, “Online detection of unusual events in videos via dynamic sparse coding,” in CVPR 2011, 2011, pp. 3313–3320. [page 44] [169] J. Shi, X. Ren, G. Dai, J. Wang, and Z. Zhang, “A non-convex relaxation approach to sparse dictionary learning,” in CVPR 2011, 2011, pp. 1809–1816. [page 44] [170] M. Yang, L. Zhang, X. Feng, and D. Zhang, “Fisher discrimination dictionary learning for sparse representation,” in 2011 International Conference on Computer Vision, 2011, pp. 543–550. [pages 44 and 45] 110 Bibliography [171] Z. Jiang, Z. Lin, and L. S. Davis, “Learning a discriminative dictionary for sparse coding via label consistent k-svd,” in CVPR 2011, 2011, pp. 1697–1704. [pages 44 and 45] [172] D.-S. Pham and S. Venkatesh, “Joint learning and dictionary construction for pattern recognition,” in 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008, pp. 1–8. [page 44] [173] J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman, “Non-local sparse models for image restoration,” in 2009 IEEE 12th International Conference on Computer Vision, 2009, pp. 2272–2279. [page 44] [174] H. Cheng, Z. Liu, L. Yang, and X. Chen, “Sparse representation and learning in visual recognition: Theory and applications,” Signal Processing, vol. 93, no. 6, pp. 1408–1425, 2013, special issue on Machine Learning in Intelligent Image Processing. [page 44] [175] M. Aharon, M. Elad, and A. Bruckstein, “K-svd: An algorithm for designing overcomplete dictionaries for sparse representation,” IEEE Transactions on Signal Processing, vol. 54, no. 11, pp. 4311–4322, 2006. [page 44] [176] M. Elad and M. Aharon, “Image denoising via learned dictionaries and sparse representation,” in 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), vol. 1, 2006, pp. 895–900. [page 45] [177] M. L. Fu, Z. M. Wang, and R. X. Cao, “Image denoising based on k-svd with adaptive dictionary size,” in Modern Technologies in Materials, Mechanics and Intelligent Systems, ser. Advanced Materials Research, vol. 1049. Trans Tech Publications Ltd, 11 2014, pp. 1645–1649. [page 45] [178] O. Bryt and M. Elad, “Compression of facial images using the k-svd algorithm,” Journal of Visual Communication and Image Representation, vol. 19, no. 4, pp. 270–282, 2008. [page 45] [179] J. Mairal, J. Ponce, G. Sapiro, A. Zisserman, and F. Bach, “Supervised dictionary learning,” in Advances in Neural Information Processing Systems, D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, Eds., vol. 21. Curran Associates, Inc., 2008. [page 45] [180] J. Wright, A. Y. Yang, A. Ganesh, S. S. Sastry, and Y. Ma, “Robust face recognition via sparse representation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 2, pp. 210–227, 2009. [page 45] Bibliography 111 [181] Y. Yang, Y. Yang, Z. Huang, H. T. Shen, and F. Nie, “Tag localization with spatial correlations and joint group sparsity,” in CVPR 2011, 2011, pp. 881–888. [page 45] [182] Z. Jiang, G. Zhang, and L. S. Davis, “Submodular dictionary learning for sparse coding,” in 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012, pp. 3418–3425. [page 45] [183] S. Zubair and W. Wang, “Tensor dictionary learning with sparse tucker decomposition,” in 2013 18th International Conference on Digital Signal Processing (DSP), 2013, pp. 1–6. [page 45] [184] F. Roemer, G. Del Galdo, and M. Haardt, “Tensor-based algorithms for learning multidimensional separable dictionaries,” in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014, pp. 3963–3967. [page 45] [185] C. F. Dantas, M. N. da Costa, and R. d. R. Lopes, “Learning dictionaries as a sum of kronecker products,” IEEE Signal Processing Letters, vol. 24, no. 5, pp. 559–563, 2017. [page 45] [186] Z. Shakeri, W. U. Z. Bajwa, and A. D. Sarwate, “Minimax lower bounds on dictionary learning for tensor data,” IEEE Transactions on Information Theory, vol. 64, pp. 2706–2726, 2018. [page 45] [187] R. Xu, Y. Xu, and Y. Quan, “Factorized tensor dictionary learning for visual tensor data completion,” IEEE Transactions on Multimedia, vol. 23, pp. 1225–1238, 2021. [page 45] [188] G. Duan, H. Wang, Z. Liu, J. Deng, and Y.-W. Chen, “K-cpd: Learning of overcomplete dictionaries for tensor sparse coding,” in Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), 2012, pp. 493–496. [page 45] [189] A. Stevens, Y. Pu, Y. Sun, G. Spell, and L. Carin, “Tensor-Dictionary Learning with Deep Kruskal-Factor Analysis,” in Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, ser. Proceedings of Machine Learning Research, A. Singh and J. Zhu, Eds., vol. 54. PMLR, 20–22 Apr 2017, pp. 121–129. [page 45] [190] X. Zhang, X. Yuan, and L. Carin, “Nonlocal low-rank tensor factor analysis for image restoration,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 8232–8241. [page 45] 112 Bibliography [191] C. Fraga Dantas, J. E. Cohen, and R. Gribonval, “Learning fast dictionaries for sparse representations using low-rank tensor decompositions,” in LVA/ICA 2018 - 14th International Conference on Latent Variable Analysis and Signal Separation, ser. LNCS, vol. 10891. Guildford, United Kingdom: Springer, jul 2018, pp. 456–466. [page 45] [192] S. Zubair and W. Wang, “Signal classification based on block-sparse tensor representation,” in 2014 19th International Conference on Digital Signal Processing, 2014, pp. 361–365. [page 45] [193] Y. Fu, J. Gao, Y. Sun, and X. Hong, “Joint multiple dictionary learning for tensor sparse coding,” in 2014 International Joint Conference on Neural Networks (IJCNN), 2014, pp. 2957–2964. [page 45] [194] N. Qi, Y. Shi, X. Sun, and B. Yin, “Tensr: Multi-dimensional tensor sparse representation,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 5916–5925. [page 45] [195] F. Ju, Y. Sun, J. Gao, Y. Hu, and B. Yin, “Nonparametric tensor dictionary learning with beta process priors,” Neurocomput., vol. 218, no. C, p. 120–130, dec 2016. [page 45] [196] J. Salvador, “Chapter 2 - a taxonomy of example-based super resolution,” in Example-Based Super Resolution, J. Salvador, Ed. Academic Press, 2017, pp. 15–29. [page 48] [197] R. R. Lewis, “Making shaders more physically plausible,” Computer Graphics Forum, vol. 13, no. 2, pp. 109–120, 1994. [page 48] [198] E. P. Lafortune and Y. D. Willems, “Using the modified phong reflectance model for physically based rendering,” Tech. Rep., 1994. [page 48] [199] A. Dür, “An improved normalization for the ward reflectance model,” Journal of Graphics Tools, vol. 11, pp. 51 – 59, 2006. [page 49] [200] D. Geisler-Moroder and A. Dür, “A new ward brdf model with bounded albedo,” in Proceedings of the 21st Eurographics Conference on Rendering, ser. EGSR’10. Goslar, DEU: Eurographics Association, 2010, pp. 1391–1398. [page 49] [201] L. Neumann, A. Neumannn, and L. Szirmay-Kalos, “Compact Metallic Reflectance Models,” Computer Graphics Forum, 1999. [page 49] [202] C. Schlick, “An inexpensive brdf model for physically-based rendering,” Computer Graphics Forum, vol. 13, no. 3, pp. 233–246, 1994. [page 50] Bibliography 113 [203] M. Ashikhmin and S. Premoze, “Distribution-based brdfs,” 2007. [page 50] [204] K. E. Torrance and E. M. Sparrow, “Theory for off-specular reflection from roughened surfaces∗,” J. Opt. Soc. Am., vol. 57, no. 9, pp. 1105–1114, Sep 1967. [page 51] [205] B. Walter, S. R. Marschner, H. Li, and K. E. Torrance, “Microfacet models for refraction through rough surfaces,” in Proceedings of the 18th Eurographics Conference on Rendering Techniques, ser. EGSR’07. Goslar, DEU: Eurographics Association, 2007, pp. 195–206. [page 51] [206] M. M. Bagher, C. Soler, and N. Holzschuch, “Accurate fitting of measured reflectances using a Shifted Gamma micro-facet distribution,” Computer Graphics Forum, vol. 31, no. 4, pp. 1509–1518, Jun. 2012. [pages 51 and 52] [207] N. Holzschuch and R. Pacanowski, “A Two-Scale Microfacet Reflectance Model Combining Reflection and Diffraction,” ACM Transactions on Graphics, vol. 36, no. 4, p. 12, Jul. 2017, article 66. [page 52] [208] M. Kurt, L. Szirmay-Kalos, and J. Křivánek, “An anisotropic brdf model for fitting and monte carlo rendering,” SIGGRAPH Comput. Graph., vol. 44, no. 1, feb 2010. [page 52] [209] T. Tongbuasirilai, “Accurate brdf modelling for wide angle scattering,” Master’s thesis, Linköping UniversityLinköping University, Media and Information Technology, The Institute of Technology, 2013. [page 52] [210] M. Oren and S. K. Nayar, “Generalization of lambert’s reflectance model,” in Proceedings of the 21st Annual Conference on Computer Graphics and Interactive Techniques, ser. SIGGRAPH ’94. New York, NY, USA: Association for Computing Machinery, 1994, pp. 239–246. [page 52] [211] J. W. S. L. D. L. S. Maxwell, J. R. Beard, “Bidirectional reflectance model validation and utilization.” [page 53] [212] H. Westlund and G. Meyer, “A brdf database employing the beard-maxwell reflection model,” in Proceedings of the Graphics Interface 2002 Conference, May 2002, pp. 189–201. [page 53] [213] A. Weidlich and A. Wilkie, “Arbitrarily layered micro-facet surfaces,” in Proceedings of the 5th International Conference on Computer Graphics and Interactive Techniques in Australia and Southeast Asia, ser. GRAPHITE ’07. New York, NY, USA: Association for Computing Machinery, 2007, pp. 171–178. [page 53] 114 Bibliography [214] M. Rump, G. Müller, R. Sarlette, D. Koch, and R. Klein, “Photo-realistic rendering of metallic car paint from image-based measurements,” Computer Graphics Forum, 2008. [page 53] [215] F. X. Sillion, J. R. Arvo, S. H. Westin, and D. P. Greenberg, “A global illumination solution for general reflectance distributions,” ser. SIGGRAPH ’91. New York, NY, USA: Association for Computing Machinery, 1991, pp. 187–196. [page 54] [216] P. Mézières and M. Paulin, “Efficient spherical harmonic shading for separable BRDF,” in 14th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH ASIA 2021), Special interest group of the Association for Computing Machinery in computer graphics and interactive techniques. Tokyo, Japan: ACM, Dec. 2021, pp. 1–4. [page 54] [217] D. Mahajan, R. Ramamoorthi, and B. Curless, “A theory of spherical harmonic identities for brdf/lighting transfer and image consistency,” in Proceedings of the 9th European Conference on Computer Vision - Volume Part IV, ser. ECCV’06. Berlin, Heidelberg: Springer-Verlag, 2006, p. 41–55. [page 54] [218] R. Basri and D. Jacobs, “Lambertian reflectance and linear subspaces,” in Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, vol. 2, 2001, pp. 383–390 vol.2. [page 54] [219] L. Zhang and D. Samaras, “Face recognition under variable lighting using harmonic image exemplars,” in 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., vol. 1, 2003, pp. I–I. [page 54] [220] R. Ramamoorthi and P. Hanrahan, “A signal-processing framework for reflection,” ACM Trans. Graph., vol. 23, no. 4, p. 1004–1042, oct 2004. [page 54] [221] R. Ramamoorthi and P. Hanrahan, “An efficient representation for irradiance environment maps,” in Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, ser. SIGGRAPH ’01. New York, NY, USA: Association for Computing Machinery, 2001, p. 497–500. [pages 54 and 61] [222] P.-P. Sloan, J. Kautz, and J. Snyder, “Precomputed radiance transfer for real-time rendering in dynamic, low-frequency lighting environments,” in Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques, ser. SIGGRAPH ’02. New York, NY, USA: Association for Computing Machinery, 2002, pp. 527–536. [page 54] Bibliography 115 [223] Z. Ren, R. Wang, J. Snyder, K. Zhou, X. Liu, B. Sun, P.-P. Sloan, H. Bao, Q. Peng, and B. Guo, “Real-time soft shadows in dynamic scenes using spherical harmonic exponentiation,” in ACM SIGGRAPH 2006 Papers, ser. SIGGRAPH ’06. New York, NY, USA: Association for Computing Machinery, 2006, pp. 977–986. [page 54] [224] R. Ramamoorthi, “Modeling illumination variation with spherical harmonics,” in In Face Processing: Advanced Modeling and Methods. Academic Press, 2005. [page 54] [225] I. G. E. Renhorn and G. D. Boreman, “Developing a generalized brdf model from experimental data,” Opt. Express, vol. 26, no. 13, pp. 17 099–17 114, Jun 2018. [page 54] [226] R. Pacanowski, O. Salazar-Celis, C. Schlick, X. Granier, P. Pierre, and C. Annie, “Rational BRDF,” IEEE Transactions on Visualization and Computer Graphics, vol. 18, no. 11, pp. 1824–1835, Mar. 2012. [page 54] [227] F. Romeiro, Y. Vasilyev, and T. Zickler, “Passive reflectometry,” in Computer Vision – ECCV 2008, D. Forsyth, P. Torr, and A. Zisserman, Eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 2008, pp. 859–872. [pages 54 and 78] [228] J. Hao, Y. Liu, and D. Weng, “A brdf representing method based on gaussian process,” in Computer Vision - ACCV 2014 Workshops, C. Jawahar and S. Shan, Eds. Cham: Springer International Publishing, 2015, pp. 542–553. [page 55] [229] S. Georgoulis, V. Vanweddingen, M. Proesmans, and L. Van Gool, “A gaussian process latent variable model for brdf inference,” in 2015 IEEE International Conference on Computer Vision (ICCV), 2015, pp. 3559–3567. [page 55] [230] K. Nishino and S. Lombardi, “Directional statistics-based reflectance model for isotropic bidirectional reflectance distribution functions,” J. Opt. Soc. Am. A, vol. 28, no. 1, pp. 8–18, Jan 2011. [pages 55 and 78] [231] M. M. Bagher, J. Snyder, and D. Nowrouzezahrai, “A non-parametric factor microfacet model for isotropic brdfs,” ACM Trans. Graph., vol. 35, no. 5, jul 2016. [pages 55 and 67] [232] E. Miandji, S. Hajisharif, and J. Unger, “A unified framework for compression and compressed sensing of light fields and light field videos,” ACM Trans. Graph., vol. 38, no. 3, may 2019. [pages 56, 62, 63, and 65] [233] L. Claustres, M. Paulin, and Y. Boucher, “BRDF Measurement Modelling using Wavelets for Efficient Path Tracing,” Computer Graphics Forum, 2003. [page 61] 116 Bibliography [234] B. Tunwattanapong, G. Fyffe, P. Graham, J. Busch, X. Yu, A. Ghosh, and P. Debevec, “Acquiring reflectance and shape from continuous spherical harmonic illumination,” ACM Trans. Graph., vol. 32, no. 4, jul 2013. [page 65] [235] A. Wilkie, A. Weidlich, M. Magnor, and A. Chalmers, “Predictive rendering,” in ACM SIGGRAPH ASIA 2009 Courses, ser. SIGGRAPH ASIA ’09. New York, NY, USA: Association for Computing Machinery, 2009. [page 68] [236] A. Ben-Artzi, R. Overbeck, and R. Ramamoorthi, “Real-time brdf editing in complex lighting,” ACM Trans. Graph., vol. 25, no. 3, pp. 945–954, jul 2006. [page 68] [237] A. Ben-Artzi, K. Egan, F. Durand, and R. Ramamoorthi, “A precomputed polynomial representation for interactive brdf editing with global illumination,” ACM Trans. Graph., vol. 27, no. 2, may 2008. [page 68] [238] X. Sun, K. Zhou, Y. Chen, S. Lin, J. Shi, and B. Guo, “Interactive relighting with dynamic brdfs,” ACM Trans. Graph., vol. 26, no. 3, pp. 27–es, jul 2007. [page 68] [239] T.-W. Schmidt, F. Pellacini, D. Nowrouzezahrai, W. Jarosz, and C. Dachsbacher, “State of the art in artistic editing of appearance, lighting, and material,” Computer Graphics Forum, vol. 35, no. 1, pp. 216–233, feb 2016. [page 68] [240] J. C. Gower and G. B. Dijksterhuis, Procrustes Problems. Oxford University Press, 01 2004. [page 70] [241] R. Vávra and J. Filip, “Minimal sampling for effective acquisition of anisotropic brdfs,” Comput. Graph. Forum, vol. 35, no. 7, pp. 299–309, oct 2016. [pages 78 and 82] [242] M. Fuchs, V. Blanz, H. P. Lensch, and H.-P. Seidel, “Adaptive sampling of reflectance fields,” ACM Trans. Graph., vol. 26, no. 2, pp. 10–es, jun 2007. [page 78] [243] G. Patow and X. Pueyo, “A survey of inverse rendering problems,” Computer Graphics Forum, vol. 22, no. 4, pp. 663–687, 2003. [page 78] [244] F. Romeiro and T. Zickler, “Blind reflectometry,” in Computer Vision – ECCV 2010, K. Daniilidis, P. Maragos, and N. Paragios, Eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 2010, pp. 45–58. [page 78] [245] S. R. Marschner, “Inverse rendering for computer graphics,” Ph.D. dissertation, USA, 1998, aAI9839924. [page 78] Bibliography 117 [246] M. Aittala, T. Weyrich, and J. Lehtinen, “Practical svbrdf capture in the frequency domain,” ACM Trans. Graph., vol. 32, no. 4, jul 2013. [page 78] [247] W. Matusik, H. Pfister, M. Brand, and L. McMillan, “Efficient Isotropic BRDF Measurement,” in Eurographics Workshop on Rendering, P. Dutre, F. Suykens, P. H. Christensen, and D. Cohen-Or, Eds. The Eurographics Association, 2003. [page 78] [248] K. Reinhard, H. Matthias B., S. Heinz Christian, and B. Dennis den, “PatchBased Sparse Reconstruction of Material BTFs,” Journal of WSCG, vol. 22, no. 2, pp. 83–90, 2014. [pages 78 and 82] [249] I. C. F. Ipsen and T. Wentworth, “The effect of coherence on sampling from matrices with orthonormal columns, and preconditioned least squares problems,” SIAM J. Matrix Anal. Appl., vol. 35, no. 4, pp. 1490–1520, 2014. [page 78] [250] H. Nyquist, “Certain topics in telegraph transmission theory,” Transactions of the American Institute of Electrical Engineers, vol. 47, no. 2, pp. 617–644, 1928. [page 81] [251] C. Shannon, “Communication in the presence of noise,” Proceedings of the IRE, vol. 37, no. 1, pp. 10–21, 1949. [page 81] [252] E. Candes, J. Romberg, and T. Tao, “Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information,” 2004. [page 81] [253] E. Candes, J. Romberg, and T. Tao, “Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information,” IEEE Transactions on Information Theory, vol. 52, no. 2, pp. 489–509, 2006. [page 81] [254] D. Donoho, “Compressed sensing,” IEEE Transactions on Information Theory, vol. 52, no. 4, pp. 1289–1306, 2006. [page 81] [255] J. A. Tropp, A. C. Gilbert, and M. J. Strauss, “Algorithms for simultaneous sparse approximation. part i: Greedy pursuit,” Signal Processing, vol. 86, no. 3, pp. 572–588, 2006, sparse Approximations in Signal and Image Processing. [pages 81 and 82] [256] J. A. Tropp, “Algorithms for simultaneous sparse approximation. part ii: Convex relaxation,” Signal Processing, vol. 86, no. 3, pp. 589–602, 2006, sparse Approximations in Signal and Image Processing. [page 81] 118 Bibliography [257] M. M. Hyder and K. Mahata, “A robust algorithm for joint-sparse recovery,” IEEE Signal Processing Letters, vol. 16, no. 12, pp. 1091–1094, 2009. [page 82] [258] D. L. Donoho, Y. Tsaig, I. Drori, and J. L. Starck, “Sparse solution of underdetermined systems of linear equations by stagewise orthogonal matching pursuit,” IEEE Trans. Inf. Theor., vol. 58, no. 2, pp. 1094–1121, feb 2012. [page 82] [259] J. D. Blanchard, M. Cermak, D. Hanle, and Y. Jing, “Greedy algorithms for joint sparse recovery,” IEEE Transactions on Signal Processing, vol. 62, no. 7, pp. 1694–1704, 2014. [page 82] [260] J. Shang, Z. Wang, and Q. Huang, “A robust algorithm for joint sparse recovery in presence of impulsive noise,” IEEE Signal Processing Letters, vol. 22, no. 8, pp. 1166–1170, 2015. [page 82] [261] N. Han, S. Li, and J. Lu, “Orthogonal subspace based fast iterative thresholding algorithms for joint sparsity recovery,” IEEE Signal Processing Letters, vol. 28, pp. 1320–1324, 2021. [page 82] [262] Z. Xu, J. B. Nielsen, J. Yu, H. W. Jensen, and R. Ramamoorthi, “Minimal brdf sampling for two-shot near-field reflectance acquisition,” ACM Trans. Graph., vol. 35, no. 6, nov 2016. [page 82] [263] V. Blanz, A. Mehl, T. Vetter, and H.-P. Seidel, “A statistical method for robust 3d surface reconstruction from sparse data,” in Proceedings. 2nd International Symposium on 3D Data Processing, Visualization and Transmission, 2004. 3DPVT 2004., 2004, pp. 293–300. [page 85] [264] J.-F. Determe, “Somp simulation framework v0.5.3,” 2015, accessed: 2022-01. [page 85] Publications Publications The publications associated with this thesis have been removed for copyright reasons. For more details about these see: https://doi.org/10.3384/9789179295585 FACULTY OF SCIENCE AND ENGINEERING Linköping Studies in Science and Technology Dissertation No. 2272 Linköping Studies in Science and Technology, Dissertation No. 2272, 2023 Department of Science and Technology Linköping University SE-581 83 Linköping, Sweden www.liu.se Data-Driven Approaches for Tanaboon Tongbuasirilai Sparse Reflectance Modeling and Acquisition Data-Driven Approaches for Sparse Reflectance Modeling and Acquisition Tanaboon Tongbuasirilai 2023