UNIVERSITEIT•STELLENBOSCH•UNIVERSITY jou kennisvennoot • your knowledge partner Drone-based traffic flow estimation and tracking using computer vision by Andrew de Bruin: 16458826 Report submitted in partial fulfilment of the requirements of the module Project (E) 448 for the degree Baccalaureus in Engineering in the Department of Electrical and Electronic Engineering at the University of Stellenbosch Supervisor: Dr. Marthinus Johannes Booysen Department of Electrical & Electronic Engineering, Stellenbosch University. November 2014 DECLARATION i Declaration I, the undersigned, hereby declare that the work contained in this report is my own original work unless indicated otherwise. Signature Date ACKNOWLEDGEMENTS ii Acknowledgements To my Mother and Father for the continuous support and encouragement offered throughout my four years of study. To Dr. M.J. Booysen for the guidance, support and supervision he provided throughout the project. To Prof. T. Jones for organising the Parrot AR Drone, as well as for increasing my interest in control systems throughout my four years of study. To Prof. W.H. Steyn for his continuous guidance. To Dr. C.E. van Daalen for his assistance with control systems. To Dr. S.J. Andersen for his assistance with the traffic flow component. To Chané Bunning for helping with the design of the poster, as well as for the continuous support and encouragement she offered throughout the process. To my lab colleagues, Alexander Chiu and Jonathan Brown, for their motivation and good company. To Trintel and MTN for providing funding for the project. DEDICATION iii Dedication Science is a wonderful thing if one does not have to earn one’s living at it. Albert Einstein ABSTRACT iv Abstract Traffic flow estimation is a technique by which traffic engineers analyse the extent to which a particular road segment can accommodate traffic. Decisions regarding the need for road upgrades, the installation of speeding cameras and even general security upgrades are made based on these results. Since traffic cameras are usually installed throughout urban areas, exploiting this established infrastructure would allow for the seamless integration of an autonomous traffic flow estimation system. This can be achieved by designing a highly flexible and adaptive system, that would allow for the analysis of traffic footage from various input sources. The purpose of this project was to design and implement a system that allowed for autonomous traffic flow estimation using computer vision techniques. The system was required to compute traffic flow metrics and upload the information in real-time to an online dashboard. The Mixture of Gaussian (MoG) background subtraction technique was implemented throughout the project as the primary method of vehicle detection. Shadows were detected based on certain chromaticity characteristics, before being subsequently removed from the foreground mask. Relative vehicle velocities were computed based on optical flow tracking techniques. Results showed both the vehicle detection as well as the velocity computations to be around 95% accurate under ideal conditions, and around 80% under non-ideal illumination conditions. A particularly attractive alternative to the static pole-mounted traffic cameras, is the use of a fully autonomous aircraft. In this way, simple landing platforms could eventually replace expensive traffic cameras. This would allow for the analysis of traffic flow in more rural areas where the necessary infrastructure is often non-existent. The system designed to automate the aircraft flight control was implemented in the discrete-time domain using augmented difference equations. The method of least squares was used to obtain a model of the plant so that appropriate controllers could be designed. Ultimately, the Ziegler-Nichols tuning method was used to obtain the practical controller parameters. Results obtained during the testing procedure showed that the aircraft was able to land on the designated platform. The inclusion of the automated aircraft is intended to add additional functionality to the primary aim of this project. The system was designed in a modular fashion, with each submodule contributing additional functionality. The modules were then integrated and thoroughly tested to ensure optimal system performance. Results of the testing showed the system to be fully functional, and proved the concept of autonomous traffic flow estimation to be a viable option. UITTREKSEL v Uittreksel Meting van verkeersvloei, is ’n tegniek wat deur verkeersingenieurs gebruik word om die mate wat ’n spesifieke pad segment verkeer kan akkommodeer, te analiseer. Die resultate kan gebruik word vir besluitneming aangaande pad opgraderings, die installering van spoedkameras en selfs vir algemene sekuriteitsopgradering. Hierdie outonome verkeersvloeimetingstelsel, kan sonder veel moeite in ’n bestaande kamera infrastruktuur in veral beboude areas geïmplementeer word. Dit word vermag deur ’n hoogs buigbare en aanpasbare stelsel te ontwerp vir die analisering van verkeersopnames wat van verskeie intreebronne verkry word. Die doel van hierdie projek was om ’n stelsel te ontwerp en implementeer wat voorsiening maak vir outonome verkeersvloeimeting deur van rekenaarvisietegnieke gebruik te maak. ’n Vereiste was dat die stelsel verkeersvloei eienskappe visueel kan bereken, en die inligting dan op ’n aanlyn beheerpaneel in reële tyd oplaai. Die ‘Mixture of Gaussian’ (MoG) agtergrond-aftrekking tegniek, was regdeur die projek geïmplementeer as die primêre metode van voertuig opsporing. Skaduwee opsoring was gebaseer op sekere chromatisiteit-eienskappe, voordat dit van die voorgrond masker verwyder is. Relatiewe voertuigsnelhede word voorgestel deur berekende optiese-vloei opsporings tegnieke. Die resultate het getoon dat beide die voertuig opsporing, asook die snelheid berekeninge, in die omgewing van 95% akkuraat was onder ideale toestande, en in die omgewing van 80% onder nie-ideale beligting toestande. ’n Baie aantreklike alternatief tot die kameras wat op statiese pale gemonteer word, is die gebruik van ’n volkome outonome model vliegtuig. Duur verkeerskameras kan hierdeur vervang word met eenvoudige landingsplatforms. Hierdie metode sal die analisering van verkeersvloei in veral die vêrafgeleë platteland toelaat, waar infrastruktuur dikwels afwesig is. Die stelsel wat ontwerp is om die vliegtuig se vlug-beheer te outomatiseer, was geïmplementeer in die diskrete tyd domein, deur gebruik te maak van aangevulde verskil vergelykings. Die kleinstekwadrate metode van was gebruik om ’n model van die aanleg te verkry, sodat doelgerigte beheerders ontwerp kon word. Die Ziegler-Nichols verstellings metode was uitsluitlik gebruik om die praktiese beheerder parameters te verkry. Die resultate wat tydens die toets metode verkry is, het bewys dat die vliegtuig op ’n aangewese platform kon land. Die insluiting van die geoutomatiseerde vliegtuig, is bedoel om bykomende funksionaliteit tot die primêre doel van die projek te voeg. Die stelsel was in ’n modulêre patroon ontwerp waar elke sub-module ’n bykomende funksionaliteit bygedra het. Die modules was geïntegreer en behoorlik getoets om optimale werkverrigting van die stelsel te verseker. Toetsresultate het getoon dat die stelsel ten volle funksioneel was, en het bewys dat die konsep van ’n outonome verkeersvloei bepaling ’n haalbare moontlikheid is. Contents Declaration i Acknowledgements ii Dedication iii Abstract iv Uittreksel v Contents vi List of Figures ix List of Tables xi Nomenclature xii 1 Introduction 1.1 Project background . 1.2 Objectives . . . . . . 1.3 Methodology . . . . 1.4 Demonstration video 1.5 Report layout . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 2 3 3 3 2 Research and Associated Literature 2.1 Computer vision support for traffic detection . . 2.1.1 Camera calibration and Pose estimation . 2.1.2 Background subtraction . . . . . . . . . . 2.1.3 Optical flow tracking . . . . . . . . . . . . 2.2 Traffic metric computations and online reporting 2.2.1 Traffic flow computations . . . . . . . . . 2.2.2 Trintel online dashboard . . . . . . . . . . 2.3 Control systems and supporting computer vision 2.3.1 System identification . . . . . . . . . . . . 2.3.2 PID control . . . . . . . . . . . . . . . . . 2.3.3 Parameter estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 4 5 5 6 6 7 7 8 8 9 9 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi CONTENTS vii 3 System Overview 3.1 Functional overview . . . . . . . 3.2 System architecture . . . . . . . 3.3 Integrated hardware . . . . . . . 3.3.1 Hardware overview . . . . 3.3.2 Hardware communication 3.4 Software functional modules . . . 3.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 10 11 11 11 13 14 15 4 Detailed System Design 4.1 Traffic flow estimation . . . . . . 4.1.1 Road profile creation . . . 4.1.2 Background modelling . . 4.1.3 Shadow removal . . . . . 4.1.4 Object detection . . . . . 4.1.5 Optical flow tracking . . . 4.1.6 Traffic flow computations 4.1.7 Online dashboard . . . . . 4.2 Drone control . . . . . . . . . . . 4.2.1 Target tracking . . . . . . 4.2.2 Target references . . . . . 4.2.3 System identification . . . 4.2.4 Theoretical design . . . . 4.2.5 Practical implementation 4.2.6 Landing algorithm . . . . 4.3 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 16 17 18 20 23 24 27 29 29 30 31 32 35 37 38 38 5 Results 5.1 Traffic flow estimation . . . . . . . . . . 5.1.1 Background modelling techniques 5.1.2 Shadow removal . . . . . . . . . 5.1.3 Vehicle velocity estimation . . . 5.1.4 Traffic metric computations . . . 5.1.5 Night-time results . . . . . . . . 5.2 Drone control . . . . . . . . . . . . . . . 5.2.1 Target tracking . . . . . . . . . . 5.2.2 Automated landing . . . . . . . . 5.3 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 39 40 40 41 42 43 43 44 46 47 6 Conclusion and Future Development 48 6.1 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 6.2 Future development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 References 51 Appendices 54 Appendix A: Project planning schedule 55 Project plan . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56 Appendix B: Project specification 57 Project overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58 Functional specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 CONTENTS viii Interfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 60 Appendix C: Outcomes compliance 61 Outcomes compliance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 Appendix D: Computer vision D1 . . . . . . . . . . . . . . . . D2 . . . . . . . . . . . . . . . . D3 . . . . . . . . . . . . . . . . D4 . . . . . . . . . . . . . . . . D5 . . . . . . . . . . . . . . . . D6 . . . . . . . . . . . . . . . . D7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66 67 68 69 70 71 72 73 Appendix E: Control E1 . . . . . . . . . . E2 . . . . . . . . . . E3 . . . . . . . . . . E4 . . . . . . . . . . E5 . . . . . . . . . . E6 . . . . . . . . . . E7 . . . . . . . . . . E8 . . . . . . . . . . E9 . . . . . . . . . . E10 . . . . . . . . . E11 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74 75 77 78 79 80 81 82 83 84 85 86 . . . . . . . . 87 88 89 90 91 92 93 94 95 systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Appendix F: Software flow charts F1 . . . . . . . . . . . . . . . . . . F2 . . . . . . . . . . . . . . . . . . F3 . . . . . . . . . . . . . . . . . . F4 . . . . . . . . . . . . . . . . . . F5 . . . . . . . . . . . . . . . . . . F6 . . . . . . . . . . . . . . . . . . F7 . . . . . . . . . . . . . . . . . . F8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . List of Figures 3.1 3.2 3.3 4.1 4.2 4.3 4.4 4.5 4.6 4.7 4.8 4.9 4.10 4.11 4.12 4.13 4.14 4.15 4.16 4.17 5.1 5.2 5.3 5.4 5.5 6.1 A.1 D.1 D.2 D.3 D.4 D.5 D.6 D.7 E.1 E.2 E.3 System architecture. . . . . . . . . . . . . . . . . . . Drone movement. . . . . . . . . . . . . . . . . . . . . Hardware integration and communication methods. . Traffic direction learning. . . . . . . . . . . . . . . . Running average comparison. . . . . . . . . . . . . . Stage 3 of road profile creation. . . . . . . . . . . . . Vehicle shadows. . . . . . . . . . . . . . . . . . . . . Cast shadow detection. . . . . . . . . . . . . . . . . Morphological transformations. . . . . . . . . . . . . Object detection. . . . . . . . . . . . . . . . . . . . . Scaling factor calibration. . . . . . . . . . . . . . . . Velocity vectors. . . . . . . . . . . . . . . . . . . . . Colour thresholding. . . . . . . . . . . . . . . . . . . Tracking targets. . . . . . . . . . . . . . . . . . . . . Target position concepts. . . . . . . . . . . . . . . . Aircraft thrust to angular acceleration relationship. . Pitch and Roll plant response. . . . . . . . . . . . . Yaw and Gaz plant response. . . . . . . . . . . . . . Open-loop plant dynamics. . . . . . . . . . . . . . . Simplified block diagram. . . . . . . . . . . . . . . . Cast shadow detection results. . . . . . . . . . . . . Night-time system results. . . . . . . . . . . . . . . . Practical step response. . . . . . . . . . . . . . . . . Pitch controller response comparison. . . . . . . . . Landing coordinate scatter plot. . . . . . . . . . . . Advanced landing platform . . . . . . . . . . . . . . Project plan. . . . . . . . . . . . . . . . . . . . . . . Computer vision hierarchy. . . . . . . . . . . . . . . Test video snapshots. . . . . . . . . . . . . . . . . . . Velocity estimation experimental setup. . . . . . . . Vehicle velocity calculation. . . . . . . . . . . . . . . Online dashboard. . . . . . . . . . . . . . . . . . . . Traffic flow graphs. . . . . . . . . . . . . . . . . . . . Graphical User Interface. . . . . . . . . . . . . . . . Drag diagram. . . . . . . . . . . . . . . . . . . . . . Full simulation block diagram (Pitch/Roll). . . . . . Full simulation block diagrams (Yaw and Gaz). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 12 14 17 18 18 21 22 23 24 26 27 30 31 32 33 34 35 36 36 41 44 45 46 47 50 56 67 68 69 70 71 72 73 75 77 78 ix LIST OF FIGURES E.4 E.5 E.6 E.7 E.8 E.9 E.10 E.11 F.1 F.2 F.3 F.4 F.5 F.6 F.7 F.8 Inner and outer-loop step response. . . . . . . . . . . . . . . . . . . . . . . . . Closed-loop step response of the theoretical controller (Yaw). . . . . . . . . . Closed-loop step response of the theoretical controller (Gaz). . . . . . . . . . Constant oscillation during ZN tuning. . . . . . . . . . . . . . . . . . . . . . . Practical step response of the plant with ZN tuned parameters (Pitch/Roll). Practical step response of the plant with ZN tuned parameters (Yaw). . . . . Integral wind-up. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . UAV On Screen Display. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Flow chart: Road profile creation. . . . . . . . . . . . . . . . . . . . . . . . . Flow chart: Background modelling. . . . . . . . . . . . . . . . . . . . . . . . . Flow chart: Shadow removal. . . . . . . . . . . . . . . . . . . . . . . . . . . . Flow chart: Traffic speed and volume analysis. . . . . . . . . . . . . . . . . . Flow chart: Metric computations and sending. . . . . . . . . . . . . . . . . . Flow chart: Target tracking. . . . . . . . . . . . . . . . . . . . . . . . . . . . . Flow chart: PID controller. . . . . . . . . . . . . . . . . . . . . . . . . . . . . Flow chart: Landing algorithm. . . . . . . . . . . . . . . . . . . . . . . . . . . x . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 80 81 82 83 84 85 86 88 89 90 91 92 93 94 95 List of Tables 4.1 4.2 4.3 5.1 5.2 5.3 5.4 5.5 6.1 B.1 C.1 Shadow removal designs. . . . . . . . . . . . Level Of Service characterisation. . . . . . . Example AT commands. . . . . . . . . . . . Background modelling results. . . . . . . . . Shadow removal results. . . . . . . . . . . . Vehicle speed results: Test vehicle 1. . . . . Vehicle speed results: Test vehicle 2. . . . . Traffic flow estimation results. . . . . . . . . Objectives and achievements cross-reference System specifications. . . . . . . . . . . . . Outcomes compliance cross-reference table. . . . . . . . . . . . . . . . . . . . . . . . . table . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 28 29 40 41 42 42 43 49 60 65 xi NOMENCLATURE xii Nomenclature Acronyms IR TMC XML UAV TLS CLS TFDC DT CT TF PID SISO QDR PRC BS MoG RA TMS PHF LOS GUI GPRS USB GSM AT CG RF HD 720P AVC SDK FFMPEG GPS HSV Infra-Red Traffic Management Center Extensible Markup Language Unmanned Aerial Vehicle Total Least Squares Constrained Least Squares Transfer Function Determination Code Discrete-Time Continuous-Time Transfer Function Proportional Integral Derivative control Single Input Single Output Quarter Decay Response Process Reaction Curve Background Subtraction Mixture of Gaussians Running Average Time Mean Speed Peak Hour Factor Level Of Service Graphical User Interface General Packet Radio Service Universal Serial Bus Global System for Mobile ATtention commands Center of Gravity Radio Frequency High Definition 720 Progressive scan lines Advanced Video Coding Software Development Kit File Format Moving Picture Experts Group Global Positioning System Hue Saturation Value NOMENCLATURE RGB QR IMU RL HIL ROI GMM FPS HUD DAT CSV FOV RoC FFC xiii Red Green Blue Quick Response Inertial Measurement Unit Root Locus Hardware In the Loop Region Of Interest Gaussian Mixture Model Frames Per Second Heads Up Display DATa file Comma Separated Values Field Of Vision Rate of Climb Front-Facing Camera NOMENCLATURE xiv List of symbols used z G(z) u(t) e(t) Kc TD TI Θ Φ Ψ X̄ C y T TP Mp FT FF m g Fx a ρ ν CD A aD aF anett Ts µk Σk β1 β2 τS τH CF 1 CF 2 My I θ̈ e Complex variable relating to the Z-Transform Discrete transfer function Control signal Error signal PID controller gain Derivative time Integral time Pitch angle Roll angle Yaw angle TF variable result Regressor Measured response Matrix transpose Peak time Maximum overshoot Force due to thrust Forward force Mass Gravitational acceleration Forces in the x direction Acceleration Mass density Velocity Drag coefficient Reference area Deceleration due to drag Forward acceleration Nett acceleration Sample Time Class mean Class covariance Lower luminosity threshold Upper luminosity threshold Saturation threshold Hue threshold Velocity conversion factor: stage 1 Velocity conversion factor: stage 2 Angular moment Moment of inertia Angular acceleration Rotation vector [s] [s] [degrees] [degrees] [degrees] [s] [%] [Newton] [Newton] [kg] [m/s2 ] [Newton] [m/s2 ] [kg/m3 ] [m/s] [m] [m/s2 ] [m/s2 ] [m/s2 ] [s] [meters/pixel] [N.m] [kg.m2 ] [rad/s2 ] CHAPTER 1. INTRODUCTION 1 Chapter 1 Introduction: Drone-based traffic flow estimation 1.1 Project background According to the National Traffic Information System, there are currently around 11 million registered vehicles on South African roads [1]. This number is increasing at an alarming rate, which requires that roads be upgraded continually. The study of traffic flow estimation is used to evaluate how well a particular road segment is accommodating traffic, as well as to determine the priority of road upgrades. Traffic flow monitoring in urban areas allows traffic engineers to determine typical road usage. This information is then used to plan for future road developments and to modify traffic control strategy. Current traffic monitoring techniques make use of intrusive static sensors in the form of inductive loop detectors, IR detectors and radar guns [2]. Visual monitoring is often done manually, with the operator watching hours of video footage while counting the cars as they pass through an area. Two of the significant problems associated with the above-mentioned techniques, is that they are both intrusive and time-consuming. Traffic cameras are mounted around most urban areas and are used primarily for security reasons. In the City of Cape town alone, there are around 300 traffic cameras streaming live video directly to the TMC database. The cameras cover the majority of the roads throughout Cape Town, and would therefore provide unparalleled access to essential video data. There are some areas throughout Cape Town that are not yet monitored by traffic cameras. The cameras and related infrastructure are expensive to install, and would require many man hours to complete. A particularly attractive solution to this problem, would be to erect simple landing platforms that would allow an automated Unmanned Aerial Vehicle (UAV) to conduct fully autonomous traffic flow analysis. CHAPTER 1. INTRODUCTION 2 Various methods of traffic flow theory have been investigated throughout the years. A rather intriguing approach, is to apply fluid dynamic principles to arrive at a qualitative description of the flow-density curve as described by H. Greenberg in [3]. For this project, the traffic flow methods explained by the Transportation Research Board in the Highway Capacity Manual [4], will be used as the analysis criteria. The aim of this project is to make use of pure computer vision techniques to automatically compute traffic statistics along road segments. The project will focus primarily on uninterrupted flow in the form of freeways and national highways. A characterising feature of uninterrupted flow, is that it has no fixed cause of delay or interruption external to the traffic steam [4]. That is to say, that there are no delays caused by traffic control devices such as traffic lights or road signs. The idea is to make the system as flexible as possible to maximise the capabilities of the estimation techniques. In order to include the functionality of an autonomous UAV, some form of controller is required. A feedback control system, working in tandem with computer vision techniques, was designed and tested so as to include functionality of the automated UAV. One of the main reasons for optimising flexibility comes from the idea of using both pole-mounted traffic footage, as well as footage obtained from other sources such as, but not limited to, the UAV’s onboard camera. 1.2 Objectives Main objective: Autonomous estimation of key traffic flow descriptors This project objective is divided into a traffic flow estimation component and a drone control component. The former performs the traffic analysis through computer vision, and the latter provides a novel means for obtaining traffic footage. The following list gives a comprehensive description of the key objectives to be achieved upon completion of the project: 1: Traffic flow estimation Objective 1.1: Remove any occlusion effects that might hinder performance Objective 1.2: Identify vehicles on the road Objective 1.3: Count the number of vehicles passing over a particular road segment Objective 1.4: Determine the relative velocities of the vehicles Objective 1.5: Automatically compute traffic flow metrics Objective 1.6: Upload the information in real-time to an online dashboard CHAPTER 1. INTRODUCTION 3 2: Automated drone control Objective 2.1: Design a target tracking system Objective 2.2: Design a control system which will automate UAV flight control Objective 2.3: Design and implement an automated landing algorithm for the UAV 1.3 Methodology The software in this project was designed according to a highly interactive design technique associated with modular programming. A truly modular environment is one in which modules are constructed, debugged, verified, and compiled in a module-by-module fashion [5]. Due to the highly complex and interactive nature of this project, the modular systems approach provides robust design techniques to ensure optimised communication between functional modules. In essence, this project can be separated into two primary phases. The first phase involves the design of a highly flexible computer vision system, used to automate traffic flow analysis from various video sources. The second phase is associated with the design of a feedback control system, used to automate the UAV target tracking and landing functionality. The software was developed in Python due to its mathematical manipulation capabilities and seamless hardware connectivity. All control system simulations were designed in Matlab Simulink before being implemented in the Python environment. 1.4 Demonstration video A demonstration video of the complete system is available online at: http://goo.gl/jT7lke 1.5 Report layout Associated literature is explored in chapter 2, while chapter 3 explains the system overview. The detailed system design of phase 1 (Computer vision) and phase 2 (UAV control system) are explained in chapter 4. Project results are explained in chapter 5, followed by a general conclusion and discussion on future developments in chapter 6. CHAPTER 2. RESEARCH AND ASSOCIATED LITERATURE 4 Chapter 2 Research and Associated Literature Techniques and methods used in exploring the problems addressed in this project were examined in the course of this study. This chapter seeks to explain prior research conducted, with a specific focus on the techniques selected for use in the project. 2.1 Computer vision support for traffic detection Computer vision, in its simplest form, is a discipline that studies how to interpret and understand real-world scenes through the eyes of a non-human entity. In essence, computer vision tries to replicate human vision using computer hardware and software at different levels. Figure D.1 on page 67 shows the typical computer vision hierarchical structure. Computer vision, in its broader sense, is applied throughout almost every known discipline. A few of these applications are listed below: • Military: Seeker missiles, vision guided Rocket Propelled Grenades (RPG), UAV flight systems, helicopter pilot interactive HUD, sense-and-avoidance systems etc. • Security: Vision based alarm systems, foreign object detection, facial recognition, retinal scanning etc. • Entomology: 3D modelling of insects, surface area identification, insect swarm pattern recognition etc. • Transportation: Smart Vehicle safety systems, collision avoidance, remote sensing, driver vigilance monitoring etc. • Robotics: Simultaneous Localization And Mapping (SLAM), obstacle avoidance, assembly, Human Robot Interaction (HRI) etc. • Industrial Automation: Object sorting, industrial inspection, document understanding etc. CHAPTER 2. RESEARCH AND ASSOCIATED LITERATURE 5 OpenCV is an Open Source Computer Vision library originally designed by Intel in 1999 [6]. It provides a library of highly optimised, portable functions that are designed primarily for computer vision applications. Sections 2.1.2 and 2.1.3 explore the two core computer vision techniques implemented in this project. 2.1.1 Camera calibration and Pose estimation The human brain is capable of successfully interpreting a three-dimensional scene based on a two-dimensional image [7]. Using pure computer vision techniques and machine learning algorithms, various methods of replicating this human perception capability have been proposed. The concept of Pose estimation allows for three-dimensional interpretation of a two-dimensional pictured scene [7]. Pose estimation relies on the correct estimation of the camera pose i.e. the location and orientation of the camera. A mathematical camera model is used to successfully project 3D points onto an image plane. The camera model consists of both intrinsic and extrinsic parameters that uniquely describe various aspects of the camera setup. The intrinsic parameters (K R3×3 ) depend on certain properties of the manufactured camera e.g. focal length and optical centres. These parameters are obtained using camera calibration techniques. Extrinsic parameters of the camera model ([R|t] R3×4 ) relate 3D points from their representation in a real-world coordinate system to their representation in the camera coordinate system [7]. This relation describes the camera position or Pose, and therefore contains information regarding the relative translation (t) and rotation (R) of the coordinate axes. p = K[R|t]P (2.1) The point mapping is shown in equation 2.1, where p represents the 2D points and P the corresponding real-world coordinates. 2.1.2 Background subtraction Background subtraction (BS) techniques are widely used to separate moving objects from a static background scene [8]. BS is seen as a fundamental step in applications involving traffic monitoring, human motion capture, video surveillance, etc. [8]. One of the key steps in the background subtraction process is to generate a model of the static background scene. Foreground segmentation can only commence once a static background model is realised. There are a number of different BS techniques described by M. Piccardi in [9]. Most common among these techniques are Running Gaussian Average, Mixture of Gaussians, Temporal Median Filter, Kernel Density Estimation and Eigenbackgrounds. Among these, the Mixture of Gaussian (MoG) technique is the most widely used. A MoG distribution is used to model each individual pixel resident in a frame. The pixels are classified as part of the static background based on their persistence and variance. In other words, pixels that remain in the scene for a longer period of time than other pixels (persistence), and whose luminance does not vary much with time (variance) are assumed to be part of the background. The background subtraction algorithm needs to solve two problems simultaneously with every new frame. The first task is to assign the new observed value xt to the best matching distribution, while the second task involves updating the model parameters [8]. CHAPTER 2. RESEARCH AND ASSOCIATED LITERATURE 2.1.3 6 Optical flow tracking Optical flow tracking provides an elegant way of determining the apparent motion of image objects between consecutive frames. The optical flow tracking algorithm creates a 2D vector field, where each vector is a displacement vector showing the movement of points between consecutive frames [10]. This method was used to determine the displacement of the pixels representative of moving vehicles. If the displacement of the pixels is known, and the frame rate is also known, the relative velocity of the moving vehicle can be determined. For computational reasons, it is highly impractical and extremely inefficient to track absolutely every moving pixel in the frame. A method proposed by J. Shi and C. Tomasi in [11] explains a feature selection criterion that is optimal by construction. They propose a way of monitoring the quality of image features during tracking by using a measure of feature dissimilarity [11]. The OpenCV platform makes use of a "Good features to track" method which finds the N strongest corners in the image by the Shi-Tomasi method. Once the tracking features have been obtained, the points can be used by the optical flow tracking algorithm to determine relative pixel displacement between consecutive frames. Several assumptions are made for the optical flow tracking algorithm. The first is that the pixel intensities of an object remain constant between consecutive frames. The second assumption is that neighbouring pixels have a similar motion [10]. The optical flow equation is represented by equation 2.2: [10] fx = u= ∂f ∂f ; fy = ∂x ∂y dy dx ;v = dt dt (2.2a) (2.2b) In equation 2.2 ∂x and ∂y represent the pixel displacement in the x and y directions respectively. The time taken between each frame is represented by ∂t, while f x and f y are readily available by computing the image gradients. However, (u, v) is an unknown and thus equation 2.2 cannot be solved with two unknown variables [10]. B. Lucas and T. Kanade proposed a method of solving the optical flow equation in [12] by using the least square fit method. The OpenCV platform makes use of the methods proposed in [12] to receive optical flow vectors of the apparent object motion. 2.2 Traffic metric computations and online reporting Traffic engineering is a branch of civil engineering and makes use of engineering principles to design and develop highly optimised traffic systems. The main objective of traffic engineers is to develop safe and efficient traffic flow systems [13]. Traffic engineers make use of traffic flow metrics to evaluate a particular road segment’s ability to accommodate traffic. The need for road upgrades and traffic control devices, such as traffic lights, are then determined based on further analysis of the traffic metrics. To meet the objectives stated in section 1.2, traffic metrics must be uploaded to an online dashboard. The online dashboard provides a graphical display platform where the traffic flow metrics can be accessed via a web browser. CHAPTER 2. RESEARCH AND ASSOCIATED LITERATURE 2.2.1 7 Traffic flow computations Volume and flow rate are two metrics that quantify the amount of traffic passing a specific point on a lane or roadway during a given interval of time [4]. Time intervals are usually expressed in terms of annual, daily, hourly or sub-hourly periods [4]. Flow rate describes the equivalent hourly rate at which vehicles pass a given point on a roadway section during a sub-hourly time interval (usually a 15 minute interval) [4]. It is important to distinguish between volume and flow rate as these two concepts are often confused. Volume can be explained as being the total number of vehicles passing a point on a roadway section, while flow rate represents the number of vehicles passing a point during a sub-hourly interval, but expressed as an equivalent hourly rate [4]. Volume and flow rate are variables that help quantify demand, and are representative of the number of vehicles that desire to use a given system element (roadway) during a specific time period [4]. Although volume and flow rate reveal important information regarding a specific roadway, these are not the only factors that are considered by traffic engineers when analysing a roadway element. In this project, there are several traffic metrics that are computed before being uploaded to the online dashboard. These metrics include Time Mean Speed (TMS), Peak Hour Factor (PHF), Density, Flow Rate, Volume and Level of Service (LOS). TMS is the arithmetic average of the observed vehicle velocities. PHF is the ratio of total hourly volume to the peak Flow Rate. Density is the number of vehicles occupying a given length of a lane or roadway at a particular instant. LOS is a performance measure used primarily to describe the relative traffic condition of an uninterrupted flow element (national highway) [4]. 2.2.2 Trintel online dashboard The Trintel system is described as an online telemetry and control visualisation platform. The platform allows control over devices connected via the mobile network through a GSM modem. A scalable Machine to Machine (M2M) architecture is provided by the system, and accommodates seamless web development through an interactive service. Metrics that are required to be uploaded to the platform are initialised via an XML editor called AirVantage, provided by the Sierra Wireless group. Once the metrics have been initialised, commands are sent to the wireless modem in the form of AT strings. The Sierra Wireless modem is connected to the serial port on the Linux Ground Station (laptop computer) via a USB to Serial converter. The modem prepares the AT strings as packets that can then be sent over the GSM network. Once received, the Trintel platform interprets the data and stores the information in a centralised database. The interactive online web development environment, provided by the SMART platform, accommodates the development of a visual dashboard. The developer is able to create a custom dashboard layout in a way that would best represent the type of information being sent to the platform. The information stored in the database is then displayed on the visual dashboard in a graphical format. The platform can be accessed remotely, and only requires the device to have an active Internet connection. This feature allows the user to view the dashboard, in real-time, from anywhere in the world. Alerts in the form of text messages and emails are an added functionality. Users can CHAPTER 2. RESEARCH AND ASSOCIATED LITERATURE 8 define device-specific conditions under which these alerts should be triggered. Alerts are usually triggered to inform the user of important information pertaining to impending critical system failures. In addition to displaying device information, commands can also be sent from the dashboard to the device through a similar protocol. Switch commands, triggered from the dashboard, are logged as jobs to be serviced on the device MSISDN number. Since commands cannot be pushed to an inactive device, the modem is required to poll the platform as a request for any pending jobs relating to the specific MSISDN identifier. The service was sponsored by Trintel and MTN. 2.3 Control systems and supporting computer vision The automated target tracking and landing functionality of the UAV requires a fully functional feedback control system. A control system is an interconnection of components working together to provide a desired function [14]. A feedback control system relies on measurements of the plant response, which are then fed back in a closed-loop system to the controller. Due to the nature of the plant (UAV) and the accompanying software, a digital control system was required. A digital control system uses digital hardware, usually in the form of a programmed digital computer as the heart of the controller [14]. Analogue controllers on the other hand, are composed of analogue hardware and make use of continuous-time signals. Continuous-time signals, as the name suggests, are defined for all time, while discrete-time signals are defined only at discrete time intervals [14]. These discrete intervals of time are usually evenly spaced and determined by the sampling time of the system. The sample time is a design choice based on the bandwidth of the system. Discrete-time signals are associated with digital controllers, and are implemented on a digital machine based on sampled measurements taken at regular intervals. Before a reliable control system can be developed, a mathematical model of the plant needs to be obtained. For practical systems, this is often the most challenging step of the design process as non-linearities and external entities often hinder the modelling accuracy. 2.3.1 System identification Total Least Squares (TLS) and Constrained Least Squares (CLS) are methods used for system identification. Transfer Function Determination Code (TFDC) is a frequency domain system identification code, for determining the SISO transfer function of a system, from experimentally derived frequency response data [15]. The method makes use of experimentally obtained data, which is applied to the TFDC system identification algorithm to determine the mathematical model of the plant. The data is obtained by first generating a random binary sequence to serve as reference inputs to the plant. The response of the plant to the random reference inputs is measured, and used to obtain a mathematical model of the plant. Using frequency response data as input, TFDC computes the numerator and denominator coefficients of the discrete-time transfer function, G(z), given by [15]: G(z) = αn z n + αn−1 z n−1 + · · · + α1 z + α0 βn z n + βn−1 z n−1 + · · · + β1 z + β0 (2.3) CHAPTER 2. RESEARCH AND ASSOCIATED LITERATURE 9 The TF, described by equation 2.3, is represented in the z-domain and is therefore characteristic of a discrete-time system. The use of the z-transform technique runs parallel to that of the Laplace transform used in the analysis of continuous-time systems. The method of least squares computes the coefficients in equation 2.3 by finding an iterative, weighted least squares solution to the equations. The resulting solution vector contains the coefficients of the assumed transfer model [15]. 2.3.2 PID control Proportional-Integral-Derivative (PID) is a widely used feedback control system which is versatile enough to control a variety of industrial processes [14]. The PID controller is described in the time domain by the following equation: u(t) = Kc 1 e(t) + TI Z t 0 de(t) e(t)dt + TD dt (2.4) Where e(t) is the error and u(t) is the controller output signal. From equation 2.4 it is apparent that the PID controller has three adjustable parameters that need to be optimised. The different optimisation techniques will be explored in section 2.3.3. The derivative action anticipates the error, initiates an early corrective action, and tends to increase the stability of the system [14]. The integral term ensures a zero steady state error in the presence of disturbances and changes in the set-point. 2.3.3 Parameter estimation In order to make use of a PID controller, the Proportional, Integral and Derivative gain parameters need to be obtained. The theoretical and systematic approach used to obtain these gain parameters is to sketch a Root Locus of the plant model and then design the controller to meet the desired closed-loop specifications. This design method is highly dependent on an accurate mathematical model of the plant which is quite often a challenge to obtain for physical objects. Obtaining a plant model is often a cumbersome exercise and in extreme cases almost impossible. Non-linearities, such as drag effects and other external disturbances, make modelling the plant problematic. Tuning methods like the one proposed by J.G. Ziegler and N.B. Nichols, provide an elegant solution to this design limitation. The Ziegler-Nichols tuning method was proposed by J.G. Ziegler and N.B. Nichols around 1940 [14]. The method proposes a way of obtaining the unknown parameters presented in equation 2.4. An advantage of using this method of parameter tuning, is that it does not rely on a mathematical model of the plant. The Ziegler-Nichols tuning method is based on the determination of ultimate gain and period. Once the ultimate gain and period is obtained, the parameters can be tuned for a specified response [14]. The parameters can be tuned based on either the quarter-decay ratio (QDR) response or the process reaction curve (PRC) [14]. CHAPTER 3. SYSTEM OVERVIEW 10 Chapter 3 System Overview This chapter provides a high-level description of the overall system structure. It describes the capacity of the system, what it entails, the component selection, the specific external interactions as well as the overall system integration. 3.1 Functional overview The automated drone-based traffic flow estimation system can effectively be separated into two parts. The first part consists of the computer vision system used to detect and calculate vehicle velocities for calculation of the traffic statistics. The second part involves the automated target tracking and landing system for the UAV. A centralised Graphical User Interface (GUI) was developed in the Python programming environment using the TKinter GUI library. Figure D.7 on page 73 shows a screenshot of the main user interface. The file drop-down menu, in the top left corner, houses all of the functionality of the system. The drop down menu displays the following options: Road Profile Creation, Drone Control, Traffic Tracking From File, Traffic Tracking From Drone, Plot Traffic Data and Open Data Files. The GUI contains various text input fields, which enable the user to specify the source video to be analysed, and to provide the system with a reporting interval for data submission. The GUI was designed based on two key concepts; convenience and simplicity. From the drop down menu, the user can select the Drone Control option which will activate the drone control system and initiate the UAV flight sequence. Once the UAV has landed on the platform, the traffic tracking algorithm will perform in the exact same way as it would from a pole-mounted camera. CHAPTER 3. SYSTEM OVERVIEW 3.2 11 System architecture A generalised system architecture diagram is shown in figure 3.1. The diagram depicts the hardware components, as well as the software functional modules that will be discussed in sections 3.3 and 3.4 respectively. The system was designed in such a way as to maximise modularity, and it is clear from figure 3.1 that the system has been seamlessly integrated. A few lower-level modules have been omitted from the diagram in order to minimise clutter. These modules, however, play a vital role in the functioning of the system and are used primarily for submodule communication. Figure 3.1: System architecture. 3.3 3.3.1 Integrated hardware Hardware overview This particular project had a focus more directed toward software development as opposed to hardware integration. Hardware did, however, play a role in this project and it is essential that these components are explained for completeness of the report. There are three main hardware components that were vital to the successful operation of the system. The primary hardware component is referred to as the Linux ground station, which can simply be described as the Python environment running on a laptop computer. The ground station controls the entire system and is responsible for all cross-platform communication to be discussed in section 3.3.2. Throughout this report there is mention of the Trintel Platform. The platform can be described as an online telemetry and control visualisation platform. The platform provides an CHAPTER 3. SYSTEM OVERVIEW 12 interactive service which allows the user to operate and control a host of devices registered on the network. In addition to being able to control the devices, the platform provides an intuitive representation of the data, in a dashboard-like format, using graphs and various other display methods. To handle transmission of the data from the base station to the Trintel Platform, the Sierra wireless fastrack modem was chosen. An image of the modem is shown in figure 3.3. The Sierra wireless modem makes use of an ARM946 processor running at 104M Hz/26M Hz. The modem supports GSM, GPRS and EDGE technology, and makes use of OpenATr for handling the AT commands. Throughout the preceding chapters of this report, there has been mention of a UAV/Drone for which a control system has been designed. The function of the drone is to provide additional functionality to the system by enabling an automated tracking and landing capability. The Parrot AR Drone 2.0 was chosen as the UAV platform for this project. The Parrot AR Drone is a quadrotor [16]. The mechanical structure is comprised of four rotors attached to the ends of a crossbar structure. The battery, processing unit and RF hardware is located at the center of the device so as to maximise stability by localising the Center of Gravity (CG). To prevent the inertia of the rotors from causing the aircraft to spin, a helicopter relies on the tail rotor to supply the opposing force. In the case of a quadrotor, there is no tail rotor, and so another technique is employed to stabilise the aircraft. The quadrotor consists of two pairs of rotors each spinning in the opposite direction. One pair spins clockwise, while the other spins anti-clockwise [16]. Figure 3.2 shows the basic drone movements: Figure 3.2: Drone movement. Figure adapted from [16]. There are four basic manoeuvres that need to be controlled simultaneously during the drone’s automated flight. The manoeuvres are obtained by changing pitch, roll, yaw and Rate of Climb (RoC). Figure 3.2 shows the relative directions of each of these parameters. To control the pitch angle (θ), the front and back rotor speeds are varied to provide the required pitch reference. The left and right rotor speeds are varied in the same way to control the roll angle (Φ). To control the yaw movement (Ψ), opposing motor pair speeds are varied correspondingly to provide the required rate of rotation [16]. CHAPTER 3. SYSTEM OVERVIEW 13 Information from the various onboard sensors is required for the implementation of the feedback control system discussed in chapter 4. The inertial measurement unit (IMU), located within the drone’s main hull, provides the software with pitch, roll and yaw measurements [16]. An ultrasound telemeter provides altitude measurements for altitude stabilisation and assisted vertical speed control [16]. The drone’s downward facing camera provides ground speed measurements for automatic hovering and trimming [16]. The feedback PID control system, discussed in chapter 4, makes use of the sensor measurements to control the reference commands sent to the drone. Once computed, the input reference commands are sent back to the drone as a percentage of the maximum set-point value. The maximum tilt angles are set at 12◦ , and the maximum rate of rotation is set at 90deg/s. The gaz reference is also sent as a fraction of the maximum climb rate of 1m/s. In addition to the onboard sensor measurements, the feedback control system makes use of the drone’s live video feed from the HD 720P front-facing camera (FFC). The Python SDK, used to interface with the drone, was designed according to the Parrot AR Drone version 1.0 architecture. Since the version 2.0 makes use of H.264 embedded video stream, the video decoding library had to be rewritten before it was compatible for use with this project. The video pipeline works as follows: 1. Drone sends video in PAVE (H.264 wrapper) format 2. Python handles socket connection and unwraps PAVE format data 3. H.264 data gets sent to ffmpeg in a pipe 4. ffmpeg sends raw frame data back in pipe 5. Python reads the raw frame data and converts to Numpy arrays 6. Numpy arrays are interpreted and manipulated by the OpenCV platform Once the video stream is successfully decoded, a checkerboard tracking algorithm is used to determine the relative distance to the center of the target. The relative distance is then used in the outer-loop of the control system (see figure 4.17) to control the position of the drone relative to the target. A more detailed explanation is given in chapter 4. 3.3.2 Hardware communication Figure 3.3 shows the hardware components and their corresponding methods of communication. The ground station communicates with a GSM modem via a USB-Serial connection. Commands are sent from the ground station to the modem as simple AT strings. The modem interprets these strings, and prepares the IP packets to be sent over the mobile network. The modem transmits the data at regular intervals specified by the user. Data is transmitted to the Trintel Database via the GSM network, where it is interpreted by the Trintel platform, and displayed graphically on the dashboard. The ground station communicates with the drone via its Wi-Fi module as shown in figure 3.3. The SDK network library handles the network interfacing between the ground station and the drone. As mentioned in section 3.3 above, reference angles and angular velocity references CHAPTER 3. SYSTEM OVERVIEW 14 Figure 3.3: Hardware integration and communication methods. are sent to the drone as fractions of a maximum. The commands are sent via the Wi-Fi network as AT strings which are interpreted by the drone’s onboard processor. The thrust generated by each motor is dependent on the speed of rotation of the blades and the corresponding blade pitch. The blade pitch remains constant, while the speed of the motors is directly proportional to the supply voltage. The drone’s processor and embedded onboard control system regulate the voltage supply to each of the four motors, thereby controlling the drone’s relative movements. 3.4 Software functional modules Figure 3.1 depicts the manner in which the software modules are integrated and shows the crossmodule communication pipelines. It is, however, important to first discuss the internal structure of these modules before moving to the system design chapter where extensive references to the modules and submodules are made. From the GUI module, there are three main modules to which the flow of control is offloaded. The three modules shown in figure 3.1 are: Utilities, Traffic tracking algorithm and Drone control. The Utilities module is primarily comprised of helper functions used by higher order submodules. Some of the helper functions include OSwalker, getRoadProfile, getHSV, setHSV, sendToServer and trainBackground to name a few. The Utilities module handles all communication with the wireless modem, and is called extensively by the GUI module. The Traffic Tracking module is responsible for two primary functions. The first pertains to the shadow removal algorithm. The second function contained within this module, handles both the vehicle detection as well as the optical flow tracking algorithms that form the basis of this project. The Traffic Tracking Algorithm can be seen as the project’s core module, as it provides the basic functionality required for object detection and all corresponding speed calculations. CHAPTER 3. SYSTEM OVERVIEW 15 The balance of the modules discussed throughout the report, provide some degree of additional functionality to the project’s core module. The Drone Control module contains all functions relating to the autonomous flight of the drone. The main function within this module makes use of the checkerboard tracking function, the PID controller function and the AR Drone library to send reference inputs to the drone. The sendToServer function, within the Utilities module, is used to send import information regarding the drone’s status (signal strength, battery percentage, etc.) to the Trintel Platform. 3.5 Discussion Throughout the design phase of the project, cognisance was taken of the intricate system interactions so as to ensure a streamlined post-integration procedure. Care was taken to maximise the modularity of each subsystem so that the system as a whole could be optimised to its fullest extent. CHAPTER 4. DETAILED SYSTEM DESIGN 16 Chapter 4 Detailed System Design The aim of this chapter is to describe a comprehensive and detailed design of each subsystem. The design phase is separated into two subsystem designs. The first detailed design is concerned with the traffic flow estimation process, with specific focus on the supporting computer vision techniques. The second subsystem design focuses on the control system and additional computer vision techniques used in the automation of the UAV flight control. The various options and design trade-offs are critically discussed throughout each section. 4.1 Traffic flow estimation The traffic flow estimation system provides the fundamental component of this project. The algorithm is required to automatically detect the number of vehicles that pass through a given area, as well as to determine their relative velocities. Once the vehicles are detected and their velocities estimated, they are classified according to relative blob area. A particularly challenging part of this project was to design a system that relied entirely on visual references. The system should therefore not make use of any intrusive hardware in the form of Inductive loops or Infra-Red counters. The possible integration of this hardware, however, would alleviate many difficulties with regards to obtaining traffic data. The idea behind this project was to design a non-intrusive system that made use of existing traffic cameras (and platforms) placed around a city. It is important to note that traffic cameras need not be the only source of video feed. As mentioned in section 1.1, the idea is to eventually incorporate a UAV into the system that can autonomously fly to remote locations which might not currently have an established traffic camera network. The system is therefore required to be extremely flexible in order to accommodate a variety of different video sources. The system relies heavily on highly adaptive computer vision techniques to compute all traffic metrics. CHAPTER 4. DETAILED SYSTEM DESIGN 4.1.1 17 Road profile creation One of the primary aims of this project was to design a system that would be able to adapt to any environment. Every road location is completely unique in the way in which traffic cameras are placed. This causes a potential problem, especially when computing relative vehicle velocities as well as classifications based on relative vehicle sizes. An elegant and particularly robust solution was developed to deal with this problem. The idea was to create and save road profiles that would store all location-specific information. Due to the static nature of the pole-mounted traffic cameras, the road profiles would only need to be generated once for each location. If the drone is to be used for traffic analysis, a location profile would have to be generated each time it lands to accommodate for orientation-specific parameters. In an attempt to make the road profile creation process a more user-friendly experience, an interactive, self learning method was designed. The method involved a three-stage creation process with the first stage being fully autonomous, and the last two requiring some basic user input. Once the user has input the necessary parameters, the system stores the location-specific data in a uniquely identifiable DAT file. The functions that deal with the road profile creation process form part of the Utilities module explained in section 3.4. As mentioned above, there are three stages that form part of the road profile creation process. The first stage involves determining the particular traffic direction required by the Traffic Tracking algorithm. This step is designed to be fully autonomous, in the sense that optical flow techniques are used to detect vehicle trajectories automatically. A more detailed explanation of optical flow tracking is discussed in section 4.1.5. Figure 4.1 shows a single frame taken during the traffic direction learning process. Figure 4.1: Traffic direction learning. The red arrows superimposed on the forward moving vehicles are characteristic of the relative vehicle trajectories. Based on the direction of these vectors, a likelihood function determines the traffic direction based on a binary classification technique. The relative vehicle direction is classified as either being horizontal (as in figure 4.1), or vertical, if the traffic arrives from the top of the frame. CHAPTER 4. DETAILED SYSTEM DESIGN 18 Stage two requires a certain degree of human input. Specific regions of interest (ROI) are required by the system to track and count vehicles within the user defined area. Before the user is requested to specify the ROI, a location-specific background scene is generated by the system. An elegant way of obtaining the static background is to use the method of a running average. This technique effectively removes all dynamic objects from the corresponding static scene. The idea behind using a running average is to compute the average pixel value after a consecutive number of frames. The more persistent pixel values, characteristic of the static scene, will influence the weighted average more than the brief pixel value of the dynamic object. Figure 4.2b shows the result of the running average in obtaining the static background scene. (a) Original frame. (b) Running average. Figure 4.2: Running average comparison. The user is now required to "drag and drop" lines over the static background to specify the ROI. In addition to this, stage three requires that the user fill in a form containing various location-specific parameters. Figure 4.3 shows an example of stage three. A software flow diagram of the road profile creation process is shown in Appendix F1 on page 88. Figure 4.3: Stage 3 of road profile creation. 4.1.2 Background modelling The key concept behind this project entails being able to successfully identify and track objects in a video stream. The Background Subtraction (BS) technique, for use in computer vision, is designed to successfully differentiate a moving object from its corresponding static background scene. The BS technique was used in this project for the detection and tracking of passing vehicles. CHAPTER 4. DETAILED SYSTEM DESIGN 19 Background modelling is one of the primary and most challenging tasks for background subtraction [17]. A successful BS algorithm should be robust against sudden illumination changes, as well as other external environmental effects. In particular, the BS algorithm should be able to ignore the movements of small background elements, e.g. movement of waving leaves in nearby trees [17]. As an initial attempt to model the static background, a simple running average of the video frames was used to segment the scene. It was later found that this method was incapable of dealing with the sudden illumination changes characteristic of an outdoor environment. Further research and investigation lead to the conclusion that a more complex background modelling technique was required. Various methods have been proposed for successful background modelling. The method of choice for this project in particular, involves the use of a Mixture of Gaussians (MoG). A single Gaussian density function for each pixel is not enough to cope with non-stationary background objects, such as waving tress and other natural elements [18]. The achieved background tries to create a model based on the varying pixel intensities. A mixture of N Gaussian distributions is then generated to model each individual pixel. Background pixels are characterised based on their persistence and variance. A particularly challenging part of the background modelling process, involves the optimal tuning of the parameter set. Various methods have been proposed for the optimal tuning process. One such example makes use of an automatic tuning strategy based on partial swarm optimisation. Equation 4.1a represents the Gaussian Mixture Model (GMM) equation: p(x) = K X πk η(x|µk , Σk ) (4.1a) k=1 Where the multivariate Gaussian distribution is given by η(x|µk , Σk ) = 1 1 exp − (x − µk )T Σ−1 k (x − µk ) 1/2 2 |2πΣk | (4.1b) Where πk represents the mixture coefficients, µk the class mean, and Σk the class covariance. Background modelling consists of two primary phases. Phase one is responsible for background initialisation. A good approximation of the static background scene is obtained by making use of the aforementioned running average technique. Figure 4.2b shows the result of the running average algorithm in approximating a background scene. The approximated background is a good initialisation point for the background modelling process. Phase two is responsible for the model update. As the illumination environment changes throughout the operational life of the model, updates are required to ensure model accuracy. A typical example that illustrates the importance of the model update, would be the incorporation of the sun as it moves across the sky. Traffic flow analysis requires the observation of traffic at a specific location over an extended period of time. Throughout the day, the sun moves across the sky, and with it, the shadows on the ground. If the illumination effects, caused primarily by occlusions, are not successfully incorporated into the background model, misclassification of foreground objects will start to occur. As mentioned above, each individual pixel is modelled using a mixture of Gaussian distributions. To put this computational complexity into perspective, an 800 × 600 resolution video CHAPTER 4. DETAILED SYSTEM DESIGN 20 source contains 480 000 individual pixels. If each pixel is to be modelled by a mixture of N Gaussian distributions, the modelling process is seen to be extremely computationally expensive. The OpenCV library provides a highly optimised background subtraction algorithm called BackgroundSubtractorMOG2, which makes use of a mixture of Gaussian distributions to model individual background pixels. A particular advantage of this method, is that the algorithm automatically determines the appropriate number of Gaussian distributions for each pixel. This allows for improved adaptability in environments with many illumination changes. The BackgroundSubtractorMOG2 function was used in this project for the detection of dynamic objects. A running average is first obtained to initialise the background model with a representative background scene. Each successive frame is then sent to the BS function. The function returns the foreground mask and updates the background model accordingly. The rate at which the background model updates to accommodate new background features, is dependent on the Learning Rate parameter. A suitable value for the learning rate was optimised through empirical investigation and was ultimately selected as LR = 0.005Hz. If the learning rate of the modelling process is set too high, vehicles that are stationary for more than a few seconds are incorporated into the background model. This is an unwanted occurrence since vehicles tend to become stationary when traffic is congested. If, however, the learning rate is set too low, new background features are not incorporated into the background model fast enough, leading to degraded detection performance. Figure 4.4b shows the result of the background subtraction process with an optimised learning rate. Results of the background modelling process are explained in section 5.1.1. A software flow diagram of the process is shown in Appendix F2. 4.1.3 Shadow removal The use of the BS algorithm, discussed in section 4.1.2, does not provide a complete solution with regards to object detection. A particular disadvantage of using the MoG technique, is that the object shadows tend to be classified as part of the foreground. The reason for this, is that shadows share the same movement patterns as the objects that create them. Shadows also tend to exhibit similar pixel intensity characteristics as the corresponding foreground objects [19]. Shadows are mainly of two types; Self shadow and Cast shadow [17]. A self shadow is one that resides primarily on the object surface itself [17]. Self shadows are perceived as the darker regions on the object surface in a direction opposite to that of the illumination source [17]. A cast shadow is classified by the dark region projected onto the ground surrounding the object. This is caused by the occlusion of light due to the object position relative to the illumination source. For this project, the removal of cast shadows was the primary shadow removal objective. One of the major problems caused by cast shadows occurs when two vehicles are particularly close together. Without the implementation of a shadow removal technique, the two nearby vehicles are detected as a single object rather than two individual vehicles. This leads to erroneous results, especially during the vehicle counting process. Figure 4.4 shows a typical example of the problems caused by cast shadows. It is apparent from figure 4.4b that the two objects in the foreground mask now resemble one single entity. The blob detection algorithm, discussed in section 4.1.4, relies on the complete separation of objects in order to accurately distinguish between individual vehicles. In order to CHAPTER 4. DETAILED SYSTEM DESIGN (a) Original colour frame. 21 (b) Foreground mask. Figure 4.4: Vehicle shadows. ensure successful separation of the vehicles in the foreground mask, it was apparent that some form of shadow removal technique was required. The OpenCV library contains a method specifically designed for shadow removal operations. It was found, however, that the method provided by OpenCV is incapable of dealing with stronger cast shadows. This eventually lead to the design and implementation of a custom shadow removal algorithm. Several methods have been proposed for the successful detection of shadow pixels. These methods include detection based on Intensity, Chromaticity, Physical properties, Geometry and Temporal features to name but a few. The shadow detection technique selected for this project is based on the Chromaticity characteristics of shadows. Chromaticity is a measure of colour that is independent of intensity [19]. When a shadow moves over a red coloured object for example, the object becomes darker red but has the same chromaticity. This colour transition model, where the intensity is reduced while the chromaticity remains the same, is usually referred to as linear attenuation [19]. This method of shadow detection requires the selection of a colour space with a more pronounced separation between chromaticity and luminosity. The HSV colour space provides a natural separation between these two characteristics [19] and was therefore selected for use in this project. The idea behind the method of chromaticity is to detect shadows based on their pixel characteristics. There are three primary characteristics that distinguish shadows pixels from their non-shadow counterparts. It is known that the intensity of a shadow pixel (V in HSV) is lower than that of the corresponding background pixel [19]. Furthermore, it known that a shadow cast on the background does not change the pixel hue (H in HSV), and that a shadow pixel often exhibits lower saturation (S in HSV) characteristics [19]. A pixel p is therefore considered a shadow pixel if the following three conditions are satisfied [19]: β1 ≤ (FpV /BpV ) ≤ β2 (4.2a) (FpS − BpS ) ≤ τS (4.2b) |FpH − BpH | ≤ τH (4.2c) CHAPTER 4. DETAILED SYSTEM DESIGN 22 Where Fp and Bp represent the current and background frame pixel p respectively. The values of β1 , β2 , τS and τH represent threshold values that are optimised empirically [19]. The shadow detection function designed for this project consists of a three phase approach. Phase one involves the determination of the background pixels. This is readily available from the MoG modelling function discussed in section 4.1.2. Phase two involves the assessment of the individual pixel characteristics according to the conditions represented by equations 4.2a, 4.2b and 4.2c. Phase three involves the successful removal of the shadow pixels from the corresponding foreground mask. It is important to note that the shadows are not removed from the HSV image. Instead, the above-mentioned algorithm is used to detect the shadow pixel coordinates and subsequently remove those pixels from the foreground mask. Once the shadows pixels are removed, the foreground mask is put through a bilateral filter to remove any noise pixels. The bilateral filtering stage is also used to smooth the mask in preparation for the blob detection. As mentioned above, the threshold values are determined empirically. Figure 4.5 shows four separate design results, each using different experimental threshold values. The threshold values corresponding to each of the designs are shown in table 4.1. The original colour frame corresponding to the shadow design process is shown in figure 4.4a. (a) Shadow design 1. (b) Shadow design 2. (c) Shadow design 3. (d) Final shadow design. Figure 4.5: Cast shadow detection. Table 4.1: Shadow removal designs. Parameter Design 1 Design 2 Design 3 Final Design β1 0 0 0 0 β2 0.2 0.6 0.8 1 τs 1 1 0.4 0 τh 1 1 0.1 0.2 CHAPTER 4. DETAILED SYSTEM DESIGN 23 Results of the shadow removal process are explained in section 5.1.2. A software flow diagram of the shadow removal process is shown in Appendix F3. 4.1.4 Object detection The shadow detection and removal technique, discussed in section 4.1.3, can be seen as one of the pre-processing stages in the object detection process. During the shadow removal process, it often occurs that the objects in the foreground mask are decayed to some extent. This is more apparent with darker vehicles and ones that have a similar colour to that of the road surface. The shadow removal algorithm will sometimes classify the darker vehicle pixels erroneously as shadow pixels and subsequently remove them from the mask. In order to address the problem of object decay, the next pre-processing stage involves the application of morphological transformations. These transforms are used to improve foreground object density and ultimately to enhance detection performance. The two morphological transformations used in this project are the Dilate and Erode transformations. Figure 4.6 shows the result of applying the transformations. (a) Original image. (b) Erode. (c) Dilate. Figure 4.6: Morphological transformations: Image courtesy of [20]. The Erode transformation introduces a second level of filtering after the bilateral filter discussed in section 4.1.3. In addition to removing noise pixels, the Erode transformation also tends to reduce object density as can be seen from figure 4.6b. The Dilate transformation seen in figure 4.6c, is often used in conjunction with the Erode transformation as it tends to restore the object density. Finally, the foreground mask is put through a Gaussian blurring filter to remove the sharp edges of the vehicle object, ultimately making them appear more blob-like. Blob detection is a common object detection and tracking method used in a wide variety of applications. The algorithm works by firstly converting the source image to binary images using frame thresholding. The connected components are then extracted from every binary image using OpenCV’s findContours method [21]. Object contours are determined through the application of border-following algorithms proposed by Satoshi Suzuki in [22]. The blob detector algorithm identifies and estimates the centres of all the closed contour blobs in the image frame. Obtaining the centres of each of the blob objects is a key component in the object tracking process. Essentially, this allowed for the successful determination of the vehicle positions (and ultimately vehicle trajectories) as they move across the static background scene. Figure 4.7 shows an example of the object detection system. A vehicle tracking algorithm was designed to compute and store the x and y pixel coordinates of the individual vehicle centres. The main reason for implementing the tracking algorithm, is to enable the traffic flow estimation system to count the number of vehicles passing through a particular ROI. A counting algorithm was designed based on the relative vehicle positions obtained during the blob detection stage. The algorithm was designed to be as computationally CHAPTER 4. DETAILED SYSTEM DESIGN 24 Figure 4.7: Crosshair identifies the approximate center of the vehicle object. inexpensive as possible in an attempt to improve real-time performance of the system. The counting algorithm was designed as follows: 1. Determine and store the number of objects in the ROI for the current frame 2. For the next frame, determine if the number of objects in the ROI has changed 3. If the number has not changed, do nothing 4. If, however, the number has changed, increment the vehicle counter by the difference between the number of objects in the current and previous frame A software flow diagram of the counting algorithm is shown in Appendix F4. A limitation of the above-mentioned approach is observed when one vehicle enters the ROI at the exact same time that another vehicle leaves it. It would then seem as though the number of vehicles in the ROI had not changed between the current and previous frames. This would result in the vehicle counter not being incremented accordingly, even though a vehicle had indeed passed through the ROI. Having said this, it is assumed highly improbable that the pixel positions of two vehicle centres will leave and enter a ROI at the exact same time. The video frames contain around 480 000 pixels, thereby minimising the likelihood of this occurrence. It is important to remind the reader that this project is concerned with traffic flow estimation theory. A minimum accuracy of 75% has therefore been established as the target requirement for this project. Results of the vehicle counting are explained in sections 5.1.1 and 5.1.2. 4.1.5 Optical flow tracking Traffic flow estimation theory does not only depend on the number of vehicles passing through a specific road location, but on the relative velocities of the vehicles as well. Vehicle velocities are usually obtained using radar guns, inductive loops and IR counters [4]. However, these methods are seen as intrusive, as additional hardware needs to be incorporated into the existing road structure. A particularly attractive alternative is to use the existing camera infrastructure to automatically compute relative traffic velocities. Equation 4.3 represents the basic velocity equation based on the rate of change of object displacement. CHAPTER 4. DETAILED SYSTEM DESIGN v= 25 dp dt (4.3) In equation 4.3, dp represents the object displacement. Therefore, the idea is primarily to obtain the relative object displacements across successive frames. Various methods have been proposed as a solution to this problem as discussed in [23]. The method implemented throughout this project was Optical flow tracking. Optical flow operates under two primary assumptions. The first assumption is based on the fact that the pixel intensities of an object should remain constant between consecutive frames [24]. The second assumption is that neighbouring pixels will have a similar motion to that of the pixel under observation [24]. Let I(x, y, t) represent a pixel with coordinates (x, y) observed at time t. The same pixel is then observed dt seconds later. Based on the assumption that the pixel intensities remain constant, and assuming that the pixel has traversed dx in the x direction, and dy in the y direction, equation 4.4a is said to represent the shifted pixel. I(x, y, t) = I(x + dx, y + dy, t + dt) (4.4a) Now taking the Taylor series approximation of equation 4.4a, removing the common terms, and dividing by dt, the following equation is observed [24]: fx u + fy v + ft = 0 (4.4b) The optical flow equations (equation 2.2 on page 6) contain two unknown variables that need to be determined. Lucas and Kanade proposed a method to solve the optical flow equation in [12] by taking a 3 × 3 pixel area around the point of observation. This resulted in having to solve 9 equations with two unknown variables (over-determined) [24]. A solution was obtained using the least square fit method [24] and the equation is shown below: u Σfx2 Σfx fy = v Σfx fy Σfy2 " # " #−1 " −Σfx ft −Σfy ft # Any optical flow algorithm involves complex mathematical calculations conducted on a large number of individual pixels, therefore the implementation of a highly optimised algorithm is necessary to ensure real-time performance. The OpenCV platform includes an optical flow tracking method based specifically on the method proposed by Lucas and Kanade in [12]. The method requires unique feature points on the objects in order to track pixels accurately. According to J. Shi and C. Tomasi in [11], corners of an object are good features to track and are therefore used in the optical flow tracking process. The corner coordinates are passed to the optical flow tracking method with each successive frame. The optical flow method returns the new coordinates of the corner pixels in the next frame. The pixel displacements denoted by dx and dy, can be computed as the difference between the current and previous frame corner coordinates. According to equation 4.3, velocity v is also dependent on the change in time between two successive frames. The frame rate of the video provides information regarding the time delay CHAPTER 4. DETAILED SYSTEM DESIGN 26 between two successive frames, and is therefore the only link to relative object velocity. Frame rate is a measure of the number of frames per second. For example, let a video feed have a frame rate of 30 frames per second (30 fps). This translates to dt = 33.33ms time delay between each successive frame. With dp and dt known, the relative velocity vectors can be computed. In order to improve the accuracy of the velocity calculation, the vectors are filtered to remove any unlikely velocity estimates. It is safe to assume that all corner points on a single vehicle are moving at the same velocity. Based on this assumption, vectors that are more than one standard deviation from the mean velocity vector are flagged as erroneous. The erroneous vectors are consequently removed from the final velocity computation. The pixel displacement dp, obtained from the optical flow method, is a measure of pixel displacement. This means that the velocity calculation will result in a velocity measure with units in pixels/second rather than km/h. It was therefore apparent that some form of scaling factor needed to be introduced into the calculation. Figure 4.8: Scaling factor calibration. Figure 4.8 shows how the scaling factor is calibrated. Real-world reference objects, such as the lines on the road, are analysed to determine their pixel representative lengths. From this information, a scaling factor is generated which converts measures of pixels to meters. For this project, the velocity calculations undergo a two-stage conversion process. Stage one involves the conversion from pixels to meters, while stage two is responsible for converting m/s to km/h. The following equations demonstrate the two-stage process: CF 1 = v= meters pixels CF 2 = 3.6 dp pixels = dt s (4.5a) (4.5b) v × CF 1 = pixels meters meters × = s pixels s (4.5c) v × CF 2 = meters km × 3.6 = s h (4.5d) CF 1 and CF 2 represent the first and second stage conversion factors respectively. CF 1 is generated based on real-world measurements while CF 2 is a known constant that converts m/s to km/h. CHAPTER 4. DETAILED SYSTEM DESIGN 27 Figure 4.9: Velocity vectors. Figure 4.9 shows the velocity vectors superimposed onto the moving vehicles. The vehicle velocities in each ROI are displayed on the Heads Up Display (HUD) toward the top of the frame. The HUD also displays the total vehicle count as well as a running average of the vehicle velocities. All computations are done in real-time and estimates are updated with every passing vehicle. Results of the velocity estimation are explained in section 5.1.3. A software flow diagram of the optical flow tracking process is shown in Appendix F4. 4.1.6 Traffic flow computations Autonomous traffic flow estimation is recognised as the fundamental core of this project. Determining the total vehicle count and respective vehicle velocities, in sections 4.1.4 and 4.1.5, was a necessary step in computing the traffic flow metrics. After conducting various meetings with the transport department at Stellenbosch University, it was concluded that the following traffic metrics would be useful in describing uninterrupted traffic flow data: Time Mean Speed (TMS), Volume, Flow Rate, Density, Peak Hour Factor (PHF) and Level of service (LOS). TMS is directly associated with vehicle velocity. The TMS equation is shown below: TMS = N 1 X vi N i=1 [km/h] (4.6) Where N is the total number of vehicles and v their corresponding velocities. TMS simply represents the arithmetic mean of the observed vehicle velocities. Volume and Flow rate are two metrics that are often confused. Volume represents the number of vehicles passing through a particular road element measured in sub-hourly intervals. Flow rate is the number of vehicles passing through a road element measured in sub-hourly intervals, but expressed as an hourly measure. The equation for the volume metric is considered trivial, as it is simply representative of the number of vehicles passing through a ROI. The equation for flow rate is shown below: F lowRate = V olume Vsub_hourly_interval [veh/h] (4.7) CHAPTER 4. DETAILED SYSTEM DESIGN 28 Peak hour factor is described as the ratio of total hourly volume to the peak flow rate within the hour [4]. Flow rate is computed by equation 4.8 [4]. P HF = P eak Hourly volume f low rate (within the (4.8) hour) The term Peak flow rate represents the maximum flow rate that occurred during the subhourly period of the analysis hour [4]. Density describes the total number of vehicles occupying a certain road element at a particular instant [4]. Equation 4.9 describes the density equation: Density = ν S [veh/km] (4.9) Where ν is the flow rate (veh/h) and S the average travel speed (km/h). Density is a critical parameter for uninterrupted flow analysis since it characterises the quality of traffic operations [4]. Density is indicative of the roadway occupancy, and describes the amount of physical space that separates subsequent vehicles. This parameter also reflects the freedom to manoeuvre within the traffic stream [4]. The Level Of Service pertaining to a particular road element is indicative of how well traffic is being accommodated [4]. The LOS measure consists of 6 levels; LOS A describes free flowing traffic and F that of congested traffic. There are a number of different metrics that are used to define LOS as described in [4]. For this project, the number of vehicles moving through a particular road element (Volume) was selected as the LOS characterisation parameter. The following table provided by the HCM [4] shows how the LOS is characterised. Table 4.2: Level Of Service characterisation. LOS Double lane Volume Triple lane Volume Quad lane Volume A V < 1230 V < 1900 V < 2590 B 1230 < V < 2030 1900 < V < 3110 2590 < V < 4250 C 2030 < V < 2930 3110 < V < 4500 4250 < V < 6130 D 2930 < V < 3840 4500 < V < 5850 6130 < V < 7930 E 3840 < V < 4560 5850 < V < 6930 7930 < V < 9360 F V > 4560 V > 6930 V > 9360 As can be seen from table 4.2, the volumes that characterise each LOS are dependent on the number of lanes that the road element consists of. The number of lanes is specified by the user during the road profile creation step, and subsequently stored in the location-specific DAT file. The function that computes traffic flow metrics runs on a separate thread to that of the main traffic flow algorithm. The function is called intermittently from the main function according to the time interval specified by the user during system start up. Once the traffic metrics are computed, the information is sent to the online dashboard where it is displayed graphically. The reason for running the function on a separate thread, is due to the time delay caused by the GSM modem sending procedure. The sending procedure takes around 3 seconds to complete and would therefore degrade real-time performance if the function were run on the same thread as the main system. Results of the traffic flow computations are explained in section 5.1.4. A software flow diagram of the traffic flow computation process is shown in Appendix F5. CHAPTER 4. DETAILED SYSTEM DESIGN 4.1.7 29 Online dashboard The traffic metrics discussed in section 4.1.6, are required by traffic engineers in their analysis of road segments. In order to make this information both easily accessible and highly intuitive, traffic metrics are uploaded to an online dashboard in real-time via MTN’s GSM network. The SMART Trintel platform, developed by Trinity Telecom, was used as the dashboard development environment for this project. The dashboard is fully customisable and relies solely on the developer for the initialisation of all metrics. The metrics are initialised via the AirVantage configuration tool which is simply an XML editor. Once all of the metrics and alert systems have been initialised, the XML file is uploaded to the Trintel platform where it is subsequently interpreted, and used to configure device-specific variables. The traffic metric data is sent from the ground station, via a serial communication port, to the GSM modem. Commands are sent in the form of simple AT strings before being interpreted and separated into the IP packets to be sent via the mobile network. Traffic data is not the only information that the system continuously uploads to the dashboard. Other important system information, such as the UAV signal strength, battery percentage and system temperature are also uploaded. The Trintel platform allows the user to define specific circumstances under which text and email message alerts will be triggered. For this project, a few alarms were put in place to protect vital system components from potential failure. Text message alerts are triggered when the UAV battery percentage drops below 20%, as well as when the ground station temperature rises above 60◦ C. Table 4.3 shows a few examples of the AT strings used to send traffic data and other important information to the online platform. Figure D.5 on page 71 shows the online dashboard that has been developed for this project. The location-specific GPS coordinates, entered by the user during the road profile creation stage, are also uploaded to the Trintel dashboard. The road location is consequently depicted on a Google Maps widget that has been built into the dashboard. Table 4.3: Example AT commands. Function AT Commmand Send volume ROI1 "AT+AWTDA=d,aid.traf,1,volumeROI1,INT32,volume" Send PHF "AT+AWTDA=d,aid.traf,1,phfROI1,INT32,phf" Drone comm link active "AT+AWTDA=d,aid.drone,1,comm,INT32,1" Drone battery critical alert "AT+AWTDA=e,aid,102,Battery Critical" A software flow diagram of the data submission process is shown in Appendix F5. 4.2 Drone control The general idea behind the inclusion of the autonomous drone for this project, was to provide some form of autonomy for future traffic analyses. The idea is that the drone will eventually fly to pre-determined destinations using an onboard GPS system. Once the drone is within visual range of the tower, a unique identifier in the form of a checkerboard pattern will be used as a reference for the onboard tracking system. When the tracking system has stabilised the drone CHAPTER 4. DETAILED SYSTEM DESIGN 30 in front of the target, the automated landing system will land the drone on the platform below. The drone’s front-facing camera (FFC) can then be used to conduct fully autonomous traffic flow estimations. The drone’s FFC is used as the primary video source for the target detection system. For this project, the aim was to develop the automated tracking and landing algorithm, with the functionality of GPS-guided flight forming part of future developments. A more detailed discussion of future developments is given in chapter 6. 4.2.1 Target tracking There are a number of ways of identifying and tracking targets by making use of various computer vision techniques. Three alternative methods of tracking were investigated throughout the development phase of the project. The first method involved a colour tracking algorithm, which used OpenCV’s inRange function to threshold all colours within a pre-determined colour range. Figure 4.10 shows a result of the inRange function when trained to detect blue objects. (a) Coloured shapes. (b) Blue object detected. Figure 4.10: Colour thresholding. Before the inRange function can be applied, each frame needs to be converted from the RGB to the HSV colour space. The function splits the HSV image into three channels, namely Hue, Saturation and Value (intensity). For the example shown in figure 4.10, only blue colours should show up in the object detection frame. Once the blue target area has been obtained, the contours of the object are used to determine the position of the target in 3D space. Colour thresholding is extremely dependent on the surrounding lighting conditions. Under poor lighting conditions, grey objects tend to be identified as blue and in the same way, red objects as orange. This limitation could potentially lead to false target detection. If false detection were to occur, the drone would simply fly toward the falsely detected object and ultimately cause system failure. Due to the limitation on the surrounding lighting conditions, a more robust target tracking method was required. The second target tracking technique involved a QR bar code scanner. Figure 4.11a shows an example of a QR code. A QR bar code scanning library called Zbar, was investigated for its ability to successfully track targets identified by a unique QR code. This method presents even more limitations than the aforementioned HSV colour tracking technique. The main limiting factor of the bar code scanner can be attributed to the extensive amount of time it takes to decode the QR marker. Through experimental investigation, it was found that the scanner CHAPTER 4. DETAILED SYSTEM DESIGN 31 function took around 800ms to decode each frame. This does not seem like a large delay, but if the control system is running at a sampling time of 30ms then a delay of 800ms is substantial. (b) Checkerboard target. (a) QR code target. Figure 4.11: Tracking targets. The third and final tracking technique involved a checkerboard scanner. Figure 4.11b shows an example of the unique checkerboard target that could eventually identify the landing platforms. In order to detect whether a checkerboard shape is currently in the frame, each frame is converted to a greyscale image to maximise the distinction between the black and white checkered squares. The frame is then put through a binary thresholding function before a pattern recognition algorithm is used to identify the location of the checkerboard. Coordinates of the checkered squares are returned in a random order. It is imperative, however, to ensure that these coordinates are returned in the exact same order after every frame. The reason for this requirement is discussed in section 4.2.2. To achieve the abovementioned requirement, the points are filtered according to their relative positions. A two-stage filter was designed, with the first stage sorting the x positions in ascending order, and stage two, sorting the y coordinates accordingly. Logic was then introduced to match the corresponding x and y coordinates, and to ensure that the order of the coordinate matrix remained constant after every frame. The matrix shown below contains the filtered coordinate positions of the checkered squares. p1(x, y) p2(x, y) p3(x, y) M = p4(x, y) p5(x, y) p6(x, y) p7(x, y) p8(x, y) p9(x, y) The coordinates are then used to determine the relative position of the target center. In addition to determining the center point of the target, the matrix coordinates are also used to determine relative target distance, as well as the rotation of the target about the Z axis. Section 4.2.2 explains how these parameters are obtained from the matrix coordinates before being used to provide reference inputs for the P/PID controllers discussed in section 4.2.5. A software flow diagram of the target tracking process is shown in Appendix F6. 4.2.2 Target references Matrix M, shown in section 4.2.1, is used to obtain three important parameters regarding the relative target position. The center of the target is obtained by simply computing the mean value of the x and y coordinates within matrix M. The relative target distance is obtained by computing the standard deviation of the coordinates. A closer target will have a higher standard CHAPTER 4. DETAILED SYSTEM DESIGN 32 deviation in terms of the pixel locations than a more distant one. Finally, the rotation of the target about the Z axis is determined using the following equation: Rotation = q (p1x − p7x )2 + (p1y − p7y )2 − q (p3x − p9x )2 + (p3y − p9y )2 (4.10) Using the drone’s FFC, the tracking algorithm running on the ground station detects, interprets and computes new reference inputs based on the information obtained from the coordinate matrix M . Figure 4.12 shows the relative target position concepts. Figure 4.12: Target position concepts. Lower 3D image adapted from [25]. 4.2.3 System identification A mathematical representation of a plant model is required before any form of controller can be designed. In order to obtain a model of the plant, step inputs were applied to the drone and the output measurements from the IMU and ultrasonic altitude sensors were recorded over time. Since the dynamics of the roll, pitch, yaw and climb rate (gaz) differ extensively, four different plant models were required. A step reference input of 12◦ was applied to the drone in order to model the roll dynamics. Angle φ was then recorded over time and the result is shown in figure 4.14a. There are a host of non-linearities, as well as sensor noise, that contribute to the strange response represented by figure 4.14a. It is virtually impossible to visually obtain an accurate model based on trial and error techniques. In order to obtain an accurate enough approximation of the plant model, a more systematic and robust system identification technique was investigated. CHAPTER 4. DETAILED SYSTEM DESIGN 33 In order to obtain an adequate model of the roll step response, the method of least squares estimation (LSQ) was implemented. An experimental Python script was used to generate a random binary sequence. The binary sequence would serve as the reference input to the drone’s roll command. In the same way that angle φ was measured for the single step response, the response of the drone to the binary sequence was recorded along with the relative reference angles. The least squares estimation formula is represented by equation 4.11b. Using the measured data and equation 4.11b, the values of α and β from equation 2.3 are computed and thus the discrete transfer function obtained. The cost function for the LSQ estimation technique is represented by equation 4.11a J= N 1X 1 1 e(k)2 = eT e = (y − C X̄)T (y − C X̄) 2 k=0 2 2 (4.11a) Minimising the above cost function yields X̄ = (C T C)−1 y (4.11b) The method of least squares does not provide a complete solution to obtaining the plant transfer function. The order of the plant is still a variable that has to be obtained through careful investigation of the system as whole. The dynamics of the pitch and roll movement can be described as a relationship between thrust and angular acceleration of the aircraft. Figure 4.13 depicts the typical roll/pitch relationship between thrust and angular acceleration. Figure 4.13: Aircraft thrust to angular acceleration relationship. Taking the sum of the moments around the center of rotation, the following equation is realised: X My = I θ̈e (4.12) Where I represents the moment of inertia, θ̈ the angular acceleration, and e the unit vector perpendicular to the torque axis. Using the relationship described by 4.12, the uncontrolled plant dynamics can be modelled by a second-order function. The drone is controlled by an onboard PID controller [16] which should be taken into account when investigating the appropriate order of the plant. Equations 4.13a and 4.13b represent the continuous-time transfer functions of a second-order plant and a PID controller respectively: G(s) = ωn2 s2 + 2ζωn s + ωn2 (4.13a) CHAPTER 4. DETAILED SYSTEM DESIGN D(s) = CL = 34 s2 Kd + sKp + Ki s (4.13b) D(s)G(s) 1 + D(s)G(s) (4.13c) Equation 4.13c was used to incorporate the controller and essentially close the feedback control loop. Based on the equations represented by 4.13, it was concluded that the best representation of the controlled plant would be that of a third-order system. Figure 4.14b shows the comparison between the physical step response and the step response of the least squares estimation of the plant. The best correspondence between the physical and mathematical response was achieved when the DT transfer function, shown in equation 4.14, was realised. Groll/pitch (z) = 0.007z 2 + 0.024z + 0.800 θ(z) = 3 θ(z) z − 0.969z 2 − 0.454z + 0.497 (a) Roll step response. (4.14) (b) LSQ System identification. Figure 4.14: Pitch and Roll plant response. From figure 4.14b it is observed that the peak time (TP ) and the maximum overshoot (Mp ) of the physical plant are mapped accurately by the mathematical plant model. Furthermore, it can be seen that both the model and the physical plant response settle around the same value. The same procedure was followed to obtain the pitch response denoted by angle θ. Postexperimental investigation of the pitch step response revealed that the pitch and roll models agreed surprisingly well. It was decided that the same model would be used to represent both the pitch (θ) and roll (φ) dynamics of the plant. Figure 4.15a shows the response of the plant to an angular velocity reference of 90deg/s. The response shown in figure 4.15a is characteristic of a simple integrator, and is accurately described by the model shown in equation 4.15a. Figure 4.15b shows the response of the plant to a Rate of Climb (RoC) of 0.8m/s. The gaz response is also characteristic of an integrator, and is accurately described by the plant model shown in equation 4.15b. CHAPTER 4. DETAILED SYSTEM DESIGN 35 Gyaw (z) = Ψ(z) 2.77 = z−1 Ψ̇(z) (4.15a) Ggaz (z) = A(z) 24 = ν(z) z−1 (4.15b) (a) Yaw model response. (b) Gaz model response. Figure 4.15: Yaw and Gaz plant response. 4.2.4 Theoretical design In order to conduct accurate simulations, there are some non-linearities in the real world that need to be modelled and successfully incorporated into the design procedure. The drag factor is one such non-linearity that influences the way the physical plant responds to reference inputs. A detailed calculation of the non-linear drag force is shown in Appendix E1 on page 75. Furthermore, before any discrete controller designs can be implemented, a well-informed decision needs to be made with regards to sampling time. The discrete-time system, represented by equation 4.14, was used to determine an appropriate sampling time. Figure 4.16a shows the frequency response of the open-loop system. It was found that the −3dB cut-off frequency (ωc ) was in the region of 7rad/s. According to the Nyquist sampling rule, this translates to a maximum sampling time of 450ms. Taking delay factors into account (video processing and network speeds), an appropriate sampling time of 30ms was decided upon. Figure 4.17 shows a simplified version of the simulation block diagram used to control the pitch/roll movement of the drone. The full simulation block diagram can be seen in Appendix E2 on page 77. The simulation block diagrams used to simulate the yaw and gaz movements are shown in Appendix E3 on page 78. Figure 4.16b depicts the Root Locus (RL) of the open-loop plant Groll/pitch (z). Three independent controllers are required to control the roll/pitch, yaw and gaz movements of the drone. The RL design method was initially used to design discrete P/PI controllers based on pre-defined closed-loop specifications. The simplified block diagram, shown in figure 4.17, was CHAPTER 4. DETAILED SYSTEM DESIGN (a) Bode plot G(z). 36 (b) Root Locus G(z). Figure 4.16: Open-loop plant dynamics. Figure 4.17: Simplified block diagram. used to design and simulate the theoretical controllers for the pitch/roll movements of the drone. It is apparent from the figure that the block diagram consists of both an inner and outer-loop controller. The inner-loop controller is responsible for controlling velocity reference inputs to angular outputs. The system was first linearised by using the small angle approximation of the plant and the second-order Pade approximation of the time delays (transport and video processing). Once linearised, the RL method was used to design the theoretical PI controller described by equation 4.16a. The outer-loop controller is responsible for controlling position reference inputs to velocity outputs. Once the inner-loop controller had been designed, equation 4.13c was used to close the feedback loop so that an appropriate outer-loop controller could be designed. Once the system had been linearised, the RL design method was used to obtain an appropriate outerloop proportional controller. The transfer function of the outer-loop controller is described by equation 4.16b. The controllers designed for the yaw and gaz movements followed a more direct approach. Figures E.3a and E.3b on page 78 show theoretical block diagrams for the yaw and gaz models of the plant. The proportional controller for the yaw model controls reference angles to angular rate. In the same way, the proportional controller for the gaz model controls reference altitudes to RoC. The RL design method was used to obtain the theoretical P controller values represented CHAPTER 4. DETAILED SYSTEM DESIGN 37 by equations 4.16c and 4.16d. Dinner (z) = θ(z) 0.825z − 0.8094 = v(z) z−1 (4.16a) Douter (z) = v(z) = 0.49872 p(z) (4.16b) Dyaw (z) = Ψ̇(z) = 0.111 Ψ(z) (4.16c) Dgaz (z) = ν(z) = 0.00545 A(z) (4.16d) The step response of the controlled plant with both the inner and outer-loop controllers, is shown in figure E.4 on page 79. The step response of the plant with the yaw and gaz controllers is shown in figures E.5 and E.6 on pages 80 and 81 respectively. 4.2.5 Practical implementation The first step in the practical implementation of the controller, is to determine appropriate values for the controller constants. These parameters are readily available from the theoretical simulations discussed in section 4.2.4. As with any practical control system, theoretical simulations do not always provide a complete solution. It was found that the values obtained in the theoretical simulation worked quite poorly when used to control the physical hardware. This can be attributed to inaccurate plant models, as well as other non-linear effects that have not been taken into account. In an attempt to better the practical control of the drone, a PID tuning technique was investigated. The Ziegler-Nichols tuning method, based on the ultimate gain and period technique, was selected as the primary tuning methodology. A particular advantage of using this technique, is that it does not require a mathematical model of the plant. Instead, the technique is carried out with Hardware In the Loop (HIL) investigations. This allows for the parameters to be tuned according to the actual plant dynamics, thereby contributing to the design of a more effective practical controller. The method involves increasing the proportional gain of the controller until a constant oscillation is observed (see Appendix E7). The proportional gain and period of oscillation are then used to determine the values of Kc , TD and TI according to predefined formulae discussed by M. Gopal in [14]. The practical implementation of the PID controllers were written as difference equations in a Python script. The Drone control method contains the digital controller difference equations that are used to control the drone’s flight dynamics. The method also communicates with the libardrone library to send and receive plant-specific parameters. The following difference equations were written according to practical implementation guidelines explained in [14]: u(k) = Kc e(k) + 1 S(k) + D(k) TI (4.17a) CHAPTER 4. DETAILED SYSTEM DESIGN S(k) = S(k − 1) + Ts e(k) D(k) = αTD TD D(k − 1) − [y(k) − y(k − 1)] αTD + Ts αTD + Ts 38 (4.17b) (4.17c) Where u(k) is the discrete control signal, e(k) the error signal, y(k) the measured plant output, Ts the sampling time and α the filter parameter. The above equations are an augmented set of the theoretical difference equations, in the sense that they include some additional features. The features include a robustness toward the ’derivative kicks’ that occur - right after the controller set-point is changed [14]. A second feature includes a limitation on the derivative gain with the aid of the filter parameter α. According to M. Gopal in [14], the filter parameter α is not adjustable. Instead, it is built into the design of the controller and is usually in the order of 0.05 to 0.3. The filter parameter was chosen as α = 0.3 to provide maximum filtering. The results of the practical controller are shown in Appendix E8 for the roll/pitch movement and in Appendix E9 for the yaw movement. During the testing phase of the practical controller, the concept of integral wind-up was encountered. Figure E.10 in Appendix E10 shows a plot of the angle φ when integral wind-up occurred. Needless to say, this resulted in the implementation of restriction logic to safeguard against this occurrence. A software flow diagram of the practical PID controller is shown in Appendix F7. The results of the tracking algorithm are discussed in section 5.2.1 4.2.6 Landing algorithm The flight controller explained throughout this chapter, is designed to center and stabilise the drone infront of the checkerboard identifier. The reason for stabilising the drone, is so that it can ultimately land on the platform. A landing algorithm was designed to initiate a landing sequence based on pre-defined events. The design of the algorithm was based on the following concept: 1. Drone centred on target in X, Y and Z direction? 2. If error in X, Y and Z are within acceptable threshold -> set stable flag 3. If the drone has been stable on the target for more than 5 seconds -> initiate landing sequence A flow diagram of the algorithm is shown in Appendix F8 on page 95. The results of the automated landing system are discussed in section 5.2.2. 4.3 Discussion It is apparent that the design of both the traffic flow estimation system and the control system were conducted both thoroughly and systematically. It is, however, not sufficient to assume that the system performs as intended. Testing of the individual subsystems, as well as the final integrated system, was necessary in order to obtain a quantitative measure of system performance. These results are explained in chapter 5. CHAPTER 5. RESULTS 39 Chapter 5 Results The aim of this chapter is to discuss and reflect upon the results observed throughout the system testing procedure. As with the detailed design chapter, this section once again deals with the two distinct subsystems individually. The individual subsystems were tested independently before the final integration and testing was completed. Throughout the testing phase, a concious effort was made to ensure that the system and all associated submodules were tested thoroughly. In each case, care was taken to ensure absolute fairness when comparing the various design options. 5.1 Traffic flow estimation System testing and results analysis is an essential part of determining the efficacy of the methods used in the project. For this project, the performance of the computer vision techniques were tested using four carefully selected video sources. The test videos were chosen to test the system under various degrees of tracking difficulty. Test videos 1 and 2 were chosen as the baseline comparison tests. Video 1 was used throughout the system design phase, and therefore the inclusion of video 2 as a baseline test ensures an unbiased testing environment. The conditions in both videos are close to ideal as the cast shadows are not too severe and the vehicle trajectories are mostly linear. Figures D.2a and D.2b on page 68 show snapshots of test videos 1 and 2. Test video 3 was chosen due to the lower overall illumination caused by bad weather conditions. The weather in the video is overcast, and even consists of heavy precipitation at certain time intervals. Test video 3 also consists of non-linear vehicle trajectories. This video was selected primarily to test the system under low lighting and bad weather conditions, as well as to test the efficacy of the tracking system with non-linear vehicle motion. A snapshot of the test video is shown in figure D.2c on page 68. Test video 4 was chosen due to the position of the illumination source during the analysis hour. The sun lies close to the horizon which causes strong cast shadows. Stronger cast shadows CHAPTER 5. RESULTS 40 result in degraded tracking performance even after the shadow removal process. This video was selected primarily to test system performance under strong shadow conditions. A snapshot of the video is shown in figure D.2d on page 68. 5.1.1 Background modelling techniques During the design phase, two background modelling techniques were investigated. The first method involved using the running average (RA) of the frames to effectively distinguish between dynamic objects on the corresponding static background scene. The second method involved using a Mixture of Gaussian (MoG) distributions to successfully model the individual background pixels, while simultaneously updating the model with each successive frame. To determine the modelling performance, the actual number of vehicles in each of the test videos was counted manually. The system was then tested by determining the measured number of vehicles when using both the running average method as well as the Gaussian modelling method. Table 5.1 shows the results of the background modelling tests. Table 5.1: Background modelling results. RA Test 1 Actual count Count 2 MoG Accuracy Count 2 Accuracy Video 1 36 34 94% 36 100% Video 2 81 74 91% 79 98% Video 3 29 53 17% 25 86% Video 4 42 79 12% 50 81% Perhaps the most important conclusion to be drawn from the results in table 5.1, is the fact that under ideal illumination conditions, the running average method performs sufficiently well. Under poor lighting conditions, however, the method fails horribly. This can be attributed to clusters of noise pixels being falsely detected as moving blobs (vehicles). The MoG adaptive background modelling technique seems to perform far better than the RA method. The results reflect an apparent robustness toward illumination effects, with a 100% accuracy rating under ideal conditions and around 80% under non-ideal conditions. 5.1.2 Shadow removal The same testing technique, used in section 5.1.1, is used to test the efficacy of the shadow removal functionality. In each case, the actual vehicle count is compared to that of the measured vehicle count; with and without shadow removal. Table 5.2 shows the results of the shadow removal testing. The MoG background modelling technique was used during the testing of the shadow removal. The results from table 5.2 conclude that shadow removal improves counting accuracy by at least 15%. The most obvious reason for this performance increase is attributed to the way that the system interprets shadow characteristics. Without the shadow removal functionality, separate vehicles in close proximity to one another are sometimes counted as a single vehicle (see 1 2 Test videos 3 and 4 represent worst case conditions. Results obtained with shadow removal algorithm engaged. CHAPTER 5. RESULTS 41 figure 4.4). In other cases, the shadow is seen as a completely separate moving entity, leading to a single vehicle being counted twice. In addition to the effect on the counting algorithm, shadows tend to cause vehicle misclassification by deforming the relative blob area. Figure 5.1 shows a comparison of the foreground mask before and after the shadow removal process. (b) Shadow removal. (a) Foreground mask. Figure 5.1: Cast shadow detection results. Table 5.2: Shadow removal results. With shadows 5.1.3 1 Shadows removed 2 Actual count Count Accuracy Video 1 36 48 67% 36 100% Video 2 81 53 65% 79 98% Video 3 29 20 69% 25 86% Video 4 42 25 60% 50 81% Test Count Accuracy Vehicle velocity estimation The optical flow tracking method, designed in section 4.1.5, is used to compute the velocities of the vehicles passing through a specific ROI. The accuracy of velocity computations has a direct impact on the validity of the traffic flow metrics. It is therefore essential to determine the efficacy of the velocity estimation system in order to quantitatively assess the validity of the system as a whole. In order to test the accuracy of the velocity computations, the actual vehicle velocities were compared to that of the system measured velocities. This would require prior knowledge of the exact speed of the vehicles in the video which was not available. The videos obtained from the TMC contained no velocity information, and could therefore not be used for the system testing. Figure D.3a in Appendix D3 shows the test rig used to obtain traffic footage with known vehicle velocities. Figure D.3b in Appendix D3 shows an image of one of the test vehicles with the test rig shown in the distance. Two test vehicles were driven past the pole-mounted camera at speeds of 10, 20, 30 and 40km/h. Vehicles of different colours were used so that the system could be thoroughly tested. One of the test vehicles was gunmetal grey - a colour similar to that of the road surface, and known to be problematic for the tracking system. The second test vehicle was a sky-blue colour, easily tracked by the system. 1 2 Test videos 3 and 4 represent worst case conditions. Results are the same as table 5.1, since shadow removal was conducted during background modelling tests. CHAPTER 5. RESULTS 42 The velocity scaling factor, discussed in section 4.1.5, is required by the system for the location-specific velocity computations. To calibrate the system, two white lines 1m long and separated by an 80cm gap were drawn on the road surface. The relative pixel distance of the lines were then used to compute the scaling factor described in section 4.1.5. Figure D.3c in Appendix D3 shows the white calibration lines drawn on the road surface. The results of the velocity computations for the first test vehicle are shown in table 5.3. Figures D.4a and D.4b in Appendix D4 show both the object detection marker and velocity vectors superimposed on the video frame. The results of the velocity computations for the second test vehicle are shown in table 5.4. Figures D.4c and D.4d in Appendix D4 show the object detection marker and velocity vectors for the second test vehicle. Table 5.3: Vehicle speed results: Test vehicle 1. Run Direction Estimated Speed (km/h) Speedometer Reading (km/h) Difference (km/h) Accuracy 1 Right–>Left 11.4 10 −1.4 86% 2 Left–>Right 21.5 20 −1.5 93% 3 Left–>Right 28.6 30 +1.4 95% 4 Right–>Left 41.8 40 −1.8 96% Table 5.4: Vehicle speed results: Test vehicle 2. Run Direction Estimated Speed (km/h) Speedometer Reading (km/h) Difference (km/h) Accuracy 1 Left–>Right 10.95 10 −0.95 91% 2 Right–>Left 19.68 20 +0.32 98% 3 Left–>Right 33.54 30 −3.54 88% 4 Right–>Left 37.53 40 +2.47 94% On observation of tables 5.3 and 5.4, it is clear that the system was able to compute the velocities of both test vehicles. Furthermore, it is evident that the tracking system was robust to vehicle colour variations, as well as to relative vehicle direction. In conclusion, it can be said that the results obtained from the test rig are accurate enough to verify the validity of the velocity computations. 5.1.4 Traffic metric computations Testing of the traffic computations followed a slightly different approach. Two major challenges were encountered when testing the accuracy of the traffic metrics generated by the system. The first problem was that there were no fixed references to compare how accurately the traffic metrics were being calculated. The second problem was that the maximum length of the traffic footage, obtained from the TMC, was in the region of 1 hour. Traffic flow metrics are usually computed throughout the course of an entire day. A video length of one hour would only be CHAPTER 5. RESULTS 43 able to provide information regarding traffic flow within the specific analysis hour. Table 5.5 shows the result of the data file generated by the system while analysing test video 3. 1 Table 5.5: Traffic flow estimation results. 22 : 49 Time stamp (hh:mm) 22 : 59 23 : 09 23 : 19 23 : 29 Spot Speed ROI1 (km/h) 86.89 86.42 84.02 83.67 84.74 Spot Speed ROI2 (km/h) 66.18 67.56 66.86 67.56 66.94 Volume ROI1 (veh) 176 176 183 214 173 Volume ROI2 (veh) 215 162 219 242 224 Flow Rate ROI1 (veh/h) 1056 1056 1098 1284 1038 Flow Rate ROI2 (veh/h) 1290 972 1314 1452 1344 Density ROI1 (veh/km) 12.15 12.22 13.07 15.35 12.25 Density ROI2 (veh/km) 19.49 14.39 19.65 21.49 20.08 The results in table 5.5 were generated based on a 10 minute time interval chosen during system start up. As can be seen from the results, traffic metrics are computed and stored separately for each of the corresponding ROI’s. Traffic metric graphing functionality is built into the system and is accessed via the main GUI. The system automatically collects data from the corresponding CSV files before generating the plots for each ROI. Figure D.6 in Appendix D6 shows a more graphical representation of the results generated by the system’s graphing software. 5.1.5 Night-time results Up until this point, all design and testing has been conducted during daylight hours between 08h00 and 17h00. In order to thoroughly test the system, night-time videos were provided as the footage to be analysed. It was found that the bright headlights of the vehicles were problematic for the object detection and counting algorithms. The reflection of the headlights on the road surface deformed the shape of the objects in the foreground mask. Clusters of vehicles in close proximity appeared as though they were all connected. This resulted in a large reduction in the counting accuracy (85% to 30%). A simple solution to this problem was to track and count the vehicles from a different angle. It was noticed that when traffic was observed from the back, the bright headlights had much less of an illumination effect. Although tracking and counting accuracy were not as high as during daytime hours, the system was indeed able to successfully analyse night-time video footage. Figure 5.2 shows the resulting foreground mask when traffic is observed from the front (figure 5.2b) and from the back (figure 5.2d). 5.2 Drone control The control system, designed in section 4.2, is responsible for the automation of the tracking and landing process. Although not the main focus of this project, the drone automation system was tested for the purpose of a more thorough system evaluation. The ability of the drone to 1 Double-lane road segment CHAPTER 5. RESULTS 44 (a) Night-time frame: From the front. (b) Night-time foreground mask: From the front. (c) Night-time frame: From the back. (d) Night-time foreground mask: From the back. Figure 5.2: Night-time system results. track the target and minimise the error signal would give an indication of the tracking system performance. The second indicator would be determined by the efficacy of the automated landing algorithm to successfully land the drone on an 80 × 80 cm platform. 5.2.1 Target tracking The response of the drone to various step inputs would give an indication of the target tracking performance. Step inputs were generated by moving the target to the edge of the camera FOV before switching on the control system. This would essentially emulate a step input to which the response could be observed. The purpose of the tests were to obtain a quantitative measure of the practical control system performance. A reference input was applied to each of the individual control systems which independently control the four degrees of freedom. Figure 5.3 shows the results of the step response for each of the four controllers. The results depicted in figure 5.3 show the correction of the relative distances (X, Y, Z, Rotation) to the target centre. Figure 5.3a shows the step response of the roll controller. As can be seen from the figure, the error in the Y direction is minimised within 1.5s. The figure also indicates that the final value of the response settles on the 0 error line. During the testing phase of the control system, it was found that if the maximum pitch angle of the drone was set higher than 5 degrees, the target would move out of the camera FOV. Reducing the maximum forward velocity of the drone, would essentially reduce the maximum tilt angle, thus ensuring that the target never leaves the FOV. The first option was to reduce the maximum forward velocity by lowering the proportional outer-loop controller gain. It was later determined that this was quite a poor design decision. Reducing the controller gain, CHAPTER 5. RESULTS 45 (a) Roll step response. (b) Pitch step response. (c) Gaz step response. (d) Yaw step response. Figure 5.3: Practical step response. ultimately lowers the bandwidth of the system for smaller angles, resulting in a less responsive control system. The second option was to introduce a saturation block to the inner-loop velocity controller so that a limit could be placed on the maximum tilt of the drone. A second saturation block was incorporated into the outer-loop controller so that the maximum velocity could be controlled more accurately. Figure 5.4 shows the step response of the two pitch controller designs. It can be seen that the controller with the reduced gain, resulted in a much slower response than the controller with the implementation of saturation logic. The controller with saturation logic is seen to be more responsive and shows an improved bandwidth for smaller angles. If the maximum tilt angle of the plant is limited by a saturation block, the integral term in the PID controller starts to build up over time. Restriction logic was implemented to subtract from the integral term when the tilt angle of the drone was in a saturated state. This would effectively deal with the concept of integral wind-up seen in figure E.10 on page 85. Figure 5.3b shows the step response of the pitch controller with the implementation of saturation logic. The controller minimises the error within 2.5s and has a 2% settling time of 4.5s. Results show that the maximum overshoot is in the region of 9% at a peak time of 3s. It CHAPTER 5. RESULTS 46 Figure 5.4: Pitch controller response comparison. is clear from the response in figure 5.3b that the final value settles on the 0 error line. Figure 5.3c represents the step response of the height controller. The controller minimises the error in the Z direction within 2 seconds. Figure 5.3d represents the step response of the yaw controller. The response shows that the rotation error is minimised within 1.12s. It is also apparent from the plot that a 5% overshoot occurred at a peak time of 1.25s. It is clear from the above-mentioned results that the tracking and control system was indeed capable of automating flight control. It was found that the control system reacted well to external disturbances as well as to auxiliary turbulence effects caused by prop wash. It was found that in almost all cases, the drone was able to stabilise itself above the landing platform. Figure E.11 in Appendix E11 shows an example of the target tracking information displayed on the UAV Heads Up Display (HUD). 5.2.2 Automated landing In order to obtain a quantitative measure of the performance of the landing algorithm, the tracking and landing sequence was conducted 34 times. After each landing, the relative distance from the centre of the platform to the hull of the drone was measured. This measurement would give an indication of the landing accuracy, and would facilitate the calculation of a successful landing probability. Figure 5.5 shows a scatter plot of the aircraft position after each successive landing. The bounding box, depicted in figure 5.5, represents the relative area of the landing platform. The coloured dots on the scatter plot, represent the location of the drone after each successive landing. The relative distances from the platform centre are grouped according to colour (red dots indicative of larger distances). The tests were conducted in semi-ideal conditions where minimal external disturbances were experienced. Prop wash from the aircraft did, however, result in some air turbulence. It is apparent, however, that the control system was able to deal with the disturbances and ultimately CHAPTER 5. RESULTS 47 Figure 5.5: Landing coordinate scatter plot. stabilise the aircraft above the landing platform. The results depicted in figure 5.5 show that out of the 34 test runs conducted, 100% were successful landings. An interesting observation can be made from figure 5.5, in that the deviation in the Y direction is slightly favoured toward the back end of the platform. Upon further investigation, it was found that the drone would drift slightly backward when the motor speeds were decreased during the landing procedure. It is impossible to guarantee that the drone will land on the centre of the platform each and every time. To deal with this limitation, the traffic tracking algorithm was designed to be as flexible and as adaptable as possible so that the position of the drone was not of concern. A background model is generated based on the current position of the camera feed, which inherently minimises the limitations placed on the position of the source. 5.3 Discussion The results discussed throughout this chapter provide tangible evidence that the system performs as intended. It was decided at the start of the project, that an accuracy rating of around 75% would be acceptable. Results confirm that this accuracy rating has been achieved, and in most cases, surpassed by at least 5%. CHAPTER 6. CONCLUSION AND FUTURE DEVELOPMENT 48 Chapter 6 Conclusion and Future Development 6.1 Conclusion The primary aim of this project was to design and implement an automated traffic flow estimation system with the addition of real-time online reporting. With the successful completion of this project, traffic management centres and traffic engineers alike would be able to use the software to automate the currently manual process of traffic flow analysis. The existing traffic camera infrastructure would allow for seamless integration of the system, and provide unparalleled traffic video coverage. The system was designed in a modular fashion with each submodule responsible for a particular system function. Flexibility of the system was a primary concern throughout the planing and design phase of the project. This allowed for the analysis of footage obtained from two very distinct sources. The system is comprised of two primary subsystems. The first subsystem is concerned with automated traffic flow analysis using optimised computer vision techniques. The second subsystem is concerned with the automated flight control of the UAV - used primarily as an alternative to the stationary pole-mounted traffic cameras. Both subsystems were designed and implemented in an incremental fashion, with the lower levels providing basic functionality, and the incremental levels providing additional system features. The subsystems were tested individually to ensure seamless integration, and to lower the risk of software bugs occurring in the final system. The results discussed in chapter 5 clearly demonstrate that the system performs as required. The system functions well under ideal conditions, and proved to be sufficiently robust against natural illumination and other external effects, characteristic of non-ideal conditions. The traffic flow estimation subsystem, as well as the integrated UAV control subsystem, managed to achieve their stated aims. CHAPTER 6. CONCLUSION AND FUTURE DEVELOPMENT 49 The objectives, described in section 1.2, provide the project with both direction and an ultimate goal to be achieved. Achievement of the stated objectives, also provides a way to determine the overall success of the project. The objectives were stated separately for each corresponding subsystem. The first set of objectives was concerned with the success of the traffic flow estimation system, while the second set involved the automation of the UAV flight control. Table 6.1 shows the objectives and achievements cross-reference table. Table 6.1: Objectives and achievements cross-reference table Traffic flow estimation Obj Description Sections in report 1 Remove any occlusion effects that might hinder performance 4.1.3; 5.1.2 2 Identify vehicles on the road 4.1.4; 5.1.1 3 Count the number of vehicles passing through a particular road segment 4.1.4; 5.1.1 4 Determine the relative velocities of the vehicles 4.1.5; 5.1.3 5 Automatically compute traffic flow metrics 4.1.6; 5.1.4 6 Upload the information in real-time to an online dashboard 4.1.7 1 Design a target tracking system 4.2.1; 4.2.2; 5.2.1 2 Design a control system which will automate UAV flight control 4.2.3; 4.2.4; 4.2.5; 5.2.1 3 Design and implement an automated landing algorithm for the UAV 4.2.6; 5.2.2 Automated drone control The successful completion of the aforementioned objectives, help to qualify the overall success of the project. Based on these achievements, it can be concluded that system performed well under all tested circumstances. The accuracy of the system was in the order of 80%, which is higher than the minimum specification rating of 75% stated prior to project commencement. Finally, it can be concluded that the technologies developed throughout this project can be considered a viable option for the successful automation of traffic flow estimation. A demonstration video of the project is available online at: http://goo.gl/jT7lke 6.2 Future development Throughout the report there has been mention of possible future developments, that could one day provide elegant solutions to many of today’s traffic problems. The system, as a whole, seems scalable enough to implement on a city-wide basis. There are, however, a few issues that need to be rectified before it can be implemented on a larger scale. Future development and recommendations for this project can be separated into two primary categories. The first category involves system developments that are absolutely essential for the scalable product. The second category involves the implementation of additional functionality and the optimisation of autonomy. These two categories are discussed below: Perhaps one of the biggest problems with the system, stems directly from the battery powered Parrot AR drone. The drone is powered by a rechargeable, 3-cell, 11.1V, 2000mAh Lithium Polymer battery. During the design phase of the project, it was found that the maximum flight time of the drone was in the region of 10 minutes. If the maximum velocity of the drone at a tilt angle of 12◦ , is around 2.5m/s, the furthest the drone could theoretically fly would about 1.5km. CHAPTER 6. CONCLUSION AND FUTURE DEVELOPMENT 50 This is assuming that there is absolutely no wind, and that the drone could actually achieve its maximum forward velocity. This distance would place a limitation on the area that the drone could actively service. If the drone were unable, somehow, to recharge itself, this flight distance would be at least halved so as to allow for the return flight. A possible solution to this, would be to employ inductive charging plates on the landing platforms themselves. In this way, the drone would be able to recharge itself while conducting traffic analysis from a static platform. The second problem involves the physical structure of the landing platform itself. A horizontal platform is required to ensure that the drone does not simply slide off upon landing. The problem with having a completely horizontal platform, is that the traffic below might not fall within the camera FOV. A possible solution to this problem, is to implement a controllable platform that would allow for the tilt angle of the drone to be varied as required. Another solution would be to modify the drone’s camera mount, so that the camera itself could be swivelled on demand. Figure 6.1 shows a representation of the proposed controllable platform. Figure 6.1: Advanced landing platform The drone control system is limited in the sense that the target is required to be in the line of sight. If the target is not visible, the drone would have no reference, and would simply remain in the hovering state. The possibility of GPS-guided flight could enable the drone to autonomously fly to the platforms without any visual references. The GPS and onboard navigation system would provide the control system with reference inputs, and ultimately guide the drone until the checkerboard target was in the line of sight. The control system designed in this project would then be allowed to take control, and land the drone on the platform below. If GPSguided flight could be implemented, traffic engineers would essentially be able to select the areas that the drone should monitor, and let it compute the traffic data automatically. This would require that all video processing and all traffic flow data be computed by the drone’s onboard processing system. The possibility of Internet access, using mobile networks, would even allow for autonomous data uploads to the online dashboard in real-time. For now, the inclusion of the autonomous drone for traffic tracking purposes seems unrealistic. However, if the system could in fact be implemented, the benefits would be boundless. Perhaps the development of future technologies would allow for the successful integration of drone-based traffic flow estimation. REFERENCES 51 References [1] National department of transport. (2014) Live vehicle population. Last accessed: 2014-0915. [Online]. Available: http://www.enatis.com/index.php/downloads/doc_download/575livevehpopulationclassprov20140331 1.1 [2] R. Sen, A. Cross, A. Vashistha, V. Padmanabhan, E. Cutrell, and W. Thies, “Accurate speed and density measurement for road traffic in india,” in Proceedings of ACM Symposium on Computing for Development (DEV 2013). ACM, January 2013. [Online]. Available: http://research.microsoft.com/apps/pubs/default.aspx?id=189809 1.1 [3] H. Greenberg, “An analysis of traffic flow,” Operations Research, vol. 7, no. 1, pp. pp. 79–85, 1959. [Online]. Available: http://www.jstor.org/stable/167595 1.1 [4] National Research Council (U.S.). Transportation Research Board, HCM 2010 : highway capacity manual. Washington, D.C. : Transportation Research Board, ©2010-, 2010. 1.1, 2.2.1, 4.1.5, 4.1.6, 4.1.6, 4.1.6 [5] T. Yuasa and R. Nakajima, “Iota: A modular programming system,” Software Engineering, IEEE Transactions on, vol. SE-11, no. 2, pp. 179–187, Feb 1985. 1.3 [6] I. Culjak, D. Abram, T. Pribanic, H. Dzapo, and M. Cifrek, “A brief introduction to opencv,” in MIPRO, 2012 Proceedings of the 35th International Convention, May 2012, pp. 1725–1730. 2.1 [7] T. Nöll, A. Pagani, and D. Stricker, “Markerless Camera Pose Estimation - An Overview,” in Visualization of Large and Unstructured Data Sets - Applications in Geospatial Planning, Modeling and Engineering (IRTG 1131 Workshop), ser. OpenAccess Series in Informatics (OASIcs), A. Middel, I. Scheler, and H. Hagen, Eds., vol. 19. Dagstuhl, Germany: Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, 2011, pp. 45–54. [Online]. Available: http://drops.dagstuhl.de/opus/volltexte/2011/3096 2.1.1 [8] Z. Tang and Z. Miao, “Fast background subtraction and shadow elimination using improved gaussian mixture model,” in Haptic, Audio and Visual Environments and Games, 2007. HAVE 2007. IEEE International Workshop on, Oct 2007, pp. 38–41. 2.1.2 REFERENCES [9] 52 M. Piccardi, “Background subtraction techniques: a review,” in Systems, Man and Cybernetics, 2004 IEEE International Conference on, vol. 4, Oct 2004, pp. 3099–3104 vol.4. 2.1.2 [10] OpenCV. (2014) Opencv documentation. Last accessed: 2014-09-05. [Online]. Available: http://docs.opencv.org/trunk/doc/py_tutorials/py_video/py_lucas_kanade/py_ lucas_kanade.html 2.1.3, 2.1.3 [11] J. Shi and C. Tomasi, “Good features to track,” in Computer Vision and Pattern Recognition, 1994. Proceedings CVPR ’94., 1994 IEEE Computer Society Conference on, Jun 1994, pp. 593–600. 2.1.3, 4.1.5 [12] B. Lucas and T. Kanade, “An iterative image registration technique with an application to stereo vision,” in Proceedings DARPA Image Understanding Workshop, Apr 1981, pp. 121–130. 2.1.3, 4.1.5 [13] W. Homburger, J. Kell, D. Perkins, and B. I. o. T. S. University of California, Fundamentals of traffic engineering, ser. Course notes. Institute of Transportation Studies, University of California, Berkeley, 1992. [Online]. Available: http://books.google. co.za/books?id=wXonAQAAMAAJ 2.2 [14] M. Gopal, Digital control and state variable methods. Mc Graw Hill, 2010. 2.3, 2.3.2, 2.3.2, 2.3.3, 4.2.5, 4.2.5 [15] W. Young and D. Irwin, “Total least squares and constrained least squares applied to frequency domain system identification,” in System Theory, 1993. Proceedings SSST ’93., Twenty-Fifth Southeastern Symposium on, Mar 1993, pp. 285–290. 2.3.1, 2.3.1 [16] S. Piskorski, N. Brulez, and P. Eline, “Parrot ar.drone developer guide,” May 2011. 3.3.1, 3.2, 3.3.1, 4.2.3 [17] A. Jalal and V. Singh. (2012) The state-of-the-art in visual object tracking. Last accessed: 2014-09-11. [Online]. Available: http://www.thefreelibrary.com/Thestate-of-theartinvisualobjecttracking.-a0309792877 4.1.2, 4.1.3 [18] J. Milla, S. Toral, M. Vargas, and F. Barrero, “Computer vision techniques for background modelling in urban traffic monitoring,” in Urban Transport and Hybrid Vehicles, Sep 2010, p. 192. 4.1.2 [19] A. Sanin, C. Sanderson, and B. C. Lovell, “Shadow Detection: A Survey and Comparative Evaluation of Recent Methods,” Pattern Recognition, vol. 45, no. 4, pp. 1684–1695, 2012. 4.1.3, 4.1.3, 4.1.3 [20] OpenCV. (2011) Morphological transformations. Last accessed: 2014-09-12. [Online]. Available: http://docs.opencv.org/trunk/doc/py_tutorials/py_imgproc/py_morphological_ ops/py_morphological_ops.html 4.6 REFERENCES 53 [21] OpenCV. (2011) Structural analysis and shape descriptors. Last accessed: 2014-09-12. [Online]. Available: http://docs.opencv.org/modules/imgproc/doc/structural_analysis_ and_shape_descriptors.html 4.1.4 [22] S. Suzuki and K. be, “Topological structural analysis of digitized binary images by border following,” Computer Vision, Graphics, and Image Processing, vol. 30, no. 1, pp. 32 – 46, 1985. [Online]. Available: http://www.sciencedirect.com/science/article/pii/ 0734189X85900167 4.1.4 [23] A. El-Kalubi, R. Zhou, and H. Sun, “Design simulation model for tracking and speed estimation of moving objects,” in Intelligent Control and Information Processing (ICICIP), 2010 International Conference on, Aug 2010, pp. 271–276. 4.1.5 [24] OpenCV. (2011) Optical flow. Last accessed: 2014-09-12. [Online]. Available: http://docs.opencv.org/trunk/doc/py_tutorials/py_video/py_lucas_kanade/py_ lucas_kanade.html 4.1.5, 4.4, 4.1.5 [25] Petty design. (2011) Sunscreen. Last accessed: 2014-09-12. [Online]. Available: http://www.pettydesign.com/wp-content/uploads/2012/11/Sunscreen_2.jpg 4.12 [26] Rensselaer polytechnic Institute. (2010) 3d vision introduction. Last accessed: 2014-09-18. [Online]. Available: http://www.ecse.rpi.edu/Homepages/qji/CV/3dvision_intro.pdf D.1 Appendices 54 APPENDIX A: PROJECT PLANNING SCHEDULE 55 Appendix A: Project planning schedule APPENDIX A: PROJECT PLANNING SCHEDULE Project plan Figure A.1: Project plan. 56 APPENDIX B: PROJECT SPECIFICATION 57 Appendix B: Project specification APPENDIX B: PROJECT SPECIFICATION 58 Project overview Original project description: Build a stand-alone pole-mounted system to monitor and report traffic flow. The number of vehicles and the flow rate must be determined using computer vision and reported to Trintel’s SMART platform through MTNs cellular network. The project is sponsored by Trintel and MTN. The original topic description states that the pole-mounted system is required to automatically count the number of vehicles, as well as to determine the corresponding flow rate. The information should then be uploaded to Trintel’s online SMART platform, where it should be displayed graphically. The system should rely solely on computer vision techniques and should therefore not make use of any additional measuring equipment. In addition to the expected specifications, various additions were made to enhance the overall value of the system. The system has been expanded by incorporating analysis of video feeds from more than a single stand-alone source. A stand-alone pole-mounted system would be limited in the sense that it would only ever be able to monitor one specific location. The system developed in this project, is able to analyse traffic videos from any location, and consequently compute and upload traffic metrics accordingly. In addition to counting the number of vehicles and computing the corresponding flow rates, various other traffic metrics are computed. The traffic volume, flow rate, density, spot speed, peak hour factor and the level of service of a particular road segment, are metrics that are computed by the system. The addition of an autonomous Unmanned Aerial Vehicle (UAV) adds further functionality to the system. A control system was developed to automate the flight control and landing procedure of the aircraft. The UAV is able to track a unique target and stabilise itself, before landing on a designated platform. Once the aircraft has landed, the live video feed from its front-facing camera serves as the primary traffic footage to be analysed. Traffic metrics are computed on-the-fly, before being uploaded in real-time to an online dashboard via the GSM network. The GUI of the ground station unit was designed to be as user-friendly and as intuitive as possible. The system has been made to automatically learn road-specific parameters (direction of traffic in each ROI), as well as to create and store locationspecific road profiles. APPENDIX B: PROJECT SPECIFICATION 59 Functional specifications Traffic flow estimation • The system makes use of pure computer vision techniques to estimate traffic flow data • The system is able to obtain a static background scene from a video feed • The system is able to automatically detect the traffic direction • The system automatically stores location-specific information • A mixture of Gaussian distributions are used to model individual pixels • Background subtraction is used as the primary vehicle detection technique • Optical flow tracking is used to automatically compute vehicle velocities • The system is able to classify individual vehicles • Key traffic descriptors are automatically generated by the system • Traffic descriptors are uploaded to an online dashboard in real-time Automated drone control • The system is able to identify a checkerboard target • The control system running on the ground station is able to automate the drone • The drone is able to autonomously track the checkerboard target • The landing algorithm is able to succesfully land the drone on the landing platform • The traffic flow estimation system is able to use the drones FFC to compute traffic data Interfaces • The ground station is powered by a 20V 4.5A DC power supply • The GSM modem is powered by a 12V 2A DC power supply • The Parrot AR drone is powered by a 11.1V 2000mAh Li-Po battery • The GSM modem dimensions are 89 × 60 × 30mm • The Parrot AR drone dimensions are 517 × 451 × 150mm • The GSM modem weighs 100g • The Parrot AR drone weighs 420g • The entire system is controlled from the linux ground station GUI • Local traffic videos are stored as .avi files APPENDIX B: PROJECT SPECIFICATION 60 • Location-specific information is stored in local DAT files • The system is able to analyse a live video stream from a USB webcam • A GSM modem is connected to the ground station via a USB-Serial connection • Data is sent to the GSM modem in the form of AT strings • Data is sent from the GSM modem to the online dashboard in the form of IP packets • The Parrot AR drone is connected to the ground station via a secure Wi-Fi network • Commands are sent to the drone in the form of AT strings • The drone streams a live video feed from its FFC to the ground station • Drone sends video in PAVE (H.264 wrapper) format Performance • The system is able to compute traffic flow descriptors in real-time • The system has an accuracy rating of 80% − 100% (dependant on illumination conditions) • Traffic data is uploaded to the online dashboard in real-time • The checkerboard tracking algorithm is able to track the target in real-time • The drone control system is able to minimise the tracking error within 3 seconds A summary of the specifications is available in table B.1. Table B.1: System specifications. Specifications Create location-specific road profiles Analyse traffic footage live from a webcam or from stored video files Automatically count the number of vehicles passing through a ROI Compute vehicle velocities using pure computer vision techniques Automatically compute traffic flow metrics Upload the traffic information in real-time to an online dashboard Store the information locally and display it graphically from the system GUI and web interfaces are available and functional UAV can be controlled manually from the ground station UAV is able to track and follow a target autonomously UAV lands itself fully autonomously on a viewing platform APPENDIX C: OUTCOMES COMPLIANCE 61 Appendix C: Outcomes compliance APPENDIX C: OUTCOMES COMPLIANCE 62 Outcomes compliance Various aspects of the report reflect multiple instances where the outcomes of Project E448 are complied with. Identification of problems, suggestion and implementation of solutions • Dealing with the problem of segmenting dynamic foreground objects from a static background scene • Dealing with occlusion effects that hinder tracking performance • The design and implementation of a tracking algorithm capable of dealing with non-linear object trajectories • The design of a traffic counting algorithm capable of accurately counting vehicles under varying illumination conditions • The design of a velocity estimation system capable of accurately computing vehicle velocities in varying illumination conditions • The implementation of a traffic metric computational system • The design and implementation of an online dashboard for real-time reporting • The design and implementation of a video decoding algorithm • The design and implementation of a visual target tracking system for use in the aircraft automation system • Dealing with the problem of not being able to access all sensor measurements on the drone • Dealing with the issue of obtaining an accurate plant model of the aircraft • The design and implementation of a feedback controller to automate aircraft flight control • The design and implementation of an automated landing algorithm to ensure successful landing of the aircraft on a static platform Application of a knowledge of mathematics and basic engineering science to implement solutions • Knowledge and application of machine learning skills with a specific focus on object modelling • Knowledge and application of computer vision techniques • Implementation of software algorithms to develop highly optimised solutions • Conversions of values between different scales and units • Application of traffic flow engineering techniques in the computation of traffic flow metrics • Knowledge of networking protocols and database structures APPENDIX C: OUTCOMES COMPLIANCE 63 • Application of control system techniques in obtaining accurate plant models • Linearisation of the plant using small angle theory and Pade approximations • Integration of both software and hardware functional modules • Knowledge of data structures • Reading and manipulating sensor values from the aircraft • Following a systematic design approach for the theoretical implementation of a controller • Application and design of digital control systems • Implementing practical control systems in software with the use of augmented difference equations • Use of testing to validate and verify specifications • Knowledge of statistics to analyse accuracy and performance Implementation of solutions through the design of components, subsystems and systems The system, as a whole, was designed and implemented in a modular fashion. Each component and subsystem were designed and tested independently before final integration. This is further evident upon observation of the report structure. Each subsystem is discussed, designed, implemented and tested separately so as to ensure optimal system performance. A further explanation of this approach is detailed in sections 1.3 and 1.5. Gathering information, analysing it critically, and then drawing sensible conclusions • Gathering and analysing information regarding various computer vision techniques • Gathering and analysing information regarding traffic flow estimation techniques • Gathering and analysing information pertaining to various control system techniques • Gathering and analysing information pertaining to particular hardware options • Gathering information regarding camera calibration techniques • Gathering information regarding the Parrot AR drone and associated communication protocols • Using information of data structure formats in the selection of optimised storage procedures • Gathering and analysing information pertaining to traffic camera interfacing software • Gathering information and analysing network protocols for the efficient communication between the ground station and the Parrot AR drone APPENDIX C: OUTCOMES COMPLIANCE 64 • Gathering information pertaining to specific GUI libraries, and selecting the optimal choice for use throughout the project Effective use of aids such as measurement equipment and software in order to verify and analyse designs • Use of Linux Python development environment and its associated debugging functionality • Use of Matlab Simulink for theoretical simulations and other control system development tools • Use of Linux bash scripts to obtain Wi-Fi signal strength as well as system temperatures to be reported to the online dashboard • Use of the AirVantage XML configuration tool for the design and implementation of the dashboard back-end • Use of Trintel SMART platform for the design and implementation of the dashboard frontend Effective reporting of the project, in both written and oral form This report is submitted in partial fulfilment of this outcome. A presentation has been scheduled for the 7/11/2014 to examine the oral reporting component of this outcome. Demonstration of the ability of independent learning There is not much information made available for the automation of traffic flow analysis using computer vision techniques. There is even less information made available for the automation of a Parrot AR drone. This lack of information required various levels of self investigation, and ultimately lead to the development of custom software. • Software development in Python and Linux • Interfacing with an external GSM modem • Interfacing with a webcam • Interfacing with a Parrot AR drone and obtaining a mathematical plant model • Implementation of practical control systems • Sending and receiving AT commands • Reading and interpreting sensor data • XML file manipulation • Web development • Using a computer vision library APPENDIX C: OUTCOMES COMPLIANCE 65 • Computation and analysis of traffic flow data • Manipulation of image data Table C.1 shows the outcomes compliance cross-reference table. Table C.1: Outcomes compliance cross-reference table. ECSA Outcome Sections in Report Identification of problems, suggestion and implementation of solutions. (ECSA outcome #1) 3.3.2; 4.1.3; 4.1.6; 4.2.2; 4.2.5; 4.1.1; 4.1.4; 4.1.7; 4.2.3; 4.2.6 4.1.2; 4.1.5; 4.2.1; 4.2.4; Application of a knowledge of mathematics and basic engineering science to implement solutions. (ECSA Outcome #2) 3.3.2; 4.1.3; 4.1.6; 4.2.2; 4.2.5; 4.1.1; 4.1.4; 4.1.7; 4.2.3; 4.2.6 4.1.2; 4.1.5; 4.2.1; 4.2.4; Implementation of solutions through the design of components, subsystems and systems. (ECSA Outcome #3) 1.3; 1.5; 3.4 Gathering information, analysing it critically, and then drawing sensible conclusions. (ECSA Outcome #4) 5.1.1; 5.1.2; 5.1.3; 5.1.4; 5.1.5; 5.2.1; 5.2.2 Effective use of aids such as measurement equipment and software in order to verify and analyse designs. (ECSA Outcome #5) Chapter 4; Chapter 5 Effective reporting of the project, in both written and oral form. (ECSA Outcome #6) Entire report Demonstration of the ability of independent learning. (ECSA Outcome #9) 2.1.2; 2.1.3; 2.2.1; 2.2.2; 2.3.1; 2.3.2; 2.3.3; 3.3; Chapter 4; Chapter 5; 6.2 APPENDIX D: COMPUTER VISION 66 Appendix D: Computer vision APPENDIX D: COMPUTER VISION D1: Computer vision hierarchical structure Figure D.1: Computer vision hierarchy. Figure adapted from [26]. 67 APPENDIX D: COMPUTER VISION 68 D2: Test video snapshots (a) Video 1 snapshot. (b) Video 2 snapshot. (c) Video 3 snapshot. (d) Video 4 snapshot. Figure D.2: Test video snapshots. APPENDIX D: COMPUTER VISION D3: Velocity estimation experimental setup (a) Velocity test rig. (b) Velocity estimation test vehicle with test rig shown in the distance. (c) System calibration. White lines painted on the road are exactly 1m long and separated by an 80cm gap. Figure D.3: Velocity estimation experimental setup. 69 APPENDIX D: COMPUTER VISION 70 D4: Velocity estimation (a) Test vehicle 1 X-Hair. (b) Test vehicle 1 vectors. (c) Test vehicle 2 X-Hair. (d) Test vehicle 2 vectors. Figure D.4: Vehicle velocity calculation. APPENDIX D: COMPUTER VISION D5: Online dashboard Figure D.5: Online dashboard. 71 APPENDIX D: COMPUTER VISION D6 Traffic flow graphs Figure D.6: Traffic flow graphs. 72 APPENDIX D: COMPUTER VISION 73 Ground station GUI (a) Main GUI. (b) File selection stage. Figure D.7: Graphical User Interface. APPENDIX E: CONTROL SYSTEMS 74 Appendix E: Control systems APPENDIX E: CONTROL SYSTEMS 75 E1: Detailed drag calculation Figure E.1: Drag diagram. FT = mg cos θ (E.1a) FF = FT sin θ = mg tan θ (E.1b) →+ ΣFx = ma (E.1c) FF − FD = ma (E.1d) From E.1b mg tan θ − FD = ma (E.1e) According to the drag equation FD = 0.5ρν 2 CD A with ρ, ν, CD , A representing the mass density of the fluid, the velocity of the object relative to the fluid, the drag coefficient and the reference area respectively. FD = maD (E.1f) Let aD be the deceleration due to drag maD = 0.5ρν 2 CD A (E.1g) From E.1g maD = kν 2 (E.1h) With k = 0.5ρCD A. When the maximum velocity (νmax ) has been obtained at maximum tilt angle (θmax ), the forward acceleration of the aircraft will be zero. It then follows from E.1e and E.1f that mg tan θmax − maD = 0 (E.1i) APPENDIX E: CONTROL SYSTEMS 76 2 kνmax = mg tan θmax (E.1j) mg tan θmax 2 νmax (E.1k) So k= And it can finally be said that anett = FF − FD m anett = g tan θ − (E.1l) kν 2 m (E.1m) With anett representing the nett acceleration of the aircraft. The only unknown in equation E.1m, is the constant value k. This constant is determined by empirical investigation of equation E.1k, and was found to be k = 0.334. APPENDIX E: CONTROL SYSTEMS E2: Simulation block diagram (Pitch/Roll) Figure E.2: Full simulation block diagram of Pitch/Roll control. 77 APPENDIX E: CONTROL SYSTEMS 78 E3: Simulation block diagrams (Yaw and Gaz) (a) Full block diagram of Yaw control. (b) Full block diagram of Gaz control. Figure E.3: Full simulation block diagrams. APPENDIX E: CONTROL SYSTEMS E4: Theoretical CL step response (Pitch/Roll) (a) Inner-loop step response. (b) Outer-loop step response. Figure E.4: Inner and outer-loop step response. 79 APPENDIX E: CONTROL SYSTEMS E5: Theoretical CL step response (Yaw) Figure E.5: Closed-loop step response of the theoretical controller (Yaw). 80 APPENDIX E: CONTROL SYSTEMS E6: Theoretical CL step response (Gaz) Figure E.6: Closed-loop step response of the theoretical controller (Gaz). 81 APPENDIX E: CONTROL SYSTEMS 82 E7: Constant oscillation during ZN tuning Figure E.7: Proportional gain is increased until constant oscillation is observed. Phi is shown as a fraction of the maximum tilt angle of 12◦ . APPENDIX E: CONTROL SYSTEMS 83 E8: Practical step response after ZN tuning (Pitch/Roll) Figure E.8: Practical step response of the plant with ZN tuned parameters. Phi is shown as a fraction of the maximum tilt angle of 12◦ . APPENDIX E: CONTROL SYSTEMS 84 E9: Practical step response after ZN tuning (Yaw) Figure E.9: Practical step response of the plant with ZN tuned parameters. External disturbance at 22.8 seconds. APPENDIX E: CONTROL SYSTEMS 85 E10: Integral wind-up Figure E.10: Integral wind-up. Phi is shown as a fraction of the maximum tilt angle of 12◦ . APPENDIX E: CONTROL SYSTEMS E11: UAV On Screen Display Figure E.11: UAV On Screen Display. 86 APPENDIX F: SOFTWARE FLOW CHARTS 87 Appendix F: Software flow charts APPENDIX F: SOFTWARE FLOW CHARTS F1: Road profile creation Figure F.1: Flow chart: Road profile creation. 88 APPENDIX F: SOFTWARE FLOW CHARTS F2: Background modelling Figure F.2: Flow chart: Background modelling. 89 APPENDIX F: SOFTWARE FLOW CHARTS F3: Shadow removal Figure F.3: Flow chart: Shadow removal. 90 APPENDIX F: SOFTWARE FLOW CHARTS F4: Traffic speed and volume analysis Figure F.4: Flow chart: Traffic speed and volume analysis. 91 APPENDIX F: SOFTWARE FLOW CHARTS F5: Metric computations and sending Figure F.5: Flow chart: Metric computations and sending. 92 APPENDIX F: SOFTWARE FLOW CHARTS F6: Target tracking Figure F.6: Flow chart: Target tracking. 93 APPENDIX F: SOFTWARE FLOW CHARTS F7: PID controller Figure F.7: Flow chart: PID controller. 94 APPENDIX F: SOFTWARE FLOW CHARTS F8: Landing algorithm Figure F.8: Flow chart: Landing algorithm. 95