DATA-ADAPTIVE SPECTRAL ESTIMATION ALGORITHMS AND THEIR SENSING APPLICATIONS By ODE OJOWU JR. A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY UNIVERSITY OF FLORIDA 2013 c 2013 Ode Ojowu Jr. ⃝ 2 I dedicate this to God, family and friends. 3 ACKNOWLEDGMENTS This dissertation would not have been possible without the support of several people. I would first of all like to thank my parents, my siblings and friends for the love and moral support they have given me throughout the years. I would also like to thank my advisor, Prof. Jian Li for taking me in as a student, and taking the time and patience to guide me throughout this important phase of my academic career; I will forever be grateful. This dissertation also would not have been possible without the help of some of my close colleagues, lab mates and friends at the Spectral Analysis Lab, which include: William Rowe, Dr. Johan Karlsson, Dr. Duc Vu, Chris Gianelli, Kexin Zhao, Dr. Luzhou Xu, Dr. Hao He, Dr. Jun Ling, Lim Deoksu, Qilin Zhang and Dr. Ming Xue. The daily discussions and advice helped with my work tremendously. I would finally like to thank my committee members, Prof. Henry Zmuda, Prof. Jenshan Lin and Prof. Hugh Fan for their guidance and support, and also for taking the time to be on my committee. I appreciate the sacrifice sincerely. 4 TABLE OF CONTENTS page ACKNOWLEDGMENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 CHAPTER 1 REVIEW OF SPECTRAL ESTIMATION AND TECHNIQUES . . . . . . . . . . 13 1.1 Introduction: Spectral Estimation Problem . . . . . . . . . . . . . 1.1.1 Energy Spectral Density . . . . . . . . . . . . . . . . . . . 1.1.2 Power Spectral Density . . . . . . . . . . . . . . . . . . . 1.1.3 Power Spectral Density Estimation . . . . . . . . . . . . . 1.2 Periodogram: Non-parametric Method . . . . . . . . . . . . . . . 1.2.1 Resolution: Periodogram . . . . . . . . . . . . . . . . . . . 1.2.2 Filter-bank Interpretation: Periodogram . . . . . . . . . . . 1.3 Data-adaptive Approaches . . . . . . . . . . . . . . . . . . . . . 1.3.1 CAPON: Non-parametric . . . . . . . . . . . . . . . . . . . 1.3.2 Amplitude and Phase Estimation (APES): Non-parametric 1.3.3 Iterative Adaptive Approach (IAA): Non-parametric . . . . 1.3.4 SLIM and SPICE Algorithms: Non-parametric . . . . . . . 1.3.5 RELAX: Parametric . . . . . . . . . . . . . . . . . . . . . . 1.4 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.5 Notations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 16 17 17 18 19 22 23 23 25 26 28 28 29 30 DATA-ADAPTIVE TECHNIQUES FOR NETWORK FREQUENCY EXTRACTION FROM DIGITAL RECORDINGS . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 2.1 2.2 2.3 2.4 Chapter Summary . . . . . . . . . . . . . . . . . . Introduction . . . . . . . . . . . . . . . . . . . . . . Network Frequency Characteristics and Database Extraction Algorithms . . . . . . . . . . . . . . . . 2.4.1 Frequency Domain Analysis (STFT) . . . . 2.4.2 IAA and TRIAA . . . . . . . . . . . . . . . . 2.4.3 Frequency Tracking . . . . . . . . . . . . . . 2.4.4 Matching the Extracted ENF to Database . 2.5 Experimental Results . . . . . . . . . . . . . . . . 2.5.1 Data1 Analysis . . . . . . . . . . . . . . . . 2.5.2 Data2 Analysis . . . . . . . . . . . . . . . . 2.6 Conclusions . . . . . . . . . . . . . . . . . . . . . . 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 31 34 36 36 39 43 44 45 47 49 51 3 DATA-ADAPTIVE RELAX FOR RFI SUPPRESSION FOR THE SYNCHRONOUS IMPULSE RECONSTRUCTION (SIRE) RADAR . . . . . . . . . . . . . . . . . 53 3.1 3.2 3.3 3.4 3.5 Chapter Summary . . . . . . . . . . . . . . . . . . . . . . . Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . SIRE Equivalent Sampling Scheme . . . . . . . . . . . . . Existing RFI Suppression Methods . . . . . . . . . . . . . . Proposed RFI Suppression Method: RELAX and Averaging 3.5.1 Modelling of RFI . . . . . . . . . . . . . . . . . . . . 3.5.2 RELAX Algorithm . . . . . . . . . . . . . . . . . . . . 3.5.3 Multi-snapshot RELAX Algorithm . . . . . . . . . . . 3.6 Autoregressive (AR) Modelling . . . . . . . . . . . . . . . . 3.7 Experimental Results . . . . . . . . . . . . . . . . . . . . . 3.7.1 Simulations . . . . . . . . . . . . . . . . . . . . . . . 3.7.2 Sniff Experimental Data . . . . . . . . . . . . . . . . 3.8 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53 54 56 60 63 63 64 69 72 74 74 76 79 DATA-ADAPTIVE, SPARSE SUPER-RESOLUTION IMAGING FOR THE SIRE FLGPR RADAR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 4.1 4.2 4.3 4.4 Chapter Summary . . . . . . . . . . . . . . . . . . . . Introduction . . . . . . . . . . . . . . . . . . . . . . . . Data Model: SIRE Impulse Based FLGPR . . . . . . . Back-projection/Delay-and-sum (DAS) Based Methods 4.4.1 Back-projection/DAS . . . . . . . . . . . . . . . 4.4.2 Sparse: CLEAN Method . . . . . . . . . . . . . 4.5 Super-resolution Methods . . . . . . . . . . . . . . . . 4.5.1 Orthogonal Projection and Time Gating . . . . 4.5.2 SLIM . . . . . . . . . . . . . . . . . . . . . . . . 4.5.3 SPICE . . . . . . . . . . . . . . . . . . . . . . . 4.6 Numerical and Experimental Results . . . . . . . . . . 4.7 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 82 85 87 88 89 90 91 95 96 99 106 CONCLUDING REMARKS AND FUTURE WORK . . . . . . . . . . . . . . . . 107 REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 BIOGRAPHICAL SKETCH . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 6 LIST OF TABLES Table page 1-1 Notations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 2-1 Abbreviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 2-2 Parameters for the Experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 2-3 Correlation coefficients of Algorithms (Data1) . . . . . . . . . . . . . . . . . . . 47 2-4 Standard Deviation of error for Algorithms (Data1) . . . . . . . . . . . . . . . . 47 2-5 Correlation coefficients of Algorithms (Data2) . . . . . . . . . . . . . . . . . . . 48 2-6 Standard Deviation of error for Algorithms (Data2) . . . . . . . . . . . . . . . . 48 3-1 ARL Parameters for Synchronous Reconstruction Radar. . . . . . . . . . . . . 57 3-2 Suppression Algorithm: RELAX + Averaging . . . . . . . . . . . . . . . . . . . 68 3-3 Suppression Algorithm: M-RELAX + Averaging . . . . . . . . . . . . . . . . . . 71 3-4 RFI Suppression (dB): File 1 (P~ . . . . . . . . . . . . . . . . . . . . . . . . 77 3-5 . . . . . . . . . . . . . . . . . . . . . . . . 77 = 1) RFI Suppression (dB): File2 (P~ = 1) 4-1 SLIM Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 4-2 CG SPICE Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 4-3 Subspace approximation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 7 LIST OF FIGURES Figure page 1-1 Synthetic aperture radar (SAR) imaging . . . . . . . . . . . . . . . . . . . . . . 14 1-2 Spectrogram: Estimating the Electric Network Frequency (ENF) in audio an recording . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 1-3 Bartlett window spectrum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 1-4 Spectra of two sinusoids with large frequency spacing . . . . . . . . . . . . . . 21 1-5 Spectra of two sinusoids with small frequency spacing . . . . . . . . . . . . . . 21 1-6 Spectrum: Comparison of adaptive methods to the periodogram . . . . . . . . 27 2-1 FDR Distribution in North America . . . . . . . . . . . . . . . . . . . . . . . . . 35 2-2 Segmentation of data for STFT . . . . . . . . . . . . . . . . . . . . . . . . . . . 38 2-3 Matching extracted ENF to database (Data1) . . . . . . . . . . . . . . . . . . . 46 2-4 Power Spectrum of one Frame (Data2): poor resolution of FFT . . . . . . . . . 49 2-5 Power Spectrum of one Frame (Data2): strong interference signal . . . . . . . 49 2-6 Extracted ENF via Frequency Tracking . . . . . . . . . . . . . . . . . . . . . . . 50 2-7 Matching extracted ENF to database (Data1) . . . . . . . . . . . . . . . . . . . 51 2-8 Absolute error of Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 3-1 Synchronous Impulse Reconstruction (SIRE) equivalent time sampling . . . . . 58 3-2 Spectrum of SIRE sampling after interleaving compared to the spectrum of regular sampling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 3-3 Spectrum SIRE sampling pattern: One fast time pulse . . . . . . . . . . . . . . 59 3-4 Spectrum SIRE sampling pattern . . . . . . . . . . . . . . . . . . . . . . . . . . 60 3-5 RFI Suppression (dB): Averaging method (M realizations) for simulated SIRE sampled RFI signals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 3-6 RFI suppression (SIRE sampling) - using RELAX with P (real-valued) sinusoids estimated . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 3-7 RFI suppression - RELAX algorithms with P (real-valued) sinusoids estimated and suppressed from sniff data . . . . . . . . . . . . . . . . . . . . . . . . . . . 78 3-8 Echo retrieval (File1) - RELAX with P (real-valued) sinusoids compared to ideal echo signal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78 8 3-9 Echo retrieval (File1) - RELAX with P (real) sinusoids combined with M-RELAX with P~ = 1 real sinusoid, compared to ideal echo signal . . . . . . . . . . . . . 79 3-10 RFI suppression - AR modelling with order q compared to averaging for sniff data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 3-11 Echo retrieval (File1) - AR modelling with order q, compared to ideal echo signal 80 4-1 Forward looking ground penetrating radar . . . . . . . . . . . . . . . . . . . . . 84 4-2 SIRE FLGPR: 2D aperture for SAR imaging . . . . . . . . . . . . . . . . . . . . 86 4-3 Time gating . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91 4-4 Subspace dimension (s) for high resolution imaging . . . . . . . . . . . . . . . 100 4-5 FLGPR SAR Imaging - detection of weak target . . . . . . . . . . . . . . . . . . 101 4-6 FLGPR Imaging - resolution improvement . . . . . . . . . . . . . . . . . . . . . 102 4-7 Orthogonal projection comparison . . . . . . . . . . . . . . . . . . . . . . . . . 103 4-8 Real data - SIRE FLGPR SAR Imaging . . . . . . . . . . . . . . . . . . . . . . 104 4-9 ROC comparison . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105 9 Abstract of Dissertation Presented to the Graduate School of the University of Florida in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy DATA-ADAPTIVE SPECTRAL ESTIMATION ALGORITHMS AND THEIR SENSING APPLICATIONS By Ode Ojowu Jr. December 2013 Chair: Jian Li Major: Electrical and Computer Engineering Spectral analysis of signals, or the problem of spectral estimation revolves around estimating the distribution of power over frequency of a random signal. It has useful applications in various fields of study (including Speech analysis, Medicine, RADAR and SONAR) due to the fact that the frequency content of an observed signal can provide very useful information in these fields. A well known method for estimating the spectral content of a signal is the Periodogram (developed by Arthur Schuster), which is a data-independent method of estimation. This method is based on computing the Fourier transform of the signal which can be computed efficiently using the Fast Fourier Transform (FFT) algorithm. This method however, is limited by relatively poor resolution and high sidelobe problems, which can lead to degradation in retrieval of the desired information present within the signal. Data-dependent (adaptive) techniques both non-parametric and parametric can offer superior performance over data-independent methods like the periodogram at a cost of increased computational complexity. These data-adaptive approaches however, can lead to improved spectral resolution and lower sidelobes, which can reveal more information about the signal under study. These advantages have led to increased interest in data-adaptive approaches to the problem of spectral estimation. This dissertation revolves around analyzing and applying robust adaptive techniques 10 to real-world problems in a unique, effective and efficient way to achieve superior performance over their data-independent counterparts. The introduction chapter briefly reviews the problem of spectral estimation as well as some of the methods for spectral estimation. We start this dissertation in Chapter 2 with the basic problem of frequency estimation (harmonic retrieval). In this chapter, adaptive techniques are used in the problem of harmonic retrieval in the presence of strong interference. The focus is on the problem of digital audio forensics, where the goal is to extract the embedded network frequency from a digital recording and compare it to a known database for digital audio verification. In the presence of significant interference, extracting the network frequency using the standard method (Periodogram) is ineffective and proves to be challenging due to poor resolution and high sidelobe problems. We therefore use a robust adaptive algorithm (Iterative Adaptive Approach - IAA) to improve the spectral resolution and suppress sidelobes hence effectively separating the network frequency from interference. A frequency tracking method based on dynamic programming is used in addition to this data-adaptive method to extract the Network frequency accurately and hence provide more reliability for the verification process compared to the current standard, which is based on the data-independent Fourier transform. Chapters 3 and 4 are the focus of this dissertation. In these chapters, the remote sensing tool known as the Synchronous Impulse Reconstruction (SIRE) Ultra-wideband radar (currently being built by the Army Research Lab (ARL) for landmine detection) is analyzed and studied. In Chapter 3, we once again apply an adaptive technique for harmonic retrieval. The goal here is to effectively suppress Radio Frequency Interference (RFI) picked up by this UWB radar which samples its returned signals using an equivalent sampling scheme. This equivalent sampling scheme makes RFI suppression difficult due to its irregular and under-sampled data (aliasing). The current method for RFI suppression for this UWB radar is simply averaging multiple realizations 11 of the measured data. In this chapter, we model the aliased RFI signals as a sum of sinusoids and estimate the aliased frequencies and amplitudes accurately using a robust algorithm - RELAX. A direct implementation of this algorithm is computationally intensive, therefore, an efficient method for implementation is presented in this chapter, which takes advantage of this equivalent sampling and improves computation. As RFI suppression is the goal, the estimates are used to reconstruct the aliased RFI samples accurately and are then suppressed from the data without altering the desired radar signals. In Chapter 4, we focus on radar imaging for landmine detection for this SIRE UWB radar. The standard method currently used for this radar is the data-independent backprojection or delay-and-sum (DAS) approach. This method suffers from high sidelobe problems and poor resolution. A recursive sidelobe minimization (RSM) algorithm was recently proposed by the army research laboratory for effective sidelobe reduction. This data-independent approach however, has the same resolution limitation as the backprojection algorithm. As imaging resolution is important for separating desired targets (mines) from clutter, this chapter, focuses on sparse super-resolution imaging techniques for imaging. A new technique for imaging based on applying data-adaptive approaches post significant data reduction as well as interference reduction via an orthogonal projection is proposed in this chapter. This approach is able to achieve an improvement in imaging resolution by a factor of approximately 2, based on simulated experiments. Chapter 5 provides the concluding remarks and possible future work. The contents of Chapter 2 are published in IEEE transactions on information forensics and security Volume 7, no. 4. The contents of Chapter 3 are published in the International Journal of Remote Sensing and Applications (IJRSA) vol 3 Issue 1. The contents of Chapter 5 are to be submitted for publication. 12 CHAPTER 1 REVIEW OF SPECTRAL ESTIMATION AND TECHNIQUES 1.1 Introduction: Spectral Estimation Problem Most phenomena or signals that occur in nature or in practice are typically random in nature, and are best modelled as random signals. Examples of such random signals include but are not limited to speech/audio signals and thermal noise generated by electronic devices. Due to the random fluctuation of these signals, they are best characterized in terms of statistical averages. The autocorrelation function of a random process is a statistical average used for characterizing these random signals in the time domain. The power spectral density (spectrum) provides the frequency content of such signals. Spectral analysis of signals or the spectral estimation problem, involves estimating the frequency content of a random signal. This is done by estimating the power distribution over frequency from a stationary sequence of finite time samples, which is known as the power spectrum of the signal [1–5]. Schuster in the late 19th century pioneered the most well-known spectral estimation techniques called the Periodogram. This harmonic analysis approach allows for detecting and measuring ”hidden periodicities” [6] in the observed data. Spectral estimation can also be performed on non-stationary data, by dividing the data into segments in time (each assumed to be stationary) [7],[8],[9]. A time-varying power spectrum (image) can be displayed to provide information about the signal (also known as the spectrogram [10]). Power spectral estimation has applications in many fields [1–3, 11, 12]. Speech signals which are periodic in nature are analyzed using the spectrogram. This frequency domain analysis provides useful information that can lead to speech recognition and generation. In the sensing fields of RADAR and SONAR, the spectral content of received signals may provide information about the targets of interest [11, 13] in a 13 given scene of interest (see Fig. 1-1). Also the power spectrum of signals may provide information about radio frequency interference in such a signal and hence lead to effective suppression of the interference. In the field of MEDICINE, the power spectrum of electroencephalogram (EEG) signals can be used to evaluate the different sleep cycles in humans [14, 15]. These can are used to investigate and study narcoleptic (disease characterized by the inability to properly regulate sleep-wake cycles) patients [15]. More recently in audio analysis, the spectrogram of the audio signal can indicate the presence of the electric network frequency (see Fig. 1-2), which can be used for digital audio authentication [16]. SAR Image from Phase History data 0 −5 −10 −15 −20 −25 −30 −35 A B Figure 1-1. Synthetic aperture radar (SAR) imaging: (A) Photograph of object at 45o (B) SAR image formed using Spectral estimation (FFT) There are two broad approaches to spectral estimation. The first approach is called the non-parametric method and the other is called the parametric method. The non-parametric methods assumes no prior information about the data, where as the parametric methods assumes a specific model of the data, which then results in a problem of parameter estimation. The parametric methods are more accurate than the classical non-parametric techniques, when the assumed model is accurate. However, they perform poorly when there are inaccuracies in the data-model. 14 Spectrogram of pre−filtered audio signal (ENF Harmonic =180 Hz) 179.5 0 179.6 −5 179.7 −10 179.8 −15 179.9 −20 180 −25 180.1 −30 180.2 −35 180.3 −40 180.4 −45 180.5 0 500 1000 1500 2000 time (secs) 2500 3000 3500 −50 Figure 1-2. Spectrogram: Estimating the Electric Network Frequency (ENF) in audio an recording for forensic analysis (see chapter 1 for more details) In this chapter, the problem of power spectral density estimation of signals is briefly described. Commonly used techniques for spectral estimation within these two broad methods (non-parametric and parametric methods) for estimating the spectrum of a signal, will be briefly discussed. The limitations of these methods in practice will also be briefly discussed. Some data-dependent (adaptive) algorithms (Capon, APES, IAA, SLIM and SPICE which are non-parametric and RELAX which is paramteric) will be mentioned along with their benefits [1, 2, 17] over the classical (data-independent) approaches in practical scenarios. The core of this dissertation is effective and efficient application of data-adaptive techniques to solving real-world problems. Before delving into the problem of spectral estimation of random signals, let us consider the case of spectral estimation of finite length deterministic signals. This analysis is fairly straightforward as deterministic signals are predictable over time [18]. The results will then be extended to the case of random signals. 15 1.1.1 Energy Spectral Density Consider a signal x[n] (discrete) with finite energy, that is, E = ∞ ∑ n=−∞ |x[n]|2 < ∞ (1–1) then its discrete time fourier transform (DTFT) exists and is given by: X (ω ) = ∞ ∑ n=−∞ x[n]e−jωn (1–2) where ω is the angular frequency variable measured in radians per sample. From Parseval’s theorem equation (1–1) can be written as: E = ∞ ∑ n=−∞ |x[n]|2 = 1 ∫ π |X (ω)|2 2π −π (1–3) From the equation above the energy spectral density of x[n] which is the distribution of the energy of the signal of frequency is therefore defined as: Sxx (ω ) = |X (ω )|2 (1–4) Note that the energy spectral density Sxx (ω ) can be written as the Fourier transform of the autocorrelation sequence rxx (k ) of the signal x[n]: Sxx (ω ) = ∞ ∑ n=−∞ rxx (k )e−jωk dω (1–5) x∗ [n]x[n − k ] (1–6) where rxx (k ) = ∞ ∑ n=−∞ The analysis above, is specifically for signals with finite energy (deterministic signals). However, signals typically encountered in applications are characterized as stochastic processes and do not have finite energy and hence do not posses a Fourier transform. These random signals however, posses an average power and can be described by their power spectral density. 16 1.1.2 Power Spectral Density Consider a stationary stochastic process y [n], where E{y [n]} = 0 for all n. The autocovariance function (same as autocorrelation function for stationary stochastic process with mean zero) of y [n] is given by ryy (k ) = E{y ∗ [n]y [n − k ]} (1–7) where E{·} is the statistical average over all realizations. The power spectral density (PSD) of y [n] is defined as (Wiener-Khintchine theorem [1] ): ϕyy (ω ) = ∞ ∑ n=−∞ ryy (k )e−jωk (1–8) This simply the fourier transform of the autocorrelation function. Note that the inverse transform of this PSD gives ryy (k ) as shown below [ ] ∫ π ∞ ∞ ∑ 1 1 ∫ π ϕ (ω)ejωk dω = ∑ jω (k−s) ryy (s) dω = ryy (s)δks = ryy (k ) 2π −π yy 2π −π e s=−∞ s=−∞ were δ denotes the Kronecker delta function. Note that, the average power of the stochastic process y [n] is given by the zero lag of the autocorrelation function ryy (0): E{|y [n]|2 } = ryy (0) = 1 ∫ π ϕ (ω)dω 2π −π yy (1–9) This equation (1–9) leads to the motivation for defining the power spectral density in (1–8). The PSD can also be defined as: 2 N 1 ∑ ϕyy (ω ) = lim E y [n]e−jωn N →∞ N n=1 (1–10) which is equivalent to the definition in (1–8) under the assumption that the autocovaraince sequence (ACS) ryy (k ) decays quickly. 1.1.3 Power Spectral Density Estimation Obtaining the true power spectral density (PSD) ϕyy (ω ) of a random process is impossible from a finite set of measurements. This is due to the fact that one will need 17 to compute an an infinite number of values from a finite set of data, which is an ill-posed problem [1, 2]. The problem of spectral estimation, then becomes getting an estimate ϕ^yy (ω ) of the true PSD ϕyy (ω ) of a random process from a finite sequence of observations of the signal. If the signal is statistically stationary, the longer the observed sequence the more accurate the estimate. However if the signal is statistically non-stationary, then one cannot select and arbitrarily long data length for estimation. This is a major limitation on the quality of the PSD estimate. Recall that the PSD describes how the power of a signal is distributed in frequency. This can then be interpreted physically as filtering the random signal through a narrowband filter around a specific frequency of interest (ωo ). This process is then repeated for all the frequencies of interest (−π ≤ ωo ≤ π). Fourier based methods (computed efficiently using the Fast Fourier Transform (FFT)) of spectral estimation are based on this technique [1] and are discussed next. 1.2 Periodogram: Non-parametric Method As mentioned in the section above, the non-parametric methods of spectral estimation provide an estimate of the power spectral assuming no prior information of the data model. The periodogram which was introduced by Schuster in 1898 to detect ”hidden periodicities” in a signal, is a classical non-parametric method which is widely used for spectral estimation. This fourier based method, along with its modified versions are based directly on the definition in (1–10). The periodogram of a set of N samples of random process {y [n]}N n=1 is given as (the subscript yy in ϕyy (ω ) has been dropped for notational simplicity): 2 N ∑ ^ϕp (ω) = 1 y[n]e−jωn N n=1 (1–11) Note that (1–11) is essentially thesame as the (1–10) with the expectation and limit operation removed. This ommission is due to the fact that only N samples of the signal 18 are available. The periodogram can be computed using the discrete fourier transform of the available samples (which can be efficiently computed using the fast fourier transform (FFT). This yields samples of the PSD estimate at frequencies ωk k = 2πk/N for = 0, 1, . . . , N − 1). Note that equation (1–11) can be written as: ϕ^p (ω ) = N −1 ∑ r^[k ]e−jωk (1–12) y [ n] y ∗ [ n − k ] (1–13) k=−N +1 where 1 r^[k ] = N N −1 ∑ k=−N +1 corresponds to the biased estimates of the ACS sequence. This is referred to as the correlogram. The unbiased estimate of the ACS can also be used to compute the correlogram. One major limitation of the periodogram is limited spectral resolution, which is discussed next. 1.2.1 Resolution: Periodogram One key concept in spectral estimation is spectral resolution, which is the ability to resolve or seperate closely spaced frequency components within a signal. The resolution of the periodogram is one major drawback of this data-independent method of spectral estimation. Note that the expected value of the periodogram can be written as: E{ϕ^p (ω )} = N −1 ∑ k=−N +1 E{r^[k ]}e −jωk = N −1 ∑ k=−N +1 w[k ]r[k ]e−jωk (1–14) where (based on (1–13)) 1 − |k| N w [k ] = 0 for n = ±1, ±2, . . . , ±N otherwise 19 (1–15) is the Bartlett window and r[k ] is the true PSD. Equation (1–14) is the Fourier transform of the product of two time sequences, which correspond to the convolution of their individual Fourier transforms as given in (1–16). ∫ 1 E{ϕ^p (ω )} = 2π ϕ(β )W (ω − β ) π (1–16) −π where W (ω ) is the Fourier transform of the Bartlett window. [ 1 sin(ωN/2) W (ω ) = N sin(ω/2) Figure 3 below shows W (ω ) for N = 10 and N = 20. ]2 (1–17) The 3dB (half-power) main lobe 0 N = 10 N = 30 −5 −10 dB −15 −20 −25 −30 −35 −40 −3 −2 −1 0 1 ω (radians/sample) 2 3 Figure 1-3. Bartlett window spectrum: resolution limitation periodogram (window length = N) width is approximately equal to 4π/2N = 2π/N radians per sample (1/N cycles per sample). The spectral estimate of periodogram ϕ^(ω ) will not be able to resolve peaks in the true PSD ϕ(ω ) that have less than 1/N cycles per sample separation. Increasing the number of observed samples will improve the spectral resolution (not be confused with zero-padding). The estimated spectrum can be computed using the DFT (and efficiently using the FFT as mentioned earlier). Increasing the number of available samples by zero-padding (adding zeros to the end of the signal) can provide more detail in the 20 spectrum computed using the FFT. This results in the interpolation of spectrum, however it does not change the spectral resolution as shown in Figure 1-4 and 1-5. Figures 1-4 = 20 samples) with frequency spacing and 1-5 show the spectrum of sinusoids (N ω = 2π × (0.06) and ω = 2π × (0.02) respectively. Each figure shows different zero-padding factors. The periodogram suffers from relatively poor resolution and high 1 Zeropad (32 samples) 0.5 0 0 5 10 15 20 25 30 1 Zeropad (128 samples) 0.5 0 0 20 40 60 80 100 120 Figure 1-4. Spectra of two sinusoids with frequency spacing ω = 2π × (0.06) 1 Zeropad (32 samples) 0.5 0 0 5 10 15 20 25 30 1 Zeropad (128 samples) 0.5 0 0 20 40 60 80 100 120 Figure 1-5. Spectra of two sinusoids with frequency spacing ω = 2π × (0.02) sidelobe problems as seen in Figure 1-3. These reasons have led to investigation into data-adaptive methods of spectral estimation that can provide improved resolution and sidelobe suppression capabilities. In the next subsection some of these data-adaptive 21 algorithms are discussed. Prior to this discussion we will review the periodogram in different light which leads to one of the well known data-adaptive algorithms known as the CAPON algorithm [19]. 1.2.2 Filter-bank Interpretation: Periodogram Recall that the PSD is the power distribution over frequency of the signal, which as mentioned earlier can be interpreted as passing the signal through a bank of narrowband filters (at different frequencies) and computing the output power (which is then divided by the bandwidth of the filter). In this light, the periodogram estimator ϕ^p (ω ) at a given frequency ω can be written as: 2 2 −1 N N 1 ∑ ∑ ∗ 2 jω (N −n) ϕ^p (ω ) = y [ n ] e = N h [ k ] y [ N − k ] = N |z (N )| ω N n=1 n=0 (1–18) where z (N ) = and ejωk ∗ hω [ k ] = 0 ∞ ∑ n=0 hω [ k ] y [ N − k ] for k = 0, 1 , . . . , N − 1 = (1–20) otherwise Note that the periodogram can be interpreted as filtering the signal y through a narrowband pass filter hω (1–19) = {y [k ]}N k=0 {hω [k ]}N k=0 and selecting just a single output z (N ) of the filtering process hH ω y for power calculation at the specified frequency ( {·}∗ and {·}H correspond to the conjugate (scalar) and conjugate transpose (vector) operation). This fact leaves the periodogram with a large variance irregardless of the data length (N ). The output power divided by the bandwidth (PSD) is then calulcated as E|z [n]|2 / = |z [N ]|2 /, where = 1/N cycles per sample is the filter’s bandwidth. Modified versions of the periodogram such as the Bartlett and Welch which segment (non-overlapping and overlapping respectively) the stationary sequence 22 in question and average the periodograms of the segments can be used to reduce the variance [2]. In terms of the filter-interpretation, these methods can be seen as computing the power with more than one sample (number of segments). However, the periodogram is computed using a reduced length of the data, hence there is a trade-off between statistical variance and resolution. Some data-adaptive non-parametric methods have addressed the limitations of the periodogram by designing a data-adaptive filter, to provide more accurate PSD estimates with better resolution. In the next section methods like the CAPON, APES (Amplitude and Phase Estimation), IAA (Iterative Adaptive Approach) which are data data-adaptive non-parametric approaches are discussed. The data-adaptive parametric approach known as RELAX (strictly for sinusoidal parameter estimation) is also discussed. 1.3 Data-adaptive Approaches In this section, we discuss some well known non-parametric data-adaptive approaches (CAPON, APES) as well as recent non-parametric spectral estimators (IAA, SLIM SPICE). These algorithms improve upon the periodogram spectral estimator in terms of resolution and sidelobe reduction. A parametric approach specifically for estimating parameters of line spectra (sinusoids) known as RELAX is also mentioned and discussed in detail later on in Chapter 3. 1.3.1 CAPON: Non-parametric From the last sub-section, the periodogram output at a specific frequency ω can be interpreted as using a data-independent filter (bandpass filter) with an impulse response −1 {hω [k ] = e−jωk }N k=0 corresponding simply to the Fourier Transform vector. Unlike the data-independent filter used in the periodogram, the CAPON method [19–21] (also known as the minimum variance method) designs a data-dependent (adaptive) bandpass filter hω 1 = {hω [k]}l− k=0 to achieve some specific desired properties 23 (The CAPON method uses overlapping segments of length (l × 1) of the data to improve statistical variance). These properties include: Design a bank of filters hω that pass the frequency component (or sinusoid with frequency) ω undistorted. 2. Filter should also effectively suppress (or minimize) all out-of-bound (any other frequencies) power within the signal. This process can be expressed as follows. Let the output of the filter at any instant 1. n = [0, 1, . . . , N − 1] be given by: z [ n] = l−1 ∑ k=0 h∗ω [k ]y [n − k ] = hH ω yn (1–21) = [y[n], y[n − 1], . . . , y[n − l + 1]]T . The total output power of the filter is H then given as E{|z [n]|2 = hH ω Rhω . Where R = E{yn yn } is the covariance matrix of the where yn data vector. The CAPON filter is designed to meet the properties in the aforementioned steps by minimizing the total output power of the filter subject to the constraint that the frequency ω is filtered without distortion given by the optimization equation (1–22). min hH ω Rhω hω where a(ω ) = subject to hH ω a(ω ) = 1 (1–22) {e−jω }ln=0 is the sinusoid component with frequency ω to be passed undistorted. The resulting filter is given by: hω = R−1 a(ω) aH (ω)R−1 a(ω) (CAPON filter) (1–23) The PSD estimate can then be calculated as filter output power E{|z [n]|2 divided by the bandwidth ≈ 1/(l). ϕ^CAP ON (ω ) = E{|z [n]|2 = aH (ω)Rl −1 a(ω) 24 (CAPON spectral estimate) (1–24) ^ based on the M The sample covariance matrix R = N − l + 1 overlapping segments (each of length l) of the data is used to estimate the covariance matrix and is given by: M −1 ∑ ^R = 1 yn ynH M n=0 (1–25) A very similar algorithm to the CAPON algorithm known as the Amplitude and Phase Estimation algorithm (APES) is described next. 1.3.2 Amplitude and Phase Estimation (APES): Non-parametric Note that in the description of the CAPON algorithm, the filter design was based on passing a single frequency, while suppressing all other out-of-bound frequencies. CAPON achieves the suppression by minimizing the total output power. APES algorithm [22],[23],[24] uses the same idea but suppressing out-of-bound frequencies is achieved by designing a filter such that the filtered sequence is as close as possible to the a sinusoidal signal at the given frequency ω in the least squares (LS) sense. The optimization equation is given by : min α(ω ),hω M −1 ∑ n=0 jωn 2 |hH | ω yn − α(ω )e subject to hH ω a(ω ) = 1 (1–26) The cost function in (1–25) can be re-written as: 1 M −1 ∑ M n=0 jωn 2 ∗ H ^ |hH | = |hH yω − α(ω)~yωH hω + α(ω)|2 ω yn − α(ω )e ω Rhω − α (ω )hω ~ (1–27) = |α(ω) − hHω ~yω |2 +hHω R^ hω − |hHω ~yω |2 Note that the second and third terms in (1–27) do not depend on α(ω ) and therefore the minimization of this cost function with respect to α(ω ) is given by α ^(ω) = hHω ~yω where ∑ ~yω = (1/M ) Mn=0−1 yne−jωn . The optimization problem for designing the filter hω is given as: ^ min hH ω Qω hω hω subject to hH ω a(ω ) = 1 25 (1–28) ^ω where Q = R^ − ~yω ~yωH The APES filter is given by: hω = Q^ −ω 1 a(ω) aH (ω)Q^ −ω 1 a(ω) (APES filter) (1–29) The amplitude spectrum of APES algorithm is given by: α ^(ω) = a(ω)H Q^ −ω 1~y(ω) aH (ω)Q^ −ω 1 a(ω) (APES amplitude spectrum) (1–30) The APES and CAPON algorithms have been shown to provide higher resolution compared to the classical non-parametric methods. The CAPON algorithm minimizes the total output power subject to a constraint which tends to provide spectral estimates that are biased downward due to the noise gain of the filter [23]. The APES algorithm minimizes a least square function requiring the filter output to be as close as possible to the a sinusoid. This provides more accurate spectral estimates. However, in the cases where the data is not stationary for a long period of time (only few snapshots are available), the APES and CAPON methods yield undesirable results. The Iterative Adaptive Approach (IAA) algorithm improves on these algorithms by being able to give good spectral estimates for a few snapshots (even a single snapshot), while providing high spectral resolution, making it very suitable for practical applications. This algorithm is discussed briefly in the next subsection and also in Chapter 1 where it is used. 1.3.3 Iterative Adaptive Approach (IAA): Non-parametric The IAA algorithm [25],[26],[27],[28] for spectral estimation is derived by minimizing a weighted least squares cost function (described in Chapter 1). The spectral estimate for the IAA algorithm for a single snapshot y is given below: α ^(ω) = a(ω)H Q^ −ω 1 y aH (ω)Q^ −ω 1 a(ω) (IAA amplitude spectrum) (1–31) This estimate looks similar to the APES estimate, with the main differences being that the IAA algorithm is iterative and also the computation of the covariance matrix of 26 ^ ωl the noise is given as Q = R^ − ∑K H i=0,i̸=l pi a(ωi )a(ωi ) , where R = APAH and P is a 2 K diagonal matrix with elements corresponding to {pi }K i=0 = {|α(ωi )| }i=0 (powers at each individual frequencies). Note that unlike the CAPON and APES algorithms where the covariance matrices are based on the data samples and computed once. The covariance matrix of the IAA algorithm is dependent on the spectral estimate and hence refined iteratively, with the initial estimates of the spectral powers computed using the periodogram. This refinement allows the IAA algorithm to produce accurate spectral estimates with a single snapshot and hence makes it useful for practical applications (where the available data for estimation is usually limited to a single snapshot). Fig. 1-6 shows the spectrum of spectrum Periodogram (single snapshot) 0.2 0 spectrum Periodogram Sinusoid 1 Sinusoid 2 Sinusoid 3 0.4 −3 −2 −1 0 1 ω (rad/sample) Adaptive Algorithm 2 3 IAA 0.4 0.2 0 −3 −2 −1 0 ω (rad/sample) 1 2 3 Figure 1-6. Spectrum: Comparison of adaptive methods to the periodogram three sinusoids in white noise (SNR = 30dB) with frequencies ω1 ω2 = 0.63 rad/samp, = 1.26 rad/samp and ω3 = 1.33 rad/samp. The CAPON and IAA estimates are poor, due to ill-conditioning of the matrices. However with a single snapshot the IAA spectra is capable of picking out the sinusoids. A comparison of the periodogram to the IAA algorithm in this figure shows how this adaptive technique improves over the periodgoram in terms of spectral resolution and sidelobe suppression. 27 1.3.4 SLIM and SPICE Algorithms: Non-parametric The Sparse Learning via Iterative Minimization (SLIM) [29] and the Sparse Iterative Covariance-based Estimation (SPICE)[30] algorithms are two super-resolution algorithms capable of providing high resolution estimates even in the presence of a single snapshot and coherent sources similar to the IAA algorithm. They both estimate the covariance matrix R iteratively similar to the IAA algorithm and are hence also useful for practical applications. The SLIM approach is a maximum a posteriori approach (MAP) based on the hierrachial model. The goal is to use a sparse prior to promote sparsity in the estimates which is useful for certain applications. The SPICE algorithm on the other hand minimizes a covariance cost function that yields sparse estimates. These two algorithms empirically yield less accurate that the IAA approach. However they provide sparse and higher resolution estimates compared to the IAA algorithm and can be useful in certain applications. They are described in more detail in Chapter 4. The algorithms mentioned above are all non-parametric methods that do not assume a specific model for the data. Next we briefly mention parametric methods, which assume a specific data model for PSD estimation. A robust algorithm (which is later discussed in more detail in Chapter 3) RELAX [31]; which is specific for estimating parameters of line-spectra (sinusoids) is discussed next. 1.3.5 RELAX: Parametric Parametric methods unlike the non-parametric methods assume a specific model for the observed data. These methods essentially estimate the PSD, by assuming the data takes on a specific model and then estimates the parameters of the model. Auto-regressive (AR) methods such as Yule, Prony, Forward-Backward Prony methods are used for estimating parameters for continuous spectra and Eigen-analysis methods (MUSIC, ESPRIT) are used for estimating parameters of line spectra (sinusoids). The AR methods model the data as the output of a linear system driven by white noise and proceed to estimate the parameters of that system. One major limitation of these 28 parametric methods is that they a subject to errors due to poor model specifications. The Eigen-analysis methods (for line spectra) estimate frequency components of sinusoids buried in noise by an eigen-decompostion of the autocorrelation matrix. These methods tend to perform poorly in practical applications due to data model inaccuracies. The RELAX algorithm is an algorithm that is robust, and that estimates parameters of sinusoids in an iterative manner. It estimates the parameters of the sinusoid accurately even with modelling errors and colored noise [31]. This algorithm is described in more detail in Chapter 3 where Radio Frequency Interference (RFI) is modeled as sum of sinusoids. The RELAX algorithm is used there for identifying and suppressing the RFI signals. 1.4 Conclusions In this section, a brief discussion on the problem of spectral estimation is presented. The periodogram which is a data-independent algorithm for spectral estimation and also widely applied in practical applications is briefly discussed. This algorithm is then re-interpreted as a filtering process with a data-independent filter. This re-interpretation has led to some data-dependent (adaptive) filters which provide improved spectral estimates. We discussed some well-known data-adaptive (non-parametric) algorithms (CAPON,APES) and the advantages provided by these data-adaptive approaches over the classical non-parametric methods. However these algorithms perform poorly in the case when only one snapshot of data is available. More recent data-adaptive (non-parametric) algorithms, which are robust and perform well even in the single snapshot case were briefly mentioned and will be discussed in more detail in later chapters. The improved robustness of these algorithms allows for useful applications in a practical setting, while providing better spectral properties (high resolution, lower sidelobes) over the commonly used periodogram. 29 A robust parametric algorithm known as the RELAX for sinusoidal parameter estimation is also mentioned briefly (discussed in more detail in Chapter 3). This algorithm is capable of accurate sinusoidal parameter estimation even in the presence of colored noise making it suitable for practical applications. In this dissertation we focus on solving specific real world problems by analyzing these data-adaptive techniques and coming up with effective and efficient ways to apply them to give superior performance to the standard data-independent approaches. 1.5 Notations Notation: Throughout this dissertation, Boldface upper-case and lower-case letters are used to denote matrices and vectors, respectively. See Table 1-1 for more details on notation. Table 1-1. Notations x X a vector a matrix diag(x) a diagonal matrix with elements of x on the diagonal (·)H (·)T (·)(n) conjugate transpose of a matrix or vector transpose of a matrix or vector n th iteration of a scalar, vector or matrix in algorithm ||·||2 x ^ ℓ2 norm estimate of scalar x , definition 30 CHAPTER 2 DATA-ADAPTIVE TECHNIQUES FOR NETWORK FREQUENCY EXTRACTION FROM DIGITAL RECORDINGS 2.1 Chapter Summary A novel forensic tool used for assessing the authenticity of digital audio recordings is known as the Electric Network Frequency (ENF) criterion. It involves extracting the embedded power line (utility) frequency from said recordings, and matching it to a known database to verify the time the recording was made, and its authenticity. In this chapter, a non-parametric, adaptive, and high resolution technique known as the Time-Recursive Iterative Adaptive Approach (TRIAA), is presented as a tool for the extraction of the ENF from digital audio recordings. A comparison is made between this data dependent (adaptive) filter, and the conventional Short-time Fourier Transform (STFT). Results show that the adaptive algorithm improves the ENF estimation accuracy in the presence of interference from other signals. To further enhance the ENF estimation accuracy, a frequency tracking method based on dynamic programming will be proposed. The algorithm uses the knowledge that the ENF is varying slowly with time to estimate with high accuracy the frequency present in the recording. 2.2 Introduction The use of digital recorders has become more prevalent in the world today due to the advancement in digital technology and the significant progress made in the field of digital signal processing (DSP). Prior to the increased use of digital recorders, forensic audio analysis relied on different techniques of audio authentication. For instance, the magnetic signatures that are left by the erase, record or play heads on the magnetic tape of analog recorders, can be used to verify the authenticity of such recordings. When it comes to digital recordings, alterations can be made very easily without leaving behind such imprints, because digital recorders produce a recording by converting sound variations to a series of numbers, making authentication of these recordings a lot more difficult [32]. The importance of being able to verify the authenticity 31 of a recording can be seen in litigation cases [16] [33], where digital recordings are brought forward as evidence in a trial. Therefore, more reliable methods of verifying the authenticity of digital recordings need to be researched. The Electric Network Frequency (ENF) Criterion was proposed by Grigoras [16], [34] to address the issue of digital audio authentication. The ENF criterion is based on extracting the utility frequency or ENF from a digital audio recording, and matching the extracted frequency estimate to a reference database in order to determine the authenticity, and also time of the digital recording. This process is possible because, in some cases, digital recorders (even some battery powered recorders [35]), can pick up the audible sound that is generated by the oscillation of a power grid’s alternating current at this frequency. The frequency of oscillation is approximately 60 Hz in the United States, whereas in Europe it oscillates at approximately 50 Hz. The corresponding harmonics of this frequency might also be present in the digital recording. The ENF criterion is based on two assumptions. Firstly, the ENF for interconnected networks is the same at all points within the network. Secondly, the frequency varies randomly within a given interconnection, and hence, are not repeatable over a long period of time. [33] There are three known methods of extracting the ENF over time from a digital recording [16], [34]. They are termed: • time/frequency domain analysis - This method is based on computing the spectrogram of the signal and visually comparing it to the database. • frequency domain analysis - This method is based on selecting the frequency location corresponding to the maximum amplitude of the power spectrum of segments (frames) of the data after applying a band-pass filter. • time domain analysis - This method is based on measuring the zero crossings of the signal in the time domain after a bandpass filter has been applied to the recording. Recently in [36], a quadratic interpolation scheme was applied to the frequency domain analysis method to estimate the spectral peak locations (frequencies) more 32 accurately. This reduces the estimation error resulting from the use of a fixed grid size in the spectral estimation process. Besides the time-domain analysis, the above methods estimate the ENF based on computing the Fast Fourier Transform (FFT) of overlapping segments (frames) of the data known as the Short-Time Fourier Transform (STFT) which is limited by the trade-off between time resolution and frequency resolution [2]. Parametric methods such as the Frequency Selective ESPRIT, which give superior resolution compared to the FFT, can also be used successfully to extract the ENF from one frame to another. However, in the presence of significant interference within a given frame, the parametric methods yield poor frequency estimates because of their sensitivity to an assumed data model. This chapter focuses on two methods of extraction. The first, builds upon the frequency domain analysis with quadratic interpolation. However, in place of the FFT, the spectrum is estimated for each segment of the data using a non-parametric and high resolution adaptive algorithm known as the Iterative Adaptive Approach (IAA) [25]. In the presence of interfering signals with frequencies within the range of values the ENF can take on, IAA yields more accurate estimates of the ENF compared to the FFT as a result of the improved spectral resolution and interference suppression capability. The second method involves applying a frequency tracking algorithm based on discrete dynamic programming [37], which takes into account the slowly varying nature of the ENF over time. This tracking algorithm is necessary because, in some frames of the data, the maximum spectral peak might correspond to an interference signal rather than the network frequency signal even within the acceptable ENF limits. The ENF is then estimated inaccurately, which can result in a false diagnosis that the recording in question has been edited. It is worthwhile to point out that, in order for the proposed methods to work, the ENF must be embedded in the recording, which is not always the case especially in some battery operated recorders [35]. This is certainly a drawback of using the ENF criterion 33 for digital authentication. However, if the ENF is embedded in a digital recording, more reliable methods of extraction need to be sought. Table 2-1. Abbreviations APES Amplitude and Phase Estimation ENF Electric Network Frequency ESPRIT Estimation of Signal Parameters by Rotational Invariance FDR Frequency Disturbance Recorder FIAA Fast Iterative Adaptive Approach IAA Iterative Adaptive Approach QN-IAA Quasi-Newton Iterative Adaptive Approach STFT Short-time Fourier Transform TRIAA Time-Recursive Iterative Adaptive Approach Extraction can also be carried out using the harmonics of the ENF signal for the frequency estimation process. In some cases, the harmonics may give better estimates because of a higher signal-to-interference-and-noise ratio compared to the fundamental frequency. The remaining sections of this chapter are organized as follows. In Section 2.3, the network characteristics and the network frequency database are described. In Section 2.4, the IAA and TRIAA algorithms are described along with the frequency tracking algorithm for ENF extraction. In Section 2.5, the experimental results based on a set of digital audio recordings are presented. Finally, Section 2.6 contains the conclusions drawn from the results. Abbreviations: The abbreviations are presented for easy reference in Table 2-1. 2.3 Network Frequency Characteristics and Database The frequency at which alternating current is distributed to various customers from power stations, corresponds to the utility frequency or ENF. For European and most Asian countries the value of this frequency is 50 Hz, while the value is 60 Hz in North America, and several countries in South America. Japan uses both frequencies (50 and 34 Figure 2-1. FDR Distribution in North America 60 Hz) for electricity distribution. This frequency is determined by the speed of rotation of the turbines used to drive the generators at the various power plants [38]. Naturally, the rotation speed is not constant and varies within a certain limit (approximately ±0.05 Hz) depending on the amount of load connected to the network, and amount of power generated at a given time. Experiments carried out in some European countries [16], [39], have shown that this frequency variation is random and unique within specific geographic locations. This uniqueness in frequency variation within a region, coupled with the fact that network frequency is not repeatable over a long period of time is what makes the aforementioned ENF criterion possible. A database of the network frequency is needed in order to match the extracted ENF from a recording for verification. In [16], such a database is created by connecting the sound card of a computer to a transformer which is then connected directly to an AC power outlet. The database currently being built in North America involves deploying several sensors termed frequency disturbance recorders (FDRs), which perform accurate ENF measurements, up to about ±0.0005 Hz. The measured data collected by the FDRs is transmitted over the internet to servers, where it can be analyzed and stored in a system termed the Information Management System (IMS) [40]. This collection forms the Frequency Monitoring Network (FNET). 35 There are two major interconnections in North America and three minor interconnections. These regions have unsynchronized networks (frequency and phase) and are therefore connected via High Voltage Direct Current Lines (HVDC) [41]. The Eastern and Western Interconnections form the major interconnections, while the Quebec, Texas and Alaska Interconnections form the minor. The Alaska Interconnection is isolated, in the sense that it is not connected to any of the other interconnections. It is therefore generally not considered to be part of the North American grid. Fig. 2-1 shows the distribution of the FDRs in Western, Eastern, Quebec and Texas Interconnections. Frequency measurements collected by the FDRs in these interconnections show that the frequency pattern is different at a given time from one interconnection to another. However, the frequency pattern is unique at different locations within each interconnection [42]. The FNET system, therefore, provides a viable ENF database. 2.4 Extraction Algorithms 2.4.1 Frequency Domain Analysis (STFT) Due to the fact that the ENF varies with time, the extraction process involves analysing a non-stationary data sequence. STFT is a common method for time-frequency analysis of signals. This analysis assumes the signal of interest is stationary within short time windows (frames); the FFT of the signal is then computed for each frame. The frequency domain analysis [16] method of extraction is based on this idea. The process involves re-sampling the audio signal to a lower sampling rate, to reduce the computational complexity of the analysis. A bandpass filter with a narrow bandwidth is applied to the signal with center frequency 50/60 Hz as a preprocessing step. The rest of the analysis is described as follows. Let, z = [z0 , z1 . . . zN −1 ]T (2–1) denote the re-sampled and filtered discrete-time signal. This signal is then split into R overlapping frames as shown in Fig. 2-2, with each frame having length M and a shift 36 from frame to frame of length T . Using the frequency domain analysis method, the ENF of the rth frame is estimated by finding the frequency that maximizes the spectrum of each frame which is computed using the FFT based periodogram. In order to get a more accurate estimate of the frequency, quadratic interpolation is used [36], [43]. This interpolation scheme, involves fitting a quadratic model of the form log ϕ^(ω) = m(ω − ωk max − )2 + c (2–2) around the frequency point that maximizes the power spectrum: ωkmax = argmax ϕr (ωk ) (2–3) ωk = 2πk/K, k = 0, 1, . . . , K − 1 corresponds to the frequency grid point of a frequency grid with size K, and ϕr (ωk ) is power spectrum of the rth frame. where ωk The value of ω that maximizes the model (2–2) is taken as the estimated peak of the spectrum. This value is determined by fitting the model to the highest sample of the power spectrum and the two adjacent points with corresponding frequencies (ωk max − 1 , ωkmax , ωkmax +1 ). This value of ω that maximizes the model is: ω = ωk + max (2–4) where = 12 β β− 1 − β1 (ωkmax+1 − ωkmax ) −1 − 2β0 + β1 βℓ , log ϕr (ωkmax +ℓ ), ℓ = −1 , 0 , 1 . (2–5) (2–6) The corresponding frequency estimate of the rth frame in Hz is given by: f^(r) = 2π (ωkmax + ) Fs where Fs is the sampling frequency (in Hz) of the signal. 37 (2–7) Figure 2-2. Segmentation of data for STFT The use of STFT will result in a trade-off between frequency resolution and time resolution. For a given frame length, this trade-off can be optimized by applying a rectangular window to each frame, which will provide the best spectral resolution at a cost of higher side lobes compared to other spectral windows. In order to get improved spectral resolution over FFT, one has to resort to using parametric methods or data-dependent (adaptive) non-parametric methods for spectral estimation. Parametric methods, on the one hand, are not robust against data model errors. On the other hand, non-parametric adaptive methods are more robust, since they do not assume a specific parametric data model. Well-known adaptive methods include the Capon algorithm and the Amplitude and Phase Estimation (APES) algorithm. These algorithms also provide higher resolution and lower sidelobes than the periodogram. However, these methods are inadequate because they require multiple realizations (snapshots) of the random signal, which is not the case with the current data, as only one snapshot is available for frequency estimation. Spatial smoothing (segmenting and spectral averaging of the data) can be used to improve the spectral estimates of the Capon and APES algorithms in the one-snapshot case; but the cost of doing this will be a degradation in the spectral resolution, which is not desirable. The wavelet transform is also a common tool for time-frequency analysis. Contrary to the STFT, which uses a 38 fixed window size, the wavelet transform uses short windows at high frequencies and longer windows at low frequencies. The wavelet transform is therefore not suitable for our problem because we are interested only in a small range of frequencies. IAA is a non-parametric data-dependent algorithm based on Weighted Least Squares (WLS), originally presented in [25] for Direction of Arrival (DOA) estimation in array processing. The IAA algorithm is capable of yielding high resolution and low sidelobes even in the case of a single snapshot [25], hence making it suitable for estimating the ENF in the presence of interferences. 2.4.2 IAA and TRIAA The ENF can be extracted with high accuracy in the presence of interference using the IAA algorithm for a given frame. The proposed ENF extraction process follows (2–2)-(2–7), with the FFT spectral estimate ϕr replaced by the IAA spectral estimate. The IAA and TRIAA [44] used for spectral estimation of non-stationary data will be discussed in this section. = [y0 , y1 . . . yM −1 ]T denote a uniformly sampled stationary data sequence and A = [a(ω0 ), a(ω1 ) . . . a(ωK−1 )], where a(ωk ) = [1, ejω , . . . , e(M −1)jω ]T corresponds to a steering (frequency) vector, and ωk = 2πk/K, k = 0, 1, . . . , K − 1, corresponds to a frequency grid point of a frequency grid with size K. Also let α = [α(ω0 ), α(ω1 ), . . . , α(ωK−1 )]T , with α(ωk ) denoting the complex spectral estimates of y at ωk . The following data model can be formulated: The spectral estimation problem can be set-up as follows. Let y k k y = Aα where the noise contributions of y are taken into account implicitly [25]. 39 (2–8) The IAA algorithm solves for the spectral estimates α by minimizing the following quadratic cost function in (2–9) using weighted least squares (WLS): ||y − a(ωk )α(ωk )||2Q−1 (ωk ) (2–9) where ||x||2Q−1 (ωk ) , xH Q−1 (ωk )x, Q(ωk ) = R − pk a(ωk )aH (ωk ) (2–10) R = APAH (2–11) and P , diag[p0 , p1 , . . . pK−1 ], with pk for k = 0, . . . , K − 1, denoting the power estimate at each frequency grid point, given by |α(ωk )|2 . R1 is the covariance matrix of the data and Q(ωk ) is the covariance matrix of the interference and noise, where interference refers to all the signals at frequency grid points other than the current grid point of interest ωk . Minimizing the cost function in (2–9) with respect to the α(ωk ) for k = 0, . . . , K − 1 gives the following solution: α ^(ωk ) = aH (ωk )Q−1 (ωk )y , aH (ωk )Q−1 (ωk )a(ωk ) k = 0, 1 , . . . , K − 1 (2–12) The solution in (2–12) can be re-written as α ^(ωk ) = aH (ωk )R−1 y , aH (ωk )R−1 a(ωk ) k = 0, 1 , . . . , K − 1 (2–13) using the Woodbury matrix identity2 and (2–10). This prevents the computation of the interference covariance matrix Q−1 (ωk ) for each frequency grid point. Note that the computation of R−1 requires the knowledge of α(ωk ) and vice versa. Hence this algorithm is solved in an iterative manner, with the estimate of α initialized using the 1 R = APAH + σ2 I for ill-conditioned matrices [45] 2 matrix inversion lemma 40 FFT. This iterative algorithm takes about 10 to 15 iterations to converge based on experimental and numerical results. Note also that without accounting for the interference from other frequency grid points (without weighting), minimizing the cost function in (2–9) for K = M gives the Discrete Fourier Transform (DFT) of the signal: α ^(ωk ) = aH (ωk )y M , k = 0 , 1, . . . , M − 1. (2–14) The IAA algorithm described above is used for spectral estimation of stationary data. Analogous to the STFT, the spectral content of a non-stationary data sequence, such as (1), can be estimated using the TRIAA [44]. The signal is split into overlapping frames similar to Fig. 2-2 and the IAA spectral estimate is computed for each frame. However, to reduce the computational complexity, each subsequent frame after the first frame is initialized with the spectral estimate of the previous frame instead of the FFT based periodogram as described in the IAA algorithm. The resulting algorithm yields better spectral resolution and lower side lobes than the STFT. There is still a significant increase in the computational complexity when using the TRIAA algorithm compared to using STFT for spectral estimation. This computational complexity is reduced slightly by reducing the number of iterations in subsequent frames for the TRIAA. This is because convergence of the estimated spectrum will occur in fewer iterations given the current frame is initialized by the spectral estimate of the previous frame. When the dataset is significantly large, the use of this algorithm is still impractical. The bottle-neck of the TRIAA algorithm is in the computation of the denominator in (2–13) for each frame. In [46], [47] the Toeplitz structure of the covariance matrix R is exploited and the computation of R−1 is performed using the Gohberg-Semencul (GS) factorization of this matrix [2]. Moreover, the denominator is obtained via evaluating a polynomial. This reduces the computational complexity of the denominator in (2–13) (which is the 41 bottleneck of the IAA algorithm) from O(M 2 K ) to O(M 2 ) floating point operations (flops) [46] for a given frame, without a loss in performance. The algorithm is termed the Fast IAA (FIAA), which is a significant improvement but still computationally expensive for large datasets. The computational complexity of IAA and FIAA are O(M 2 K ) and O(M 2 + KlogK ), respectively, where M is the data length and K is the grid size, with K >> M . An approximate algorithm to the IAA algorithm with significantly faster computational time is described in [48] and referred to as the Quasi-Newton IAA (QN-IAA). The QN-IAA algorithm estimates the covariance matrix as if it were from a low-order (L) autoregressive (AR) process, where L << M with M being the data (frame) length. The inversion of the lower-order covariance matrix Q ∈ CL×L is carried out in place of R ∈ CM ×M , yielding an approximate solution to the IAA spectral estimate (2–13) with significant reduction in the computational complexity and just a slight degradation in the resolution. The computational complexity of this algorithm is O(L2 + KlogK ). The FIAA or QN-IAA can be used in a time-recursive manner for non-stationary data as is the case with the ENF signal. This algorithm reduces the trade-off between frequency resolution and time-resolution for a given frame length compared to the FFT based periodogram during the ENF extraction process. The extraction process is the same as the frequency domain analysis (2–2)-(2–7) with ϕr replaced by either of the aforementioned algorithms. However, even if a good algorithm is used for frequency estimation based on (2–7), specific frames might be corrupted by interference signals with frequency components within the ENF limits. This could lead to errors in frequency estimation, if the frequency location corresponding to the maximum value of the estimated spectra belongs to an interference signal. A robust method of tracking the ENF that exploits the slowly varying nature of this frequency is needed. The next section describes the proposed frequency tracking algorithm. 42 2.4.3 Frequency Tracking A method of estimating the ENF by tracking it from one frame to another is formulated here from a mathematical point of view. The proposed method uses discrete dynamic programming [37] to find a minimum cost path. A cost function as shown in this section is selected which takes into account the slowly varying nature of the actual network frequency. This cost function penalizes significant jumps in frequency from frame to frame and the corresponding path is used to estimate the ENF. This algorithm involves finding the peak locations from the spectrum of each frame and assigning costs based on the difference between a peak location in one frame and a peak location in another frame. The magnitude of the assigned cost is related to the difference in the frequency from one frame to another. The minimum cost path from the first frame to the last frame is computed to estimate the ENF. To estimate the number of relevant peaks (sinusoids) in a given frame, a model order selection tool known as the Bayesian Information Criterion (BIC) is used. The BIC for complex sinusoids in noise is given by (refer to [2] [49] for a full derivation): ) ( 2nr ∑ a(ωk )^α(ωk )||2 + 5(2nr )lnM. BIC(nr ) = M ln ||y − k=1 (2–15) The number of peaks (real sinusoids) nr , is estimated as the minimizing argument of the above BIC criterion. The first term in (2–15) is a Least-Squares data fitting term, which decreases as the number of estimated peaks nr increases, where as, the second term is a penalty term that prevents ’over-fitting’ of the data model. Once the nr largest peaks and corresponding locations are determined in each frame, the frequency tracking problem is formulated and solved as follows. Assume that for a given frame r, a set of estimated peak locations (frequencies) is denoted by r fr ∈ = {Pr1 , Pr2 , . . . Prnr }. We would like to find a path {fr }R r=1 , such that r and where the difference fr − fr−1 is as small as possible for r = 1, 2, ..., R. This set corresponds to the estimated ENF over all frames and can be obtained as the 43 minimizing argument in the following optimization problem: = J min R ∑ fr ∈ r r=1,...,R r=2 (fr − fr−1 )2 . (2–16) Calculating this cost using an exhaustive search is impractical. However, using dynamic programming [37] the path that minimizes this cost can be computed recursively and efficiently by minimizing the cost from a given frame j < R, to the last frame, denoted by J (j, fj ). J (j, fj ) = R ∑ min fr ∈r r=j +1,...,R r=j +1 (fr − fr−1 )2 , fj ∈ j . (2–17) This optimal cost satisfies the recursive equation, J (j, fj ) = min {(fj +1 − fj )2 + J (j + 1, fj +1 )}, fj ∈ j fj +1 ∈j +1 which can be calculated for j (2–18) = R − 1, R − 2, . . . , 1, with the initialization, J (R, fR ) = 0, fN ∈ N . Note that J = fmin J (1, f1 ), fN ∈ N ∈ 1 1 (2–19) is the cost from the first frame to the last frame R and the set {fr }R r=1 that minimizes this cost function corresponds to the extracted ENF signal as mentioned above. Dynamic programming has a computational complexity of O(R2max ), where R corresponds to the total number of frames and max is the number of spectral peaks in the frame with the maximum number of peaks. 2.4.4 Matching the Extracted ENF to Database Once the ENF signal has been extracted, a method of matching the estimated signal to the database signal is required. The goal is to find the location/time within the database that is similar in pattern to the extracted ENF. In [36] a method based on minimizing the squared error between the ENF and database is used for automated 44 matching. A method of correlation matching proposed in [50] for short digital recordings (10-15 minutes) is used in place of this MSE method. The process of correlation matching is described as follows. Assume that f signal and d = [f1 , f2 , . . . , fR ] is the extracted ENF = [d1 , d2 , . . . , dL] corresponds to the database signal with L > R. The matching process requires finding lmax such that: lmax = argmax c(l), l l = 1, 2, . . . , L − R (2–20) where c(l) is the correlation coefficient between f and the vector [dl , dl+1 , . . . , dl+R−1 ]. An important point to make is that, the maximum correlation coefficient c(lmax ) is used here only for matching the estimated ENF to the database and comparing the accuracy (reliability) of the different algorithms presented. Once a match has been made, determining locations of edits to a recording should be based on the differences between the ENF estimate and the database. Table 2-2. Parameters for the Experiment PARAMETERS Data1 Data2 T (Time Shift) 1s 1s M (Length of Frame) 20s 33s R (Number of Frames) 1800 1800 2.5 Experimental Results The algorithms presented in the previous section are applied to two different digital audio datasets referred to as Data1 and Data2. The two datasets are recorded simultaneously and therefore, should contain the same ENF pattern over time. The first data set (Data1) is acquired by connecting an electric outlet via a voltage divider directly to the internal sound card of a desktop computer, resulting in an ENF signal with a rather high signal-to-interference-and-noise ratio. On the other hand, the second dataset (Data2) is an actual speech recording played from a speaker and picked up by the internal microphone of a laptop computer. 45 Each of these recordings are originally sampled at 44.1 kHz at a bit rate of 16 bits per sample. Each dataset is re-sampled to 441 Hz, hence keeping only the fundamental frequency (1st harmonic) and the two higher harmonics of the ENF. A bandpass filter with a narrow bandwidth around the network frequency is applied to the data to eliminate as much interference as possible without distorting the ENF signal. Based on Fig. 2-2 each data set is split using the values shown in Table 2-2. This set-up results in an ENF estimate every second for a total of 30 minutes for each dataset. An increase in the frame length improves the signal-to-noise ratio of the signal [36] and the spectral resolution at the cost of lower time resolution. Therefore, a larger frame length is used for Data2 which has a weak ENF signal compared to Data1 which has a strong ENF signal. Figure 2-3. Matching extracted ENF to database (Data1 - scaled to 60 Hz) Fig. 2-3 shows the extracted ENF signal (shifted by 0.05 Hz for illustration purposes) from Data1, matched with the truth obtained from the FDRs, when the data set has not been altered in any form (using STFT and (2–7)). Fig. 2-7 shows the extracted ENF using the STFT based method and our proposed method (also shifted for comparison purposes). Tables 2-3 and 2-5 give the maximum correlation coefficient c(lmax ) of the various methods for Data1 and Data2, respectively, also when the signals have not been altered. The maximum correlation coefficient values are used to compare 46 the accuracy of the algorithms and hence determine which is more reliable for ENF estimation. We have also included similar MSE (actually standard deviation) analysis in Tables 2-4 and 2-6 for the datasets, where the MSE is computed by averaging the squared difference between the True ENF and the estimated ENF. It is important to point out that the estimated ENF can sometimes have a constant offset [39], [50]. Therefore, the correlation is the preferred method for accuracy measure. The datasets used for this experiment do not have such an offset. They have also been made available at http://www.sal.ufl.edu/download.html. Table 2-3. Correlation coefficients of Algorithms (Data1) Algorithm Harmonic STFT STFT(Track) TRIAA TRIAA(Track) F-ESPRIT 60 Hz 0.9912 0.9917 0.9895 0.9900 0.9800 120 Hz 0.9911 0.9949 0.9902 0.9946 0.9470 180 Hz 0.9968 0.9968 0.9961 0.9961 0.9962 Table 2-4. Standard Deviation of error for Algorithms (Data1) Algorithm Harmonic 60 Hz 120 Hz 180 Hz STFT STFT(Track) TRIAA TRIAA(Track) F-ESPRIT 2.772e−3 2.650e−3 3.032e−3 2.919e−3 5.364e−3 2.774e−3 1.900e−3 2.145e−3 1.851e−3 2.822e−3 1.999e−3 2.198e−3 1.999e−3 6.570e−3 2.830e−3 2.5.1 Data1 Analysis Fig. 2-3 shows the extracted harmonic (180 Hz) of the ENF signal scaled to 60 Hz and matched (using the location corresponding to the maximum correlation (2–20)) to the actual database frequency obtained from the FDRs. For each of the algorithms used, the third harmonic gave the most accurate results for this dataset as shown in Table 2-3. This is because for a fixed grid size, the estimation error when using the third harmonic is reduced by a factor of three compared to the fundamental frequency. Harmonics with frequencies higher than 180 Hz can be used for the estimation process 47 at a cost of increased computational complexity due to the increased sampling rate. Also from Table 2-3, It can be seen that each of the STFT and TRIAA algorithms, produce accurate estimates of the ENF using (2–7) because of the rather strong ENF signal. The signal at the second harmonic is weak relative to the first and third harmonics, and in a few frames the estimate was inaccurate. However, the frequency tracking algorithm mitigated these inaccuracies successfully by tracking the correct spectral peaks. The parametric method, frequency selective (F-ESPRIT) [2],[51] also yields accurate estimates of the ENF for Data1 when the signal model assumes there is only one sinusoid per frame. However, this method and other parametric methods are not appropriate for ENF estimation in the presence of interference, because they are sensitive to model assumptions. For this dataset, the STFT yields slightly better results, compared to the adaptive method (TRIAA). This can be explained by the fact that the periodogram is optimal for estimating spectral lines (sinusoids) in the presence of white noise when they are well resolved [2]. However, when there are interfering signals present, the poor resolution of the periodogram will yield inaccurate estimates as is the case with Data2, a typical digital recording. Table 2-5. Correlation coefficients of Algorithms (Data2) Algorithm Harmonic STFT STFT(Track) TRIAA TRIAA(Track) F-ESPRIT 120 Hz 0.9125 0.9857 0.9305 0.9907 0.8446 Table 2-6. Standard Deviation of error for Algorithms (Data2) Algorithm Harmonic 120 Hz STFT STFT(Track) TRIAA TRIAA(Track) F-ESPRIT 7.948e−3 3.369e−3 7.225e−3 2.914e−3 1.086e−2 48 −7 2.5 x 10 True frequency (119.963 Hz) FFT IAA Freq. estimate IAA (119.965 Hz) Freq. estimate FFT (119.970 Hz) squared magnitude 2 1.5 1 0.5 0 119.8 119.85 119.9 119.95 120 120.05 frequency (Hz) 120.1 120.15 120.2 Figure 2-4. Power Spectrum of one Frame (Data2): poor resolution of FFT 2.5.2 Data2 Analysis For Data2, the second harmonic (120 Hz) is used to estimate the ENF, because the first and third harmonics are too weak to be used for estimation. Table 2-5 shows the maximum correlation coefficient values for the STFT and TRIAA using (2–7), the frequency tracking algorithm using the spectral peaks of the FFT and IAA and the parametric method (F-ESPRIT) with one assumed sinusoid. The ENF estimation accuracy is improved using the adaptive method (IAA) because of improved spectral resolution for several frames. Fig. 2-4 shows a comparison of the spectrum of one frame of the Data2, where the poor frequency resolution of the FFT results in a relatively poor estimate of the network frequency compared to the IAA algorithm. −7 4 x 10 3.5 True frequency (119.952 Hz) FFT IAA squared magnitude 3 2.5 2 1.5 1 0.5 0 119.8 119.85 119.9 119.95 120 120.05 frequency (Hz) 120.1 120.15 120.2 Figure 2-5. Power Spectrum of one Frame (Data2): strong interference signal 49 Figure 2-6. Extracted ENF via Frequency Tracking (Data2 - scaled to 60 Hz) Fig. 2-7 shows this extracted ENF harmonic using the STFT and (2–7) matched with the database. From this figure, there are several frames where the ENF is estimated inaccurately, due to the fact that the frequency corresponding to the maximum spectral peak for those frames do not correspond to the ENF. This can occur if there is another signal present with frequency within the limits of the acceptable range of the ENF as illustrated in Fig. 2-5. This figure shows that for both spectral estimation techniques used (IAA, FFT) the ENF harmonic estimate using (2–7) will be 120 Hz, whereas the true frequency is approximately 119.95 Hz. This problem can be rectified using our dynamic programming based frequency tracking algorithm presented above. Fig. 2-6 shows the spectral peak locations computed using the TRIAA and the corresponding ENF estimate using dynamic programming. The estimate of the network frequency using this tracking algorithm is then matched to the database in Fig. 2-7, which provides a better match when compared to using (2–7), which can also be seen in this figure, Fig. 2-8 (absolute error) and also from Table 2-5. A few important points to make are that the frequency tracking algorithm uses the peak locations for each frame estimated either by the adaptive algorithm (IAA) or the FFT. The results show that the estimated ENF is more accurate when the peak locations 50 Figure 2-7. Matching extracted ENF to Database (Data2 - scaled to 60 Hz) Figure 2-8. Absolute error of Algorithms: STFT and TRIAA (Track) of IAA are used. This is as a result of the inaccurate estimates in some frames caused by the poor resolution of using FFT. Also, all the numbers presented can be improved upon slightly by using the entire dataset (44.1kHz) for analysis. For example, the STFT maximum correlation of 0.9125 will be improved to 0.9158 without re-sampling, which may not be worth the increased computational complexity. 2.6 Conclusions When it comes to digital audio verification, the reliability of the method used for authentication cannot be overemphasized. This chapter demonstrates a reliable method of extracting the network frequency from a digital recording when the ENF cannot be 51 extracted from some of the frames using the FFT based periodogram either because of poor spectral resolution or a stronger interference signal within said frame. These problems were solved by using an iterative adaptive method (IAA), which provides better spectral resolution than the FFT based approach. Also a frequency tracking method based on dynamic programming was used for accurate extraction of the ENF even in the presence of a strong interference signals within ENF limits. From the results presented, the FFT gives slightly better estimates of the network frequency when the signal-to-interference-plus-ratio is very high as is the case with the first dataset. However, in most digital recordings, there will be significant interferences from the recorded speech signals and other surrounding sounds that could lead to poor estimation performance using the FFT due to its poor resolution and high side lobe problems. As the results have shown, the adaptive techniques and frequency tracking method should be adopted for ENF estimation, especially in challenging environments. 52 CHAPTER 3 DATA-ADAPTIVE RELAX FOR RFI SUPPRESSION FOR THE SYNCHRONOUS IMPULSE RECONSTRUCTION (SIRE) RADAR 3.1 Chapter Summary This next two chapters focus on remote sensing applications, specifically on the Synchronous impulse reconstruction radar (SIRE) UWB radar built by the Army research laboratory for landmine detection. This chapter focuses on suppression of Radio Frequency Interference (RFI) for ultra-wideband (UWB) radar signals, sampled using this synchronous impulse reconstruction (SIRE) time equivalent sampling scheme. This equivalent sampling scheme is based on the Army Research Lab’s (ARL) efforts to build an ultra-wideband (UWB) radar in forward looking mode that samples returned radar signals using low rate and inexpensive analog-to-digital (A/D) converters. The cost effectiveness of this SIRE UWB radar makes it plausible for actual ground missions for detecting buried explosive devices. However, the equivalent time sampling scheme complicates RFI suppression as the RFI samples are aliased and irregularly sampled in real time. In this chapter, the data-dependent RELAX and multi-snapshot RELAX algorithms are presented as an intermediate step to the previously proposed averaging scheme by the Army Research Laboratory, in order to enhance RFI suppression for this sampling scheme. A direct application of the RELAX algorithm is computationally intensive so an efficient method for generating the spectrum of this equivalently sampled data is proposed in this chapter that provides a factor of 10 improvement in computation. The proposed suppression technique involves modelling the narrowband RFI signals as a sum of sinusoids and applying the aforementioned algorithms. The RELAX algorithm improves the RFI suppression performance without altering the target signatures compared to AR modelling. The multi-snapshot RELAX algorithm which provides a more accurate sinusoidal model than the RELAX algorithm, improves on the RELAX algorithm in terms of suppression. However, the target signatures are suppressed as the number of sinusoids increases. The analysis of the algorithms is performed using sniff (passive) 53 data collected using the SIRE radar in addition to simulated wide-band echo signals (point-target signatures). 3.2 Introduction Ultra-wideband (UWB) radar is a commonly used tool for various remote sensing applications. Such applications include but are not limited to the use of low frequency, high bandwidth pulses for detecting improvised explosive devices (IEDs) and land mine targets. The effective detection of land mines and other IEDs could lead to increased safety for various ground related missions [52]. The use of low frequencies in UWB radar is necessary for foliage or ground penetration, whereas the use of wideband pulses are necessary for good resolution (ability to detect targets from clutter) [53]. However, because of these requirements, the data (target returns) collected by the UWB radar will be corrupted by signals in the radio frequency spectrum (specifically the UHF/VHF bands). These signals include FM Radio, TV broadcasts and other narrowband and wideband communication signals. The ability to effectively detect targets is reduced by the presence of these radio frequency signals. General methods of suppressing RFI and their limitations are discussed in detail in [53] for conventional UWB radar, which is the case when the returned signals are sampled regularly at or above the Nyquist rate. Due to the large bandwidth of the returned radar signals, conventional sampling will require high rate analog-to-digital (A/D) converters to digitize the returned signals. These high speed A/D converters are expensive to build and makes practical applications improbable. In other to improve on the cost of UWB radars, the Army Research Laboratory (ARL) is currently working on an equivalent time sampling UWB radar in forward looking mode, referred to as the Synchronous Impulse Reconstruction (SIRE) radar [54]. This radar uses low rate (inexpensive and commercially available) A/D converters to sample the returned signals (approximately 3GHz bandwidth), which makes the radar more feasible for adoption in practice. This equivalent time sampling 54 scheme takes advantage of the fact that the scene is not changing with time, hence aliasing of the returned target signals can be prevented. However, this is not the case with the radio frequency signals which are changing with time. Aliasing and the irregular sampling caused by the time-equivalent scheme becomes an issue when it comes to the subject of RFI suppression as discussed in the next two sections. This chapter focuses on suppression of RFI signals for this equivalent time-sampling scheme. The goal is to model the narrowband interference as a sum of sinusoids in real time and estimate and subtract the sinusoids before averaging to achieve further suppression. A cyclic optimization algorithm known as RELAX [31] is proposed for estimating the parameters of the sinusoids (in an iterative manner). This algorithm is an asymptotic maximum likelihood approach [55] and is computationally and conceptually simple. It has been applied to problems like non-contact vital sign detection for more accurate estimates of respiratory rates and heart rates [56]. It has also been shown to estimate the parameters of sinusoids accurately even in the presence of colored noise [55]. The multi-snapshot RELAX [57] algorithm, which is an extension of the RELAX algorithm, will be used to provide a more accurate sinusoidal model for the SIRE sampling scheme. The RELAX algorithm or multi-snapshot RELAX are implemented as an intermediate step to the already proposed averaging method [58] to achieve further suppression. In Section 3.3, the time equivalent SIRE sampling scheme is described. Section 3.4 briefly describes the limitations of some conventional methods to the SIRE sampled data for RFI suppression. The averaging method proposed in [58] for RFI suppression of SIRE sampled data is also discussed in this section, along with its performance. In Section 3.5, the RELAX algorithm, along with a fast computation of the spectrum of irregularly sampled SIRE data for this algorithm is presented; the multi-snapshot RELAX algorithm is also described in this section. The results are presented in Section 3.7, starting with simulations that show how the RELAX algorithm suppresses aliased 55 sinusoids for simulated data. The sniff (passive data collected using the SIRE UWB radar) is then used to test the effectiveness of the proposed algorithms, which are compared to AR modelling of the interference based on this sampling scheme (see Appendix). Finally, the conclusions of this chapter are presented in Section 3.8. 3.3 SIRE Equivalent Sampling Scheme In this section, the Synchronous Impulse Reconstruction (SIRE) equivalent time sampling technique as detailed in [54] is briefly described. This time equivalent sampling scheme poses some challenges on identifying and hence suppressing RFI sources, due to the fact that the RFI sources are changing with time as will be discussed. The SIRE sampling scheme involves sampling the returned radar signals from a scene at a significantly lower sampling rate fs , (with corresponding sampling period s), than the Nyquist rate, which leads to aliased samples. N aliased samples are collected per pulse repetition interval (PRI) or fast time, and for each subsequent PRI, N more samples are collected with the range profile shifted by e (in time). After K pulse repetition intervals (PRIs) or slow time, a total of K × N aliased samples are collected. These samples are interleaved as shown in Fig. 3-1, which gives an effective sampling rate of fe = 1/e that is equal to, or greater than the Nyquist rate. Because the scene of interest in not changing with time, the returned samples from a given range bin theoretically should also remain unchanged in time. Therefore, the interleaved samples are theoretically effectively sampled above the Nyquist rate and should be unaliased. The measurements from each range profile are typically repeated M times and added coherently to improve the signal-to-noise ratio (SNR). Fig. 3-1 shows the special case of M = 1. Table 1 summarizes the parameters used by ARL in the SIRE radar pertaining to RFI suppression [54]. The RFI signals, which are collected in addition to the desired target returns, on the other hand, are changing with time. Therefore, when the collected data are interleaved, they do not represent the true time samples of the RFI signals. 56 Table 3-1. ARL Parameters for Synchronous Reconstruction Radar. Radar A/D sampling rate = 40 MHz s = 25 ns PRF = 1 MHz PRI = 1 µs N =7 K = 193 K × N = 1351 e = 129.53ps fe = 7.72 GHz fs Radar A/D sampling period Pulse repetition frequency Pulse repetition interval Number of range profiles (per slow time) Interleaving factor Total number of range profiles Effective sampling period Effective sampling rate For instance, consider a complex sinusoid sampled at fe (see Tab. 3-1), with time samples h[n] = ejωo n . The periodogram estimate of the spectrum of h[n] is given by ϕ(ω ) = (1/L)|H (ω )|2 , where H (ω ) = L ∑ ej (ωo −ω)n i=1 (3–1) is the discrete-time Fourier transform (DTFT) of h[n] and L = K × N is the total number of samples. If this complex sinusoid is sampled using the SIRE technique (M = 1), the time samples of the interleaved signal will be given by: h[l(T + 1)] for l = 0, 1, . . . , K − 1 for l = K, . . . , 2K − 1 ~h[l] = h[(l − K )(T + 1) + K ] ··· h[(l − 6K )(T + 1) + 6K ] for l = 6K, . . . , 7K − 1 = (fe/PRF) (the other variables are described in Tab. 3-1). The corresponding ~ (ω)|2 , where H~ (ω) is the DTFT of ~h[l] and is periodogram is given by ϕ~(ω ) = (1/L)|H where T 57 given by: K− ∑1 ~ (ω)= H e l=0 ··· + jωo l(T +1) −jωl e 7∑ K−1 + K− ∑1 ejωo ((l−K )(T +1)+K ) e−jωl l=K ejωo ((l−6K )(T +1)+6K ) e−jωl (3–2) l=6K Therefore, ~ (ω) = H ( 6 ∑ ej (ωo −ω)sK ) (K−1 ∑ s=0 ) ej (ωo (T +1)−ω)r (3–3) r=0 Fig. 3-2 shows the periodogram spectral estimate of the regularly sampled sinusoid ϕ(ω ) and the interleaved SIRE sampled signal ϕ~(ω ). The spectrum of the complex ~ (ω) can sinusoid is not only distorted, but it peaks at a different frequency. Note that H be re-written as: ~ (ω) = H ( 6 ∑ ej (ωo −ω)sK ) (K−1 ∑ ) ej (ωo −ω)r ejωo T r r=0 s=0 (3–4) Figure 3-1. Synchronous Impulse Reconstruction (SIRE) equivalent time sampling. = 2πm/T , where m ∈ Z, then H~ (ω) reduces to H (ω). This condition implies that the frequency (in Hz) of the complex sinusoid f = fe ωo /2π = m × PRI, Therefore, if ωo is an integer multiple of the pulse repetition frequency. Unless this condition is true, interleaving will lead to distortion of the complex sinusoid. 58 0 SIRE (interleaved) Regularly sampled −10 −20 dB −30 −40 −50 −60 −70 0 0.1 0.2 0.3 0.4 Normalized frequency (cycles/sample) 0.5 Amplitude Figure 3-2. Spectrum of SIRE sampled complex sinusoid sampled after interleaving compared to the spectrum of sampled regularly above the Nyquist rate. 1 0.5 Magnitude 0 0 0.2 0.4 0.6 time (s) 0.8 1 −6 x 10 1 0.5 0 0 20 40 60 80 frequency (MHz) 100 120 Figure 3-3. Spectrum SIRE sampling pattern: One fast time pulse (N = 7 samples). A single complex sinusoid, sampled regularly below the Nyquist rate (fs ), should consist of a single peak at an ambiguous frequency in the frequency domain (in a bandwidth of fs ). However, due to the irregular sampling pattern of the SIRE sampling technique, a single sinusoid will be seen as multiple peaks within this bandwidth. Fig. 3-3 shows the SIRE sampling pattern (in real time) and its corresponding spectrum for a single fast time pulse (N samples). As expected, this will result in a sinc like function every 40 MHz (fs ) in the frequency domain. However repeating this sampling pattern K times will correspond to sampling in the frequency domain as seen in Fig. 3-4. Therefore, the spectrum of a single sinusoid sampled using the SIRE 59 sampling scheme will correspond to convolving the spectrum of this sampling scheme with that of a sinusoid resulting in multiple peaks. In the next section, we discuss some of existing algorithms for RFI suppression as Amplitude well as the limitations posed by this sampling scheme, based on the analysis above. 1 0.5 Magnitude 0 0 0.5 1 1.5 time (s) 2 2.5 −6 x 10 1 0.5 0 0 20 40 60 80 frequency (MHz) Figure 3-4. Spectrum SIRE sampling pattern (N × K 100 120 = 1351 samples). 3.4 Existing RFI Suppression Methods One popular technique for RFI suppression based on conventional sampling involves the use of notch filters. This method involves estimating the spectrum of the corrupted signal and removing the spikes in this spectrum using a notch filter. This method works well for narrowband interference sources. However, it will introduce sidelobes in the time-domain [59–61]. Filtering techniques in general, suffer from filter transients and reduced data length. The notch filtering problem is even more severe because of the ambiguity in frequency for the SIRE sampling scheme based on the analysis in the previous section and Fig. 3-2 for the interleaved signals. Also, if the analysis of the corrupted signal is performed in real-time (before interleaving), one interference source will appear to have multiple peaks in the spectrum due to irregular sampling as seen in Fig. 3-4, which makes this method not applicable. Modelling the RFI using AR models can also be used for suppression (see Appendix). The irregular sampling of the SIRE data makes this endeavour challenging. 60 However, the SIRE sampled data is sampled regularly in fast time and slow time, and this can be exploited for AR modelling. There are only N = 7 samples sampled regularly = 193 regularly sampled samples in slow time. These slow time samples can be used for AR modelling with N = 7 snapshots allowing for in fast time, whereas there are K more freedom in the choice of the AR model order (see Appendix for AR modelling of SIRE sampled data). Modelling the narrowband RFI as a sum of sinusoids, and estimating their parameters has been shown to be effective for suppressing RFI with little signal distortion [59],[62]. This method involves estimating the amplitude, frequency and phase of each interfering sinusoid and subtracting the resulting sinusoid from the corrupted data. The effectiveness depends on how accurate these parameters are estimated, and is reduced if the sinusoidal model for the RFI signals starts to breakdown [62]. This occurs when the duration of data is greater than the modulation time (inverse of modulation bandwidth) of the RFI signals. For instance, a 3 kHz narrowband voice channel will have a modulation time of approximately 0.3 ms, whereas wideband TV signals with bandwidth of several kHz will have a much smaller modulation time [62]. If the duration of the processed data is greater than this modulation time, the estimated parameters will change during the acquisition time, leading to less effective suppression. These methods are also computationally expensive when many interference sources are estimated. When the RFI signals are sampled using the SIRE equivalent sampling scheme, estimation of these parameters becomes even more challenging due to the irregular sampling pattern and aliasing introduced, even if the model is accurate. Another technique for RFI suppression is using passive data to adaptively suppress RFI from active radar data by projecting the measured active data to a signal subspace created by the passive data. This method assumes orthogonality between the desired target signatures and the RFI, and has been shown to be effective for RFI suppression 61 in [63] for conventionally sampled data. However, as noted in [54], this method is inadequate for suppression of SIRE sampled data, due to the irregular sampling pattern and aliased samples of the RFI. These challenges have prompted the need for new RFI suppression techniques for the SIRE sampling scheme. The averaging method proposed in [58] and also detailed therein, has been shown to suppress wideband and narrowband interferers. The method is based on repeating the measurements from the same range profile M times and averaging the repeated measurements. The averaged samples are then interleaved and used for generating SAR images. Fig. 3-2 shows the amount of suppression as a function of the number of repeated measurements based on simulated RFI sources. A similar plot can be seen in [58]. An important point to note is that this method of suppression does not take into account any properties of the RFI signal, which is the motivation for improving the performance. −10 Suppression (dB) −15 −20 −25 −30 −35 0 200 400 600 800 Number of Pulses Averaged (M) 1000 Figure 3-5. RFI Suppression (dB): Averaging method (M realizations) for simulated SIRE sampled RFI signals. In this chapter, the averaging method is improved by analyzing the data in ’real-time’ (before interleaving). The aliased samples of the data in ’real-time’ are modelled as a sum of sinusoids, in other to achieve further suppression with little signal distortion. The parameters of the sinusoids are estimated and the resulting sinusoids are subtracted 62 from the data using the RELAX algorithms [31],[57] (to provide accurate estimates of the parameters) before averaging. Based on the analysis in the previous section, a single sinusoid appears as multiple peaks due to the irregular SIRE sampling pattern in ’real-time’. However, in theory, estimating the parameters of a single sinusoid from the maximum peak location of the spectrum, and subtracting this from the data, will correspond to its removal from the spectrum. This will eliminate all the multiple aliased peaks (Fig. 3-4). This analysis will be shown on simulated sinusoids in the results section. The RELAX algorithm and its multi-snapshot counterpart are described in the next section and the steps for RFI suppression are also presented. 3.5 Proposed RFI Suppression Method: RELAX and Averaging 3.5.1 Modelling of RFI The proposed suppression method, entails modelling RFI signals of length L collected in real time (before interleaving) as a sum of P complex-valued aliased sinusoids as described in Eq. (3–5): z= P ∑ p=1 αp a(fp ) (3–5) where αp and fp are the complex amplitude and frequency of the pth sinusoid and [ a(fp ) = 1 ej 2πfp ··· ej 2π(L−1)fp The received measurement signal can be written as y ]T = z + s + n, where z, s, and n, are the RFI signal, desired target returns, and receiver noise, respectively. The target returns have a wide bandwidth relative to the RFI signals and can be modelled as white noise [53] ,[64]. RFI suppression, then, becomes a case of estimating the parameters of multiple sinusoids in the presence of white noise. The non-linear least squares (NLS) approach (an asymptotic Maximum Likelihood approach [55],[2]) estimates these 63 parameters by minimizing the following non-linear least squares cost function in Eq. (3–6), }P { α ^p, f^p p=1 2 P ∑ = argmin y − αpa(fp) P p=1 {α^p ,f^p }p=1 (3–6) where P is the number of sinusoids, which can be estimated using a model-order selection tool like the Bayesian Information Criterion (BIC) [65]. This method can approach the Cramer-Rao bound in performance, but it involves a multi-dimensional search and hence involves complex computations for the case of multiple sinusoids. It can also be sensitive to initializations [2],[66]. The RELAX algorithm can be used for solving the problem in an iterative manner reducing the computational complexity significantly [31]. This conceptually and computationally simple algorithm was shown to estimate sinusoidal parameters accurately and robustly even in the presence of colored noise [55]. The parameters are estimated for the above non-linear least squares fitting problem in an iterative manner as described below. 3.5.2 RELAX Algorithm The RELAX algorithm estimates the parameters as follows: Let yp , y − P ∑ i=1,p̸=i α ^ia(f^i) (3–7) The frequency and complex amplitude estimates of the pth sinusoid are, respectively, estimated by: f^p = argmax |aH (fp)yp|2 (3–8) H = a (Lfp)yp |f =f^ (3–9) fp and α ^p | {z } DTFT of yp 64 p p The RELAX algorithm steps are given by: = 1. Estimate f^1 and α^1 from y. • Step 1: Assume P • Step 2: Assume P = 2. Compute y2 based on estimates from the previous step and estimate f^2 and α ^2 . Compute y1 and re-estimate f^1 and α^1 . Re-iterate previous steps until practical convergence. • • Step 3: Assume P = 3. Compute y3 and estimate f^3 and α ^3 . Re-compute y1 and ^ ^ ^ re-estimate f1 and α ^1 from f2 , α^2 , f3 ,α^3 . Re-iterate until convergence or a fixed number of iterations. Remaining Steps: Continue until P = P^, which is an estimated or desired number. Note that, the frequencies and complex amplitudes in (3–8) and (3–9), respectively, are estimated using the DTFT of the signals yp . This can be efficiently computed using the FFT and zero-padding for conventionally (regularly) sampled data. Based on Fig. 3-3 and the analysis leading to Eq. (3–4), as previously discussed, the interleaving process of a SIRE sampled sinusoid leads to a distortion of that signal except for a specific case, being that the frequency of the sinusoid is an integer multiple of the PRI. The analysis of the RFI using the RELAX algorithm will, therefore, be performed on the data in real-time (before the interleaving process). As will be shown in the results section, the estimated complex amplitudes and frequencies (although possibly ambiguous), can be used to accurately reconstruct the aliased RFI samples and yield effective RFI suppression using the RELAX algorithm. The RELAX algorithm requires the computation of the spectrum of the received samples. For irregularly sampled SIRE data, this spectrum can be computed using an FFT after re-sampling the data (interpolating with zeros). Re-sampling this data to give a regularly sampled data with effective sampling frequency of fe , will lead to a significantly long data sequence with most of the samples being zero. For instance, one realization = 1) of a SIRE sampled data, sampled at fs = 40 MHz , contains N = 7 aliased samples per PRI. After re-sampling to an effective rate of fe = 7.72 GHz, each PRI will consist of T = fe × PRI = 7720 samples. Hence a total of T × K = 7720 × 193 ≈ 1.5 (M 65 million samples per realization. Therefore, applying a direct FFT (with zero-padding) to this re-sampled data to estimate the frequencies and complex amplitudes becomes computationally intensive with a computational complexity of O(T KlogT K ). Note that similar to Fig. 3-4, a single sinusoid sampled using the SIRE sampling technique and re-sampled as discussed above to give an effective sampling rate of fe = 7.72 GHz will repeat itself approximately every 40MHz (A/D rate) in the frequency domain, due to aliasing. In order to reduce the computational complexity of this re-sampling scheme, the regular sampling of the data in both fast and slow time can be exploited and the spectrum can be computed only on a 40 MHz bandwidth to save on computations. The analysis is performed as follows: The spectral estimate for SIRE sampled data in real time (before interleaving) based on parameters in Tab. 3-1 is given by: X (f ) = ∑∑ n xm,n e−j 2πf (mm +nn ) (3–10) m where n = 0, 1, 2, · · · , N − 1 (N = 7) m = 0, 1, 2, · · · , K − 1 (K = 193) n = s = 25 ns, (ADC sampling rate), m = PRI + e . A direct computation of the spectrum in Eq. (3–10) is obviously computationally intensive, especially for a fine grid size in frequency. Assuming f = k1 + k2 f m 66 (3–11) where k1 = 0 , 1, 2, · · · , K 1 − 1 K1 = T = 7720 k2 = 0 , 1, 2, · · · , K 2 − 1 K2 = 1/(mf ) f = fixed grid size (in Hz) Eq. (3–11) is the frequency grid (in Hz) on which the spectrum in Eq. (3–10) will be computed. Note that the choice of K2 determines the grid spacing f and the choice of the k1 values determines the portion of the bandwidth in which the spectrum is to be estimated. For instance, k1 = 0, 1 · · · T = 7720 computes the spectrum over the entire 7.72 GHz (effective sampling rate) bandwidth. For the frequency grid specified in (3–11), the spectrum in (3–10) can be re-written as follows: X (f ) = ∑∑ n xm,n e−j 2π( m +k2 )(mm +nn ) k1 (3–12) m which simplifies to: X (k1 , k2 ) = ∑ e−j 2π( m +k2 f )nn k1 n X (k1 , k2 ) = ∑ −j 2π K2 m k xm,n e 2 m ∑ e−j 2π( m +k2 f )nn Xn (k2 ) k1 (3–13) n From Eq. (3–13), the spectrum is computed by summing up multiple FFTs. Also because the signal is aliased, the spectrum needs only to be computed over a small portion (40 MHz - A/D sampling rate) of the entire bandwidth. The computational complexity of this algorithm is O(N K2 logK2 + K1 K2 N ). Note that the bottle neck of this algorithm is in the second term. When the spectrum is computed over the entire frequency grid, K1 = T = 7720, the computational complexity is on the same order as re-sampling and applying an FFT. However, when the spectrum is computed over a 40 67 = 40), this algorithm drastically improves on the computation. For example, for a frequency grid with spacing (f ) of approximately 2 kHz, the spectrum in MHz bandwidth (K1 Fig. 3-4 was computed in 0.26 secs when the SIRE sampled data was re-sampled and the FFT was applied directly using the MATLAB software. However, the spectrum based on Eq. (3–13) was computed in 0.035 secs on a 40 MHz grid. Table 3-2. Suppression Algorithm: RELAX + Averaging Step 1: RELAX (P sinusoids estimated). - Compute the DTFT of the measured data y from (3–13) and estimate f^1 and α ^1 , using (3–8), (3–9) and (3–13). - Compute y2 using (3–7) and its DTFT using (3–13). Estimate f^2 and α ^2 . Re-estimate f^1 and α ^1 from y1 and iterate. Continue for yp , f^p and α ^p (3 ≤ p ≤ P ) (Section IV.B). Step 2: Reconstruct aliased RFI samples using {f^i }Pi=1 and {α ^i}Pi=1 . Subtract from each realization (y). Step 3: Average residue from each realization and interleave. This spectrum is used in the RELAX algorithm (3–8) and (3–9), to estimate the parameters of the sinusoids present. The RELAX algorithm is applied here to one realization (M = 1) of SIRE sampled data (which correspond to a data with an acquisition time of 0.193 ms based on Tab. 3-1). Therefore a narrowband interference source with a modulation bandwidth of 5 kHz or less can be accurately approximated as a single tone, whereas multiple sinusoids are needed to model an interference source with wider bandwidth. The sinusoidal model begins to break down for very wideband interferers. In the next subsection we propose the multi-snapshot RELAX algorithm for SIRE sampled data that provides a more accurate sinusoidal model for the RFI signals by using fewer samples (smaller modulation time), for suppression. The overall proposed RFI suppression algorithm can be summarized in the following steps as shown in Tab. 3-2 for RELAX. 68 3.5.3 Multi-snapshot RELAX Algorithm The multi-snapshot RELAX algorithm [57] uses N = 7 samples (150 ns acquisition time) for RFI suppression. Interference sources with modulation bandwidth of 6.7 MHz or less can be accurately modelled as sinusoids, which includes wideband interferers like TV broadcasts etc. The multi-snapshot RELAX algorithm (M-RELAX for short) proposed for angle and waveform estimation in [57] is a modification of the originally proposed RELAX algorithm [31]. The algorithm estimates the angle of arrival (using multiple snapshots of the data) and the corresponding waveform for each snapshot. This algorithm is proposed here for RFI suppression of SIRE sampled data to = provide a more accurate sinusoidal model for the RFI signals. Here, each set of N 7 fast time samples is treated as a snapshot. The data is split into K = 193 total snapshots based on the parameters in Tab. 3-1. The frequency of a single tone is estimated by averaging the periodogram of each snapshot and finding the frequency that maximizes the average. The complex amplitudes of each snapshot is estimated by finding the complex value of the spectrum of each snapshot at the estimated frequency. Note that for a single complex sinusoid, K = 193 complex amplitudes are estimated from each snapshot, whereas only one frequency is estimated. The parameters are estimated as given in Eq. (3–14) (modification of NLS for the multi-snapshot case [57]): { }P α ^ p, f^p p=1 2 K P ∑ ∑ = argmin αp (k )a(fp ) x(m) − {αp fp }P p=1 p=1 m=1 (3–14) = [αp(1), αp(2), . . . , αp(K )] contains the estimated complex amplitudes of the pth sinusoid for each of the K snapshots, f^p is the estimated frequency of the where αp pth sinusoid for all snapshots and x(m) is the mth snapshot. These parameters are estimated as follows. Let xp (m) , x(m) − P ∑ i=1,p̸=i 69 α ^i(m)a(f^i ) (3–15) The estimates described above are given by: f^p = argmax fp K ∑ m=1 |aH (fp )xp (m)|2 (3–16) and α ^p(m) = aH (fp )xp (m) | |fp =f^p , L {z } DTFT of xp (m) m = 1 , 2, . . . , K (3–17) The multi-snapshot RELAX algorithm steps are as follows: = 1. Estimate f^1 and α^1 (m) from x(m), for m = 1, 2, . . . , K. • Step 1: Assume P • Step 2: Assume P = 2. Compute x2 (m) based on the estimates from the previous step and estimate f^2 and α ^2 (m), for m = 1, 2, . . . , K. Compute x1 (m) and ^ re-estimate f1 and α ^1 (m), for m = 1, 2, . . . , K. Re-iterate previous steps until practical convergence. • • Step 3: Assume P = 3. Compute x3 (m) using {α ^ p, f^p}2p=1 and estimate f^3 and α ^3 (m), for m = 1, 2, . . . , K. Re-compute x1 (m) and re-estimate f^1 and α^1 (m) from {α ^ p, f^p}3p=2 , for m = 1, 2, . . . , K. Then re-compute x2 (m) and re-estimate f^2 and α ^2 (m) from {α^ p, f^p}p=1,3 , for m = 1, 2, . . . , K. Re-iterate until convergence. Remaining Steps: Continue until P = P^, which is an estimated or desired number. For the SIRE sampled data, the spectrum (DTFT) of each snapshot can be described as follows. Let x(m) = −1=7 {dm (n)}N correspond to the mth snapshot, n=0 where dm (n) denotes the nth sample of the mth fast time pulse (m = 1, 2, . . . K). Note that each snapshot is regularly sampled (at the A/D rate). The spectral estimate can therefore be computed using an FFT multiplied by a corresponding phase shift (over a 40 MHz bandwidth). The spectrum of each snapshot is given by: Xm (f ) = where (f ∈ 6 ∑ n=0 dm (n)e−j 2πf (nn +mm ) (3–18) (0, fs)) is the frequency (in Hz), n = 1/fs and m are the sampling period and the time difference from one snapshot to the next, respectively. Equation (3–18) 70 above can be simplified as follows: Xm (f ) = e−j 2πf (mm ) 6 ∑ n=0 dm (n)e−j 2π fs n (3–19) dm (n)e−j 2π R n (3–20) f which simplifies to: Xm (r) = e−j 2π R ( r for a discrete frequency grid r mm n ) 6 ∑ n=0 r = 0, 1 . . . R − 1. It is important to note that parameter identifiability (maximum number of sinusoids that can be uniquely identified) [67–69], becomes an issue with this approach. Given N = 7 real valued samples, only up = 2 sinusoids (amplitude, frequency, and phase), can be uniquely identified. Estimating more than P = 2 sinusoids will significantly distort the target signatures. to P The overall proposed RFI suppression algorithm can be summarized in the steps as shown in Tab. 3-3 for multi-snapshot RELAX. We also consider Auto-regressive(AR) modelling of the RFI data for suppression, however due to desired signal distortion, this approach is not effective. AR for RFI suppression for the SIRE radar is described in the next section. Table 3-3. Suppression Algorithm: M-RELAX + Averaging Step 1: M-RELAX (P sinusoids estimated) - Compute the DTFT of the mth snapshot x(m) of the measured data y from (3–20) and estimate f^1 and α ^1 (m), using (3–16), (3–17). - Compute x2 (m) using (3–15) for each snapshot and its DTFT using (3–20). Estimate f^2 and α ^2 (m). Re-estimate f^1 and α^1 (m) and iterate. ^p (3 ≤ p ≤ P ). (Section IV.C). Continue for yp , f^p and α Step 2: ^i(m)}Pi=1 for each Reconstruct aliased RFI samples using {f^i }Pi=1 and {α snapshot. Subtract from each realization (y). Step 3: Average residue from each realization and interleave. 71 3.6 Autoregressive (AR) Modelling Auto-regressive (AR) models, which is commonly used for modelling narrowband (”peaky”) signals, can be used for estimating and suppressing RFI signals. The measured signal (RFI, desired target returns, and thermal noise) is modelled as an AR process [62]. The AR modelling (linear prediction modelling) equation is written as: y [ tn ] = − q ∑ i=1 a[i]y [tn − i] + u[tn ] (3–21) where, y [tn ] is the measured data sequence, u[tn ] corresponds to the white noise term at a time instant tn and q corresponds to the AR order, which is determined by the number of spectral peaks and their widths. The assumption is that the first term on the right hand side of Eq. (3–21) corresponds to the RFI signal. The suppression process therefore involves estimating {a[i]}qi=1 and using the coefficients to suppress the RFI signals. Note that Eq. (3–21) can be re-written as: y [tn ] = H (z )u[tn ] (3–22) where H (z ) = 1/A(z ) = 1/(1 + a[1]z −1 + . . . + a[q ]z −q ), with z −1 being the delay operator. The RFI suppression process involves passing the measured data through the inverse filter 1/H (z ) = A^(z ) (from the estimated AR coefficients {^ a[i]}qi=1 ). The well-known methods for solving for the AR coefficients in (3–21) include the Yule-Walker (YW) method, Prony method and the modified Prony method [2]. The YW and Prony methods give similar results for large data samples. However, for smaller data records the Prony method tends to give gives more accurate AR estimates [2]. If both sides of the forward linear prediction equation (3–21) are multiplied by y [tn − tm ], and the expectation is taken, the well-known Yule-Walker equations are obtained. 72 r(1) r[0] · · · r[−q + 1] a[1] . . .. .. = − ... .. . r(n) r [q ] · · · r[0] a[q ] (3–23) which can be re-written as r = −Ra. Where r and R are the covariance vecotr and matrix of the data. The AR coefficients (a) are estimated by solving Eq. (3–23). The Yule-Walker method estimates the coefficients by replacing r with the standard biased autocorrelation sequence (ACS) estimator [2]. The Prony method solves the forward linear prediction equation (3–21) using least squares (LS). The problem reduces to (3–23), with the covariance sequence estimated by the standard unbiased ACS estimator [2]. The Modified covariance (Prony) method (which improves on the Prony method) combines the forward linear prediction in (3–21) and the backward linear equation given below in (3–24) to solve for the AR coefficients using least squares: y [ tn ] = − q ∑ i=1 ab [i]y [tn + i] + ub [tn ] (3–24) where ab [i] = a[i]. This Modified covariance method is applied to the SIRE sampled data which is sampled regularly in fast-time and slow-time. N samples (K = 7 sets of the slow-time regular = 193 samples per set) are used for AR modelling. The Modified covariance 73 equations can be written in matrix form as follows (for each set of slow time samples): y (q ) y [0] y [q − 1] · · · .. .. .. . . . a[1] y (K − 1) y [K − 2] · · · y [K − q − 1] a[2] = − (3–25) .. y [1] . y (0) · · · y [ q ] . . . a[q ] .. .. .. y (K − q − 1) y [K − q ] · · · y [K − 1] where q is the AR order. The equation can be re-written as yn = −Yn a, for n = 1, 2, . . . N with N =7 (3–26) The least-squares solution of this overdetermined linear system of equations is given by: a = −(YnT Yn )−1 YnT yn for n = 1, 2, . . . N with N =7 (3–27) where (YnT Yn )−1 estimates the covaraince matix and YnT yn estimates the ACS in (3–23). A more accurate estimate of the covariance matrix in (3–27) is derived by averaging the N = 7 snapshots. 3.7 Experimental Results 3.7.1 Simulations In this section, a signal consisting of three sinusoids in white noise (SNR = 10dB) is simulated and sampled using the SIRE equivalent scheme based on the parameters = 1). The sinusoids have frequencies = 650.255 MHz, all with amplitudes of 1. in Tab. 3-1, with no repeated measurements (M f1 = 111.111 MHz, f2 = 300 MHz and f3 Note that the samples obtained will also correspond to a signal containing sinusoids with = f1 mod fs, fa2 = f2 mod fs and fa3 = f3 mod fs, as well as a signal containing sinusoids fa1 + kfs , fa2 + kfs and fa3 + kfs (where k ∈ Z and fs is the A/D frequencies fa1 rate). This ambiguity in frequency is caused by aliasing due to the low A/D rate of the radar. The RELAX algorithm can be used to accurately estimate the complex amplitudes 74 of these sinusoids as well as the ambiguous frequencies. This is achieved using the spectrum in (3–13) estimated only on a 40 MHz bandwidth to save on computations. The estimated parameters are then used to reconstruct the aliased samples, in order to suppress the sinusoids through subtraction. Original Signal Amplitude Amplitude 0 −2 1200 1250 1300 Samples Spectrum 1350 0.4 0.2 0 0 10 20 Frequency (MHz) 30 2 0 −2 1200 Magnitude Magnitude Residue 2 1250 0.2 0 40 0 10 20 Frequency (MHz) Magnitude Amplitude −2 1300 Samples Spectrum 1350 0.4 0.2 0 20 Frequency (MHz) 30 2 0 −2 1200 Magnitude Amplitude 0 10 40 Residue 2 0 30 B Residue 1250 1350 0.4 A 1200 1300 Samples Spectrum 40 1300 Samples Spectrum 1350 0.4 0.2 0 0 C 1250 10 20 Frequency (MHz) 30 40 D Figure 3-6. RFI suppression (SIRE sampling) - Signal and spectrum of simulated data containing 3 real-valued sinusoids in white noise after suppression using RELAX with P (real-valued) sinusoids estimated (A) Original data, (B) P = 1, (C) P = 2, (D) P = 3. Fig. 3-6, shows the original signal, its spectrum and the progression of suppression as the number of estimated aliased parameters increases. From Fig. 3-6, we observe that the spectrum of the three sinusoids contains multiple peaks, due to the irregular sampling as described previously. By estimating the ambiguous frequency and complex amplitudes of each of the sinusoids (on only a 40 MHz bandwidth), multiple aliased peaks are suppressed. The purpose of the above simulations is to show the ability of the 75 RELAX algorithm to estimate the ambiguous frequency and complex amplitudes of the sinusoids on a small bandwidth correctly, to effectively suppress the sinusoids (including the multiple aliased peaks). In the next subsection the sniff dataset collected using ARL’s SIRE UWB radar is analyzed and the RFI is suppressed using both the RELAX and multi-snapshot RELAX algorithms. Comparison with AR modelling of the RFI is also provided. 3.7.2 Sniff Experimental Data The sniff data to be analyzed, was collected by ARL using the SIRE UWB radar in passive mode based on the parameters in Tab. 3-1. Each set of data consists of L = K × N = 1351 samples. In this subsection this RFI data will be analyzed using the proposed algorithms. Two sets of RFI data with different energy levels are analyzed. For simplicity they will be referred to as File1 and File2. Each set of the data, consists of M = 88 realizations. The amount of suppression achieved are presented in Table3-4. A wideband echo signal which represents a return from a single point target is simulated. This signal is added to each realizations (M = 88) of the sniff data, in a way that the echo signal adds up coherently. The goal is to show how much distortion is introduced to the desired signals after the application of the RFI suppression algorithms. Fig. 3-7 shows the amount of suppression achieved when the RELAX algorithm with P real-valued sinusoids are suppressed for each realization and the residues are averaged (File1). These results are compared to straightforward averaging, also in this figure. A similar analysis is performed for the multi-snapshot RELAX algorithm and the amount of suppression can be seen in Table 3-4 and 3-5. The average power of the signals {s^(i)}Li=1 , are computed using (3–28): ( L ) ∑ 10 log10 |s^(i)|2 /L i=1 (3–28) From Table 3-4 and 3-5, it is clear that the amount of suppression increases as the number of real-valued sinusoids increases for the RELAX algorithm. This improvement 76 Table 3-4. RFI Suppression (dB): File 1 (P~ Avg. RELAX = 1) 26.61 (P = 4) 27.79 (P = 7) 28.06 (P = 10) 20.89 23.46 (P M-RELAX = 1) *PI (P = 4) *PI (P = 7) *PI (P = 10) 24.19 (P Table 3-5. RFI Suppression (dB): File2 (P~ Avg. = 1) = 1) RELAX M-RELAX = 1) 21.55 (P = 4) 22.30 (P = 7) 22.61 (P = 10) = 1) *PI (P = 4) *PI (P = 7) *PI (P = 10) 18.49 20.02 (P 20.22 (P RELAX/M-RELAX (P~ ) AR (q) = 1) 29.11 (P = 4) 30.38 (P = 7) 31.27 (P = 10) 27.04 (P 23.21 (2) 24.52 (20) RELAX/M-RELAX(P~ ) AR (q) = 1) 23.47 (P = 4) 23.65 (P = 7) 25.97 (P = 10) 22.40 (P 20.03 (2) 20.37 (20) *PI - Parameter identifiability not met. comes at a cost of increased computational complexity. However, the target signatures are left basically unaltered as can be seen in Fig. 3-8. The multi-snapshot RELAX algorithm shows a significant amount of suppression of the data as the number of sinusoids increases. Due to the issue of parameter identifiability discussed in the previous section, estimating more than P sinusoids (in theory), using only N = 2 real-valued = 7 real-valued samples per-snapshot will effectively suppress all the samples to zero. This leads to the suppression of the target energy, as can also be seen in Fig. 3-8. The multi-snapshot RELAX algorithm can be seen to improve on the suppression with little target distortion for P = 1 based on the real RFI data collected using the SIRE radar. This algorithm is combined with the RELAX algorithm to effectively suppress both wideband and narrowband interferers. This improvement is seen in Table 3-4 and 3-5 and Fig. 3-9 shows the reconstructed echo after suppression. A similar analysis is performed for AR modelling. The AR modelling improves on the suppression compared to averaging as can be seen in Fig. 3-10. However, this inverse 77 30 30 Averaging RELAX (P = 10) and averaging Averaging RELAX (P = 1) and averaging 10 Amplitude 10 Amplitude 20 20 0 0 −10 −10 −20 −20 −30 0 200 400 600 800 Samples 1000 1200 −30 0 1400 200 400 600 800 Samples A 1000 1200 1400 B Figure 3-7. RFI suppression - RELAX algorithms with P (real-valued) sinusoids estimated and suppressed from sniff data (File1) compared to averaging. (A) P = 1, and (B) P = 10. Echo signal RELAX M−RELAX 60 40 Amplitude Amplitude 40 20 20 0 0 −20 −20 −40 650 Echo signal RELAX M−RELAX 60 660 670 680 690 −40 650 700 660 670 680 Samples Samples A B 690 700 Echo signal RELAX M−RELAX 60 Amplitude 40 20 0 −20 −40 650 660 670 680 690 700 Samples C Figure 3-8. Echo retrieval (File1) - RELAX with P (real-valued) sinusoids compared to ideal echo signal. (A) P = 1, (B) P = 2, and (C) P = 10. 78 Echo signal RELAX/M−RELAX 60 40 Amplitude Amplitude 40 20 20 0 0 −20 −20 −40 650 Echo signal RELAX/M−RELAX 60 660 670 680 690 −40 650 700 660 670 680 Samples Samples A B 690 700 Echo signal RELAX/M−RELAX 60 Amplitude 40 20 0 −20 −40 650 660 670 680 Samples 690 700 C Figure 3-9. Echo retrieval (File1) - RELAX with P (real) sinusoids combined with M-RELAX with P~ = 1 real sinusoid, compared to ideal echo signal. (A) P = 1, (B) P = 2, and (C) P = 10. filtering technique leaves the desired signal distorted. This distortion is increased as the model order increases (due to filtering transients) as seen in Fig. 3-9. Hence the combined RELAX and multi-snapshot RELAX outperforms the AR approach in terms of both RFI suppression and desired target echo preservation. 3.8 Conclusions In this chapter, we have proposed a method for RFI suppression for the SIRE UWB radar, which is a cost efficient system of sampling returned radar signals used for detecting land mines and IEDs developed by ARL. The low sampling rate and irregular sampling pattern of this radar poses a challenge for Radio Frequency Interference (RFI) suppression as the measured RFI signals will be severely aliased. In this chapter, we have discussed the challenges of RFI suppression for this radar and proposed 79 30 30 Averaging AR−2 10 0 10 0 −10 −10 −20 −20 −30 0 200 400 600 800 Samples 1000 1200 Averaging AR−20 20 Amplitude Amplitude 20 −30 0 1400 200 A 400 600 800 Samples 1000 1200 1400 B Figure 3-10. RFI suppression - AR modelling with order q compared to averaging for sniff data (File1). (A) q = 2, and (B) q = 20. Echo AR−2 60 40 Amplitude Amplitude 40 20 0 −20 −40 650 Echo AR−20 60 20 0 −20 660 670 680 Samples 690 −40 650 700 A 660 670 680 Samples 690 700 B Figure 3-11. Echo retrieval (File1) - AR modelling with order q, compared to ideal echo signal. (A) q = 2, and (B) q = 20. using the RELAX algorithm and its multi-snapshot counterpart as an intermediate step to the already proposed averaging scheme for RFI mitigation, for the SIRE UWB radar. The results show that the RELAX algorithm can suppress RFI further than just averaging without altering desired target echo signals. The RELAX algorithms are easy to implement since they just involve FFTs. They have been shown to outperform AR modelling of the RFI singals. The multi-snapshot RELAX uses a shorter time-duration (and fewer samples) for suppression, which yields a more accurate wideband model of the RFI as sum of sinusoids compared to the RELAX algorithm. However, this algorithm significantly 80 suppresses target signatures as the number of sinusoids increases and is limited to estimating only one sinusoid. Combining this algorithm assuming just one sinusoid, with the RELAX algorithm increases the suppression performance with little signal distortion. 81 CHAPTER 4 DATA-ADAPTIVE, SPARSE SUPER-RESOLUTION IMAGING FOR THE SIRE FLGPR RADAR 4.1 Chapter Summary In the previous chapter, the problem of RFI suppression was presented for the Forward Looking Ground Penetrating Radar (FLGPR) known as the SIRE radar built by the Army Research Laboratory. FLGPR has multiple applications, one of which includes its use for detecting landmines and other buried improvised explosive devices (IEDs). In this chapter, we focus on data-adaptive high resolution imaging for this SIRE FLGPR. The standard method for generating SAR images for this radar is the back-projection algorithm, which is limited by poor resolution and high side-lobes problems. In this chapter, we consider using the Sparse Iterative Covariance-based Estimation (SPICE) and the Sparse Learning via Iterative Minimization (SLIM) algorithms for generating sparse high-resolution images for FLGPR. The pre-processing involves an orthogonal projection of the received measurements to a subspace related to the region of interest for data and clutter reduction. These user-parameter free algorithms are capable of providing sparse results as well as improved resolution synthetic aperture radar (SAR) images. We also examine the well-known CLEAN approach based on a signal model in the time domain for imaging. We show using simulated data that the SPICE/SLIM algorithms provide higher resolution than CLEAN and standard backprojection algorithm. Imaging using real data collected via the Synchronous Impulse Reconstruction (SIRE) radar, a multiple-input multiple-output (MIMO) FLGPR radar developed by the Army Research Laboratory (ARL)) is also used for analysis. 4.2 Introduction The global problem of landmines and other buried improvised explosive devices (IEDs) is affecting both military and civilians alike [70–74], and effective as well as efficient methods for detecting these devices is very important in the world today. Methods for detecting landmines include but are not limited to the use of metal 82 detectors, infrared sensing of the land surface [75], biological sensors such as animals (dogs) [76] and more recently detecting fumes of the mines using lasers to ionize the air [74]. Radar is an excellent tool for remote sensing applications [77]. Ground penetrating radar (GPR) which transmits an electromagnetic wave into the ground and examines the back-scattered returns to determine buried objects has become a useful tool for effectively detecting landmines and IEDs [78–80]. By operating in forward looking mode, GPR can be applied to the problem of landmine detection as it inspects the ground surface with a safe stand-off range as can be seen in Fig 4-1. Impulse based forward looking ground penetrating radar (FLGPR) typically transmits a mono-cycle pulse with typical operating frequency range spanning the UHF, and L bands [54, 79]. The low frequency of GPR provides the necessary ground penetrating properties of the radar and the large bandwidth provides the necessary down-range resolution. The cross-range resolution on the other hand is limited by the antenna beamwidth [81]. Increasing the antenna physical size can improve cross-range resolution. However, this is limited by physical antenna size constraints. Side-looking synthetic aperture radar techniques improve cross-range resolution by synthesizing a virtual aperture much larger than the physical aperture [11, 82–84]. However, in forward looking mode, the cross-range resolution is limited by the physical radar size. A multi-input multi-output (MIMO) radar [85] can be used to enhance this resolution [86][87]. For example, Fig. 4-1 shows the FLGPR for landmine detection built by the Army Research Laboratory (ARL) known as the synchronous impulse reconstruction (SIRE) radar [54]. This radar consists of 2 transmitters and 16 receivers and exploits waveform diversity [54, 86, 88] to enhance cross-range resolution by alternatively transmitting between its two transmitters. The well-known conventional method for imaging for this type of radar is the standard back-projection method [89, 90]. This approach also known as the delay-and-sum (DAS) approach, suffers from high sidelobe problems and is limited by poor resolution. 83 Figure 4-1. Forward looking ground penetrating radar [54] High resolution imaging is important for separating closely spaced targets as well as distinguishing targets from clutter and such imaging techniques should be investigated. In this chapter, we focus on sparse high-resolution imaging for impulse based FLGPR. A signal model in the time domain is established since the transmitted impulse is well localized in time. Based on this model, the well-known CLEAN [91], [92] approach is analyzed for imaging compared to the standard backprojection algorithm. This technique eliminates side-lobes, but it provides no improvement in imaging resolution over the standard backprojection algorithm. Two recently proposed, user-parameter free and data-adaptive methods are considered here for imaging. The Sparse Learning via Iterative Minimization (SLIM) [29] and the SParse Iterative Covariance-based Estimation (SPICE) [93] methods are capable of providing sparse, as well as high resolution imaging results. These methods are applied to the data of significantly lower dimension for impulsed based FLGPR to achieve sparse high resolution imaging results. The data-reduction is achieved via a pre-processing technique, which involves an orthogonal projection of the received data to the subspace spanned by the dominant singular vectors of a steering matrix corresponding to the imaging ROI. An efficient decomposition of the steering matrix is performed using an eigenvalue decomposition of a matrix of much reduced dimension. 84 A conjugate gradient SPICE (CG-SPICE) algorithm is also introduced in this paper similar to the CG-SLIM algorithm [29] to speed up the computation of SPICE. The improvement in imaging results over the standard backprojection method are shown via simulated results and also real measured experimental data. For real experimental data, the SIRE radar developed by ARL [54, 94] is used for analysis. This radar was also presented as a MIMO radar in [88]. The remaining sections of this chapter are organized as follows. In Section 4.3, a proposed data model is presented for forward looking GPR based on the ARL’s SIRE radar. This MIMO radar is also briefly described therein. In Section 4.4, the standard back-projection method for imaging is analyzed as well ARL’s Recursive Sidelobe Minimization (RSM) algorithm which is based on the BP algorithm and iteratively and effectively suppresses sidelobes [54, 95]. Based on the proposed model we also show that the CLEAN approach can be used for sparse imaging for impulse based FLGPR. In Section 4.5 we present the sparse and adaptive methods for improved resolution as well as the pre-processing step of orthogonal projection for clutter and data reduction. Section 4.6 contains the numerical results based on simulated and real data. Finally the conclusions are drawn in Section 4.7. 4.3 Data Model: SIRE Impulse Based FLGPR For impulse based FLGPR, we consider the SIRE radar which is designed by ARL and mounted on an SUV for landmine detection [54]. The radar geometry as seen in Fig. 4-1 consists of two transmitters and sixteen receivers. Each transmitter transmits an impulse with a frequency range of 0.3-3.0 GHz, which determines the downrange resolution. The cross range resolution is determined by the physical 2 m aperture of this radar. This radar can be described as a practical example of a MIMO radar which exploits waveform diversity by transmitting orthogonal waveforms from the two transmit antennas located at the edges of the receive array [88]. These orthogonal waveforms are achieved by alternatively transmitting narrow pulses (in ”ping-pong” mode [96]) from 85 Figure 4-2. SIRE FLGPR: 2D aperture for SAR imaging each transmitter. This creates a virtual aperture which is effectively almost double the physical 2 m aperture of the radar hence improving cross-range resolution [97]. The 2D-aperture of received measurements shown in Fig 4-2 is used for image = 1 · · · K receive measurements for a desired imaging area consisting of i = 1 · · · L targets (pixels). Let rk (t) denote the kth receive measurement. formation where there are k This measurement can described by the following equation: rk (t) = where αk,i target, τk,i = = 1 1 Rt (k,i) Rr (k,i) Rt (k,i)+Rr (k,i) c L∑ +M i=1 αk,i zi s(t − τk,i ) + nk (t) (4–1) is the propagation path-loss, zi is the reflectivity of the ith is the trip time delay from transmitter to target to receiver, where Rt (k, i), Rt (k, i) are the distances of transmitter to target and the target to receiver, respectively. The speed of propagation is given by c and nk (t) is thermal noise associated with kth measurement. Note that the model of the received measurement in (4–1) takes into account the contribution of the M scatterers outside the ROI, i.e., 86 desired imaging area which consists of L pixels. The model in (4–1) can be simplified into the following linear equation: y = Cz + n β +n γ = A B (4–2) = [r1 (0), . . . , r1 (T − 1), . . . , rK (0), . . . , rK (T − 1)]T , which is a vector of received measurements stacked together; β = {βi }Li=1 are the pixel values in the desired imaging grid to be estimated, γ = {γi }M i=1 corresponds to the pixel values outside the ROI and n is the noise vector. The matrix A of dimensions (T K ) × L consists of delayed and scaled where y versions of the transmitted signal, given by: α1,1 s(τ1,1 ) · · · α1,I s(τ1,I ) .. .. ... A = . . αK,1 s(τK,1 ) · · · αK,I s(τK,I ) where the vector s(τk,i ) (4–3) = {s(t − τk,i )}Tt=0−1 is the transmitted impulse delayed by τk,i. This data model is used for FLGPR SAR imaging in this paper. In the next section the backprojection algorithm as well as the CLEAN algorithm are described for SAR imaging. 4.4 Back-projection/Delay-and-sum (DAS) Based Methods The standard backprojection (BP) algorithm is a well-known and widely used algorithm for FLGPR SAR imaging (also known as the delay-and-sum (DAS) algorithm). This algorithm is limited in downrange resolution by the bandwidth of the transmitted impulse and in cross-range resolution by the physical (or virtual) aperture of the radar. One other limitation of this algorithm is that it produces images with high sidelobes. A Recursive Sidelobe Minimization (RSM) algorithm based on the BP algorithm was proposed in [54], [95] for effectively suppressing sidelobes. The CLEAN approach [91] (which is also based on this BP/DAS algorithm) can also be used for eliminating 87 sidelobes [92] as well as accurately estimating weak targets by iteratively subtracting the contributions of stronger targets from the received data based on the proposed data model. In this section, we describe these algorithms as well as analyze the CLEAN approach based on the proposed data model for impulse based FLGPR SAR imaging. 4.4.1 Back-projection/DAS Based on Fig 4-2, the backprojection algorithm is described as follows: Consider the ith pixel in this figure with location (xi , yi , zi ) relative to a predefined reference point or origin. For a specific transmit-receive pair, let (xr,k , yrk , zrk ) and (xt,k , ytk , ztk ) denote the transmitter and receiver locations in this coordinate system for k = 1, . . . , K. The delay due to the transmitted EM pulse from the transmitter corresponding to the kth receive measurement to the ith pixel back to the corresponding receiver is given as: τk,i √ = τacq + 1c ( (xt,k − xi)2 + (yt,k − yi)2 + (zt,k − zi )2 √ + (xr,k − xi)2 + (yt,k − yi)2 + (zt,k − zi)2 ) (4–4) where τacq is the acquisition time delay associated with the radar system. The estimate of the reflection coefficient at the ith pixel given by βi is given as: 1 β^i = K ∑ wk rk (t − τk,i ) (4–5) K k=1 This estimate is simply a summation of delayed receive measurements with the propagation loss compensated by a weighting factor wk . This is referred to as coherent processing of the received measurements which improves SNR by a factor K compared to using a single received measurement. The backprojection/DAS algorithm is limited in resolution and suffers from poor sidelobes. The recursive sidelobe minimization (RSM) algorithm proposed by ARL for sidelobe suppression effectively suppresses sidelobes by generating multiple DAS images with apertures with randomly missing measurements and selecting the minimum value across all images [54]. In the next subsection we analyze a CLEAN approach 88 based on the data model in (4–1) for sparse imaging. These DAS based algorithms do not improve imaging resolution compared to the standard DAS algorithm. We then present new approaches to imaging using the SLIM and SPICE algorithms for improved resolution in imaging after preprocessing via an orthogonal projection of the data. 4.4.2 Sparse: CLEAN Method The data-dependent CLEAN algorithm [91] (also known as matching pursuit) for image formation based on the data model in (4–1) is briefly described and analyzed here for imaging. This technique was introduced to produce ’CLEANer’ images (where prior knowledge of the point spread function was required) [92]. The standard DAS algorithm suffers high sidelobe problems. CLEAN can be used to eliminate the side-lobes of strong returns so that weak targets can be revealed by eliminating contributions of strong targets from the receive measurements. The CLEAN algorithm can therefore be used iteratively to find the pixel location of the strongest target and then subtract all the contributions of that target from the data. The next strongest point is then computed more accurately based on the updated measurements. The RELAX algorithm [31] can therefore be used to get even more accurate estimates as well as improved imaging resolution. The RELAX approach involves estimating the strongest target, subtracting the contributions of this target and estimating the next strongest target. The initial pixel value of the strongest target is then re-estimated based on the new estimate of the next strongest target. These two estimates are then iterated back and forth to achieve more accurate estimates. The process is then repeated for all the targets in the scene of interest. Due to the exponential increase in computation as the number of targets increases, this process proves to be too computationally intensive for practical purposes when there are many scatterers in the ROI. The much faster CLEAN approach is described in the following steps based on the data model in (4–1). 89 • Step 1: Determine the brightest estimate P (io ) and corresponding location io from the backprojection image (based on 4–5). • Step 2: Subtract the contribution of brightest point from the received signals (update receive measurements). – – rk up = rk − λ(io )s(^τk,i0 ) (i ) × 1 λ(io ) = P K α k = 1···K o k,io • Step 3: Generate a new image by filling in the io th pixel with P (i0 ). • Step 4: Use updated received signals to regenerate back-projection image. • Iteration: Repeat previous steps with regenerated image until reaching a predefined threshold. η > Pσ(i2o ) . – – η threshold (typically chosen as 1). σ 2 Noise variance. This CLEAN approach, although effective for accurate and sparse imaging, is limited in resolution, which is similar to the standard backprojection/DAS algorithm. We therefore, investigate super-resolution methods for FLGPR SAR imaging. 4.5 Super-resolution Methods In this section, a new approach for high-resolution imaging is presented for impulse based FLGPR. The 2D aperture in Fig. 4-2 for imaging of the specified grid will result in a data vector in (4–1) y ∈ RT K×1 that is large (on the order of 106 ), making practical applications of adaptive techniques infeasible. Also the availability of a single data vector makes well known high resolution data adaptive approaches, such as the CAPON, APES, as well as subspace based methods [2], not directly applicable. Another challenge is that the data vector will contain clutter reflections from regions outside the imaging ROI, which need to be effectively suppressed prior to imaging. We propose a data filtering and reduction approach via time gating and orthogonal projection to reduce interference from scatterers outside the ROI. This approach involves a singular value decomposition of the steering matrix and a projection of the 90 Figure 4-3. Time gating data using the dominant singular vectors, which effectively reduces the data dimension to practical levels. This projection also filters the data to effectively utilize received energy from only the ROI. A computationally efficient method of this projection is performed via an eigenvalue decomposition to obtain an updated data model. Using the updated model, two recently proposed algorithms (SPICE and SLIM) are used for imaging to produce sparse, accurate and high resolution imaging results, even with a single data vector. The process is described in the next subsections. 4.5.1 Orthogonal Projection and Time Gating −1 For time gating, consider the kth receive measurement {rk (t)}Tt=0 . Based on (4–4), the delays of the pixels in the ROI imaging grid, τ k = [τk,1 , τk,2 , . . . , τk,I ], to the corresponding transmit-receive pair for this measurement can be computed. The minimum and maximum delays between this transmit-receive pair and the pixels in the = min(τ k ) and τk = max(τ k ), respectively. The kth τ by discarding data outside receive measurement can then be updated to {rk (t)}t=τ ROI imaging grid are given by τkmin max kmax kmin these computed delays. Without loss of generality, this procedure is shown in Fig. 4-3 for a colocated transmit-receive pair centered below (with a pre-specified standoff-range) the imaging ROI grid. Interference from the regions below the minimum delay line and above the maximum delay line are discarded. The data vector in (4–1) can then be updated to 91 yg = [r1 (τ1 ), . . . , r1 (τ1 ), . . . , rK (τK ), . . . , rK (τK )]T , and the updated data model is: min max max min yg = Ag β + Bg γ + ng . (4–6) with Ag ∈ RQ×L and Bg ∈ RQ×M (Q < T K). Note from Fig. 4-3, interference from scatterers outside the ROI still contribute to the receive measurements. In this paper, an orthogonal projection of the the data to relevant singular vectors of the data matrix is performed to reduce the effects of the scatterers outside this grid as well as to reduce the data dimension significantly (by a factor of 103 in practical applications), allowing for practical applications of high resolution imaging techniques. This interference reduction via orthogonal projection is performed in a computationally efficient manner via an eigen-decomposition and is motivated and described as follows. Consider the following penalized least squares optimization problem used for square-root LASSO [98] (for notational simplicity, we eliminate the subscript g from (4–6)). argmin ||y − Aβ − Bγ||2 +λ||β||1 β ,γ (4–7) Note here that the sparsity promoting ℓ1 constraint is placed on β (the desired estimates), whereas no constraint is placed of γ due to the fact that even if γ is sparse, the subsequent transformation eliminates this sparsity. Let Bγ = UB B VB T γ , UB γ~ via a singular value decomposition of B. The optimization problem in (4–7) for γ ~ (which is no longer sparse) given β is unconstrained and the solution is given as ^~ γ = UB T (y − Aβ). Given γ^~ , the constrained optimization problem in (4–7) is now given as follows: ^ β = argmin ||(I − UBUB T )(y − Aβ)||2 +λ||β||1 β Since B and hence UB is unknown, we assume that UB and UA , where UA are the singular vectors of A are orthogonally compliment to each other. Then Eq. (4–8) 92 (4–8) becomes: ^ β = argmin ||UA UA T (y − Aβ)||2 + λ||β||1 (4–9a) β ^ = argmin ||UA T (y − Aβ)||2 + λ||β||1 β (4–9b) β Consider the following singular value decomposition of the steering matrix A ∈ RQ×L , with Q > L in practice1 : T A = UV = UA A U~ A 0 [ ] V T (4–10) = UA AVT ~ A ] are the left singular vectors of A and the columns of where the columns of [UA , U V are their right counterparts, and the singular values of A are on the diagonal of the diagonal matrix A . Due to the find grid used for the ROI, some of the singular values of A in A are quite small. By discarding the small singular values of A, we approximate A as A ≈ Us s Vs T . Then the optimization problem in Eq. (4–9b) becomes: ^ β = argmin ||UsT (y − Aβ)||2 +λ||β||1 (4–11) β Then using Us for orthogonal projection yields: Us T y = Us T Aβ + ϵ, ~y = A~ β + ϵ (4–12a) (4–12b) Via a series of simulations, we found that the number of columns in Us is not sensitive to the image formation performance. We choose the dimension of Us by analyzing the 1 Note that since the radar illuminates a large area M > Q, but we do not consider M as the contributions of the clutter (Bγ) are eliminated based on the decomposition 93 metric ||A − Us s Vs T ||F /||A||F . Note here that ~ y ∈ Rs×1 and A~ ∈ Rs×1 where s << Q (a factor of 103 reduction in practical applications). The projection described in (4–12b) can be obtained in a computationally efficient way via an eigenvalue decomposition in lieu of an SVD. Note that in (4–10) UA ∈ RQ×L , which makes the SVD decomposition in this equation computationally intensive (O(Q2 L)) in practical scenarios. Consider the following eigenvalue decomposition2 AT A = VVT where V ∈ RL×L and = 2A . Then UA (4–13) = AV−A 1 , where the diagonal matrix −A 1 can be written as: −1 −A 1 = s 0 0 −1 n (4–14) 1 with the diagonal of the sub-matrix − s consisting of the inverse of the dominant singular values of A. Then Us = AVs −s 1 , with Vs corresponding to the s dominant eigenvectors of AT A. The updated data vector in (4–12b) is given as: ~y = UsT y = (−s 1 )Vs T AT y (4–15) whereas the updated steering matrix is given as: A~ = UTs A = −s 1 Vs T AT A = −s 1 Vs T VVT = s Vs T 2 Complexity of eigenvalue decomposition - O(L3 ) 94 (4–16) The updated data model in (4–12b), obtained using Us can be obtained using the decomposition in (4–13). This decomposition can be performed offline as long as prior knowledge of the imaging grid is known, which is typically the case. A different approach for orthogonal projection that improves on computation is described next. Consider the steering matrix G which corresponds to the imaging ROI with a much coarser grid. The matrix G can then be generated by selecting the appropriate columns of A. The matrix G is used to approximate Us . However, unlike Us this matrix is not semi-unitary. The updated data vector is then given as ~y = (GT G)−1 GT y and the updated steering matrix is given as A~ = (GT G)−1 GT A. This new projection approach, skips the computation AT A and its subsequent decomposition, significantly improving computation at cost of less interference suppression.. Based on the updated model in (4–12b), we present below two recently proposed, data-adaptive and iterative approaches for high resolution FLGPR SAR imaging. They are the Sparse Learning via Iterative Minimization (SLIM) [29] and the SParse Iterative Covariance-based Estimation (SPICE) [93], which is equivalent to square-root LASSO with λ = 1 [99]. These two approaches are user-parameter free and are capable of producing sparse and high resolution estimates when only a single data vector is available for imaging; they are described next. 4.5.2 SLIM The SLIM method [29] is a maximum-aposteriori (MAP) approach for sparse signal recovery. The sparse recovery problem is can be solved by optimizing the ℓ1 optimization cost function in (4–17) based on the linear model in (4–12): ^ β = argmin ||~y − A~ β||2 +λ||β||1 (4–17) β The SLIM algorithm can be considered as an ℓq norm approach norm for 0 < q ≤ 1, that considers the following hierarchial Bayesian model (with a sparsity promoting prior) for 95 Table 4-1. SLIM Algorithm Initialization: ^ (0) with the DAS algorithm and σ^(0) Obtain initial estimate β ^ (0) and (4–20) based on β SLIM (nth) Iteration: Repeat the following steps until convergence ^ (n+1) = A~ diag(|β(n)|2−q )A~ T Step 1: Compute: R = A~ P(n) A~ T + σ(n) I. ^ (n) = P(n) A~ T R^ −(n1)~y. Step 2: Compute: β Step 3: Update: σ ^(n) = L1 ||~y − A~ β^ (n)||22 . estimation [29]. ~ β, σI) ~y|β, σ ∼∏ CN (A 2 e− (|β | −1) f (β ) ∝ q f (σ ) ∝ 1 i + σ(n) I (4–18a) q (4–18b) i (4–18c) SLIM estimates the desired sparse vector β, and the noise variance σ, iteratively by minimizing the negative logarithm cost of the posterior density given by: cq (β, σ ) = L log σ + The choice of q 1 ||~y − A~ β||2 + ∑ 2 (|β |q −1) 2 σ i q i (4–19) = 1 simplifies this cost function to the well-known ℓ1 norm constraint for sparse estimation [29]. Minimizing the cost function in (4–19) yields the following estimates: where P ^ β = PAT (A~ PA~ T + σI)−1~y = PA~ T R−1~y σ ^ = L1 ||~y − A~ β||22 (4–20a) (4–20b) = diag(p) and p = |β|2−q . These estimates are obtained in an iterative manner based on the steps in Tab. 4-1. 4.5.3 SPICE The SPICE method for parameter estimation [30, 93] modifies the model in (4–12) as follows: ~y = A~ β + ϵ = Dx 96 (4–21) Table 4-2. CG SPICE Algorithm Obtain initial estimate x ^j(0) Initialization: j = 1, . . . , L + s. = dTj ~y/dTj dj , and pj(0) = |x^j(0) |/ωj SPICE (nth) Iteration: Repeat the following steps until convergence ^ (n) = Ddiag(p(n) )DT = DP(n)DT . Step 1: Compute R 1 y. Step 2: Compute (using CG): s(n) = R− ( n) ~ CG Initialization: s(n)(0) CG Iterations (mth): = 0; r(0) = q(0) = ~y = (r(Tm) r(m) )/(q(Tm) q(m) ) - s(n)(m+1) = s(n)(m) + α(m) q(m) ^ (n)q(m) - r(m+1) = r(m) − α(m) R - q(m+1) = q(m) + (r(Tm+1) r(m+1) )/(r(Tm) r(m) ) Step 3: Update ^ x(n+1) = P(n) DT R−(n1)~y. j = 1, . . . , L + s. Step 4: Update pj (n+1) = x ^j(n+1) /ωj j = 1, . . . , L + s. - α(m) where D = [A~ , I] and x = [βT , ϵT ]T . This method is a covariance fitting approach to parameter estimation that minimizes the following covariance fitting cost function: ||R1/2 (~ y~yT − R)||F (4–22) where R = E{~ y~yT } = DPDT is the covariance matrix of the data and the diagonal matrix P is now given as: Ps 0 P = 0 Pϵ with Ps (4–23) = diag(ps ) and Pϵ = diag(pϵ) being the diagonal matrices containing the power estimates of β and the noise and interference residue ϵ, respectively. The criterion in (4–22) simplifies to the following minimization problem to estimate both x and p = [ps T , pTϵ ]T . {^ x, p^} = argmin x x,p where ωj T −1 P x+ L+s ∑ j =1 ωj pj s.t. Dx = ~y (4–24) = ||dj ||/||~y|| and ||dj || is the jth column of D. The estimates are given as: 97 ^x = PDT R−1~y |βj | pj = , j = 1, . . . , L + s (4–25a) (4–25b) ωj +s and p = {p }L+s . which are solved iteratively till convergence [30]; with x = {xj }Lj =1 j j =1 To improve on the computationally efficiency of the SPICE algorithm for FLGPR SAR imaging, a conjugate gradient based SPICE algorithm (CG-SPICE) is presented in this paper. This approach is similar to the conjugate gradient SLIM algorithm described in [29], [100]. The steps of this CG-SPICE algorithm are described in Tab 4-2. Based on [101], the SPICE optimization problem can be re-written as: L+s ∑ argmin ||x||1 = |xj | s.t. x^ j =1 Dx = ~y For real-valued data, let ~ hj , max(xj , 0) and hj , −min(xj , 0). Note that xj and |xj |= ~ hj (4–26) = ~hj − hj + hj . The optimization problem in Eq. (4–26) can then be augmented to [102]: argmin uT h h^ where h s.t. D h = ~y and h ≥ 0 (4–27) = [~h1 , . . . ~hL+s , h1 . . . hL+s], D = [D, −D] and u = [1, 1 . . . , 1]T . The linear program in (4–27) can be solved efficiently to provide sparse estimates for FLGPR SAR imaging. This approach is on the same order (computationally) as the cyclic optimization approach in Tab 4-2 implemented using the conjugate gradient, and faster (approximately 2 times) without CG based on numerical simulations. Note that the estimates in SLIM and SPICE have the same form with the difference lying in the estimation of the noise and interference residue. SPICE estimates the reflection coefficients and the noise and interference residue simultaneously with the noise and interference residue variance of each elements not necessarily being equal unlike the case in SLIM. However, simulation results show similar performance of the two algorithms with SPICE being less susceptible to the noise and interference residue. This algorithms are robust and effective for generating sparse results. 98 Table 4-3. Subspace approximation No. of singular vectors - s ||A − Us s Vs T ||F /||A||F 15 0.902 100 0.512 300 0.182 Unlike most well-known adaptive algorithms, SLIM can dynamically estimate the user parameter (which in this case, corresponds to the noise power estimate) of the original LASSO cost function (for sparse parameter estimation) [103] which is sensitive to the choice of this parameter (λ). The SPICE criteria can also be reduced to the criteria in (4–17) with λ = 1 [99] (a special case of the square-root lasso [98] which is insensitive to the choice of the user-parameter λ). These robust user parameter free algorithms are applied here to problem of FLGPR SAR imaging. Analysis is performed on both simulated and real experimentally measured SIRE FLGPR data for imaging, and the results are presented in the next section. 4.6 Numerical and Experimental Results In this section, we perform sparse high resolution imaging for FLGPR using orthogonal projection (using Us ) for clutter and data reduction. The SPICE and SLIM algorithms are considered for high resolution imaging. We also analyze the well-known CLEAN approach for imaging based on the proposed data model and show the ability of this well-known algorithm to yield sparse and accurate results. The CLEAN approach, however, is limited in imaging resolution and does not improve resolution over the standard BP algorithm. The coarse grid approach using G is also analyzed for projection. For FLGPR SAR imaging analysis, we use the SIRE radar designed by ARL for imaging based on the setup in Figs. 4-1 and 4-2. For simulations, we consider an imaging area which has a range swath of 4 m and a cross-range swath of 5 m. A 99 0 35 Eigenvalues Threshold = 15 Threshold = 100 Threshold = 300 cross−range (meters) 30 2 Eigenvalues 25 20 15 10 5 0 0 −5 −10 1 −15 0 −20 −25 −1 −30 −35 −2 200 400 600 Eigenvalue number 800 −2 1000 A −1 0 1 downrange (meters) 2 B 0 0 2 −5 −10 1 −15 0 −20 −25 −1 −30 cross−range (meters) cross−range (meters) 2 −35 −2 −2 −1 0 1 downrange (meters) −40 2 −40 C −5 −10 1 −15 0 −20 −25 −1 −30 −35 −2 −2 −1 0 1 downrange (meters) 2 −40 D Figure 4-4. Subspace dimension (s) for high resolution imaging (A) Eigenvalues of AT A, (B) s = 15, (C) s = 100, and (D) s = 300 minimum standoff distance of 8 m is used for simulations, with a maximum standoff distance of 14 m. Simulation was run with three targets placed at various [x, y, z ] locations in meters marked by the symbol ’X’. The imaging area consists of L = 10000 pixels. An analysis of the subspace dimension, (i.e., the number of dominant singular values of A) is performed first. Targets at locations [0,0,0], [-0.3, 1, 0], and [1.5, 1.5, 0] are simulated. Fig. 4-4 shows the SPICE algorithm applied with various thresholds. From Tab. 4-3, thresholds as large 0.5 based on the criterion ||A − Us s Vs T ||F /||A||F 100 0 0 2 cross−range (meters) cross−range (meters) 2 −10 1 0 −20 −1 −30 −2 −2 −10 1 0 −20 −1 −30 −2 −1 0 1 downrange (meters) 2 −40 −2 −1 0 1 downrange (meters) A 2 B 0 0 2 cross−range (meters) cross−range (meters) 2 −10 1 0 −20 −1 −30 −2 −2 −40 −10 1 0 −20 −1 −30 −2 −1 0 1 downrange (meters) 2 −40 −2 −1 0 1 downrange (meters) C 2 −40 D 0 cross−range (meters) 2 −10 1 0 −20 −1 −30 −2 −2 −1 0 1 downrange (meters) 2 −40 E Figure 4-5. FLGPR SAR Imaging - detection of weak target (A) Back-projection, (B) RSM, (C) CLEAN, (D) SLIM, and (E) SPICE (normalized scale of 0 to 1) yield desirable results with all the targets detected as can be seen in Fig. 4-4. The next analysis involves detecting weak targets buried by the sidelobes of much stronger targets. Three targets are again simulated at locations [0,0,0], [-0.1, 0.8, 0], 101 0 0 2 cross−range (meters) cross−range (meters) 2 −10 1 0 −20 −1 −30 −2 −2 −10 1 0 −20 −1 −30 −2 −1 0 1 downrange (meters) 2 −40 −2 −1 0 1 downrange (meters) A 2 B 0 0 2 cross−range (meters) cross−range (meters) 2 −10 1 0 −20 −1 −30 −2 −2 −40 −10 1 0 −20 −1 −30 −2 −1 0 1 downrange (meters) 2 −40 −2 −1 0 1 downrange (meters) C 2 −40 D 0 cross−range (meters) 2 −10 1 0 −20 −1 −30 −2 −2 −1 0 1 downrange (meters) 2 −40 E Figure 4-6. FLGPR Imaging - resolution improvement (A) Back-projection, (B) RSM, (C) CLEAN, (D) SLIM, and (E) SPICE and [1.5, 1.5, 0], with the strong targets 10 times stronger than the weak target as shown in Fig. 4-5. From this figure we can see that the weak target is buried by the sidelobes of the stronger target using the standard backprojection algorithm. The CLEAN approach, which iteratively subtracts out the contributions of the strongest target 102 0 0 2 −10 1 0 −20 −1 −30 cross−range (meters) cross−range (meters) 2 −2 −2 −10 1 0 −20 −1 −30 −2 −1 0 1 downrange (meters) 2 −40 −2 A −1 0 1 downrange (meters) 2 B 0 0 2 −10 1 0 −20 −1 −30 cross−range (meters) cross−range (meters) 2 −2 −2 −40 −10 1 0 −20 −1 −30 −2 −1 0 1 downrange (meters) 2 −40 −2 C −1 0 1 downrange (meters) 2 −40 D Figure 4-7. Orthogonal projection using (A) Us (High SNR), (B) G (High SNR), (C) Us (Low SNR), and (D) G (Low SNR) from the receive measurements, can effectively and accurately detect this weak target. The SPICE and SLIM algorithms are also applied post orthogonal projection (with threshold ||A − Us s Vs T ||F /||A||F = 0.2) and the weak target is revealed. The CLEAN approach, although effective in providing sparse and accurate results, is limited in resolution and has no improvement over the standard backprojection algorithm. The high resolution imaging methods for FLGPR, provide improvement in resolution over the backprojection-based algorithms (BP, RSM and CLEAN). Fig. 4-6 shows three targets at locations [0,0,0], [0, 0.75, 0], and [1.5, 1.5, 0]. Two of these targets are ’closely’ spaced and are clearly resolved by SLIM and SPICE. The SPICE and SLIM algorithms provide almost a factor of 2 improvement in imaging resolution. 103 220 0 220 210 cross−range (meters) cross−range (meters) 210 0 −10 200 190 −20 180 −30 170 160 −10 200 190 −20 180 −30 170 160 −10 0 10 downrange (meters) −40 −10 0 10 downrange (meters) A −40 B 220 0 cross−range (meters) 210 −10 200 190 −20 180 −30 170 160 −10 0 10 downrange (meters) −40 C Figure 4-8. Real data - SIRE FLGPR SAR Imaging: (A) Back-projection, (B) RSM, and (C) SPICE 104 Receiver Operating Characteristics (ROC) Probability of Detection (PD) 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 Backprojection RSM SPICE 0.2 0.1 0 0 1 2 3 4 5 6 7 8 9 10 11 12 13 Number of False Alarms Figure 4-9. Receiver Operating Curve (ROC) comparison: FLGPR SAR Imaging This experiment is repeated using G for orthogonal projection. The results are compared to using the semi-unitary matrix Us for projection and are shown in Fig. 4-9. A gain in computation is achieved at a cost of less interference suppression. The proposed approach is verified using real experimentally measured data. Results based on real SIRE data provided by the Army research lab can be seen in Fig. 4-8. In this figure a subimages 2m in range are continuously formed based on overlapping 2D apertures to generate the entire image [54]. Based on this figure, significant interference reduction can be seen by the high resolution SPICE compared to the standard backprojection algorithm, with some of the targets of interest marked in red oval circles. A quantitative numerical analysis is performed to show the effectiveness of the high-resolution SPICE approach for FLGPR SAR imaging. Several targets with varying strengths are simulated and the receiver operating characteristics curve (ROC) is shown in Fig. 4-9. This curve which shows the probability of detection versus the number of false alarms is generated by using a simple threshold detector. The image under analysis is segmented into regions, and the maximum pixel value in each region is retained. The threshold is incremented in steps and for each threshold, the number 105 of alarms are recorded. The alarms that fall outside the regions with targets present (based on prior knowledge) are considered false alarms. As shown in Fig. 4-9, when there is full detection, the SPICE approach has less false alarms the the BP algorithms. 4.7 Conclusions In this chapter, we have considered new approaches to imaging for forward looking ground penetrating radar. The pre-processing involves a proposition of a data model in the time domain, which takes into account the contributions of clutter outside the imaging area. An orthogonal projection of the measured data to a subspace spanned by the steering matrix corresponding to the imaging ROI is then used for clutter reduction as well as significant data reduction, making it feasible for practical applications of high resolution methods. The steering matrix decomposition is performed efficiently and depends only on the prior knowledge of the desired imaging area and hence can be performed offline. Two recently proposed, data-adaptive approaches, SPICE and SLIM are used for FLGPR SAR imaging. They are user parameter free algorithms and have the ability to provide sparse and high resolution images using a single data vector, unlike other well-known high resolution methods. The results using simulated data show that SLIM and SPICE provide improvement in resolution close to a factor of two compared to the backprojection based algorithms including BP, RSM, CLEAN. A new conjugate gradient based SPICE algorithm is also introduced in this paper for more efficient computations of the estimates. 106 CHAPTER 5 CONCLUDING REMARKS AND FUTURE WORK Due to limitations of data independent methods for spectral estimation, data-adaptive methods are currently being investigated for improved performance. In this dissertation, we focused on efficient and effective applications of data-adaptive methods to real world sensing problems. In Chapter 2, the basic problem of harmonic retrieval is investigated. The problem pertains to digital audio forensics. The contribution we make to this problem involves coming up with a more reliable and accurate way of estimating the network frequency buried in an audio recording using data adaptive techniques. The proposed approach involves spectral analysis using a robust high resolution algorithm and tracking the network frequency via a dynamic programming approach. The approach yields significant improvement in the estimation of the embedded network frequency when this signal is weak compared to the audio recording (a major challenge for this problem). Chapters 3 and 4 are the focus of this dissertation. In these chapters, the Synchronous Impulse Reconstruction (SIRE) radar, which is a remote sensing tool for landmine detection is analyzed and studied. In Chapter 3, we propose a new approach of Radio Frequency Interference suppression for this radar. This new approach can provide an improvement of close to 7 dB in RFI suppression without distorting the desired target signatures. This approach is implemented in an efficient way by exploiting the equivalent sampling technique of this radar. Chapter 4 focuses on sparse high resolution imaging for this SIRE Forward Looking Ground Penetrating radar (FLGPR). In this chapter, we establish a signal model in the time domain since the transmitted impulse is well localized in time. This data model takes into account the contributions of clutter outside the imaging region of interest (ROI). We propose a pre-processing step of orthogonal projection to mitigate the effects of clutter outside the ROI which is present in the collected data. Recently proposed 107 robust, sparse high resolution algorithms for imaging are then applied to provide improved imaging resolution. We achieve close to a factor of 2 improvement in imaging resolution compared to the standard methods currently used for SAR imaging for this radar. Our current and future work focuses on more efficient ways to implement the pre-processing step of orthogonal projection. In lieu of a decomposition of the steering matrix corresponding to imaging ROI and then projection of the data, we propose a direct projection of the data to the steering matrix corresponding to the ROI with a much coarser grid to improve computation. This matrix approximates the set of orthogonal vectors that spans the subspace of the matrix corresponding to the ROI. Results show significant improvement in computation at cost of less interference suppression. 108 REFERENCES [1] S. Kay, Modern Spectral Estimation: Theory and Applications. NJ: Prentice-Hall, 1988. [2] P. Stoica and R. L. Moses, Spectral Analysis of Signals. Prentice-Hall, 2005. [3] S. Marple, Digital Spectral Analysis with Applications. Prentice-Hall, 1987. Englewood Cliffs, Upper Saddle River, NJ: Englewood Cliffs, NJ: [4] P. Stoica and R. L. Moses, Introduction to Spectral Analysis, Upper Saddle River, NJ, 1997, vol. 1. [5] G. M. Jenkins and D. G. Watts, Spectral analysis. Holden-Day, 1968. San Francisco, CA: [6] A. Schuster, “On the investigation of hidden periodicities with application to a supposed 26 day period of meteorological phenomena,” Terrestrial Magnetism, vol. 3, no. 1, pp. 13–41, 1898. [7] J. Allen, “Short term spectral analysis, synthesis, and modification by discrete Fourier transform,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 25, no. 3, pp. 235–238, Jun 1977. [8] L. Du, J. Li, P. Stoica, H. Ling, and S. Ram, “Doppler spectrogram analysis of human gait via iterative adaptive approach,” Electronics Letters, vol. 45, no. 3, pp. 186–188, 29 2009. [9] S. Ram and H. Ling, “Analysis of microDopplers from human gait using reassigned joint time-frequency transform,” Electronics Letters, vol. 43, no. 23, November 2007. [10] S. Haykin, Advances in spectrum analysis and array processing Volume 3. Prentice Hall, 1991. [11] C. Jakowatz, D. Wahl, P. Eichel, D. Ghiglia, and P. Thompson, Spotlight-Mode Synthetic Aperture Radar: A signal processing approach. Norwell, MA: Kluwer Academic Publishers, 1996. [12] E. Gudmundson, P. Stoica, J. Li, A. Jakobsson, and M. Rowe, “Spectral estimation of irregularly sampled exponentially decaying signals with applications to RF spectroscopy,” J. Magn. Resonance, vol. 201, no. 1, pp. 167–176, March 2010. [13] B. Edde, RADAR: Principles, Technology and Applications. NJ: Prentice-Hall, 1993. 109 Upper Saddle River, [14] R. Armitage, C. Landis, and R. H. et al, “Power spectral analysis of sleep EEG in twins discordant for chronic fatigue syndrome.” J. Psychosomatic Research, vol. 66, pp. 51–57, 2009. [15] J. Mukai, “Spectral analysis of all-night sleep EEG in narcoleptic patients and normal subjects,” J Sleep Res., pp. 63–71, 2003. [16] C. Grigoras, “Digital audio recording analysis: The electric network frequency criterion,” The International Journal of Speech Language and the Law, vol. 12, no. 1, pp. 63–76, 2005. [17] L. Xu, J. Li, and P. Stoica, “Adaptive Techniques for MIMO Radar,” in Sensor Array and Multichannel Processing, 2006. Fourth IEEE Workshop on, July 2006, pp. 258–262. [18] J. Proakis and D. Manolakis, Digital Signal Processing: Principles, Alogrithms and Applications. Upper Saddle River, NJ: Prentice-Hall, 2007. [19] J. Capon, “High-resolution frequency-wavenumber spectrum analysis,” Proceedings of the IEEE, vol. 57, no. 8, pp. 1408–1418, Aug. 1969. [20] P. P. Stoica, Z. Wang, and J. Li, “Robust Capon beamforming,” IEEE Signal Processing Letters, vol. 10, no. 6, pp. 172–175, June 2003. [21] E. Larsson, P. Stoica, and J. Li, “Spectral estimation via adaptive filterbank methods: a unified analysis and a new algorithm,” Signal Processing, vol. 82, pp. 1991–2001, March 2002. [22] J. Li and P. Stoica, “An adaptive filtering approach to spectral estimation and SAR imaging,” IEEE Transactions on Signal Processing, vol. 44, no. 6, pp. 1469–1484, Jun 1996. [23] P. Stoica, H. Li, and J. Li, “A new derivation of the APES filter,” IEEE Signal Processing Letters, vol. 6, no. 8, pp. 205–206, Aug. 1999. [24] Y. Wang, J. Li, and P. Stoica, Spectral Analysis of Signals: The Missing Data Case. Morgan and Claypool, 2005. [25] T. Yardibi, J. Li, P. Stoica, M. Xue, and A. B. Baggeroer, “Source localization and sensing: A nonparametric iterative adaptive approach based on weighted least squares,” IEEE Transactions on Aerospace and Electronic Systems, vol. 46, no. 1, pp. 425–443, Jan. 2010. [26] W. Roberts, P. Stoica, J. Li, T. Yardibi, and F. Sadjadi, “Iterative Adaptive Approaches to MIMO Radar Imaging,” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 1, pp. 5–20, Feb. 2010. 110 [27] P. Stoica, J. Li, and H. He, “Spectral Analysis of Nonuniformly Sampled Data: A New Approach Versus the Periodogram,” IEEE Transactions on Signal Processing, vol. 57, no. 3, pp. 843 –858, March 2009. [28] P. Stoica, J. Li, and J. Ling, “Missing Data Recovery Via a Nonparametric Iterative Adaptive Approach,” Signal Processing Letters, IEEE, vol. 16, no. 4, pp. 241–244, April 2009. [29] T. Xing, W. Roberts, J. Li, and P. Stoica, “Sparse Learning via Iterative Minimization with Application to MIMO Radar Imaging,” IEEE Transactions on Signal Processing, vol. 59, no. 3, pp. 1088–1101, March 2011. [30] P. Stoica, P. Babu, and J. Li, “SPICE: A Sparse Covariance-Based Estimation Method for Array Processing,” IEEE Transactions on Signal Processing, vol. 59, no. 2, pp. 629–638, 2011. [31] J. Li and P. Stoica, “Efficient Mixed-Spectrum Estimation with Applications to Target Feature Extraction,” IEEE Transactions on Signal Processing, vol. 44, no. 2, pp. 281–295, Feb 1996. [32] R. C. Maher, “Audio Forensic Examination - authenticity, enhancemement, and interpretation,” IEEE Signal Processing Magazine, vol. 26, no. 2, pp. 84–94, Mar. 2009. [33] E. B. Brixen, “Techniques for the Authentication of Digital Audio Recordings,” in Audio Engineering Society Convention 122, May 2007. [34] C. Grigoras, “Applications of ENF criterion in forensic audio, video and telecommunications analysis,” Forensic Science International, vol. 167, pp. 136–176, 2007. [35] E. B. Brixen, “ENF - Quantification of the Magnetic Field,” in Proc. AES 33rd Conf., Audio Forensics - Theory and Practice, Denver, CO, Jun. 2008. [36] A. J. Cooper, “The Electric Network Frequency (ENF) as an aid to authenticating forensic digital audio recordings - an automated approach,” in Proc. AES 33rd Conf., Audio Forensics - Theory and Practice, Denver, CO, Jun. 2008, pp. 1–6. [37] U. Jönsson, C. Trygger, and P. Ögren, “Lecture notes on Optimal Control: Optimization and System Theory,” unpublished. [38] D. P. N. Rodrı́guez, J. Apolinário, and L. W. P. Biscainho, “Audio authenticity: Detecting ENF discontinuity with high precision phase analysis,” IEEE Transactions on Information Forensics and Security, vol. 5, no. 3, pp. 534–543, Sept. 2010. [39] M. Kajstura, A. Trawinska, and J. Hebenstreit, “Application of the Electrical Network Frequency (ENF) Criterion - A case of a digital recording,” Forensic Science International, vol. 155, pp. 165–171, 2005. 111 [40] Y. Liu, “A US-Wide Power Systems Frequency Monitoring Network,” in IEEE Power Systems Conference and Exposition, Oct. 29 - Nov. 1 2006, pp. 159–166. [41] N. G. Hingorani, “High-voltage DC Transmission - A power electronics workhorse,” IEEE Spectrum, vol. 33, no. 4, pp. 63–72, Apr. 1996. [42] Liu, Z. Yuan, P. N. Markham, R. Conners, and Y. Liu, “Wide-area frequency as a criterion for digital audio recording authentication,” in IEEE Power and Energy Society General Meeting, Jul. 2011, pp. 1–7. [43] J. O. Smith and X. Serra, “PARSHL An Analysis/Synthesis program for non-Harmonic Sounds Based on Sinusoidal Representation,” in International Computer Music Conference, San Francisco, CA, 2004. [44] G. Glentis and A. Jakobsson, “Time-Recursive IAA Spectral Estimation,” IEEE Signal Processing Letters, vol. 18, no. 2, pp. 111–114, Feb. 2011. [45] W. Roberts, P. Stoica, J. Li, T. Yardibi, and F. Sadjadi, “Iterative Adaptive Approaches to MIMO Radar Imaging,” Selected Topics in Signal Processing, IEEE Journal of, vol. 4, no. 1, pp. 5–20, Feb. 2010. [46] M. Xue, L. Xu, and J. Li, “IAA Spectral Estimation: Fast Implementation Using the Gohberg-Semencul Factorization,” IEEE Transactions on Signal Processing, vol. 59, no. 7, pp. 3251–3261, Jul. 2011. [47] G. Glentis and A. Jakobsson, “Efficient implementation of iterative adaptive approach spectral estimation techniques,” IEEE Transactions on Signal Processing, vol. 59, no. 9, pp. 4154–4167, Sept. 2011. [48] G. O. Glentis and A. Jakobsson, “Superfast Approximative Implementation of the IAA Spectral Estimate,” IEEE Transactions on Signal Processing, vol. 60, no. 1, pp. 472–478, 2012. [49] P. Stoica, J. Li, and H. He, “Spectral Analysis of Nonuniformly Sampled Data: A New Approach Versus the Periodogram,” IEEE Transactions on Signal Processing, vol. 57, no. 3, pp. 843–858, March 2009. [50] M. Huijbregtse and Z. Geradts, “Using the ENF Criterion for Determining the Time of Recording for Short Digital Audio Recordings,” in Proc. of the 3rd International Workshop on Computational Forensics, ser. IWCF ’09, vol. 1, 2009, pp. 116–124. [51] J. Gunarsson and T. McKelvey, “High SNR Performance Analysis of F-ESPRIT,” in Conference Record of the 38th Asilomar Conference on Signals, Systems and Computers, vol. 1, Nov. 2004, pp. 1003–1007. [52] R. Miles, A. Dogariu, and J. Michael, “Bringing bombs to light,” IEEE Spectrum, vol. 49, no. 2, pp. 38–43, February 2012. 112 [53] T. Miller, L. Potter, and J. McCorkle, “RFI Suppression for Ultra Wideband Radar,” IEEE Transactions on Aerospace and Electronic Systems, vol. 33, no. 4, pp. 1142 –1156, Oct. 1997. [54] L. Nguyen, “Signal and Image Processing Algorithms for the U.S. Army Research Laboratory Ultra-wideband (UWB) Synchronous Impulse Reconstruction (SIRE) Radar.” ARL Technical Report, ARL-TR-4784, April 2009. [55] P. Stoica, A. Jakobsson, and J. Li, “Cisoid parameter estimation in the colored noise case: asymptotic Cramer-Rao bound, maximum likelihood, and nonlinear least-squares,” IEEE Transactions on Signal Processing, vol. 45, no. 8, pp. 2048–2059, Aug 1997. [56] C. Li, J. Ling, J. Li, and J. Lin, “Accurate Doppler Radar Noncontact Vital Sign Detection Using the RELAX Algorithm,” IEEE Transactions on Instrumentation and Measurement, vol. 59, no. 3, pp. 687–695, March 2010. [57] J. Li, D. Zheng, and P. Stoica, “Angle and waveform estimation via RELAX,” IEEE Transactions on Aerospace and Electronic Systems, vol. 33, no. 3, pp. 1077–1087, July 1997. [58] L. Nguyen and M. Soumekh, “Suppression of Radio Frequency Interference (RFI) for Synchronous Impulse Reconstruction Ultra-Wideband Radar,” ser. Proc. SPIE, vol. 5808, 2005, pp. 178–184. [59] R. Lord and M. Inggs, “Approaches to RF Interference Suppression for VHF/UHF Synthetic Aperture Radar,” ser. Proc. COMSIG, 1998, pp. 95–100. [60] D. Vu, L. Xu, M. Xue, and J. Li, “Nonparametric Missing Sample Spectral Analysis and Its Applications to Interrupted SAR,” IEEE Journal of Selected Topics in Signal Processing, vol. 6, no. 1, pp. 1–14, Feb 2012. [61] L. Nguyen and R.Innocenti, “Suppression of sidelobes and noise in airborne SAR imagery using the Recursive Sidelobe Minimization technique,” in Radar Conference, 2010 IEEE, May 2010, pp. 522–525. [62] M. Braunstein, J. Ralston, and D. Sparrow, “Signal processing approaches to radio frequency interference (RFI) suppression,” in, Algorithms for Synthetic Aperture Radar Imagery, A. Giglio, eds., Orlando, FL, USA, vol. 2230, pp. 190–208, April 1994. [63] L. Nguyen, M. Ressler, D. Wong, and M. Soumekh, “Adaptive Coherent Suppression of Multiple Wide-Bandwidth RFI Sources in SAR,” ser. Proc. SPIE, vol. 5427, 2004, pp. 1–16. [64] C. Yu, Y.Zhang, Z. Dong, and D. Liang, “Eigen-Decomposition Method for RFI Suppression Applied to SAR Data,” in Multimedia Technology (ICMT), 2010 International Conference on, oct. 2010, pp. 1–4. 113 [65] P. Stoica and Y. Selén, “Model-order selection: a review of information criterion rules,” IEEE Signal Processing Magazine, vol. 21, no. 4, pp. 36–47, July 2004. [66] X. Huang and D. Liang, “Gradual RELAX algorithm for RFI suppression in UWB-SAR,” Electronics Letters, vol. 35, no. 22, pp. 1916–1917, Oct 1999. [67] M. Wax and I. Ziskind, “On unique localization of multiple sources by passive sensor arrays ,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 37, no. 7, pp. 996–1000, Jul 1989. [68] A. Nehorai, D. Starer, and P. Stoica, “Direction-of-arrival estimation in applications with multipath and few snapshots,” Circuits, Syst., Signal Process., vol. 10, pp. 327–342, 1991. [69] J. Li, P. Stoica, L. Xu, and W. Roberts, “On Parameter Identifiability of MIMO Radar,” IEEE Signal Processing Letters, vol. 14, no. 12, pp. 968–971, Dec. 2007. [70] S. M. Bird and C. B. Fairweather, “Military fatality rates (by cause) in Afghanistan and Iraq: a measure of hostilities,” vol. 36, no. 4, pp. 841–846, 2007. [71] R. Gunaratna, “Suicide terrorism: a global threat,” Janes Intelligence Review, vol. 12, no. 4, pp. 52–55, 2000. [72] H. Killers, “The global landmine crisis,” in 1998 report to the US Congress, US Department of State, Bureau of Political-Military Affairs, US GPO, 1998. [73] A. Soroush, F. Falahati, M. Zargar, M. Soroush, S. Khateri, A. Khaji et al., “Amputations due to landmine and unexploded ordnances in post-war Iran,” Arch Iran Med, vol. 11, no. 6, pp. 595–7, 2008. [74] R. Miles, A. Dogariu, and J. Michael, “Bringing bombs to light,” IEEE Spectrum, vol. 49, no. 2, pp. 38–43, 2012. [75] B. N. Nelsen, “A forward looking infrared sensor for landmine detection that incorporates a novel method of identifying regions of interest and a fuzzy inference system,” in Fuzzy Information Processing Society, 1999. NAFIPS. 18th International Conference of the North American, 1999, pp. 408–412. [76] J. A. MacDonald and J. Lockwood, Alternatives for landmine detection. corporation, 2003, no. 1608. [77] M. Skolnik, Introduction to radar systems. 2001. RAND New York, NY: McGraw-hill New York, [78] D. J. Daniels, Ground penetrating radar. London, UK: IET, 2004. [79] S. Vitebskiy, L. Carin, M. Ressler, and F. Le, “Ultra-wideband, short-pulse ground-penetrating radar: simulation and measurement,” IEEE Transactions on Geoscience and Remote Sensing, vol. 35, no. 3, pp. 762–772, 1997. 114 [80] L. Peters, J. Daniels, and J. Young, “Ground penetrating radar as a subsurface environmental sensing tool,” Proceedings of the IEEE, vol. 82, no. 12, pp. 1802–1822, 1994. [81] E. Byron, Radar Principles, Technology, Applications. Jersey: PTR Prentice hall, 1993. Englewood Cliffs, New [82] W. G. Carrara, R. S. Goodman, and R. M. Majewski, Spotlight synthetic aperture radar: signal processing algorithms. Norwood, MA: Artech House Inc., 1995. [83] M. Soumekh, Synthetic aperture radar signal processing. Sons, 1999. [84] M. A. Richards, Fundamentals of radar signal processing. Education, 2005. New york, NY: Wiley & Tata McGraw-Hill [85] E. Fishler, A. Haimovich, R. Blum, D. Chizhik, L. Cimini, and R. Valenzuela, “MIMO radar: an idea whose time has come,” in Proceedings of the IEEE, Radar Conference, 2004., April 2004, pp. 71–78. [86] J. Li and P. Stoica, “MIMO Radar with Colocated Antennas,” IEEE Signal Processing Magazine, vol. 24, no. 5, pp. 106–114, Sept. 2007. [87] D. Bliss and K. Forsythe, “Multiple-input multiple-output (MIMO) radar and imaging: degrees of freedom and resolution,” in Conference Record of the Thirty-Seventh Asilomar Conference on Signals, Systems and Computers, 2003., vol. 1, Nov. 2003, pp. 54 – 59. [88] O. Ojowu, Y. Wu, J. Li, and L. Nguyen, “SIRE: a MIMO radar for landmine/IED detection,” in SPIE Defense, Security, and Sensing. International Society for Optics and Photonics, 2013. [89] H. V. Trees, Detection, Estimation, and Modulation Theory, Optimum Array Processing. New York, NY: John Wiley & Sons, 2004. [90] L. A. Gorham and L. J. Moore, “SAR image formation toolbox for MATLAB,” in SPIE Defense, Security, and Sensing. International Society for Optics and Photonics, 2010. [91] J. Högbom, “Aperture synthesis with a non-regular distribution of interferometer baselines,” Astophys, J. Suppl., pp. 417–426, Aug. 1974. [92] E. Karpat, “CLEAN technique to classify and detect objects in subsurface imaging,” International Journal of Antennas and Propagation, 2012. [93] P. Stoica, P. Babu, and J. Li, “New Method of Sparse Parameter Estimation in Separable Models and Its Use for Spectral Analysis of Irregularly Sampled Data,” IEEE Transactions on Signal Processing,, vol. 59, no. 1, pp. 35–47, Jan. 2011. 115 [94] M. Ressler, L. Nguyen, F. Koenig, D. Wong, and G. Smith, “The Army Research Laboratory (ARL) Synchronous Impulse Reconstruction (SIRE) Forward-Looking Radar.” ser. Proc. SPIE, Unmanned Systems Technology IX, vol. 6561, 2007. [95] L. Nguyen and R. Innocenti, “Suppression of sidelobes and noise in airborne SAR imagery using the Recursive Sidelobe Minimization technique,” in Radar Conference, 2010 IEEE, May 2010, pp. 522 –525. [96] J. Ender and J. Klare, “System architectures and algorithms for radar imaging by MIMO-SAR,” in IEEE Radar Conference, May 2009, pp. 1 –6. [97] J. Li and P. Stoica, MIMO Radar Signal Processing. Sons, 2009. Hoboken, NJ: John Wiley & [98] A. Belloni, V. Chernozhukov, and L. Wang, “Square-root lasso: pivotal recovery of sparse signals via conic programming,” Biometrika, vol. 98, no. 4, pp. 791–806, 2011. [99] P. Babu and P. Stoica, “Connection between SPICE and Square-Root LASSO for sparse parameter estimation,” Signal Processing, vol. 95, pp. 10–14, February 2014. [100] J. Ling, J. Li, P. Stoica, and M. Datum, “Probing waveforms and adaptive receivers for active sonar,” The Journal of the Acoustical Society of America, vol. 129, pp. 3640–3651, 2011. [101] P. Babu and P. Stoica, “A combined linear programming-maximum likelihood approach to radial velocity data analysis for extrasolar planet detection,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, pp. 4352–4355. [102] P. Stoica and P. Babu, “Algebraic Derivation of Elfving Theorem on Optimal Experiment Design and Some Connections With Sparse Estimation,” IEEE Signal Processing Letters, vol. 17, no. 8, pp. 743–745, 2010. [103] R. Tibshirani, “Regression shrinkage and selection via the lasso,” Journal of the Royal Statistical Society. Series B (Methodological), pp. 267–288, 1996. 116 BIOGRAPHICAL SKETCH Ode Ojowu Jr. was born in Zaria, Nigeria. He came to the United States in 2001 to pursue an academic career. He received a Bachelor of Arts in physics from Grinnell College in 2005, as well as a Bachelor of Science and Master of Science in electrical engineering from Washington University in 2007. He is currently with the Spectral Analysis Lab (SAL) supervised by Prof. Jian Li at the University of Florida. He will receive a Doctor of Philosophy in electrical engineering from the University of Florida in the Fall of 2013. His general research interest lies in the field of signals and systems with a focus on data-adaptive spectral estimation techniques, array signal processing and radar signal processing. 117