What is Frequency? Norden E. Huang Research Center for Adaptive Data Analysis Center for Dynamical Biomarkers and translational Medicine National Central University Zhongli, Taiwan, ROC The 15th Wu Chien Shiung Science Camp, 2012 Definition of Frequency Given the period of a wave as T ; the frequency is defined as 1 T . T This definition is easy for regular sine wave, but not very practical for complicated oscillations. Impossible for complicated waves Need Decomposition Jean-Baptiste-Joseph Fourier Fourier’s work is a great mathematical poem. Lord Kelvin 1807 “On the Propagation of Heat in Solid Bodies” 1812 Grand Prize of Paris Institute 1817 Elected to Académie des Sciences 1822 Appointed as Secretary of Math Section paper published Ever since Fourier’s ground breaking work, people always think of any changes in terms of waves. Fourier’s work is a great mathematical poem. Lord Kelvin Definitions of Fourier Type Frequency : For any data from linear Processes 1. Fourier Analysis : T F ( ) x( t ) e j i jt dt . 0 2. Wavelet Analysis 3. Wigner Ville Analysis Fourier Analysis Fourier proved that the Fourier expansion in terms of trigonometric function x( t ) a j e i j t j is complete, convergent, orthogonal and unique. Therefore, every signal could be think as combination of sinusoidal waves each with constant amplitude and frequency. Are those frequency meaningful? Definition of Power Spectral Density Since a signal with nonzero average power is not square integrable, the Fourier transforms do not exist in this case. Fortunately, the Wiener-Khinchin Theroem provides a simple alternative. The PSD is the Fourier transform of the auto-correlation function, R(τ), of the signal if the signal is treated as a widesense stationary random process: S( ) R( )e 2 i d S( )d 2 ( t ) Fourier Spectrum Surrogate Signal I. Hello The original data : Hello The surrogate data : Hello The Fourier Spectra : Hello Surrogate Signal II. delta function and white noise Non-causality: Event involves both past and future Random and Delta Functions Fourier Components : Random Function Fourier Components : Delta Function The Importance of Phase Can we just use phase to explore physical processes? Yes, to some extent. After more than 15 years of searching, I found the key to nonlinear and nonstationary data analysis is the proper definition for frequency. Then: In search of frequency, I found the methods to quantify nonlinearity and determine the trend. Instantaneous Frequencies and Trends for Nonstationary Nonlinear Data Hot Topic Conference University of Minnesota, Institute for Mathematics and Its Applications, 2011 How to define frequency? It seems to be trivial. But frequency is an important parameter for us to understand many physical phenomena. Traditional Definition of Frequency • frequency = 1/period. • • • • • Definition too crude Only work for simple sinusoidal waves Does not apply to nonstationary processes Does not work for nonlinear processes Does not satisfy the need for wave equations Instantaneous Frequency Velocity distance ; mean velocity time Newton v Frequency dx dt 1 ; mean frequency period HHT defines the phase function d dt So that both v and can appear in differential equations. Prevailing Views on Instantaneous Frequency The term, Instantaneous Frequency, should be banished forever from the dictionary of the communication engineer. J. Shekel, 1953 The uncertainty principle makes the concept of an Instantaneous Frequency impossible. K. Gröchennig, 2001 The Idea and the need of Instantaneous Frequency According to the classic wave theory, the wave conservation law is based on a gradually changing φ(x,t) such that k , ; t k 0 . t Therefore, both wave number and frequency must have instantaneous values and differentiable. This is the true definition of frequency. Hilbert Transform : Definition For any x( t ) Lp , y( t ) 1 x( ) d , t then, x( t )and y( t ) form the analytic pairs: z( t ) x( t ) i y( t ) a( t ) e i ( t ) , where a( t ) x 2 y 2 1 / 2 and ( t ) tan 1 y( t ) . x( t ) The Traditional use of the Hilbert Transform for Data Analysis failed miserably and gave IF a bad break. Traditional View a la Hahn (1995) : Data LOD Traditional View a la Hahn (1995) : Hilbert Traditional Approach a la Hahn (1995) : Phase Angle Traditional Approach a la Hahn (1995) : Phase Angle Details Traditional Approach a la Hahn (1995) : Frequency Why the traditional approach does not work? Hilbert Transform a cos + b : Data Hilbert Transform a cos + b : Phase Diagram Hilbert Transform a cos + b : Phase Angle Details Hilbert Transform a cos + b : Frequency The Empirical Mode Decomposition Method and Hilbert Spectral Analysis Sifting Empirical Mode Decomposition: Methodology : Test Data Empirical Mode Decomposition: Methodology : data and m1 Empirical Mode Decomposition: Methodology : data & h1 Empirical Mode Decomposition: Methodology : h1 & m2 Empirical Mode Decomposition: Methodology : h3 & m4 Empirical Mode Decomposition: Methodology : h4 & m5 Empirical Mode Decomposition Sifting : to get one IMF component x( t ) m 1 h1 , h1 m 2 h2 , ..... ..... hk 1 m k hk . hk c1 . The Stoppage Criteria The Cauchy type criterion: when SD is small than a preset value, where T SD h t 0 k 1 ( t ) hk ( t ) 2 T 2 h k 1 ( t ) t 0 Or, simply pre-determine the number of iterations. Empirical Mode Decomposition: Methodology : IMF c1 Definition of the Intrinsic Mode Function (IMF): a necessary condition only! Any function having the same numbers of zero cros sin gs and extrema ,and also having symmetric envelopes defined by local max ima and min ima respectively is defined as an Intrinsic Mode Function ( IMF ). All IMF enjoys good Hilbert Transform : c( t ) a( t )e i ( t ) Empirical Mode Decomposition: Methodology : data, r1 and m1 Empirical Mode Decomposition Sifting : to get all the IMF components x( t ) c1 r1 , r1 c2 r2 , . . . rn 1 cn rn . x( t ) n c j 1 j rn . An Example of Sifting & Time-Frequency Analysis Length Of Day Data LOD : IMF Orthogonality Check • Pair-wise % • Overall % • • • • • • • • • • • 0.0003 0.0001 0.0215 0.0117 0.0022 0.0031 0.0026 0.0083 0.0042 0.0369 0.0400 • 0.0452 LOD : Data & c12 LOD : Data & Sum c11-12 LOD : Data & sum c10-12 LOD : Data & c9 - 12 LOD : Data & c8 - 12 LOD : Detailed Data and Sum c8-c12 LOD : Data & c7 - 12 LOD : Detail Data and Sum IMF c7-c12 LOD : Difference Data – sum all IMFs Traditional View a la Hahn (1995) : Hilbert Mean Annual Cycle & Envelope: 9 CEI Cases Uniqueness It should be pointed out that in general x( t ) a( t ) e i j ( t ) j is not unique. But the EMD determines the envelope based on the data to make the expansion unique. Phase Function computation no longer based on Hilbert Transform, but through quadrature to account for the full nonlinear wave form distortion and other complications. Properties of EMD Basis The Adaptive Basis based on and derived from the data by the empirical method satisfy nearly all the traditional requirements for basis empirically and a posteriori: Complete Convergent Orthogonal Unique The combination of Hilbert Spectral Analysis and Empirical Mode Decomposition has been designated by NASA as HHT (HHT vs. FFT) Comparison between FFT and HHT 1. FFT : x( t ) aj e i jt . j 2. HHT : x( t ) a j ( t ) e j i j ( t )d . Comparisons: Fourier, Hilbert & Wavelet Speech Analysis Hello : Data Four comparsions D The original data : Hello Additionally To quantify nonlinearity we also need instantaneous frequency. How to define Nonlinearity? How to quantify it through data alone? The term, ‘Nonlinearity,’ has been loosely used, most of the time, simply as a fig leaf to cover our ignorance. Can we be more precise? How is nonlinearity defined? Based on Linear Algebra: nonlinearity is defined based on input vs. output. But in reality, such an approach is not practical: natural system are not clearly defined; inputs and out puts are hard to ascertain and quantify. Furthermore, without the governing equations, the order of nonlinearity is not known. In the autonomous systems the results could depend on initial conditions rather than the magnitude of the ‘inputs.’ The small parameter criteria could be misleading: sometimes, the smaller the parameter, the more nonlinear. Linear Systems Linear systems satisfy the properties of superposition and scaling. Given two valid inputs to a system H, x1 ( t ) and x2 ( t ) as well as their respective outputs y1 ( t ) H{ x1 ( t )} and y2 (t) = H{ x2 ( t )} then a linear system, H, must satisfy y1 ( t ) y1 ( t ) H{ x1 ( t ) x2 ( t )} for any scalar values α and β. How is nonlinearity defined? Based on Linear Algebra: nonlinearity is defined based on input vs. output. But in reality, such an approach is not practical: natural system are not clearly defined; inputs and out puts are hard to ascertain and quantify. Furthermore, without the governing equations, the order of nonlinearity is not known. In the autonomous systems the results could depend on initial conditions rather than the magnitude of the ‘inputs.’ The small parameter criteria could be misleading: sometimes, the smaller the parameter, the more nonlinear. How should nonlinearity be defined? The alternative is to define nonlinearity based on data characteristics: Intra-wave frequency modulation. Intra-wave frequency modulation is known as the harmonic distortion of the wave forms. But it could be better measured through the deviation of the instantaneous frequency from the mean frequency (based on the zero crossing period). Characteristics of Data from Nonlinear Processes d2 x 3 x x cos t 2 dt d2 x x 2 dt 1 x 2 cos t Spring with position dependent cons tan t , int ra wave frequency mod ulation; therefore , we need ins tan tan eous frequency . Duffing Pendulum x d2x 2 x ( 1 x ) cos t . 2 dt Duffing Equation : Data Hilbert’s View on Nonlinear Data Intra-wave Frequency Modulation A simple mathematical model x( t ) cos t sin 2 t Duffing Type Wave Data: x = cos(wt+0.3 sin2wt) Duffing Type Wave Perturbation Expansion For 1 , we can have x( t ) cos t sin 2 t cos t cos sin 2 t sin t sin sin 2 t cos t sin t sin 2 t .... 1 cos t cos 3 t .... 2 2 This is very similar to the solution of Duffing equation . Duffing Type Wave Wavelet Spectrum Duffing Type Wave Hilbert Spectrum Duffing Type Wave Marginal Spectra Quantify nonlinearity Degree of nonlinearity Let us consider a generalized intra-wave frequency modulation model as: x( t ) cos( t sin t ) IF= d 1 cos t dt IF IFz DN (Degree of Nolinearity ) should be I F z 2 1/ 2 . 2 Depending on the value of , we can have either a up-down symmetric or a asymmetric wave form. Degree of Nonlinearity • DN is determined by the combination of δη precisely with Hilbert Spectral Analysis. Either of them equals zero means linearity. • We can determine δ and η separately: – η can be determined from the instantaneous frequency modulations relative to the mean frequency. – δ can be determined from DN with known η. NB: from any IMF, the value of δη cannot be greater than 1. • The combination of δ and η gives us not only the Degree of Nonlinearity, but also some indications of the basic properties of the controlling Differential Equation, the Order of Nonlinearity. Stokes Models d2x 2 2 x x cos t with ; =0.1. 2 dt 25 Stokes I: is positive ranging from 0.1 to 0.375; beyond 0.375, there is no solution. Stokes II: is negative ranging from 0.1 to 0.391; beyond 0.391, there is no solution. Data and IFs : C1 Stokes Model c1: e=0.375; DN=0.2959 2 1.5 1 Frequency : Hz 0.5 0 -0.5 -1 -1.5 IFq IFz Data IFq-IFz -2 -2.5 -3 0 20 40 60 T ime: Second 80 100 Duffing Models d2x 2 3 ax x cos t with ; =0.1. 2 dt 25 Duffing I : a=1; is positive ranging from 0.1 to any number; there is always solution. Duffing II : a=1; is negative ranging from 0.1 to 0.230; beyond 0.230 there is no solution. Duffing O : a=-1; is positive ranging from 0.1 to any number; there is no solution, but the system would become chaotic. Data and Ifs Details Duffing I : e=0.500 3 2.5 2 Frequency : Hz 1.5 1 0.5 0 -0.5 -1 IFq IFz Data IFq-IFz -1.5 -2 60 65 70 75 80 Time: Second 85 90 95 100 Lorenz Model • Lorenz is highly nonlinear; it is the model equation that initiated chaotic studies. • Again it has three parameters. We decided to fix two and varying only one. • There is no small perturbation parameter. • We will present the results for ρ=28, the classic chaotic case. Phase Diagram for ro=28 Lorenz Phase : ro=28, sig=10, b=8/3 30 20 z 10 0 -10 -20 -30 20 10 50 40 0 30 20 -10 y 10 -20 0 x X-Component DN1=0.5147 CDN=0.5027 Data and IF Lorenz X : ro=28, sig=10, b=8/3 50 IFq IFz Data IFq-IFz Frequency : Hz 40 30 20 10 0 -10 0 5 10 15 T ime : second 20 25 30 Lorenz Model dx y x dt dy x z y dt dz xy z dt with (Prandtl number)=10; =8/3; (Rayleigh number) varying Comparisons Fourier Wavelet Hilbert Basis a priori a priori Adaptive Frequency Integral transform: Global Integral transform: Regional Differentiation: Local Presentation Energy-frequency Energy-timefrequency Energy-timefrequency Nonlinear no no yes, quantifying Non-stationary no yes Yes, quantifying Uncertainty yes yes no Harmonics yes yes no How to define Trend ? Parametric or Non-parametric? Intrinsic vs. extrinsic approach? The State-of-the arts: Trend “One economist’s trend is another economist’s cycle” Watson : Engle, R. F. and Granger, C. W. J. 1991 Long-run Economic Relationships. Cambridge University Press. Philosophical Problem Anticipated 名不正則言不順 言不順則事不成 ——孔夫子 Definition of the Trend Proceeding Royal Society of London, 1998 Proceedings of National Academy of Science, 2007 Within the given data span, the trend is an intrinsically fitted monotonic function, or a function in which there can be at most one extremum. The trend should be an intrinsic and local property of the data; it is determined by the same mechanisms that generate the data. Being local, it has to associate with a local length scale, and be valid only within that length span, and be part of a full wave length. The method determining the trend should be intrinsic. Being intrinsic, the method for defining the trend has to be adaptive. All traditional trend determination methods are extrinsic. Algorithm for Trend • Trend should be defined neither parametrically nor non-parametrically. • It should be the residual obtained by removing cycles of all time scales from the data intrinsically. • Through EMD. Global Temperature Anomaly Annual Data from 1856 to 2003 GSTA IMF Mean of 10 Sifts : CC(1000, I) Statistical Significance Test IPCC Global Mean Temperature Trend Comparison between non-linear rate with multi-rate of IPCC Various Rates of Global Warming 0.025 0.02 Warming Rate: oC/Yr 0.015 0.01 0.005 0 -0.005 -0.01 -0.015 -0.02 1840 1860 1880 1900 1920 1940 Time : Year 1960 1980 2000 2020 Blue shadow and blue line are the warming rate of non-linear trend. Magenta shadow and magenta line are the rate of combination of non-linear trend and AMO-like components. Dashed lines are IPCC rates. GSTA Summary Summary GSTA DN Results : 5=CDN 0.5 SGQ SGH gq gh 0.45 Degree of Nonlinearity 0.4 0.35 0.3 0.25 0.2 0.15 0.1 1 1.5 2 2.5 3 3.5 IMF Number : 5=CDN 4 4.5 5 Extend to N-dimension Wave number in spatial data I. Hurricane The numerically simulated vertical velocity at 500 hPa level of Hurricane Rita of 2005. The warm color represents the upward motion and the cold color downward motion. The unit is m/s. The EEMD components of the vertical velocity. In each panel, the color scales are different, the blue lines corresponding to zeros. The final components of hurricane Andrew. Comparisons Fourier Wavelet Hilbert Basis a priori a priori Adaptive Frequency Integral-Transform: Global Integral-Transform Regional Differentiation: Local Presentation Energy-frequency Energy-timefrequency Energy-timefrequency Nonlinear no no Yes: quantify Non-stationary no yes Yes: quantify Uncertainty yes yes no Harmonics yes yes no Scientific Activities Collecting, analyzing, synthesizing, and theorizing are the core of scientific activities. Theory without data to prove is just hypothesis. Therefore, data analysis is a key link in this continuous loop. Data Processing and Data Analysis • Processing [proces < L. Processus < pp of Procedere = Proceed: pro- forward + cedere, to go] : A particular method of doing something. – Data Processing >>>> Mathematically meaningful parameters • Analysis [Gr. ana, up, throughout + lysis, a loosing] : A separating of any whole into its parts, especially with an examination of the parts to find out their nature, proportion, function, interrelationship etc. – Data Analysis >>>> Physical understandings The Job of a Scientist The job of a scientist is to listen carefully to nature, not to tell nature how to behave. Richard Feynman To listen is to use adaptive methods and let the data sing, and not to force the data to fit preconceived modes. Thanks Current Efforts and Applications • Non-destructive Evaluation for Structural Health Monitoring – (DOT, NSWC, DFRC/NASA, KSC/NASA Shuttle, THSR) • Vibration, speech, and acoustic signal analyses – (FBI, and DARPA) • Earthquake Engineering – (DOT) • Bio-medical applications: dynamical biomarkers – (Harvard, Johns Hopkins, NTU, VHT, …) • Climate changes – (NASA Goddard, NOAA, CCSP) • Cosmological Gravity Wave – (NASA Goddard) • Financial market data analysis – (NCU, AS) • Theoretical foundations – (Princeton University and Caltech) Conclusions • Yes, we can still think of any change in terms of waves, but no the simple harmonic kind. • With EMD, we can define the true frequency, quantify nonlinearity and extract trend from any data. • Among other applications, the degree of nonlinearity could be used to set an objective criterion in health monitoring and in natural phenomena; the trend could be used in financial as well as natural sciences. • These are all possible because of adaptive data analysis method. Data Fourier Spectra : Stations #3,4,5,& 6 Phase Angle Phase Angle Reference to Station # 1 Wave Fusion Station #5 The most precise way is utilizing the phase to define Instantaneous Frequency Hilbert Spectrum : Station #1