Advances in Random Matrix Theory: Let there be tools Alan Edelman Brian Sutton, Plamen Koev, Ioana Dumitriu, Raj Rao and others MIT: Dept of Mathematics, Computer Science AI Laboratories World Congress, Bernoulli Society Barcelona, Spain 2/1/2005 Wednesday July 28, 2004 1 Message Ingredient: Take Any important mathematics Then Randomize! This will have many applications! We can’t keep this in the hands of specialists anymore: Need tools! 2/1/2005 2 Tools So many applications … Random matrix theory: catalyst for 21st century special functions, analytical techniques, statistical techniques In addition to mathematics and papers Need tools for the novice! Need tools for the engineers! Need tools for the specialists! 2/1/2005 3 Themes of this talk Tools for general beta What is beta? Think of it as a measure of (inverse) volatility in “classical” random matrices. Tools for complicated derived random matrices Tools for numerical computation and simulation 2/1/2005 4 The Wigner’s Semi-Circle classical & most famous rand eig theorem Let S = random symmetric Gaussian MATLAB: A=randn(n); S=(A+A’)/2; S known as the Gaussian Orthogonal Ensemble Normalized eigenvalue histogram is a semi-circle n=20; %matrix size, samples, sample s=30000; Precised=.05; statements require n→∞ etc.dist e=[]; %gather up eigenvalues im=1; %imaginary(1) or real(0) for i=1:s, a=randn(n)+im*sqrt(-1)*randn(n);a=(a+a')/(2*sqrt(2*n*(im+1))); v=eig(a)'; e=[e v]; end hold off; [m x]=hist(e,-1.5:d:1.5); bar(x,m*pi/(2*d*n*s)); axis('square'); axis([-1.5 1.5 -1 2]); hold on; t=-1:.01:1; plot(t,sqrt(1-t.^2),'r'); 5 sym matrix to tridiagonal form G χ6 χ6 G χ5 χ5 G χ4 χ4 G χ3 Same eigenvalue distribution as GOE: G χ3 χ2 2 O(n) storage !! O(n ) compute χ2 G χ1 χ1 G 2/1/2005 6 General beta G χ6β beta: 1: reals 2: complexes 4: quaternions χ6β G χ5β χ5β GBidiagonal χ4β Version corresponds matrices of Statistics G χ3β χ4βTo Wishart χ3β G χ2β χ2β G χβ χβ G 2/1/2005 7 Tools Motivation: A condition number problem Jack & Hypergeometric of Matrix Argument MOPS: Ioana Dumitriu’s talk The Polynomial Method The tridiagonal numerical 109 trick 2/1/2005 8 Tools Motivation: A condition number problem Jack & Hypergeometric of Matrix Argument MOPS: Ioana Dumitriu’s talk The Polynomial Method The tridiagonal numerical 109 trick 2/1/2005 9 κ(A) Numerical Analysis: Condition Numbers = “condition number of A” If A=UΣV’ is the svd, then κ(A) = σmax/σmin . Alternatively, κ(A) = √λ max (A’A)/√λ min (A’A) One number that measures digits lost in finite precision and general matrix “badness” Small=good Large=bad The 2/1/2005 condition of a random matrix??? 10 Von Neumann & co. Solve Ax=b via x= (A’A) -1A’ b M ≈A-1 Matrix Residual: ||AM-I||2 ||AM-I||2< 200κ2 n ε ≈ How should we estimate κ? Assume, as a model, that the elements of A are independent standard normals! 2/1/2005 11 Von Neumann & co. estimates (1947-1951) “For a ‘random matrix’ of order n the expectation value has been shown to be about X n” ∞ Goldstine, von Neumann “… we choose two different values of κ, namely n and √10n” P(κ<n) ≈0.02 Bargmann, Montgomery, vN P(κ< √10n)≈0.44 “With a probability ~1 … κ < 10n” P(κ<10n) ≈0.80 2/1/2005 Goldstine, von Neumann 12 Random cond numbers, n→∞ Distribution of κ/n 2 x + 4 −2 / x −2 / x 2 y= e 3 x 2/1/2005 Experiment with n=200 13 Finite n n=10 n=25 n=50 n=100 2/1/2005 14 Condition Number Distributions P(κ/n > x) ≈ 2/x P(κ/n2 > x) ≈ 4/x2 Real n x n, nÆ∞ Generalizations: •β: 1=real, 2=complex Complex n x n, nÆ∞ •finite matrices •rectangular: m x n 2/1/2005 15 Condition Number Distributions P(κ/n > x) ≈ 2/x P(κ/n2 > x) ≈ 4/x2 Square, nÆ∞: P(κ/nβ > x) ≈ (2ββ-1/Γ(β))/xβ (All Betas!!) Real n x n, nÆ∞ General Formula: P(κ>x) ~Cµ/x β(n-m+1), where µ= β(n-m+1)/2th moment of the largest eigenvalue of Wm-1,n+1 (β) and C is a known geometrical constant. Complex n x n, nÆ∞ Density for the largest eig of W is known in terms of 1F1((β/2)(n+1), ((β/2)(n+m-1); -(x/2)Im-1) from which µ is available Tracy-Widom law applies probably all beta for large m,n. 2/1/2005shows at least beta=1,2. Johnstone 16 Tools Motivation: A condition number problem Jack & Hypergeometric of Matrix Argument MOPS: Ioana Dumitriu’s talk The Polynomial Method The tridiagonal numerical 109 trick 2/1/2005 17 Multivariate Orthogonal Polynomials & Hypergeometrics of Matrix Argument Ioana Dumitriu’s talk The important special functions of the 21st century Begin with w(x) on I pκ(x)pλ(x) ∆(x)β ∏i w(xi)dxi = δκλ Jack Polynomials orthogonal for w=1 on the unit circle. Analogs of xm ∫ 2/1/2005 18 Multivariate Hypergeometric Functions 2/1/2005 19 Multivariate Hypergeometric Functions 2/1/2005 20 Wishart (Laguerre) Matrices! 2/1/2005 21 Plamen’s clever idea 2/1/2005 22 Tools Motivation: A condition number problem Jack & Hypergeometric of Matrix Argument MOPS: Ioana Dumitriu’s talk The Polynomial Method The tridiagonal numerical 109 trick 2/1/2005 23 Mops (Dumitriu etc.) Symbolic 2/1/2005 24 Symbolic MOPS applications A=randn(n); S=(A+A’)/2; trace(S^4) det(S^3) 2/1/2005 25 Symbolic MOPS applications β=3; hist(eig(S)) 2/1/2005 26 Smallest eigenvalue statistics A=randn(m,n); hist(min(svd(A).^2)) 2/1/2005 27 Tools Motivation: A condition number problem Jack & Hypergeometric of Matrix Argument MOPS: Ioana Dumitriu’s talk The Polynomial Method -- Raj! The tridiagonal numerical 109 trick 2/1/2005 28 RM Tool – Raj! Courtesy of the Polynomial Method 2/1/2005 29 2/1/2005 30 The Riemann Zeta Function 1 ζ( s ) = Γ (s) ∞ ∫ 0 s −1 u du = u e −1 ∞ ∑ k =1 1 ks On the real line with x>1, for example 2 1 1 1 π ζ( 2 ) = 1 + 2 + 2 + 2 + … = 2 3 4 6 May be analytically extended to the complex plane, with singularity only at x=1. 2/1/2005 31 The Riemann Hypothesis ∞ x −1 -1 0 ∞ u 1 1 du = ζ( x ) = ∑ u x ∫ Γ( x ) 0 e − 1 k =1 k -3 -2 ½ 1 2 3 All nontrivial roots of ζ(x) satisfy Re(x)=1/2. (Trivial roots at negative even integers.) 2/1/2005 32 =.5+i* |ζ(x)| along Re(x)=1/2 Zeros 14.134725142 21.022039639 The Riemann Hypothesis 25.010857580 ∞ x −1 ∞ 30.424876126 u 1 1 32.935061588 du = ζ( x ) = ∑ u x ∫ 37.586178159 Γ( x ) 0 e − 1 40.918719012 k =1 k -3 -2 -1 0 43.327073281 48.005150881 49.773832478 ½ 152.970321478 2 3 56.446247697 59.347044003 All nontrivial roots of ζ(x) satisfy Re(x)=1/2. (Trivial roots at negative even integers.) 2/1/2005 33 Computation of Zeros Odlyzko’s fantastic computation of 10^k+1 through 10^k+10,000 for k=12,21,22. See http://www.research.att.com/~amo/zeta_tables/ Spacings behave like the eigenvalues of A=randn(n)+i*randn(n); S=(A+A’)/2; 2/1/2005 34 Nearest Neighbor Spacings & Pairwise Correlation Functions 2/1/2005 35 Painlevé Equations 2/1/2005 36 Spacings Take a large collection of consecutive zeros/eigenvalues. Normalize so that average spacing = 1. Spacing Function = Histogram of consecutive differences (the (k+1)st – the kth) Pairwise Correlation Function = Histogram of all possible differences (the kth – the jth) Conjecture: These functions are the same for random matrices and Riemann zeta 2/1/2005 37 Tools Motivation: A condition number problem Jack & Hypergeometric of Matrix Argument MOPS: Ioana Dumitriu’s talk The Polynomial Method The tridiagonal numerical 109 trick 2/1/2005 38 Everyone’s Favorite Tridiagonal -2 1 1 n2 1 -2 … 1 … … … … 1 1 -2 d2 dx2 2/1/2005 39 Everyone’s Favorite Tridiagonal -2 1 1 n2 1 -2 … G 1 … … … … 1 d2 dx2 2/1/2005 1 +(βn)1/2 G 1 -2 G + dW β1/2 40 Stochastic Operator Limit 2 d − x + 2 dx 2 dW , β ⎛ N(0,2) χ (n −1)β ⎜ ⎜ χ (n −1)β N(0,2) χ (n − 2)β 1 ⎜ β … Hn ~ … ⎜ 2 nβ ⎜ χ 2β ⎜ ⎝ H 2/1/2005 β n ≈ H ∞ n + ⎞ ⎟ ⎟ ⎟ … ⎟, N(0,2) χβ ⎟ ⎟ χβ N(0,2) ⎠ 2 G β n , 41 Largest Eigenvalue Plots 2/1/2005 42 MATLAB beta=1; n=1e9; opts.disp=0;opts.issym=1; alpha=10; k=round(alpha*n^(1/3)); % cutoff parameters d=sqrt(chi2rnd( beta*(n:-1:(n-k-1))))'; H=spdiags( d,1,k,k)+spdiags(randn(k,1),0,k,k); H=(H+H')/sqrt(4*n*beta); eigs(H,1,1,opts) 2/1/2005 43 Tricks to get O(n9) speedup • Sparse matrix storage (Only O(n) storage is used) • Tridiagonal Ensemble Formulas (Any beta is available due to the tridiagonal ensemble) • The Lanczos Algorithm for Eigenvalue Computation ( This allows the computation of the extreme eigenvalue faster than typical general purpose eigensolvers.) • The shift-and-invert accelerator to Lanczos and Arnoldi (Since we know the eigenvalues are near 1, we can accelerate the convergence of the largest eigenvalue) • The ARPACK software package as made available seamlessly in MATLAB (The Arnoldi package contains state of the art data structures and numerical choices.) • The observation that if k = 10n1/3 , then the largest eigenvalue is determined numerically by the top k × k segment of n. (This is an interesting mathematical statement related to the decay of the Airy function.) 2/1/2005 44 Level Densities 2/1/2005 45 Open Problems The distribution for general beta Seems to be governed by a convection-diffusion equation 2/1/2005 46 Random matrix tools! 2/1/2005 47