Sketching and Embedding are Equivalent for Norms Alexandr Andoni (Columbia) Robert Krauthgamer (Weizmann Inst) Ilya Razenshteyn (MIT) 1 Sketching • Compress a massive object to a small sketch • Objects: high-dimensional vectors, matrices, graphs • Similarity search, compressed sensing, numerical linear algebra d • Dimension reduction (Johnson, Lindenstrauss 1984): random projection on a low-dimensional subspace preserves distances n When is sketching possible? 2 Similarity search • Motivation: similarity search • Model similarity as a metric • Sketching may speed-up computation and allow indexing • Interesting metrics: • • • • Euclidean Manhattan, Hamming βπ distances Edit distance, Earth Mover’s Distance etc. 3 Sketching metrics 0 1 1 0 … 1 • Alice and Bob each hold a point from a Alice metric space, x and y π₯ • Both send π -bit sketches to Charlie • For π > 0 and π· > 1 distinguish sketch(π₯) • π(π₯, π¦) ≤ π • π(π₯, π¦) ≥ π·π • Shared randomness, allow 1% probability of error • Trade-off between π and π« Bob π¦ sketch(π¦) Charlie π(π₯, π¦) ≤ π or π(π₯, π¦) ≥ π·π ? 4 Sketches ⇒ Near Neighbor Search • Near Neighbor Search (NNS): • Given π-point dataset π • A query π within π from some data point • Return any data point within π·π from π • Sketches of size π imply NNS with space ππ π and a 1-probe query • Polynomial space whenever π = π(1) 5 Sketching βπ norms • [Kushilevitz-Ostrovsky-Rabani’98]: can sketch Hamming space • [Indyk’00]: can sketch βπ for 0 < π ≤ 2 via random projections using p-stable distributions • For π· = 1 + π one gets π = π(1/π2) • Tight by [Woodruff 2004] • For π > 2 sketching βπ is somewhat hard (Bar-Yossef, Jayram, Kumar, Sivakumar 2002), (Indyk, Woodruff 2005) • To achieve π· = π(1) one needs sketch size to be π = Θ π1−2/π 6 The main question Which metrics can we sketch with constant sketch size and approximation? 7 Beyond βπ norms: embeddings • A map f: X → Y is an embedding with distortion C, if for a, b from X: dX(a, b) / C ≤ dY(f(a), f(b)) ≤ dX(a, b) • Reductions for geometric problems aSketches of size s and approximation D for Y X b f f f(a)s and Sketches of size approximation CDYfor X f(b) 8 Metrics with good sketches: summary • A metric X admits sketches with s, D = O(1), if: • X = βp for p ≤ 2 • X embeds into βp for p ≤ 2 with distortion O(1) • Are there any other metrics with efficient sketches? • We don’t know! 9 The main result If a normed space π admits sketches of size π and approximation π·, then for every ε > 0 the space π embeds into β1−π with distortion π(π π· / π) Embedding into βp, p ≤ 2 d • A normed space: R equipped with a metric (Kushilevitz, Ostrovsky, For norms EMD Examples: Rabani 1998) βπ ’s, matrix norms (spectral, trace), (Indyk 2000) Efficient sketches 10 Application: lower bounds for sketches • Convert non-embeddability into lower bounds for sketches in a black box way No embeddings with distortion O(1) into β1 – ε *in No sketches* of size and approximation O(1) fact, any communication protocols 11 Example 1: the Earth Mover’s Distance • For π₯: Δ × [Δ] → π with zero average, ||π₯||πΈππ· is the cost of the best transportation of the positive part of π₯ to the negative part • Initial motivation for this work • Upper bounds: [Charikar’02, Indyk-Thaper’03, Naor-Schechtman’05, [A.-Do Ba-Indyk-Woodruff’09] • Lower bound also holds for the minimum-cost matching metric on subsets No embedding into β1−π with distortion O(1) [Naor-Schechtman’05] No sketches with D = O(1) and s = O(1) 12 Example 2: the Trace Norm • For an n × n matrix A define the Trace Norm (the Nuclear Norm) βAβ to be the sum of the singular values • Previously: lower bounds only for certain restricted classes of sketches [Li-Nguyen-Woodruff’14] Any embedding into β1 requires distortion Ω π (Pisier 1978) Any sketch must satisfy π π· = Ω π log π 13 The sketch of the proof Good sketches for X Uses that X is a norm Good sketches for β∞(X) β||(π₯1, π₯2, … , π₯π )||= maxi ||π₯π || [A-Jayram-PΔtraΕcu 2010], Direct sum for Information Complexity Absence of certain Poincaré-type inequalities on X Linear embedding of X into β1-ε [Aharoni-Maurey-Mityagin 1985], Fourier analysis π: π → β2 s.t. πΏ ||π₯1 − π₯2 ||π ≤ ||π π₯1 − π π₯2 || ≤ π(||π₯1 − π₯2 ||π ) • πΏ and π are non-decreasing, • πΏ(π‘) > 0 for π‘ > 0 • π(π‘) → 0 as π‘ → 0 Uniform embedding π of X into β2 [Johnson-Randrianarivony 2006], Lipschitz extension ||π₯1 − π₯2 || ≤ 1 ⇒ ||π π₯1 − π π₯2 || ≤ 1 ||π₯1 − π₯2 || ≥ π π· ⇒ ||π π₯1 − π π₯2 || ≥ 10 Weak embedding π of X into β2 Convex duality + compactness 14 Open problems • Can one strengthen our theorem to “sketches with O(1) size and approx. imply embedding into β1 with distortion O(1)”? • Equivalent to an old open problem from Functional Analysis [Kwapien 1969] • Extend to a more general class of metrics (e.g., Edit Distance?) • Other regimes: what about super-constant π , π· ? • Linear sketches with π(π ) measurements and π(π·) approximation? 15