arora-slides - Princeton University

advertisement
What have we learnt about graph expansion
in the new millenium?
Sanjeev Arora
Princeton University &
Center for Computational Intractability
Overview
Last millenium:
•
•
•
•
Central role of expansion and expanders
Recognizing expander graphs via eigenvalues (Cheeger71,Alon-Milman85)
O(log n)-approximation via flows (Leighton-Rao88); region-growing technique;
Close connection to metric embeddings; O(log n) approximation for general
sparsest cut via Bourgain’s Embedding Theorem (Linial-London-Rabinovich, Aumann-Rabani)
This millenium (so far):
• O(√log n )-approximation (A., Rao, Vazirani 04) via both SDP and flows;
• Better metric embeddings; O(√log n )-approximation for general sparsest cut
(Chawla-Gupta-Raecke05, A.-Lee-Naor06)
• Inapproximability results via Unique Games Conjecture (CKKRS06; KV06)
• Lowerbounds for metric embeddings (inspired by PCPs) [KV06; others]; lowerbounds
for SDPs;
• Progress in relating full eigenvalue spectrum to (small-set) expansion (A., Barak,
Steurer10)
(Will not talk about: New understanding of expansion in Cayley graphs of groups, new
algebra-free constructions of optimal expanders, etc.)
Graph Expansion
Important concept: derandomization, network routing, coding theory,
Markov chains, differential geometry, group theory
d-regular graph G
d
vertex set S
expansion(S) =
# edges leaving S
d |S|
a-expander: expansion(S) ≥ a for all S (co-NP-hard to recognize)
α2/2 ≤ λ ≤ 2α [Cheeger, Alon 85, Alon-Milman85].
λ= smallest nonzero eigenvalue of Laplacian of G.
(often us
willtorestrict
attention
sets: |S|, |Sc| > W(n))
Allows
recognize
graphswlog
withtoα “balanced”
= Ω(1) (“expander”)
Approximating expansion via flows
[Leighton-Rao’88]
O(log n)-approximation.
Find largest b s.t. we can
simultaneously route b/n units of
flow between every vertex pair.
(“embed a complete graph”)
b / 2 £ a £ O(logn)b
β⁄n units of flow
bet. each vtx pair
Why α ≥ β/2 :
Total flow out of each subset S
is β⁄n × |S| (n - |S|) ≥ β|S|/2
Why α ≤ O(log n) β:
The LP expressing existence of flow
is feasible if graph diameter is O(1/β).
(uses duality theorem)
In a graph with expansion α,
diameter is O(log n/α).
(Region growing argument:
BFS from S one step at a time;
# of edges increases by (1+α) factor each step;
reach >1/2 the edges in O(log n/α) steps.)
Approximating expansion via expander flows
(A.,Rao, Vazirani 2004)
β units of flow
originating at
each vtx
Route a flow with some demand graph W= (wij)
(wij = flow between i and j)
s.t. W is β-regular and has expansion 0.01
(“expander flow”)
Maximise β.
Easy: α ≥ 0.01 β
(Amount of flow leaving each set S is at least 0.01 β |S|.)
Main claim: α≤ O(β √log n)
Next: Geometry of cuts and how efficiently they can be crossed
Geometry of cuts
Cut semimetric
dS(i,j) = 1 if i, j on opposite sides of the cut,
= 0 else.
1
(gives embedding into a line)
0
Convex combination of cut semimetrics
d(i, j) = ΣS αS dS(i, j)
(Gives embedding into l1 : i  vi
|vi – vj|1= fraction of cuts i, j are across)
Approximating expansion via flows (A.,Rao, Vazirani 2004)
β units of flow
originating at
each vtx
LP
Route
formulation:
a flow with demand graph W= (wij)
(wij = flow between i and j)
W is β-regular and has expansion 0.01
Maximise β.
Check by computing
eigenvalue (“separation
oracle”)
Main claim: α≤ O(β √log n)
Duality Thm 
Feasibility follows if for every distribution (αS) on balanced cuts,
there are Ω(n) disjoint vertex pairs (i1, j1), (i2, j2), … s.t.
(a) d(ir, jr) = O(√log n/ α)
1st structure theorem
(b) ir, jr are across Ω(1) fraction of cuts.
Open: Replace √log n with o(√log n )? (Best lowerbound: log log n [DKSV06])
[ARV04] If G is an α-expander then for every distribution (αS)
on balanced cuts, there are Ω(n) disjoint vertex pairs (i1, j1),
(i2, j2), … s.t.
(a) d(ir, jr) = O(√log n/ α)
(b) ir, jr are across Ω(1) fraction of cuts.
Thoughts on
Structure Thm
sink
1
1
Warmup: If max degree= O(1) and given a single
balanced cut, above is true with O(1/α) instead of
O(√log n/ α)
Pf: Max-Flow Min Cut Thm
4/α
1
1
source
(all other edges
capacity 4/α )
α-expansion  Min Cut = Ω(|S|) = Ω(n) =Max-Flow
Total capacity = O(n/α)
Flow decomposition  Ω(n) flowpaths of length O(1/α)
with one endpoint in S and one in Sc
A flow-based O(√log n)-approximation algorithm for expansion
For β = 1/n, 2/n, 4/n,… do
Try to solve above LP to route a β-regular expander flow in G
If succeed, have verified that expansion ≥ 0.01β
If fail, use [ARV] technique to find a cut of expansion < O(β√log n)
(note: before finding this cut had already verified expansion ≥ 0.01 β/2)
(Note: Can be implemented in O(n1.5) time using matrix multiplicative
weight method [Sh09,AK07,AHK05]. Satyen Kale’s talk.)
Suggested research directions
Nothing special about routing an expander flow; could use any graph
family whose expansion can be verified up to O(1) factor. (Suffices to
solve LP.)
Example: Graphs with a few small nonzero eigenvalues (generalizes expanders,
which have no small eigenvalues) [A., Barak, Steurer’10]
Could also try for o(√log n) approximation in subexponential time.
See David Steurer’s talk….
View 2: Use of math programming relaxations
Semidefinite
Linear
Integer program for c-balanced separator (expansion of sets of size ≥ cn)
Recall:
[LR88]; O(log n)-approximation
Cut semimetric
dS(i,j) = 1 if i, j on opposite sides of the cut,
= 0 else.
[ARV04]: O(√log n) –approximation.
(Main obstacle: understanding vectors satisfying triangle inequality condition).
How to round the SDP: 2nd Structure Theorem
v1, v2, v3, … : unit vectors in Rn, s.t.
avg |vi –vj|2 = Ω(1) (“well-spread”)
|vi –vj|2 + |vj-vk|2 ≥ |vi –vk|2
(l22 property; angle subtended by any two points
on the third is nonobtuse; includes l1 as subcase)
THM: For Δ = Ω(1/√log n) there exist sets S, T of size Ω(n) s.t.
|vi –vj|2 ≥ Δ for all i in S, j in T
Δ
(S, T are Δ-separated)
NB: Implies weak version of 1st Structure Thm: Maxflow Mincut applied to S, T
yields Ω(n) paths of length O(1/α) that cross Ω(1/√log n) fraction of cuts.
Rounding the SDP
S, T: Δ-separated sets of size Ω(n)
Do BFS wrt distance function d(i,j) = |vi –vj|2 ,
starting from S and going until you hit T
Output the level of the BFS tree with least expansion.
Claim: This gives a balanced cut (O, Oc)
s.t. |E(O, Oc)| ≤ SDPOPT /Δ = O(√log n) SDPOPT
d(S, i)
d(S, j)
S
vj
vi
d(S, j) – d(S, i) ≤ |vi –vj|2
Edge (i,j) contributes to cut for ≤ |vi –vj|2 levels,
and each level cuts at least |E(O, Oc)| edges.
O(√log n)-approximation for other cut-like problems
•
MIN-2-CNF deletion and several graph deletion problems. [Agarwal, Charikar,
Makarychev, Makarychev04]
[’04]. Weighted version of S
•
MIN-LINEAR ARRANGEMENT [Charikar, Karloff, Rao’05]
•
General SPARSEST CUT [Chawla-Gupta-Raecke05, A. Lee Naor’06] 0
•
Min-ratio VERTEX SEPARATORS and Balanced VERTEX SEPARATORS[Feige, Hajiaghayi, Lee’04]
•
•
All
Method: SDP rounding using a generalized structure theorem…
Suggested future direction
SDP with triangle inequality corresponds to level 2 of
Lasserre, Lovasz-Schrijver, etc. (see Madhur Tulsiani’s talk)
Use more powerful SDP relaxations from higher levels?
* May need to allow superpolynomial time (rth level  nr time)
* Not currently ruled out under UGC.
Cut problems and embeddings
General Sparsest Cut: Cost matrix (cij) cij ≥0;
Demand matrix (dij) dij ≥ 0;
Find
SDP relaxation:
[LLR94,AR94]: Integrality gap
= Minimum distortion incurred
when embedding l22 metrics into
l1 (= convex combination
of cut semi-metrics)
Geometric Embeddings of Metric Spaces
Finite metric space (X, d)
Geometric space, eg l1
f
f(x)
x
d(x,y)
y
f(y)
Distortion of f : Minimum C s.t. d(x, y) ≤ |f(x) –f(y)| ≤ C d(x,y)
[Bourgain’85, LLR94]: Distortion O(log n) into l1, l2
What if X is itself geometric?
[Chawla-Gupta-Raecke05, A.-Lee-Naor06]: Distortion O(√log n  log log n) for
embedding l22 into l1; and embedding l1 into l2
Embedding theorems in one slide
Tool 1: Padded decompositions [Krauthgamer,Lee, Mendel,Naor04]
Scale S, padding parameter p:
Metric
space
(X, d)
Line embedding
Partition probabilistically into pieces
of diameter ≤ S, s.t. for all x
Pr[x’s partition contains Ball(x, S/p)] ≥ ½
x

0
Map each block to 0 with probability ¼;
Map x to d(x, zero-block)
Tool 2: Use ARV structure theorem to produce padded decompositions
at different scales; combine line embeddings into a single embedding using
“measured descent.”
Proving lowerbounds on distortion
• [Khot-Vishnoi05] log log n lowerbound; construction inspired by PCPs
(hypercontractivity of noisy hypercubes)
• [Lee-Naor],[Cheeger,Kleiner,Naor] (log n)ε lowerbound; construction
based upon Heisenberg group; new notion of differentiation
• [Lee-Muharrami] √log n lowerbound; only for embedding
weak l22 spaces into l1. Elementary construction and analysis.
Open: √log n lowerbound for l22 spaces;
(log n)ε lowerbound for SDP integrality gap of uniform sparsest cut
(ie edge expansion).
Proof of Structure Theorems
Recall:
v1, v2, v3, … : unit vectors in Rd, s.t.
avg |vi –vj|2 = Ω(1) (“well-spread”)
|vi –vj|2 + |vj-vk|2 ≥ |vi –vk|2
For Δ = Ω(1/√log n) there exist sets S, T of size
Ω(n) s.t.
|vi –vj|2 ≥ Δ for all i in S, j in T
(S, T are Δ-separated)
Δ
(Recall: 1st Structure Theorem concerned distributions of cuts,
which correspond to l1 metrics, which are a subcase of l22)
Algorithm to produce two Δ-separated sets (Δ= 1/√log n)
0.01/√d
Easy: Su and Tu are likely to have size Ω(n).
Tu
Delete any vi in Su and vj in Tu s.t. |vi – vj|2 < Δ
(repeat until no such pair remains)
Su
u
If Su and Tu still have size Ω(n) output them.
Main difficulty: Show that whp only o(n) points get deleted.
Obs: Deleted pairs were “stretched”, i.e.,
|vi – vj|2 < Δ,
|<vi – vj, u>| > 0.01/√d
Fact: Pr[|<vi – vj, u>| > 0.01 √Δ/√d]
= exp(-1/Δ) = exp(-√log n). Too large for union bound
Walks in l22 space
|vi –vj|2 + |vj-vk|2 ≥ |vi –vk|2
Δ
Δ
Δ
Δ
r steps of squared-length Δ only take you
a total squared distance rΔ (i.e., distance √r Δ)
Main proof step: Use measure concentration to prove that for most directions
u there is a walk of length r on stretched edges (v1, v2), (v2, v3),.. (vr, vr+1)
so that |<v1 – vr+1, u>| > 0.001 r/√d
Pr[such v1 , vr+1 exist in the point set] < exp(- r/Δ) < 1/n2
Unique Games Conjecture
[Khot03] Given m equations in n variables x1, x2, …, xn of the type
axi + b Xj = a (mod 113)
s.t. (1-ε) fraction are simultaneously satisfiable, it is NP-hard to satisfy
½ of them simultaneously.
Used to prove best inapproximability results for host of problems, including
expansion problems. Inspired SDP integrality gaps (aka embedding lowerbounds).
(See Khot’s talk)
(Expansion strikes back) The Achilles heel of UGC appears to be expansion.
Better understanding of small-set expansion may disprove UGC.
(see Steurer’s talk)
Looking forward to more insight in the next decade!
Thank you!
Download