Lecture 4

advertisement
Quantum Computing
(Fall 2013)
Instructor: Shengyu Zhang.
Lecture 4 Hidden Subgroup Problem 2: Fourier sampling and
query-efficient algorithms.
1. HSP: What and why.
Suppose that 𝐺 is a group and 𝐻 is a subgroup. Recall that cosets of 𝐻 partition the group
𝐺. A function 𝑓: 𝐺 → 𝑆 hides a subgroup 𝐻 ≤ 𝐺 if 𝑓(π‘₯) = 𝑓(𝑦) for π‘₯ and 𝑦 in the
same coset and 𝑓(π‘₯) ≠ 𝑓(𝑦) for π‘₯ and 𝑦 in different cosets.
Definition. Given a black box function 𝑓: 𝐺 → 𝑆 (for some finite set 𝑆) that hides a
subgroup 𝐻 ≤ 𝐺, the Hidden Subgroup Problem (HSP) asks to find 𝐻.
Two important special cases: symmetric group and dihedral group.
Symmetric group 𝑆𝑛 : the set of permutations on 𝑛 elements. An efficient algorithm for HSP
for symmetric group can be used to solve Graph Isomorphism in π‘π‘œπ‘™π‘¦(𝑛) time.
Graph Isomorphism (GI): Given two 𝑛-node graphs 𝐺1 and 𝐺2 , decide whether they are
isomorphic, i.e. whether there exists a permutation πœ‹ ∈ 𝑆𝑛 s.t. πœ‹(𝐺1 ) = 𝐺2 .
Here for a graph 𝐺 = (𝑉, 𝐸), πœ‹(𝐺) = (𝑉, πœ‹(𝐸)) where πœ‹(𝐸) = {(πœ‹(𝑖), πœ‹(𝑗)) ∢ (𝑖, 𝑗) ∈ 𝐸}.
Thus the two graphs are isomorphic if the following holds: (𝑖, 𝑗) ∈ 𝐸1 iff (πœ‹(𝑖), πœ‹(𝑗)) ∈ 𝐸2.
Graph Isomorphism is known to be equivalent to the following variants: Graph Isomorphism
Finding (given two isomorphic graphs, find an isomorphism), Graph Isomorphism Counting
(given two isomorphic graphs, count the number of isomorphisms), Graph Automorphism
Finding (given a graph 𝐺, find its automorphism group 𝐴𝑒𝑑(𝐺) ≝ {πœ‹: πœ‹(𝐺) = 𝐺}). The
decision version of Graph Automorphism (just to decide whether 𝐴𝑒𝑑(𝐺) = {𝟏}) is a clearly
no harder, and possibly easier task.
Graph Isomorphism is a fundamental problem that appears in many disciplines, and it’s one
of the few problems whose complexity isn’t pinned down: It’s not known to be in P, but it’s
also not known to be NP-complete either. Actually, most people believe that it is not
NP-complete. It may be well in P and we just haven’t found an algorithm yet.
Given the equivalence between GI and Graph Automorphism Finding (GA), we can see why
GI reduces to HSP for symmetric group. First reduce GI to GA. Now for GA, given a graph
𝐺, define 𝑓 on 𝑆𝑛 by 𝑓(πœ‹) = πœ‹(𝐺). We claim that 𝑓 hides 𝐴𝑒𝑑(𝐺). Actually,
𝑓(πœ‹) = 𝑓(𝜎) ⇔ πœ‹(𝐺) = 𝜎(𝐺) ⇔ 𝜎 −1 πœ‹(𝐺) = 𝐺
⇔ 𝜎 −1 πœ‹ ∈ 𝐴𝑒𝑑(𝐺) ⇔ πœŽπ΄π‘’π‘‘(𝐺) = πœ‹π΄π‘’π‘‘(𝐺)
The second major application of HSP for non-Abelian groups is the (Approximate) Shortest
Vector problem, which reduces to HSP for dihedral group. We’ll define (Approximate)
Shortest Vector problem and dihedral group next.
Consider ℝ𝑛 and a set of basis, namely 𝑛 linearly independent vectors in it. An
𝑛-dimensional lattice is the set of all integer linear combinations of 𝑛 basis vectors. In the
Shortest Vector problem, we want to find a shortest (but nonzero) vector in the lattice. That is,
we want to use integer linear combination of basis vectors to get a vector that is very close to
the origin. This problem itself is NP-hard, and we consider a relaxed version, that the input is
promised to have only one (up to its sign) vector that achieves the minimum length; all the
other vectors are at least 𝑔(𝑛) times longer. Of course, the larger 𝑔(𝑛) is, the stronger the
promise and thus the easier the problem. It is known that if 𝑔(𝑛) = 𝑂(1) then the problem
is still NP-hard, and if 𝑔(𝑛) = (1 + πœ–)Ω(𝑛) , then the problem becomes in P. The question is
what the complexity is for 𝑔(𝑛) = π‘π‘œπ‘™π‘¦(𝑛). It is conjectured to be hard, and actually
cryptosystems are built based on this computational assumption. What can a quantum
computer do? If HSP for dihedral group is easy, then there is an efficient quantum algorithm
to solve the (Approximate) Shortest Vector problem. The reduction is nontrivial, and we
won’t do it here. (It’s actually one of the reading projects.) We’ll just introduce what a
dihedral group is.
Consider a regular 𝑛-gon on a 2D plane. A symmetry is a rotation or a reflection which
keeps the 𝑛-gon unchanged. A regular 𝑛-gon has 2𝑛 symmetries: 𝑛 rotational ones and 𝑛
reflection ones, the collection of which form the dihedral group 𝐷2𝑛 . So
𝐷2𝑛 = ⟨ π‘Ÿ, 𝑠 ∣ π‘Ÿ 𝑛 = 1, 𝑠 2 = 1, 𝑠 −1 π‘Ÿπ‘  = π‘Ÿ −1 ⟩
(𝐷2𝑛 is generated by elements π‘Ÿ and 𝑠 satisfying the relations π‘Ÿ 𝑛 = 1, 𝑠 2 = 1, 𝑠 −1 π‘Ÿπ‘  =
π‘Ÿ −1.)
2. Detour: density matrices.
Mixed states: Pure states with probabilities.
ο‚Ÿ
|πœ“⟩ → |πœ“⟩⟨πœ“|, rank-1 positive semi-definite (psd) matrix
ο‚Ÿ
{|πœ“π‘– ⟩, 𝑝𝑖 } → ∑𝑖 𝑝𝑖 |πœ“π‘– ⟩⟨πœ“π‘– |, trace-1 psd matrices.
Fact: Any trace-1 psd matrix also corresponds to an ensemble {|πœ“π‘– ⟩, 𝑝𝑖 } of quantum pure states.
Proof. Do the spectral decomposition and use the fact that the trace is 1 to conclude that the sum
of eigenvalues is 1. β–‘
ο‚Ÿ
Unitary transform: 𝜌 → π‘ˆπœŒπ‘ˆ † .
ο‚Ÿ
Measurement: {𝑀𝑖 } with ∑𝑖 𝑀𝑖† 𝑀𝑖 = 𝐼. Then Pr[𝑖] = π‘‘π‘Ÿ(𝑀𝑖 πœŒπ‘€π‘–† ), and the post-measurement
state is 𝑀𝑖 πœŒπ‘€π‘–† /π‘‘π‘Ÿ(𝑀𝑖 πœŒπ‘€π‘–† ).
Why use density matrices: Because only those matter---The exact ensemble of pure states
doesn’t since we can’t distinguish different ensembles with the same density matrix.
ο‚Ÿ
Composition: 𝜌1 ⊗ 𝜌2 .
3. Standard approach, weak and strong Fourier sampling
Standard approach:
1
√|𝐺|
π‘žπ‘’π‘’π‘Ÿπ‘¦ 𝑓
→
∑π‘₯∈𝐺 |π‘₯⟩
1
√|𝐺|
∑π‘₯∈𝐺 |π‘₯⟩|𝑓(π‘₯)⟩
π‘šπ‘’π‘Žπ‘ π‘’π‘Ÿπ‘’ 𝑓(π‘₯)
→
1
√|𝐻|
∑β„Ž∈𝐻 |π‘₯β„Ž⟩ for a random π‘₯ ∈ 𝐺
Writing this in the density matrix form, we have the following
𝜌=
1
1
∑ |π‘₯β„Ž⟩⟨π‘₯β„Ž′ | =
∑ |π‘₯β„Ž⟩⟨π‘₯β„Ž′ |
|𝐺||𝐻|
|𝐺|
π‘₯∈𝐺
β„Ž,β„Ž′ ∈𝐻
where 𝑇𝐻 is a set of representatives.
Weak Fourier sampling:
π‘₯∈𝑇𝐻
β„Ž,β„Ž′ ∈𝐻
𝜌=
1
1
∑ 𝑅(β„Ž)|π‘₯⟩⟨π‘₯|𝑅(β„Ž′ )† =
∑ 𝑅(β„Ž) (∑|π‘₯⟩⟨π‘₯|) 𝑅(β„Ž′ )†
|𝐺||𝐻|
|𝐺||𝐻| ′
π‘₯∈𝐺
β„Ž,β„Ž′ ∈𝐻
=
1
1
1
∑ 𝑅(β„Ž)𝑅(β„Ž′ )† =
∑ 𝑅(β„Žβ„Ž′−1 ) =
∑ 𝑅(β„Ž)
|𝐺||𝐻| ′
|𝐺||𝐻| ′
|𝐺|
β„Ž,β„Ž ∈𝐻
𝐹𝐺
→
π‘₯∈𝐺
β„Ž,β„Ž ∈𝐻
β„Ž∈𝐻
β„Ž,β„Ž ∈𝐻
1
1
∑⊕𝜌∈𝐺̂ (πΌπ‘‘πœŒ ⊗ 𝜌(β„Ž)∗ ) =
⊕ Μ‚ (𝐼 ⊗ ∑ 𝜌(β„Ž)∗ )
|𝐺|
|𝐺| 𝜌∈𝐺 π‘‘πœŒ
β„Ž∈𝐻
π‘šπ‘’π‘Žπ‘ π‘’π‘Ÿπ‘’ 𝜌
→
β„Ž∈𝐻
observe 𝜌 w.p.
1
π‘‘π‘Ÿ (πΌπ‘‘πœŒ
|𝐺|
π‘‘πœŒ
⊗ ∑β„Ž∈𝐻 𝜌(β„Ž)∗ ) = |𝐺| ∑β„Ž∈𝐻 πœ’πœŒ (β„Ž)∗
Question: Does π‘π‘œπ‘™π‘¦ log |𝐺| samples of 𝜌 contain enough info to determine 𝐻?
For Abelian groups: Yes, as previously discussed.
Another class of subgroups that weak Fourier sampling suffices: normal subgroups.
Definition. A subgroup 𝐻 ≤ 𝐺 is normal if 𝑔𝐻 = 𝐻𝑔, for all 𝑔 ∈ 𝐺.
So if 𝐻 is normal subgroup, then for any 𝑔 ∈ 𝐺, β„Žπ‘” = π‘”β„Ž′ for some β„Ž′ ∈ 𝐻. And if β„Ž
runs over 𝐻, then so does β„Ž′.
Theorem. If 𝐻 is a normal subgroup of 𝐺, then the weak Fourier sampling gives 𝜌 with
probability π‘‘πœŒ2 |𝐻|/|𝐺| if 𝐻 ⊆ ker(𝜌), and 0 otherwise.
Proof. If 𝐻 ⊆ ker(𝜌), then one observes 𝜌 w.p.
π‘‘πœŒ
𝑑
𝜌
∑
πœ’ (β„Ž)∗ = |𝐺| ∑β„Ž∈𝐻 π‘‘πœŒ =
|𝐺| β„Ž∈𝐻 𝜌
2 |𝐻|
π‘‘πœŒ
|𝐺|
.
If 𝐻 is not contained in ker(𝜌), then ∃β„Ž∗ ∈ 𝐻 s.t. 𝜌(β„Ž∗ ) ≠ 𝐼. Now note that
(∑ 𝜌(β„Ž)) 𝜌(𝑔) = ∑ 𝜌(β„Ž)𝜌(𝑔) = ∑ 𝜌(β„Žπ‘”) = ∑ 𝜌(π‘”β„Ž′) = 𝜌(𝑔) (∑ 𝜌(β„Ž)) ,
β„Ž∈𝐻
β„Ž∈𝐻
β„Ž∈𝐻
β„Ž′ ∈𝐻
thus by Schur’s lemma, ∑β„Ž∈𝐻 𝜌(β„Ž) = πœ†πΌ for some πœ† ∈ ℝ. But
𝜌(β„Ž∗ ) (∑ 𝜌(β„Ž)) = ∑ 𝜌(β„Ž∗ β„Ž) = (∑ 𝜌(β„Ž)),
β„Ž∈𝐻
β„Ž∈𝐻
Thus ∑β„Ž∈𝐻 𝜌(β„Ž) = 0. So one observes 𝜌 w.p. 0.
β„Ž∈𝐻
β–‘
β„Ž∈𝐻
The algorithm for HSP for normal subgroup is very similar to that for Abelian HSP.
ο‚Ÿ
Use weak Fourier sampling to get 𝜌1 , 𝜌2 , … , πœŒπ‘‡ for 𝑇 = 𝑂(log|𝐺|).
ο‚Ÿ
Output 𝐾𝑇 =∩𝑇𝑑=1 ker(πœŒπ‘‘ )
Theorem. 𝐾𝑇 = 𝐻 with high probability.
1
Proof. If ker(𝐾𝑑 ) ≠ 𝐻, we claim that Pr[𝐾𝑑 ⊆ ker(πœŒπ‘‘+1 )] ≤ 2. Indeed,
Pr[𝐾𝑑 ⊆ ker(πœŒπ‘‘+1 )] =
∑
𝜌:𝐾𝑑 ⊆ker(𝜌)
π‘‘πœŒ2 |𝐻| |𝐺| |𝐻| |𝐻| 1
=
=
≤ .
|𝐺 |
|𝐾𝑑 | |𝐺 | |𝐾𝑑 | 2
|𝐺|
Here we used a fact that for any normal subgroup 𝑁 of 𝐺, ∑𝜌∈𝐺̂:𝑁⊆ker(𝜌) π‘‘πœŒ2 = |𝑁|. β–‘
Strong Fourier sampling:
Weak Fourier sampling fails to provide sufficient information for HSP for 𝑆𝑛 and 𝐷2𝑛 . So
people resort to strong Fourier sampling, which uses the remaining state 𝜌(𝐻) =
1
∑
𝜌(β„Ž).
|𝐻| β„Ž∈𝐻
Question: What basis to use to measure this state?
It turns out that even random basis can already solve some cases, such as HSP for Heisenberg
Group
1
{(𝑏
π‘Ž
0 0
1 0) : π‘Ž, 𝑏, 𝑐 ∈ 𝔽𝑝 }
𝑐 1
However, for symmetric group, even strong Fourier sampling fails. Multi-register
measurement is needed.
4. Query efficient algorithm
When multi-register measurement is used, we can solve HSP using only π‘π‘œπ‘™π‘¦ log |𝐺|
queries to 𝑓, though the computational time is still exponential.
1
Recall that the standard approach gives 𝜌𝐻 = |𝐺||𝐻| ∑
π‘₯∈𝐺 |π‘₯β„Ž⟩⟨π‘₯β„Ž
β„Ž,β„Ž′ ∈𝐻
′
|. Our task is just to
identify 𝐻 from a collection {𝜌𝐻 : 𝐻 ≤ 𝐺}. In general, mixed state identification is known to
have the following bound.
Theorem. There exists a quantum measurement that identifies {πœŒπ‘– : 𝑖 = 1, … , 𝑁} with
probability at least 1 − 𝑁√max 𝐹(πœŒπ‘– , πœŒπ‘— ).
𝑖≠𝑗
Here 𝐹(𝜌, 𝜎) = π‘‘π‘Ÿ √√𝜌𝜎√𝜌 = β€–√𝜌√πœŽβ€–
π‘‘π‘Ÿ
is a measure of how close two mixed states 𝜌
and 𝜎 are. The 𝐹 value is always between 0 and 1, and the closer to 1, then closer the two
states. So to identify 𝜌𝐻 (namely to identify 𝐻), it is enough if 𝜌𝐻 ’s for different 𝐻 are far
from each other. It turns out that this is more or less true.
Theorem. 𝐹(𝜌𝐻 , 𝜌𝐻 ′ ) ≤ 1/√2.
We will not prove this result, but only explain how to use it. It is not hard to verify that
𝐹(𝜌⊗𝑛 , 𝜎 ⊗𝑛 ) = 𝐹(𝜌, 𝜎)𝑛 .
By the above results, we know that if we want the error probability to be πœ–, then we need to
use 𝐾 copies of 𝜌_𝐻, satisfying
𝑁√max 𝐹(πœŒπ‘– , πœŒπ‘— )𝐾 ≤ πœ–.
𝑖≠𝑗
𝑁
Solving this gives 𝐾 ≥ 𝑂 (log πœ– ). How large is 𝑁? It’s just the number of subgroups.
2 |𝐺|
Fact. Any group 𝐺 has at most 2log
subgroups.
Proof. Each subgroup has a generating set of size at most log |𝐺|, because adding one more
generator at least double the size of the subgroup. Therefore, the number of subgroups is at
most (
|𝐺|
2
) ≤ 2log |𝐺| .
log|𝐺|
β–‘
Reference
[CvD10] Andrew M. Childs and Wim van Dam, Quantum algorithms for algebraic problems, Reviews
of Modern Physics, Volume 82, January–March 2010.
[Lom04] Chris Lomont, The hidden subgroup problem - review and open problems,
arXiv:quant-ph/0411037v1
Download