thomas - William Stein

advertisement
Alex Thomas
12 March 2010
Professor William Stein
Math 414
Twelve Steps:
The Proofs for the Prime Number Theorem and Prime Number
Theorem in Arithmetic Progression by Newman(via Zagier) and
Soprounov Respectively
In this project I will explain the proof of the Prime Number Theorem(PNT) by D.J.
Newman, as described by D. Zagier, and the proof by Ivan Soprounov of prove the Prime
Number Theorem in Arithmetic Progression(PNTAP). Once I have explained the
asymptotic estimates of the prime counting function, and the prime counting function for
primes congruent to some number modulo another, I will test the latter estimate by means
of a program I have written in Sage.
We will begin with Newman’s proof for the PNT
Newman’s proof of the PNT begins with establishing Euler’s product formula. First,
let us show the Riemann zeta function. This function was first examined by Euler in a proof
of Euclid’s theorem, as well as by Dirichlet, but Riemann was the first to study in-depth as a
function of a complex variable1, hence its name.
Definition 1: the Riemann zeta function
∞
1
𝜁(𝑠) ≝ ∑ 𝑠 , 𝑠 ∈ β„‚
𝑛
𝑛=1
Let us now examine the expanded series.
Eq. 1
1
1
1
1
1
𝜁(𝑠) = ∑ 𝑠 = 1 + 𝑠 + 𝑠 + 𝑠 + 𝑠 + β‹―
𝑛
2
3
4
5
∞
𝑛=1
Now we multiply both sides by the second term.
Eq. 2
∞
1
1
1
1
1
1
1
1
𝜁(𝑠) = 𝑠 ∑ 𝑠 = 𝑠 + 𝑠 + 𝑠 + 𝑠 + 𝑠 + β‹―
𝑠
2
2
𝑛
2
4
6
8
10
𝑛=1
Next we subtract Eq. 2 from Eq, 1.
1
LeVeque, 154, 155
Eq. 3
1
1
1
1
1
1
𝜁(𝑠) − 𝑠 𝜁(𝑠) = (1 − 𝑠 ) 𝜁(𝑠) = 1 + 𝑠 + 𝑠 + 𝑠 + 𝑠 + β‹―
2
2
3
5
7
9
Then let us multiply both sides by the second term again.
Eq. 4
1
1
1
1
1
1
1
(1 − 𝑠 ) 𝜁(𝑠) = 𝑠 + 𝑠 + 𝑠 + 𝑠 + 𝑠 + β‹―
𝑠
3
2
3
9
15
21
27
As before, let us subtract Eq. 4 from Eq. 3.
Eq. 5
1
1
1
1
1
1
1
(1 − 𝑠 ) (1 − 𝑠 ) 𝜁(𝑠) = 1 + 𝑠 + 𝑠 + 𝑠 + 𝑠 + 𝑠 + β‹―
3
2
5
7
11
13
17
We do these same steps to obtain another factor.
Eq. 6
1
1
1
1
1
1
1
1
(1 − 𝑠 ) (1 − 𝑠 ) (1 − 𝑠 ) 𝜁(𝑠) = 1 + 𝑠 + 𝑠 + 𝑠 + 𝑠 + 𝑠 + β‹―
5
3
2
7
11
13
17
19
If we continue in this way the right-hand side approaches 1, and this gives us
Eq. 7
𝜁(𝑠) ∏(1 − 𝑝−𝑠 ) = 1
𝑝
One can notice the similarity of this proof to the Sieve of Eratosthenes.
Now solving for 𝜁(𝑠), we get the Euler product formula as desired.
Result 1: Euler’s product formula
−1
𝜁(𝑠) = (∏(1 − 𝑝−𝑠 ))
𝑝
= ∏(1 − 𝑝−𝑠 )−1
𝑝
1
Next we will show that 𝜁(𝑠) − 𝑠−1 is analytic (Zagier uses the word ‘holomorphic’, which
means the same in this situation)2 on {𝑠 ∈ β„‚: ℜ(𝑠) > 0}, the right half of the complex plane.
We will transform the expression into something that will let us talk of analyticity. We can
1
treat 𝑠−1 as the result of an improper integral, and then finesse the summand in the
Riemann zeta function underneath the integral. The way this is done becomes obvious if
you transform the rightmost side into the middle one.
2
Zagier, 1
Eq. 8
∞
∞
𝑛+1
1
1
1
1
1
𝜁(𝑠) −
= ∑ 𝑠 − ∫ 𝑠 𝑑π‘₯ = ∑ ∫
− 𝑠 𝑑π‘₯
𝑠
𝑠−1
𝑛
𝑛
π‘₯
1 π‘₯
𝑛
∞
𝑛=1
𝑛=1
Now we examine the absolute value of the summed integral, whence we obtain
1
Result 2: 𝜁(𝑠) − 𝑠−1 is analytic on {𝑠 ∈ β„‚: ℜ(𝑠) > 0}
∞
∞ π‘₯
|𝑠|
1
1
1
𝑠
|∫ 𝑠 − 𝑠 𝑑π‘₯| = |𝑠 ∫ ∫ 𝑠+1 𝑑𝑒 𝑑π‘₯| ≤ max | 𝑠+1 | = ℜ(𝑠)+1
n≤u≤n+1 𝑒
π‘₯
𝑛
1 𝑛
1
𝑛 𝑒
This shows that the summand is less than that of a convergent power series times a
constant, |𝑠|, so by the comparison test, it too must be convergent.
Now we will show ∑𝑝≤π‘₯ log(𝑝) = 𝑂(π‘₯), where 𝑂(π‘₯) is big O notation, but first let us
officially define this series as a function.
Definition 2: πœƒ(π‘₯)
πœƒ(π‘₯) ≝ ∑ log(𝑝)
𝑝≤π‘₯
First, we will try and get a handle on how fast πœƒ(π‘₯) changes with respect to π‘₯.
Eq. 9
𝑛
2𝑛
2
= (1 + 1)
2𝑛
2𝑛
2𝑛
) ≥ ( ) ≥ ∏ 𝑝 = 𝑒 πœƒ(2𝑛)−πœƒ(𝑛)
𝑖
𝑛
= ∑(
𝑖=0
𝑛<𝑝≤2𝑛
This has given us something which we can manipulate by taking the log of both sides.
Eq. 10
2𝑛 log(2) ≥ πœƒ(2𝑛) − πœƒ(𝑛)
This gives us something with πœƒ(π‘₯) on the left side, and something 𝑂(π‘₯) on the right.
Eq. 11
π‘₯
πœƒ(π‘₯) − πœƒ ( ) ≤ 𝐢π‘₯, 𝐢 > log 2
2
Now we will sum these differences for over increasing powers of 2.
Eq. 12
∞
∞
π‘₯
π‘₯
π‘₯
∑ (πœƒ ( 𝑖 ) − πœƒ ( 𝑖+1 )) ≤ 𝐢 ∑ 𝑖
2
2
2
𝑖=0
𝑖=0
Applying term by term cancellation on the left side, and the normal properties for a
geometric series on the right, we obtain
Eq. 13
∞
π‘₯
πœƒ(π‘₯) ≤ 𝐢 ∑ 𝑖 = 2𝐢π‘₯
2
𝑖=0
From this we see that
Result 3
πœƒ(π‘₯) = 𝑂(π‘₯)
Now we will examine Φ(𝑠).
Definition 3: Φ(𝑠)
log(𝑝)
Φ(𝑠) ≝ ∑
𝑝𝑠
𝑝
We want to show that the Riemann zeta function has no zeroes for{𝑠 ∈ β„‚: ℜ(𝑠) ≥ 1}, also
1
we will show that Φ(𝑠) − 𝑠−1 is analytic there as well.
To examine 𝜁(𝑠) for zeroes, we will look at the logarithmic derivative.
Eq. 14
𝑑
𝑑
log(𝜁(𝑠)) = − log (∏(1 − 𝑝−𝑠 ))
𝑑𝑠
𝑑𝑠
𝑝
The rules of logarithms lead us to use the product form of 𝜁(𝑠); these rules will allow us
transform the logarithm of a product into a sum of logarithms.
Eq. 15
−
ζ′ (s)
𝑑
𝑑
= − log (∏(1 − 𝑝−𝑠 )−1 ) = − ∑ log((1 − 𝑝−𝑠 )−1 ) =
ζ(s)
𝑑𝑠
𝑑𝑠
𝑝
𝑝
−𝑠
𝑝 log(p)
log(𝑝)
(1 − 𝑝−𝑠 )2
−∑
=∑ 𝑠
−𝑠
−1
(1 − 𝑝 )
𝑝 −1
𝑝
𝑝
This leaves us with a suggestive form (the negative was appended to the logarithmic
derivative in order to get us here), which we can manipulate to achieve our intended result.
Eq. 16
log(𝑝)
log(𝑝) log(𝑝) log(𝑝)
log(𝑝)
∑ 𝑠
=∑ 𝑠
−
+
=
Φ(𝑠)
+
∑
𝑝 −1
𝑝 −1
𝑝𝑠
𝑝𝑠
𝑝 𝑠 (𝑝 𝑠 − 1)
𝑝
𝑝
𝑝
Now, if we establish analyticity for the rightmost part, we will have completed both of our
log(𝑝)
tasks. So now let us show that the series ∑𝑝 𝑝𝑠 (𝑝𝑠 −1) is convergent by the comparison test.
Eq. 17
|
log(𝑝)
log(𝑝)
| = ℜ(𝑠) 𝑠
− 1)
|(𝑝 − 1)|
𝑝
𝑝 𝑠 (𝑝 𝑠
Now we must get a handle on |(𝑝 𝑠 − 1)|, so let us separate off some of the series.
Eq. 18
𝑛
log(𝑝)
log(𝑝)
log(𝑝)
∑ 𝑠 𝑠
=∑ 𝑠 𝑠
+∑ 𝑠 𝑠
𝑝 (𝑝 − 1)
𝑝 (𝑝 − 1)
𝑝 (𝑝 − 1)
𝑝
𝑝=2
𝑝>𝑛
The term on the right of the plus sign is finite, so let us work on the one on the left, but we
will consider the value of 𝑝 first.
Eq. 19
𝑝ℜ(𝑠)
𝑝 > 𝑛 ⇒ 𝑝ℜ(𝑠) > 𝑛ℜ(𝑠) , ℜ(𝑠) > 1, ℜ(𝑠) > 0
𝑛
This will eventually give us a finite number with which to work.3
Eq. 20
ℜ(𝑠)
𝑝
1
𝑝ℜ(𝑠) − ℜ(𝑠) = 𝑝ℜ(𝑠) (1 − ℜ(𝑠) ) < 𝑝ℜ(𝑠) − 1
𝑛
𝑛
1
1
= 𝐾 > ℜ(𝑠)
1
𝑝
−1
𝑝ℜ(𝑠) (1 − ℜ(𝑠) )
𝑛
This gives us something to replace the troublesome |(𝑝 𝑠 − 1)|.
Eq. 21
log(𝑝)
log(𝑝)
<
ℜ(𝑠)
𝑠
|(𝑝 − 1)| 𝐾𝑝2ℜ(𝑠)
𝑝
1
Which is convergent for2ℜ(𝑠) − 1 > 0, or, ℜ(𝑠) > 2, as desired.
So this, along with Result 2, establishes that the rightmost side is at least meromorphic,
that is to say, analytic with a specific set of poles. We will examine Φ(𝑠) on the boundary of
{𝑠 ∈ β„‚: ℜ(𝑠) ≥ 1} for poles. If there are none, we will have succeeded. Let us examine five
points, and we will assume they are indeed poles of Φ(𝑠).
Eq. 22
𝑠 = 1,1 ± 𝑖𝛼, 1 ± 2𝑖𝛼
Let the complex ones’ poles be of order πœ‡, 𝜈 respectively (so πœ‡, 𝜈 are non-negative integers).
We have that order of the zeroes of a function, are the residues of the logarithmic
derivative, e.g.
3
Thanks to Prof. John Sylvester for this method of showing convergence for
1
ℜ(𝑠) > 2
Eq. 23
𝑓(𝑧) = 𝑧 , log(𝑓(𝑧)) = log(𝑧 𝑛 ) = 𝑛 log(𝑧)
𝑑
𝑛
𝑛 log(𝑧) =
𝑑𝑧
𝑧
𝑛
And since the residue is equal to the coefficient of the first negative-power term of the
Laurent series expansion of 𝑓(𝑧), we have 𝑛, as the residue at 𝑧 = 0, which happens to be
the order of the zero of 𝑓(𝑧) at 𝑧 = 0. Because of this, the residue of Φ(𝑠) at the points
above should equal the negative of the order of the zeroes of 𝜁(𝑠). Since Φ(𝑠) is not under
any power, so its pole at 𝑠 = 1 is a simple one (has an order of 1).
Eq. 24
res(Φ(𝑠), 𝑠 = 1) = πœ–Φ(1 + πœ–) = 1
res(Φ(𝑠), 𝑠 = 1 ± 𝑖𝛼) = πœ–Φ(1 + πœ– ± 𝑖𝛼) = −πœ‡
res(Φ(𝑠), 𝑠 = 1 ± 2𝑖𝛼) = πœ–Φ(1 + πœ– ± 2𝑖𝛼) = −𝜈
𝛼 ∈ ℝ, 𝛼 ≠ 0
Now we will attempt to create a contradiction.
Eq. 25
2
4
∑ (
) Φ(1 + π‘Ÿπ‘–π›Ό) = Φ(1 − 2𝑖𝛼) + 4Φ(1 − 𝑖𝛼) + 6Φ(1) + 4Φ(1 + 𝑖𝛼) + Φ(1 + 2𝑖𝛼)
2+π‘Ÿ
π‘Ÿ=−2
Let us see if we can transform this unwieldy right-hand side into something more useful .
Eq. 26
log(𝑝)
log(𝑝)
log(𝑝)
log(𝑝)
log(𝑝)
= ∑ 1+πœ–−2𝑖𝛼 + 4 ∑ 1+πœ–−𝑖𝛼 + 6 ∑ 1+πœ– + 4 ∑ 1+πœ–+𝑖𝛼 + ∑ 1+πœ–+2𝑖𝛼 =
𝑝
𝑝
𝑝
𝑝
𝑝
𝑝
𝑝
𝑝
𝑝
𝑝
log(𝑝)
log(𝑝)
log(𝑝)
log(𝑝)
log(𝑝)
∑ 1+πœ–−2𝑖𝛼 + 4 1+πœ–−𝑖𝛼 + 6 1+πœ– + 4 1+πœ–+𝑖𝛼 + 1+πœ–+2𝑖𝛼 =
𝑝
𝑝
𝑝
𝑝
𝑝
𝑝
1
1
1
1
1
∑ log(𝑝) ( 1+πœ–−2𝑖𝛼 + 4 1+πœ–−𝑖𝛼 + 6 1+πœ– + 4 1+πœ–+𝑖𝛼 + 1+πœ–+2𝑖𝛼 )
𝑝
𝑝
𝑝
𝑝
𝑝
𝑝
This looks like the expansion of a linear expression raised to the fourth power
Eq. 27
log(𝑝)
1
1
1
1
∑ 1+πœ– ( −2𝑖𝛼 + 4 −𝑖𝛼 + 6 + 4 𝑖𝛼 + 2𝑖𝛼 ) =
𝑝
𝑝
𝑝
𝑝
𝑝
𝑖𝛼 4
log(𝑝)
∑ 1+πœ– ((𝑝 2 )
𝑝
𝑝
𝑝
𝑖𝛼 3
𝑖𝛼 2
𝑖𝛼
𝑖𝛼 2
𝑖𝛼 3
𝑖𝛼
𝑖𝛼 4
+ 4 (𝑝 2 ) (𝑝− 2 ) + 6 (𝑝 2 ) (𝑝− 2 ) + 4 (𝑝 2 ) (𝑝− 2 ) + (𝑝− 2 ) ) =
4
4
𝑖𝛼
log(𝑝) 𝑖𝛼
log(𝑝)
𝛼
∑ 1+πœ– (𝑝 2 + 𝑝− 2 ) = ∑ 1+πœ– (2 cos ) ≥ 0
𝑝
𝑝
2
𝑝
𝑝
Now, multiply the right-hand side of Eq. 25 by πœ–, let πœ– go to zero, and we get
Eq. 28
−𝜈 − 4πœ‡ + 6 − 4πœ‡ + −𝜈 = 6 − 8πœ‡ − 2𝜈 ≥ 0
But here we must remember that πœ‡ is a non-negative integer, so in order for this statement
to be true, it must be zero. This means there is no pole there, so we have a contradiction.
This, with the Result 2, gives us the desired result.
Result 4:
𝜁(𝑠) ≠ 0, ∀𝑠 ∈ β„‚ s. t. ℜ(𝑠) ≥ 1
∞ πœƒ(π‘₯)−π‘₯
Now we will examine ∫1 π‘₯ 2 𝑑π‘₯ for convergence. First we will try to make the integral
that we are concerned with out of something with established properties.
Eq. 29
𝑛
∑q≤xi log(q) − ∑q≤xi−1 log(q)
log(𝑝)
log(𝑝)
Φ(𝑠) = ∑
=
lim
∑
=
lim
∑
, 𝑝 ∈]π‘₯𝑖−1 , π‘₯𝑖 ]
𝑛→∞
𝑛→∞
𝑝𝑠
𝑝𝑠
𝑝𝑠
𝑛
𝑝
𝑝
𝑝
This gives us something with πœƒ(π‘₯), which is promising.
Eq. 30
𝑛
∞
∑q≤xi log(q) − ∑q≤xi−1 log(q)
π‘‘πœƒ(π‘₯)
lim ∑
=
∫
𝑠
𝑛→∞
𝑝
π‘₯𝑠
1
𝑝
We are nearly where we want, let us now change this Stieltjes integral into the more
familiar Riemann integral with integration by parts for Stieltjes integrals.
Eq. 31
∞
∞
π‘‘πœƒ(π‘₯)
πœƒ(π‘Ÿ)
1
∫
=
lim
−
πœƒ(1)
−
∫
πœƒ(π‘₯)𝑑
(
)
π‘Ÿ→∞ π‘Ÿ 𝑠
π‘₯𝑠
π‘₯𝑠
1
1
πœƒ(1) is obviously zero, so let us examine the limit.
Eq. 32
∑
πœƒ(π‘Ÿ)
O(r)
p≤r log(p)
lim 𝑠 = lim
= lim 𝑠 = 0
𝑠
π‘Ÿ→∞ π‘Ÿ
π‘Ÿ→∞
π‘Ÿ→∞ π‘Ÿ
π‘Ÿ
This is obtained with Result 3. So now we are left with a friendlier Stieltjes integral.
Eq. 33
∞
∞
∞
π‘‘πœƒ(π‘₯)
1
πœƒ(π‘₯)
∫
=
−
∫
πœƒ(π‘₯)𝑑
(
)
=
𝑠
∫
𝑑π‘₯
𝑠+1
π‘₯𝑠
π‘₯𝑠
1
1
1 π‘₯
Now we will apply a change of variable to this in order to get the equation that will let us
obtain the desired result.
Eq. 34
∞
∞
πœƒ(π‘₯)
πœƒ(𝑒 𝑑 )
Φ(𝑠) = 𝑠 ∫ 𝑠+1 𝑑π‘₯ = 𝑠 ∫
𝑑𝑑
𝑒 𝑠𝑑
1 π‘₯
0
Next we will make some definitions in order to use the Analytic Theorem.
Definition 3
πœƒ(𝑒 𝑑 ) − 𝑒 𝑑
𝑓(𝑑) ≝
𝑒𝑑
∞
𝑔(𝑧) ≝ ∫ 𝑓(𝑑)𝑒 −𝑧𝑑 𝑑𝑑 , ℜ(𝑧) > 0
0
Now we will transform 𝑔(𝑧) into something from which we can determine whether it is
analytic on ℜ(𝑧) > 0.
Eq. 35
∞
∞
∞
πœƒ(𝑒 𝑑 ) − 𝑒 𝑑 −𝑧𝑑
πœƒ(𝑒 𝑑 ) − 𝑒 𝑑
𝑔(𝑧) = ∫ 𝑓(𝑑)𝑒 −𝑧𝑑 𝑑𝑑 = ∫
𝑒
𝑑𝑑
=
∫
𝑑𝑑 =
𝑒𝑑
𝑒 𝑑(𝑧+1)
0
0
0
∞
∞
πœƒ(𝑒 𝑑 )
𝑒𝑑
Φ(𝑧 + 1) 1
∫ 𝑑(𝑧+1) 𝑑𝑑 − ∫ 𝑑(𝑧+1) 𝑑𝑑 =
−
𝑧+1
𝑧
0 𝑒
0 𝑒
This shows that 𝑔(𝑧) is analytic by Result 3, and by Result 2 we have that 𝑓(𝑑) is bounded,
and by definition it resembles a step function and so is continuous almost everywhere. So
∞
the Analytic Theorem provides us that ∫0 𝑓(𝑑)𝑑𝑑 converges.
Analytic Theorem:
Let 𝑓(𝑑), (𝑑 ≥ 0) be a bounded and locally integrable function and
∞
suppose that the function (𝑧) = ∫0 𝑓(𝑑)𝑒 −𝑧𝑑 𝑑𝑑 , ℜ(𝑧) > 0 extends
∞
holomorphically to ℜ(𝑧) ≥ 0. Then ∫0 𝑓(𝑑)𝑑𝑑 exists (and equals
𝑔(0)).4
Zagier goes onto show the final step here, and then proves the Analytic Theorem; we will
instead not prove the Analytic Theorem as it strays too far afield into complex analysis for
the purposes of this project.
So now we will apply a change of variable to 𝑓(𝑑) to get the integral in which we were first
interested.
Result 5:
𝑑π‘₯
π‘₯ = 𝑒𝑑,
= 𝑑𝑑
π‘₯
∞
∞
∞
πœƒ(𝑒 𝑑 ) − 𝑒 𝑑
πœƒ(π‘₯) − π‘₯
∫ 𝑓(𝑑)𝑑𝑑 = ∫
𝑑𝑑
=
∫
𝑑π‘₯ convergent
𝑒𝑑
π‘₯2
0
0
1
Now that we have established these five results, we can go on to finish the PNT by showing
πœƒ(π‘₯)~π‘₯.
First we will assume that there is some πœ† > 1 such that πœƒ(π‘₯) ≥ πœ†π‘₯, for an arbitrarily large π‘₯,
from this we would have
Eq. 36
4
Zagier, 4
πœ†π‘₯
∫
π‘₯
𝑛−1
πœ†π‘₯
πœƒ(π‘₯) − 𝑑
πœ†π‘₯ − 𝑑
πœ†π‘₯ − 𝑑𝑖∗
(𝑑𝑖+1 − 𝑑𝑖 )
𝑑𝑑 ≤ ∫
𝑑𝑑 = lim ∑
𝑛→∞
𝑑2
𝑑2
𝑑𝑖∗ 2
π‘₯
𝑖=0
𝑑𝑖∗ = πœ†∗𝑖 π‘₯, 𝑑𝑖 = πœ†π‘– π‘₯, 𝑑𝑖+1 = πœ†π‘–+1 π‘₯, 1 ≤ πœ†∗𝑖 , πœ†π‘– , πœ†π‘–+1 ≤ πœ†
𝑛−1
lim ∑
𝑛→∞
𝑖=0
πœ†π‘₯ − 𝑑𝑖∗
𝑑𝑖∗ 2
𝑛−1
(𝑑𝑖+1 − 𝑑𝑖 ) = lim ∑
𝑛→∞
𝑛−1
πœ† − πœ†∗𝑖
lim ∑ ∗ 2 (πœ†π‘–+1
𝑛→∞
πœ†π‘–
𝑖=0
πœ†π‘₯ − πœ†∗𝑖 π‘₯
πœ†∗𝑖 2 π‘₯ 2
𝑖=0
πœ†
− πœ†π‘– ) = ∫
1
(πœ†π‘–+1 π‘₯ − πœ†π‘– π‘₯) =
πœ†−𝑑
𝑑𝑑 > 0
𝑑2
But this presents a problem, if any interval with bounds being some integer to πœ† times that
integer is equal to the integral from 1 to πœ†, then we have
Eq. 37
𝑛
𝑛
∞
πœ†π‘–+1
πœ†
πœ†
πœƒ(π‘₯) − π‘₯
πœ†−𝑑
πœ†−𝑑
πœ†−𝑑
∫
𝑑π‘₯
=
lim
∑
∫
𝑑𝑑
=
lim
∑
∫
𝑑𝑑
=
lim
𝑛
∫
𝑑𝑑 → ∞
𝑛→∞
𝑛→∞
𝑛→∞
π‘₯2
𝑑2
𝑑2
𝑑2
1
πœ†π‘–
1
1
𝑖=0
𝑖=0
This contradicts Result 5. By an equivalent argument, we can show that if instead we have
πœ† < 1 such that πœƒ(π‘₯) ≤ πœ†π‘₯, we run into the same contradiction (except this time with −∞).
Therefore, we get our final result .
Result 6:
πœƒ(π‘₯)~π‘₯
Now all that is left is to relate our findings to the prime counting function πœ‹(π‘₯).
Eq. 38
πœƒ(π‘₯) = ∑ log(𝑝) ≤ ∑ log(π‘₯) = πœ‹(π‘₯) log(π‘₯)
𝑝≤π‘₯
πœƒ(π‘₯) = ∑ log(𝑝) ≥
𝑝≤π‘₯
𝑝≤π‘₯
∑
π‘₯ 1−πœ– ≤𝑝≤π‘₯
log(𝑝) ≥
∑
(1 − πœ–) log(π‘₯) =
π‘₯ 1−πœ– ≤𝑝≤π‘₯
(1 − πœ–) log(π‘₯) (πœ‹(π‘₯) + 𝑂(π‘₯1−πœ– )) → πœ‹(π‘₯) log(π‘₯) , as Ο΅ → 0
So we have the long sought after result.
PNT
πœ‹(π‘₯) log(π‘₯) ~π‘₯
x
πœ‹(π‘₯)~
log(π‘₯)
And so, we have established the PNT by Newman’s Proof.
Soprounov’s proof follows in almost the exact same way, except for a few differences.
Now we will explain Soprounov’s proof of the PNTAP. First though, we must define two
functions: the Dirichlet character, and the Dirichlet L-series.
Defintion 4: Dirichlet Characters5
πœ’π‘˜ mod π‘š : β„€ → β„‚
πœ’π‘˜ (π‘Ž) = 𝑧
πœ’π‘˜ (π‘Ž)πœ’π‘˜ (𝑏) = πœ’π‘˜ (π‘Žπ‘)
π‘Ž ≡ 𝑏 (π‘š) ⇒ πœ’π‘˜ (π‘Ž) = πœ’π‘˜ (𝑏)
0 if gcd(𝑛, π‘š) > 1
πœ’π‘˜ (𝑛) mod π‘š ≝ { 2πœ‹π‘–π‘Ÿ
} , π‘˜ ∈ β„€πœ™(π‘š) , π‘Ÿ ∈ β„€
𝑒 πœ™(π‘š) if gcd(𝑛, π‘š) = 1
πœ’π‘˜ (1) = 1
πœ’0 (𝑛) = πœ’0 = 1 principle character
An important thing to note from the definition is that |πœ’| = 1 or 0 regardless of the integers
used in its construction. Now that we have this, we will define the Dirichlet L-series.
Definition 5: Dirichlet L-series
∞
πœ’π‘˜ (𝑛)
𝐿(𝑠, πœ’π‘ž ) ≝ ∑
𝑛𝑠
𝑛=1
Dirichlet used these two functions along with the Riemann zeta function to show that the
number of primes in an arithmetic progression is infinite.6 We must now begin our work on
the PNTAP. First we will establish Euler’s product formula for the Dirichlet L-series.
As before, let us examine the expanded series
Eq. 39
∞
πœ’π‘˜ (𝑛)
πœ’π‘˜ (𝑛2 ) πœ’π‘˜ (𝑛3 ) πœ’π‘˜ (𝑛4 ) πœ’π‘˜ (𝑛5 )
𝐿(𝑠, πœ’π‘˜ ) = ∑
= 1+
+
+
+
+β‹―
𝑠
𝑛
𝑛2 𝑠
𝑛3 𝑠
𝑛4 𝑠
𝑛5 𝑠
𝑛=1
𝑛𝑖 ∀ π‘š, πœ’π‘˜ mod π‘ž
Now we will attempt the same step as before, we will find the first prime in the list and
obtain
Eq. 40
∞
πœ’π‘˜ (𝑝)
πœ’π‘˜ (𝑝)
πœ’π‘˜ (𝑛)
)
𝐿(𝑠,
πœ’
=
∑
=
π‘˜
𝑝𝑠
𝑝𝑠
𝑛𝑠
𝑛=1
πœ’π‘˜ (𝑝) πœ’π‘˜ (𝑝)πœ’π‘˜ (𝑛2 ) πœ’π‘˜ (𝑛3 )πœ’π‘˜ (𝑝) πœ’π‘˜ (𝑛4 )πœ’π‘˜ (𝑝) πœ’π‘˜ (𝑛5 )πœ’π‘˜ (𝑝)
+
+
+
+
+β‹―=
𝑝𝑠
𝑝 𝑠 𝑛2 𝑠
𝑝 𝑠 𝑛3 𝑠
𝑝 𝑠 𝑛4 𝑠
𝑝 𝑠 𝑛5 𝑠
πœ’π‘˜ (𝑝) πœ’π‘˜ (𝑝𝑛2 ) πœ’π‘˜ (𝑝𝑛3 ) πœ’π‘˜ (𝑝𝑛4 ) πœ’π‘˜ (𝑝𝑛5 )
+ 𝑠 𝑠 + 𝑠 𝑠 + 𝑠 𝑠 + 𝑠 𝑠 +β‹―
𝑝𝑠
𝑝 𝑛2
𝑝 𝑛3
𝑝 𝑛4
𝑝 𝑛5
Now subtracting Eq. 40 from Eq. 39
5
6
LeVeque, 159
LeVeque, 155
Eq. 41
𝐿(𝑠, πœ’π‘˜ ) −
πœ’π‘˜ (𝑝)
πœ’π‘˜ (𝑝)
𝐿(𝑠, πœ’π‘˜ ) = (1 −
) 𝐿(𝑠, πœ’π‘˜ )
𝑠
𝑝
𝑝𝑠
This removes all the multiples of 𝑝 from the right-hand side, we would then pick another
prime and repeat, and in this way we will sieve out all the primes and solve for 𝐿(𝑠, πœ’π‘ž ),
and obtain
Result 7a:
𝐿(𝑠, πœ’π‘˜ ) = ∏(1 − πœ’π‘˜ (𝑝)𝑝−𝑠 )−1
𝑝
Furthermore, in the case of πœ’π‘˜ = πœ’0
Result 7b:
𝐿(𝑠, πœ’0 ) = ∏(1 − 𝑝−𝑠 )−1 = ∏(1 − 𝑝−𝑠 ) ∏(1 − 𝑝−𝑠 )−1 = ∏(1 − 𝑝−𝑠 ) 𝜁(𝑠)
π‘βˆ€π‘š
𝑝|π‘š
𝑝
Now we will show that 𝐿(𝑠, πœ’π‘˜ ), and 𝐿(𝑠, πœ’0 ) −
πœ™(π‘ž) 1
π‘ž
𝑠−1
𝑝|π‘š
is analytic on {𝑠 ∈ β„‚: ℜ(𝑠) > 0}. Let
Soprounov takes a partial sum from the Dirichlet L-series and changes into the following. I
was unable to figure out how he produced this change, so I am forced to take on faith that it
is true.
Eq. 42
π‘₯
π‘₯
πœ’π‘˜ (𝑛) 𝐴(π‘₯)
𝐴(𝑑)
∑
=
+
𝑠
∫
𝑑𝑑
𝑠
𝑠
𝑠+1
𝑛
π‘₯
1 𝑑
𝑛=1
Definition 6:
π‘₯
𝐴(π‘₯) ≝ ∑ πœ’π‘˜ (𝑛)
𝑛=1
If πœ’π‘˜ (𝑛) is not principal then 𝐴(π‘₯) is bounded by a finite number of roots of unity, as the
sum over the mth roots of unity hits zero every πœ™(π‘š) times. So we will let π‘₯ → ∞ to obtain
Eq. 43
∞
∞
𝐴(𝑑)
𝐴(𝑑)
𝐿(𝑠, πœ’π‘˜ ) = 𝑠 ∫ 𝑠+1 𝑑𝑑 = 𝑠 ∫ 𝑠+1 𝑑𝑑
1 𝑑
1 𝑑
This is convergent, and therefore analytic for ℜ(𝑠) > 0. For the case of πœ’π‘˜ (𝑛) being
principal we can look at Result 7b, or even the definition of 𝐿(𝑠, πœ’π‘˜ ), and see that it is
meromorphic, with only a simple pole at 𝑠 = 1 with residue being
Eq. 44
𝑝−1
πœ™(𝑝)
πœ™(π‘š)
∏(1 − 𝑝−1 ) = ∏ (
) = ∏(
)=
𝑝
𝑝
π‘š
𝑝|π‘š
𝑝|π‘š
𝑝|π‘š
This gives us all the information needed to say
Result 8:
πœ™(π‘ž) 1
𝐿(𝑠, πœ’π‘˜ ), 𝐿(𝑠, πœ’0 ) −
are analytic on {𝑠 ∈ β„‚: ℜ(𝑠) > 0}
π‘ž 𝑠−1
Now we will show that πœƒπ‘ž (π‘₯) = 𝑂(π‘₯), but first we must define it
Definiton 7:
πœƒπ‘ž (π‘₯) ≝ πœ™(π‘š) ∑ log(𝑝)
𝑝≤π‘₯
𝑝≡π‘Ž (π‘š)
This will be easy enough to show.
Result 9:
πœƒπ‘ž (π‘₯) ≤ πœ™(π‘š)πœƒ(π‘₯) = 𝑂(π‘₯) ⇒ πœƒπ‘ž (π‘₯) = 𝑂(π‘₯)
Now we will examine
Definition 8:
πœ’π‘˜ (𝑝) log 𝑝
πœ™(𝑠, πœ’π‘˜ ) ≝ ∑
𝑝𝑠
𝑝
Φπ‘š (𝑠) ≝ ∑ πœ™(𝑠, πœ’π‘˜ )
πœ’π‘˜
Φπ‘š,π‘Ž (𝑠) ≝ ∑ Μ…Μ…Μ…Μ…Μ…Μ…Μ…Μ…
πœ’π‘˜ (π‘Ž)πœ™(𝑠, πœ’π‘˜ ) , Μ…Μ…Μ…Μ…Μ…Μ…Μ…Μ…
πœ’π‘˜ (π‘Ž) conjugate of πœ’π‘˜ (π‘Ž)
πœ’π‘˜
We want to show that the Dirichlet L-series has no zeroes for {𝑠 ∈ β„‚: ℜ(𝑠) ≥ 1} for any πœ’π‘˜ ,
1
also we will show that Φπ‘š (𝑠) − 𝑠−1 is analytic there as well.
In order to guarantee our desired result for any πœ’, we will look at the a new function.
Definition 9:
β„’(𝑠) ≝ ∏ 𝐿(𝑠, πœ’π‘˜ )
πœ’
Soprounov takes as given, as will we, that 𝐿(1, πœ’π‘˜ ) ≠ 0 provided that πœ’π‘˜ is not principal.
So let us now again to the logarithmic derivative of β„’(𝑠).
Eq. 45
𝐿′(𝑠, πœ’π‘˜ ) ∏πœ’ 𝐿(𝑠, πœ’π‘˜ )
𝑑
∑
(
)
πœ’
𝐿(𝑠, πœ’π‘˜ )
β„’ ′ (𝑠) 𝑑𝑠 ∏πœ’ 𝐿(𝑠, πœ’π‘˜ )
𝐿′(𝑠, πœ’π‘˜ )
=
=
=
∏πœ’ 𝐿(𝑠, πœ’π‘˜ )
∏πœ’ 𝐿(𝑠, πœ’π‘˜ )
β„’(𝑠)
𝐿(𝑠, πœ’π‘˜ )
Therefore, we need only examine the logarithmic derivative of the Dirichlet L-series.
Eq. 46
𝑑
𝑑
log(𝐿(𝑠, πœ’π‘˜ )) = − log (∏(1 − πœ’π‘˜ (𝑝)𝑝−𝑠 )−1 )
𝑑𝑠
𝑑𝑠
𝑝
We will use the rules of logarithms as before.
Eq. 47
−
𝐿′(𝑠, πœ’π‘˜ )
𝑑
𝑑
= − log (∏(1 − πœ’π‘˜ (𝑝)𝑝−𝑠 )−1 ) = − ∑ log((1 − πœ’π‘˜ (𝑝)𝑝−𝑠 )−1 ) =
𝐿(𝑠, πœ’π‘˜ )
𝑑𝑠
𝑑𝑠
𝑝
𝑝
πœ’π‘˜ (𝑝)𝑝−𝑠 log(p)
πœ’π‘˜ (𝑝) log(𝑝)
(1 − πœ’π‘˜ (𝑝)𝑝−𝑠 )2
−∑
=∑ 𝑠
−𝑠
−1
(1 − πœ’π‘˜ (𝑝)𝑝 )
𝑝 − πœ’π‘˜ (𝑝)
𝑝
𝑝
We now use the exact same methods as in the proof for the PNT.
Eq. 48
πœ’π‘˜ (𝑝) log(𝑝)
πœ’π‘˜ (𝑝) log(𝑝) πœ’π‘˜ (𝑝) log(𝑝) πœ’π‘˜ (𝑝) log(𝑝)
∑ 𝑠
=∑ 𝑠
−
+
=
𝑝 − πœ’π‘˜ (𝑝)
𝑝 − πœ’π‘˜ (𝑝)
𝑝𝑠
𝑝𝑠
𝑝
𝑝
Φm (𝑠) + ∑
𝑝
πœ’π‘˜2 (𝑝) log(𝑝)
𝑝 𝑠 (𝑝 𝑠 − πœ’π‘˜ (𝑝))
The analyticity for the term on the right is essentially the same as before, so like both
authors, we will take it as given.
Combined with Result 8, this establishes that the rightmost side is at least meromorphic, that is
to say, analytic with a specific set of poles. The rest of the proof of this claim follows
indentically to the proof for Result 4.
Result 10:
𝜁(𝑠) ≠ 0, ∀𝑠 ∈ β„‚ s. t. ℜ(𝑠) ≥ 1
Result 8 and Result 10 give
1
Φπ‘š,π‘Ž (𝑠) − 𝑠−1
∞ πœƒπ‘ž (π‘₯)−π‘₯
Now we will show that ∫1
π‘₯2
Result 11:
as analytic on {𝑠 ∈ β„‚: ℜ(𝑠) ≥ 1}.
𝑑π‘₯ is convergent.
Eq. 49
πœ’π‘˜
Μ…Μ…Μ…Μ…Μ…Μ…Μ…Μ…
Φπ‘š,π‘Ž (𝑠) = ∑ Μ…Μ…Μ…Μ…Μ…Μ…Μ…Μ…
πœ’π‘˜ (π‘Ž)πœ™(𝑠, πœ’π‘˜ ) = ∑ (πœ’
π‘˜ (π‘Ž) ∑
πœ’π‘˜
πœ’π‘˜
Μ…Μ…Μ…Μ…Μ…Μ…Μ…Μ…
∑ (πœ’
π‘˜ (π‘Ž) ∑
πœ’π‘˜
𝑝
𝑝
(𝑝) log 𝑝
)=
𝑝𝑠
πœ’π‘˜ (𝑝) log 𝑝
)
𝑝𝑠
The Μ…Μ…Μ…Μ…Μ…Μ…Μ…Μ…
πœ’π‘˜ (π‘Ž) can also be written as πœ’π‘˜ (π‘Ž−1 mod π‘š), by the multiplicative property of πœ’π‘˜ , so it
works like a filter with πœ’π‘˜ (𝑝) leaving us with only the primes congruent to π‘Ž modulo m.
Eq. 50
∑ Μ…Μ…Μ…Μ…Μ…Μ…Μ…Μ…
πœ’π‘˜ (π‘Ž)πœ’π‘˜ (𝑝) =
πœ’π‘˜
∑
1 = πœ™(π‘š)
π‘Ž−1 𝑝≡1(π‘š)
So now we are left with turning Φπ‘š,π‘Ž (𝑠) into something familiar.
Eq. 51
log 𝑝
log 𝑝
∑ πœ™(π‘š) 𝑠 = ∑ πœ™(π‘š) 𝑠
𝑝
𝑝
−1
π‘Ž
𝑝≡π‘Ž(π‘š)
𝑝≡1(π‘š)
This is reminiscent of Φ(𝑠) from the proof of the PNT, and indeed it can be transformed
into an integral in the same manner. So now we have, with a familiar change of variable
Eq. 52
∞ πœƒ (π‘₯)
∞ πœƒ (𝑒 𝑑 )
π‘ž
π‘ž
Φπ‘š,π‘Ž (𝑠) = 𝑠 ∫
𝑑π‘₯
=
𝑠
∫
𝑑π‘₯
𝑠+1
π‘₯
𝑒 𝑠𝑑
1
1
From here we proceed in the exact same way as in the proof for the PNT, using the Analytic
Theorem we get that indeed
Result 12:
∞ πœƒ (π‘₯) − π‘₯
π‘ž
∫
𝑑π‘₯ convergent
π‘₯2
1
From this point on, the Soprounov proof says nothing new from the Newman proof, so we
will cut to the chase and get to the finale.
PNTAP
πœ‹(π‘₯, π‘ž) =
π‘₯
πœ™(π‘ž) log π‘₯
Now I explain the program by which I will test these results.
def PNTAP_test1(x,q,a):
assert gcd(q,a) == 1
finish_q = int(Mod(x,q))
tries = (x-finish_q)/q + 1
hits = 0
for k in range(tries) :
y = q*k+a
if y > x:
break
if is_prime(y):
hits += 1
estimate = N(x/(euler_phi(q)*log(x)))
abs_error = abs(hits-estimate)
rel_error = abs_error/hits
return [hits,estimate.n(digits=5), \
abs_error.n(digits=5), \
rel_error.n(digits=5)]
Step 1: Make sure that π‘Ž and π‘ž are coprime
The range of numbers to be tested will be partitioned into testing grounds by the
arithemetic progression, so we will see how much is cropped off by the cap of the range.
Step 2: Get the distance between the beginning of the final interval and the cap
Step 3: Calculate the number of prime tests that will be conducted
Step 4: Get the value, y of the arithmetic progression for the current test
Step 5: If we have gone too far skip to Step 8
Step 6: If y is prime add one to the hit count
Step 7: Repeat Steps 4-6 until all the tests are done
Step 8: Calculate the estimate according to the PNTAP
Step 9: Calculate the absolute error
Step 10: Calculate the relative error
Return: Number of hits, estimate, absolute error, relative error
def PNTAP_test2(x,q):
phi_q = euler_phi(q)
cong_classes = q.coprime_integers(q)
hit_data = []
estimate = N(x/(phi_q*log(x)))
abs_error_data = []
rel_error_data = []
for k in range(phi_q):
a = cong_classes[k]
data = PNTAP_test1(x,q,a)
hit_data += [data[0]]
abs_error_data += [data[2]]
rel_error_data += [data[3]]
hit_mean = N(sum(hit_data)/phi_q)
abs_error_mean = N(sum(abs_error_data)/phi_q)
rel_error_mean = N(sum(rel_error_data)/phi_q)
hit_stddev = 0
abs_error_stddev = 0
rel_error_stddev = 0
for k in range(phi_q):
hit_stddev += N(sqrt((phi_q-1)^-1*(hit_data[k]-hit_mean)^2))
abs_error_stddev += N(sqrt((phi_q-1)^-1*\
(abs_error_data[k]-abs_error_mean)^2))
rel_error_stddev += N(sqrt((phi_q-1)^-1*\
(rel_error_data[k]-rel_error_mean)^2))
return [[hit_mean.n(digits=5),hit_stddev.n(digits=5)], \
[estimate.n(digits=5)], \
[abs_error_mean.n(digits=5), \
abs_error_stddev.n(digits=5)], \
[rel_error_mean.n(digits=5), \
rel_error_stddev.n(digits=5)]]
Step 1: Calculate the necessary values, πœ™(π‘ž), the coprime integers, and the estimate
Step 2: Pick a congruency class
Step 3: Obtain data on the given arithmetic progression for all the congruency classes
Step 4: Distribute the data among the hit data, absolute errors, and relative errors
Step 5: Calculate the mean for the hit data, absolute errors, and relative errors
Step 6: Calculate the standard deviation for the hit data, absolute errors, and relative errors
Return: hit mean, hit standard deviation, absolute error mean, absolute error standard
deviation, relative error mean, and relative error standard deviation
First, we will test the estimate with a particular congruency class (PNTAP_test1).
I will use the following arithmetic progressions to explore how far out we need to go to get
close results.
TEST 1
4x+1, x=[103, 105, 107]
[hits, estimate, mean, absolute error, relative error]
[80, 72.382, 7.6176, 0.095220] x= 103
[4783, 4342.9, 440.06, 0.092004] x= 105
[332180, 310210., 21970., 0.066138] x= 107
TEST 2
27x+8, cap=[103, 105, 107, 109]
[hits, estimate, mean, absolute error, relative error]
[8, 8.0425, 0.042490, 0.0053113] x= 103
[543, 482.55, 60.451, 0.11133] x= 105
[36912, 34468., 2444.2, 0.066216] x= 107
[2824735, 2.6808e6, 143900., 0.050945] x= 109
TEST 3
90x+37, x=[103, 105, 107, 109]
[hits, estimate, mean, absolute error, relative error]
[8, 6.0319, 1.9681, 0.24602] x= 103
[405, 361.91, 43.088, 0.10639] x= 105
[27734,25851, 1883.1, 0.067900] x= 107
[2119211, 2.0106e6, 108590., 0.051240] x= 109
TEST 4
105x+16, x=[103, 105, 107, 109]
[hits, estimate, mean, absolute error, relative error]
[3, 3.0159, 0.015934, 0.0053113] x= 103
[405, 361.91, 43.088, 0.10639] x= 105
[13835,12925., 909.57, 0.065744] x= 107
[1059463, 1.0053e6, 54152., 0.051112] x= 109
We see that, as expected, the relative error decreases as π‘₯ grows. 27x+8 is odd in that for x
= 103 it is unusually accurate, and unlike 105x+16, it can’t be explained as it being a very
small number.
Now we will try the second fact obtained from the PNTAP (PNTAP_test2) with two
arithmetic progressions.
TEST 5
m = 27, x=[103, 105, 107]
[[hit mean,hit stddev],[estimate],
[abs error mean,abs error stddev],
[rel error mean,rel error stddev]]
x=103 [[9.2778, 5.0932], [8.0425],
[1.4858, 3.9994],
[0.14958, 0.34342]]
x=105 [[532.83, 22.798], [482.55],
[50.284, 22.798],
[0.094220, 0.039111]]
x=107 [[36921., 170.26], [34468.],
[2453.2, 170.26],
[0.066443, 0.0043041]]
TEST 6
m = 42, x=[103, 105, 107]
[[hit mean,hit stddev],[estimate],
[abs error mean,abs error stddev],
[rel error mean,rel error stddev]]
x=103 [[13.750, 4.0704], [12.064],
[1.7181, 3.9359],
[0.11735, 0.25734]]
x=105 [[799.08, 24.473], [723.82],
[75.259, 24.473],
[0.094098, 0.027691]]
x=107 [[55381., 93.870], [51702.],
[3679.6, 93.874],
[0.066441, 0.0015834]]
The results for this test are less satisfactory, but still we see the means behave like we saw
the specific congruency classes from tests 1 through 4.
This result is subtly amazing. According to the PNTAP, the number of primes
congruent to a number modulo another depends only on the modulus, which means that
the primes are evenly distributed among the congruency classes of the multiplicative group
of the modulus. This hints at a structure to what otherwise appears to be a randomly
distributed set of numbers, the primes.
Bibliography
LeVeque, William J. Fundamentals of Number Theory. New York: Dover, 1996.
Soprounov, Ivan. A Short Proof of the Prime Number Theorem for Arithmetic Progressions.
[http://www.math.umass.edu/~isoprou/pdf/primes.pdf]. 3 March 2010
Zagier, D. Newman’s Short Proof of the Prime Number Theorem.
[http://mathdl.maa.org/images/upload_library/22/Chauvenet/Zagier.pdf]. 4 March
2010
Download