Unexpected Irregularities in the Distribution of Prime Numbers

advertisement
Unexpected Irregularities in the Distribution of
Prime Numbers
A N D R E W GRANVILLE*
D e p a r t m e n t of Mathematics
University of Georgia
Athens, Georgia 30602, USA
In 1849 t h e Swiss mathematician E N C K E wrote to G A U S S , asking whether he had
ever considered trying to estimate ir(x), t h e number of primes up to x, by some
sort of "smooth" function. On Christmas Eve 1849, GAUSS replied t h a t "he h a d
pondered this problem as a bof' and h a d come to t h e conclusion t h a t "at around
x, the primes occur with density 1/loga;." Thus, he concluded, ir(x) could b e
approximated by
Li(z)
r—
J2
log*
Ioga:
+ log2x + 0
log 3 a;
Comparing G A U S S ' s guess t o the best d a t a available today (due to D E L E G L I S E
and R I V A T ) , we have:
X
8
10
109
10 l ü
IO 11
IO 12
IO 13
IO 14
IO 15
IO 16
IO 17
IO1«
ir(x)
[U(x)-7r(x)}
5761455
50847534
455052511
4118054813
37607912018
346065536839
3204941750802
29844570422669
279238341033925
2623557157654233
24739954287740860
754
1701
3104
11588
38263
108971
314890
1052619
3214632
7956589
21949555
The number of primes, 7T(:E), u p to x.
This d a t a certainly seems to support G A U S S ' S prediction, because Li(x) — ir(x)
appears t o be no bigger t h a n a small power of TT(X). In 1859, R I E M A N N , in a
now famous memoir, illustrated how t h e question of estimating ir(x) could be
*)
The author is an Alfred P. Sloan Research Fellow; and is also supported, in part, by the
National Science Foundation.
Proceedings of the International Congress
of Mathematicians, Zürich, Switzerland 1994
© Birkhäuser Verlag, Basel, Switzerland 1995
Distribution of Primes
389
turned into a question in analysis: Define Ç(s) := Y2n>i n _ S ^ o r ^ e ( s ) > 1> a n d
then analytically continue Ç(s) to t h e rest of t h e complex plane. We have, for
sufficiently large x,
xb~E < max \w(y) - U(y)\
< xb+E
where b : =
sup
/3.
(1)
The (as yet unproven) Riemann Hypothesis ( R H ) asserts t h a t b = 1/2 (in fact
t h a t /? = 1/2 whenever C(ß+i"y) = 0 with 0 < ß < 1) leading t o t h e sharp estimate
ir(x)
= Li(a0 + O(a: 1 / 2 loga:) .
(2a)
It was not until 1896 t h a t H A D A M A R D and D E L A V A L L é E - P O U S S I N independently
proved t h a t ß < 1 whenever ((ß + ry) = 0, which implies the prime
theorem: t h a t is, G A U S S ' S prediction t h a t
TT(X) ~
Li(x)
number
Ioga:
In 1914, L I T T L E W O O D showed, unconditionally, t h a t
*(*) - u(x) = n± («v» logl^gx),
(2b)
the first proven "irregularities" in t h e distribution of primes. 1
As G A U S S ' S vague "density assertion" was so prescient, CRAA4ÉR [4] decided,
in 1936, to interpret GAUSS's statement more formally in terms of probability
theoiy, t o t r y t o make further predictions about t h e distribution of prime numbers:
let Z2,Zs,...
be a sequence of independent random variables with
Prob(Zn = l) = ^ —
log 11
a n d P r o b ( Z n = 0) = 1 -
Let S be the space of sequences T — z2,23,...
KT(X)
= YI
1
log 77,
and for each x > 2 define
Zn
'
2<n<x
T h e sequence P = ir2,7T3,... , where irn = 1 if and only if n is prime, belongs to S.
C R A M E R wrote: "In many cases it is possible to prove that, with probability 1, a
certain relation R holds for sequences in S ...
Of course we cannot in general
conclude that R holds for the particular sequence P, but results suggested in
this way may sometimes afterwards be rigorously proved by other methods." For
example C R A M E R was able t o show, with probability 1, t h a t
m^M^-LKy)!-^.1^^,
1
f(x-)
f{x) — tì±{9Ìx)) means that there exists a constant c > 0 such t h a t /(œ_j-) > cg{x+) and
< — cg(x-) for certain arbitrarily large values of x±.
390
Andrew Granville
which corresponds well with the estimates in (2); and if true for T — P implies
RH, by (1).
GAUSS'S assertion was really about primes in short intervals, and so is best
applied to tr(x + y) —ir(x), where y is "small" compared to x. The binomial random
variables Zn are, more-or-less, the same for all integers n in such an interval. If we
take y = À log x so that the "expected" number of primes À in the interval is fixed
then we would expect that the number of primes in such intervals should follow
a Poisson distribution. Indeed, we can prove that for any fixed À > 0 and integer
k > 0, we have
+ A log x) — irT(x) = k} ~ e~x—X
(3)
k\
as X —> oo, with probability 1 for T e S. In 1976, GALLAGHER [11] showed that
this holds for the sequence of primes (that is, for P) under the assumption of
a reasonable "uniform" version of HARDY AND LITTLEWOOD'S Prime k-tuplets
conjecture [14]. This conjecture is the case where we take each fj(x) to be a linear
polynomial in SCHINZEL [22]
HYPOTHESIS H. Let F = {fi(x),f2(x),...
, fk(x)} be a set of irreducible polynomials with integer coefficients. Then the number of integers n < x for which
each | fj (n) \ is prime is
#{integers x < X :
*,(,)
= {Cr +
• * -
TTT(X
°(1),iog|/.W|log IAWI-.log IAWI
* , -
n
p prime
0 - ^ ) / K ) ' .
N
*-
/
/
\
r /
and UJF(P) counts the number of integers n, in the range 1 < n < p, for which
fi(n)f2(n)...fk(n)=0
(modp). 2 - 3
Estimates analogous to (2a) should hold for the number of primes in intervals
of various lengths, if we believe that what almost always OCCLUS in S should also
hold for P. Specifically, if 10 log2 x < y < x then
7TT(x + y) - TTT(X) = Li(z + y) - Li(z) + 0(y^2)
(4)
with probability 1 for T G S. In 1943 SELBERG [23] showed that primes do, on the
whole, behave like this by proving, under the assumption of RH, that
ir(x + y) -ir(x)
-
(5)
Ioga;
for "almost all" integers x, provided y /log x —> oo as x —> oo.
MONTGOMERY [17] has shown that one can deduce estimates about primes
in short intervals by understanding local distribution properties of the zeros of
CM:
2
Elementary results on prime ideals guarantee that the product defining Cp converges if
the primes are taken in ascending order.
3
T h e asymptotic formula proposed here for ITF(X) has a "local part" Cp, which has a factor
corresponding to each rational prime p, and an "analytic part" x/Tli l°ë\fi(x)\This reminds
one of formulae that arise when counting points on varieties.
Distribution of P r i m e s
P A I R C O R R E L A T I O N C O N J E C T U R E ( P C ) . Assume
average number of zeros 1/2 + ry' of ((s)
with 7 < 7 ; < 7 + 27rci'/log(|7| + 1), is
391
R H . For any fixed a > 0, the
"close" to a given zero 1/2 + ry, that is
(6a)
Inspired by t h e work of M O N T G O M E R Y and others, G O L D S T O N [12] showed t h a t
(6a) holds for eveiy fixed a > 0 if and only if for every fixed ß > 0,
.. .
TJ
s \ 2 dx
• ^ ) - r )
^
a
log2T
~ ^ ^ ^ '
(6b)
where ^ ( z ) : = ^ „ m ^ l o g p . Because (6b) is predicted by C R A M E R ' S model, thus
so is P C . D Y S O N predicted an analogous density function for t h e correlation of
7i-tuples of zeros of C( s )> 4 which presumably may b c shown t o be equivalent t o
estimates for primes in short intervals, and thus b e predicted b y C R A M E R ' S model.
C R A M E R ' S model does seem to accurately predict what we already believe t o
be true about primes for more substantial reasons. 5 To b e sure, one can find small
discrepancies 6 b u t t h e probabilistic model usually gives one a strong indication of
the t r u t h . C R A M E R made one conjecture, based on his model, t h a t does not seem
to be attackable by other methods: if p\ — 2 < p2 = 3 < p 3 = 5 < . . . is t h e
sequence of prime numbers then
max ( p n + 1 -pn)
~ log 2 x.
pn<X
This statement (or t h e weaker 0(log 2 x)) is known as "Cramer's
is some computational evidence to support it:
conjecture";
Pn
Pn+1 ~Pn
(Pn+l - Pn)/log* Pn
31397
370261
2010733
20831323
25056082087
2614941710599
19581334192423
72
112
148
210
456
652
778
.6715
.6812
.7025
.7394
.7953
.7975
.8177
there
Record-breaking gaps between primes, u p t o 10 14
4
Based somewhat surprisingly on the fact that (6a) also describes the distribution of eigenvalues of random Hermitian matrices, which arise in physics as models for various naturally
occurring quanta. The "Fourier transforms" of these pair correlation and 7i-tuple correlation
conjectures for the zeros of Ç(s) are now known to hold in a natural restricted range, under the
assumption of RH (see [17] and [21]).
5
Though HARDY AND LITTLEWOOD [14] remarked thus on probabilistic models: "Probability is not a notion of pure mathematics,
but of philosophy or physics".
6
As has been independently pointed out to me by SELBERG, MONTGOMERY, and PINTZ:
for example, PINTZ noted that the mean square of \ip(y) — y\ for y < ce, is ^» x2b~E (with b as in
(1)), in fact x x assuming RH, whereas the probabilistic model predicts X x logx.
Andrew Granville
392
In 1985 M A I E R [16] surprisingly proved t h a t , despite S E L B E R G showing (5) holds
"almost all" t h e time when y = l o g 5 x (assuming R H ) for fixed B > 2, it cannot
hold all of t h e time for such y. This not only radically contradicts what is predicted
by t h e probabilistic model, but also what most researchers in t h e field had believed
to b e t r u e , whether or not they had faith in t h e probabilistic model. Specifically,
M A I E R showed t h e existence of a constant 6B > 0 such t h a t for occasional, but
arbitrarily large, values of x+ and z _ ,
-B-l
7 r ( z + + l o g * z + ) - * " ( * + ) > (l + A^Jlog^ - s x+
and
?v(x-+logBX-)-TT(X-)
(7)
< (1 - < 5 ß ) l o g ß - 1 :
Outline of the Proof. There are ~ e~1x/ log z integers < x, all of whose prime
factors are > z, provided z is not too large. Among these we have all but ir(z)
of t h e primes < x, and so the probability t h a t a randomly chosen such integer is
prime is ~ e 7 log zj log x. Thus, in a specific interval (x,x-\-y\ we should "expect"
~ e7<Ë> log z/ log x primes, where <£> is t h e number of integers in t h e interval t h a t are
free of prime factors < z. Now if we can select our interval so t h a t $ ^ ^~1y/ log z
then our new prediction is not t h e same as t h a t in (5).
If x is divisible by P = YiP<z P ^ n e n
<Ê> = $(y, z) := # { 1 <n<y
: p\n => p > z} ~ Lü(U)
log z
for y = zu with u fixed (see [3]), where CJ(U) = 0 if 0 < u < 1 and satisfies
the differential-delay equation uu(u) = 1 + J"" u(t) dt if u > 1. Obviously
lim u _ > 0 0 cj(ii) = e - 7 . IWANIEC showed t h a t LJ(U) — e ~ 7 oscillates, crossing zero
either once or twice in every interval of length 1. Thus, if we fix u > B, chosen
so t h a t u(u) > e Y or < e _ 7 (as befits t h e case of (7)), select y = log x and
z — y1/™, and "adjust" x so t h a t it is divisible by P, then we expect (5) to be
false.
To convert this heuristic into a proof, M A I E R considered a progression of
intervals of t h e form (rP, rP -\- y], with R < r < 2R for a suitable value of R.
Visualizing this as a "matrix", with each such interval represented by a different
row, we see t h a t t h e primes in t h e matrix are all contained in those columns j for
which (j, P) = l.
RP + l
{R+1)P+1
RP + 2
{R+l)P + 2
AP + 3
(Ä + 1)P + 3
{R + 2)P + 2
{i,j)th entry:
(R + i)P + j
•••
RP + y
(R+l)P + y
•••
(2R-l)P
{R + 2)P+1
(R + 3)P+1
(2R - 1)P + 1
The "Maier m a t r i x " for ir(x + y) — TV(X)
{2R - 1)P + 2
+y
Distribution of Primes
393
Now, for any integer q > 1, the primes are roughly equi-distributed amongst those
arithmetic progressions a (mod q) with (a, q) — 1: in fact up to x we expect that
the number of such primes
*(*;*«) ~ -±±.
(8)
If so, then the number of primes in the j t h column, when (j, P) = 1, is
*(2X;P,j)-K{X;P,j)
~ ^ p j ^
where X = RP.
To get the total number of primes in the matrix we sum over all such j , and then
we can deduce that, on average, a row contains
${Viz)
1
RP
E (ß(P) log P P ~
y
P
1
l oi gr :^i^7( Pm) ul o. gDPD
P
t Jv y
Mi
7 / N Ï/
v
~ e7<j(u)'logPP
primes. MAIER'S result follows provided we can prove a suitable estimate in (8).
D
In general, it is desirable to have an estimate like (8) when x is not too large
compared to q. It has been proved that (8) holds uniformly for
(i) All q < log 0 x and all (a, q) = 1, for any fixed B > 0 (SIEGEL-WALFISZ).
(ii) All q < y ^ / l o g 2 + E x and all (a,q) = 1, assuming GRH. 7 In fact (8) then
holds with error term O (y/xlog2(qx)).
(iii) Almost all q < v / ï / l o g 2 + E x and all (a,q) = 1 (BOMBIERI-VINOGRADOV). 8
(iv) Almost all q < a;1/2+o(i) w j t n ^^^ _ ^ for ß x e c j a ^ Q (BOMBIERIFRIEDLANDER-IWANIEC, FOUVRY).
(v) Almost all q < x/ log 2+E x and almost all (a, g) = 1 (BARBAN-DAVENPORTHALBERSTAM, MONTGOMERY, HOOLEY).
Thus, when GRH is true, we get a good enough estimate in (8) with R = P 2
to complete MAIER'S proof. However MAIER, in the spirit of the BOMBIERIVINOGRADOV theorem, showed how to pick a "good" value for P (see [8, Prop.
2]), so that (8) is off by, at worst, an insignificant factor when R is a large, but
fixed, power of P (thus proving his result unconditionally).
In [15], HILDEBRAND AND MAIER extended the range for y in the proof above,
establishing that there are arbitrarily large values of x for which (4) fails to hold for
some y > exp ((loga:) 1 / 3_£ ); and, assuming GRH, for some y > exp ((loga;) 1 / 2- ^).
Moreover, they show that such intervals (x,x -f y] occur within every interval
[X,2X].9
It is plausible that (5) holds uniformly if logy/ log logo; —> oo as x —> oo; and
that (4) holds uniformly for T = P if y > exp ((loga;) 1 / 2 "^) (at least, we can't
7
The Generalized Riemann Hypothesis (GRH) states that if ß-\-ry is a zero of any Dirichlet
L-function then ß < 1/2.
8
This result is often referred to as "GRH on average".
9
A far better localization than those obtained in any proof of (2b).
394
Andrew Granville
disprove these statements as yet). We conjecture, presumably safely, that (4) and
(5) hold uniformly when y > xE.
One can show that there are more than x/exp ((logx) CB ) integers x± < x
satisfying the unexpected inequalities in (7). Although this may not be enough to
upset (6b), it surely guarantees that the error term there will not be as small as
might have been hoped. Thus, we should not expect the pair correlation conjecture
to hold with as much uniformity as might have been believed. Evidence that this is
so may be seen in the computations represented by [19, Figures 2.3.1,2,3]: the pair
correlation function for the nearest 106 zeros to the 10 12 th zero fits well with (6a)
for a < .8 and then has larger amplitude for .8 < a < 3; also, the pair correlation
function for the nearest 8 x 106 zeros to the 10 20 th zero fits well with (6a) for
a < 3 and then has larger amplitude for 3 < a < 5.
M A I E R ' S work suggests that CRAMER'S model should be adjusted to take into
account divisibility of n by "small" primes. 10 It is plausible to define "small" to
mean those primes up to a fixed power of log n. Then we are led to conjecture that
there are infinitely many primes pn with p n +i — pn > 2e _ 7 log 2 p n , contradicting
11
CRAMER'S conjecture!
If we analyze the distribution of primes in arithmetic progressions using a
suitable analogue of CRAMER'S model, then we would expect (8), and even
ir(x;q,a) = 5 4 + o f f - V
m
log(gz)] ,
(8')
to hold uniformly when (a, q) = 1 in the range
q<Q = x/logBx,
(9)
for any fixed B > 2. However the method of M A I E R is easily adapted to show that
neither (8) nor (8') can hold in at least part of the range (9): for any fixed B > 0
there exists a constant 6B > 0 such that for any modulus q, with "not too many
small prime factors", there exist arithmetic progressions a± (mod q) and values
x
± £ [(ß(q) log Qi 20(c) log q] such that
n(x+;q,a+)
> (l + 5 f l ) ^ ± 2
and 7r(^_;g,a_) < ( l _ Ä f l ) ? f e l . (10)
The proof is much as before, though now using a modified "Maier matrix" :
10
O n e has to be careful about the meaning of "small" here, because if we were to take into
account t h e divisibility of n by all primes up to y/n, then we would conclude t h a t there are ~
e *x/\ogx primes up to x.
11
It is unclear what the "correct conjecture" here should be because, to get at it with this
approach, we would need more precise information on "sifting limits" than is currently available.
395
Distribution of P r i m e s
RP
(R+1)P
RP + q
(R+l)P + q
{R + 2)P
(R+2)P
(R + 3)P
':
(2Ä - 1)P
(2R -l)P
RP + 2q
{R+l)P + 2q
•••
RP + yq
(R+l)P + yq
•••
(2R-l)P
+q
{i,j)th entry:
(R + i)P + jq
+q
+ yq
The Maier matrix for n(yq; q, a)
The BOMBIERI-VINOGRADOV theorem is usually stated in a stronger form than
above: for any given A > 0, there exists a value B = B(A) > 0 such that
V^ max max n(y\q,a)
-
TT(Z/)
*{q)
<
log X
(11)
where Q = y ^ / l o g x. It is possible [6] to take the same values of R and P in
the Maier matrix above for many different values of q, and thus deduce that there
exist arbitrarily large values of a and x for which
£>*«-$}
»z;
(12)
Q<q<2Q
(<7,a)=l
thus refuting the conjecture that for any given A > 0, (11) should hold in the
range (9) for some B = B(A) > 0. In [10] we showed that (11) even fails with
Q = a;/exp ((A - e)(loglogz) 2 /(loglogloga;)) .
We also showed that (8;) cannot hold for every integer a, prime to q, for
(i) Any q > xjexp ((logz) 1 / 5 - 6 ).
(ii) Any q > a;/exp ((logx)1/3"8) that has < 1.5 log log log ç distinct prime
factors < logg. 12
(iii) Almost any q E (y,2y], for any y > xj exp ((loga:) 1 / 2 _ £ ).
Moreover, under the assumption of GRH we can improve the values 1/5 and 1/3
in (i) and (ii), respectively, to 1/3 and 1/2.
It seems plausible that (8) holds uniformly if log(a;/ç)/log logg —> oo as
q —> oo; and that (11) holds uniformly for Q < x/exp ((loga:) 1 / 2+E ). At least we
can't disprove these statements as yet, though we might play it safe and conjecture
only that they hold uniformly for q, Q < xl~E.
Notice that in the proof described above, the values of a increase with x,
leaving open the possibility that (8) might hold uniformly for all (a, q) = 1 in the
12
Which, by the TURAN-KUBILIUS inequality, includes "almost all" integers.
396
Andrew Granville
range (9) if we fix a.13 However, in [8] we observed that when a is fixed one can
suitably modify the Maier matrix, by forcing the elements of the second column
to all be divisible by P :
1
1
1 + (RP - 1)
1+ ((Ä+1)P-1)
1
1+
1 + 2(RP - 1)
l + 2((fl+l)P-l)
•••
l +
l+
y(RP-l)
y((R+l)P-l)
{(R+2)P-1)
(i,j)th entry:
l+j((R +
i)P-l)
1
:
1
1 + ((2Ä - 1)P - 1)
•••
l+y{{2R-l)P-l)
The Maier matrix for ir(yq; q,l)
Notice that the jth column here is now part of an arithmetic progression with
a varying modulus, namely 1 — j (mod jP). With this type of Maier matrix we
can deduce that, for almost all 0 < \a\ < x/log x (including all fixed a / 0 ) ,
there exist q G (xj log x, 2x/log x], coprirne to a, for which (8) does not hold.
However (8) cannot be false too often (like in (12)), because this would contradict
the BARBAN-DAVENPORT-HALBERSTAM theorem. So for which a is (8) frequently
false? It turns out that the answer depends on the number of prime factors of a:
In [9], extending the results of [2], we show that for any given A > 1 there exists
a value B = B(A) > 0 such that, for any Q < x / l o g ß x and any integer a that
satisfies 0 < \a\ < x and has <C log log x distinct prime factors, 14 we have
(x;q,a)
Q<q<2Q
(q,a)=l
-
n(x)\
mi
<
log X
(13)
On the other hand, for every given A,B > 0, there exists Q < z / l o g ^ z and
an integer a that satisfies 0 < \a\ < x and has < (loglog^) 6 / 5 + e distinct prime
factors, for which (13) does not hold (and assuming GRH we may replace 6/5 + e
here by 1 + e).
Finding primes in (x, x + y] is equivalent to finding integers n < y for which
f(n) is prime, where f(t) is the polynomial t-\-x. Similarly, finding primes < x that
belong to the arithmetic progression a (mod q) is equivalent to finding integers
n <y := x/q for which f(n) is prime, where f(t) is the polynomial qt + a. Define
the height h(f) of a given polynomial f(t) — Y2i c%k% to be h(f) := > / ^ c 2 . In the
cases above, in which the degree is always 1, we proved that we do not always get
the asymptotically expected number of prime values f(n) with n <y = log 0 h(f),
13
14
Which would be consistent with the BARBAN-DAVENPORT-HALBERSTAM theorem.
Which includes almost all integers a once the inexplicit constant here is > 1 (by a famous
result of H A R D Y AND RAMANUJAN) .
Distribution of Primes
397
for any fixed B > 0. In [7] we showed that this is true for polynomials of arbitrary
degree d, which is somewhat ironic because it is not known that any polynomial of
degree > 2 takes on infinitely many prime values, nor that the prime values are ever
"well-distributed". NAIR AND PERELLI [18] showed that some of the polynomials
d
FR(IT) — fi + RP attain more than, and others attain less than, the number of
prime values expected in such a range, by considering the following Maier matrix:
*Wi(i)
FR(2)
FR+I(2)
FR(3)
*WI(3)
•••
•••
F„(y)
FR+1(y)
•••
F2R-i{y)
FR+2(1)
FR+3(1)
FR+2{2)
F2R-I(1)
(i,j)th entry :
FR+i(j)
The Maier matrix for 71^(7/)
F2R-I(2)
Notice that the j t h column here is part of the arithmetic progression j d (mod P).
Using Maier matrices it is possible to prove "bad equi-distribution" results
for primes in other interesting sequences, such as the values of binary quadratic
forms, and of prime pairs. For example, if we fix B > 0 then, once x is sufficiently
large, there exists a positive integer k < Ioga: such that there are at least 1 + 6B
times as many prime pairs p,p + 2k, with x < p < x + log 0 a:, as we would
expect from assuming that the estimate in Hypothesis H holds uniformly for n <^C
logBh((t + x)(t + (x + 2k))) .
We have now seen that the asymptotic formula in Hypothesis H fails when
x is an arbitrary fixed power of log h(F)(:= ^ ^ log h(fi)), for many different nontrivial examples F. Presumably the asymptotic formula does hold uniformly as
log xj log log h(F) —> 00. However, to be safe, we only make the following prediction:
CONJECTURE. Fix e > 0 and positive integer k. The asymptotic formula in
Hypothesis H holds uniformly for x > h(F)E as h(F) —> 00.
Our work here shows that the "random-like" behavior exhibited by primes in many
situations does not carry over to all situations. It remains to discover a model that
will always accurately predict how primes are distributed, as it seems that minor
modifications of CRAMER'S model will not do. We thus agree that:
"It is evident that the primes are randomly distributed hut, unfortunately,
we don't know what random7 means." — R.C. VÀUGHAN (February 1990).
398
Andrew Granville
Final r e m a r k s
Armed with M A I E R ' S ideas it seems possible to construct incorrect conclusions
from, more or less, any variant of C R A M E R ' S model. This flawed model may still
be used t o make conjectures about t h e distribution of primes, b u t one should b e
very cautious of such predictions!
T h e r e are no more t h a n 0(x2/log3B
x) arithmetic progressions a (mod q),
with 1 < a < q < z / l o g ^ r r a n d (a, q) = 1, for which (8) fails, by t h e BARBAND A V E N P O R T - H A L B E R S T A M theorem. However, our methods here may be used t o
show t h a t (8) does fail for more t h a n a; 2 /exp ((logz) e ) such arithmetic progressions.
Maier's matrix has been used in other problems too: K O N Y A G I N recently used
it t o find unusually large gaps between consecutive primes. M A I E R used it t o find
long sequences of consecutive primes, in which there are longer than average gaps
between each pair. SHIU has used it t o show t h a t every arithmetic progression a
(mod q) with (a, q) = 1 contains arbitrarily long strings of consecutive primes.
BALOG [1] has recently shown t h a t t h e prime fc-tuplets conjecture holds "on
average" (in t h e sense of t h e B O M B I E R I - V I N O G R A D O V Theorem). 1 5
As we saw in t h e table above, Li(x) > TT(X) for all x < 10 1 8 . However, (2b)
implies t h a t this inequality does not persist for ever; indeed, it is reversed for
some x < IO 3 7 0 ( T E R I E L E ) . Recently, however, R U B I N S T E I N AND S A R N A K [20] 16
showed t h a t it does hold more often t h a n not, in t h e sense t h a t there exists a
constant 6 « 1/(4 • 10 6 ) such t h a t t h e (logarithmically scaled) proportion of x
for which 7v(x) >Li(a;) exists a n d equals 6. Such biases m a y also be observed in
arithmetic progressions, in t h a t there are "more" primes belonging t o arithmetic
progressions t h a t are quadratic nonresidues t h a n those t h a t are quadratic residues.
In particular they prove t h a t 7r(:c;4,3) > ir(x;A, 1) for a (logarithmically scaled)
proportion 0.9959 . . . of t h e time.
Delicate questions concerning t h e distribution of prime numbers still seem
to be very mysterious. It may b e t h a t by taking into account divisibility by small
primes we can obtain a very accurate picture; or it m a y b e t h a t there are other
phenomena, disturbing t h e equi-distribution of primes, t h a t await discovery .. .
"Mathematicians
have tried in vain to discover some order in the sequence
of prime numbers but we have every reason to beheve that there are some
mysteries which the human mind will never penetrate. "
— L. E U L E R (1770).
Acknowledgments: I'd like t o t h a n k R e d Alford, Nigel Boston, John Friedlander,
Dan Goldston, K e n Ono, Carl Pomerance, and Trevor Wooley for their helpful
comments on earlier drafts of this paper.
15
Which was improved to the exact analogue of (11) by MiKAWA for k = 2, and by KAWADA
for all k > 1.
16
A11 of their results are proved assuming appropriate conjectures such as RH, GRH, and
that the zeros of t h e relevant L-functions are linearly independent over (Q).
Distribution of Primes
399
References
[I]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
[II]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
[23]
A. Balog, The prime k-tuplets conjecture on average, Analytic Number Theory (B.C.
Berndt, H.G. Diamond, H. Halberstam, A. Hildebrand, eds.), Basel and Birkhäuser,
Boston, 1990, pp. 165-204.
E. Bombieri, J. B. Friedlander, and H. Iwaniec, Primes in arithmetic progressions
to large moduli III, J. Amer. Math. Soc. 2 (1989), 215-224.
A. A. Buchstab, On an asymptotic estimate of the number of numbers of an arithmetic progression which are not divisible by relatively small prime numbers, Mat. Sb.
28/70 (1951), 165-184 (Russian).
H. Cramer, On the order of magnitude of the difference between consecutive prime
numbers, Acta Arith. 2 (1936), 23-46.
H. Davenport, Multiplicative Number Theory (2nd ed.), Springer-Verlag, Berlin and
New York, 1980.
J. B. Friedlander and A. Granville, Limitations to the equi-distribution of primes I,
Ann. of Math. (2) 129 (1989), 363-382.
, Limitations to the equi-distribution of primes IV, Proc. Royal Soc. London
Ser. A 435 (1991), 197-204.
, Limitations to the equi-distribution of primes III, Compositio Math. 81
(1992), 19-32.
, Relevance of the residue class to the abundance of primes, Proc. Amalfi
Conf. on Analytic Number Theory (E. Bombieri, A. Perelli, S. Salerno, U. Zannier,
eds.), Salerno, Italy, 1993, pp. 95-104.
J. B. Friedlander, A. Granville, A. Hildebrand, and H. Maier, Oscillation theorems
for primes in arithmetic progressions and for sifting functions, J. Amer. Math. Soc.
4 (1991), 25-86.
P. X. Gallagher, On the distribution of primes in short intervals, Mathematika 23
(1976), 4-9.
D. A. Goldston, On the pair correlation conjecture for zeros of the Riemann zetafunction, J. Reine Angew. Math. 385 (1988), 24-40.
A. Granville, Harald Cramer and the distribution of prime numbers, Scand. Actuarial
J. 2 (1995), to appear.
G. H. Hardy and J. E. Littlewood, Some problems on partitio numerorum III On
the expression of a number as a sum of primes, Acta Math. 44 (1923), 1-70.
A. Hildebrand and H. Maier, IiTegularities in the distribution of primes in short
intervals, J. Reine Angew. Math. 397 (1989), 162-193.
H. Maier, Primes in short intervals, Michigan Math. J. 32 (1985), 221-225.
H. L. Montgomery, The pair correlation of zeros of the zeta function, Proc. Sympos.
Pure Math., vol. 24, Amer. Math. Soc, Providence, RI, 1973, pp. 181-193.
M. Nair and A. Perelli, On the prime ideal theorem and irregularities in the distribution of primes, Duke Math. J. 77 (1995), 1-20.
A . M . Odlyzko, The 10 th zero of the Riemann zeta function and 70 million of its
neighbors, to appear.
M. Rubinstein and P. Sarnak, Chebyshev's bias, to appear.
Z. Rudnick and P. Sarnak, The n-level correlations of the zeros of the zetafunction,
C.R. Acad. Sci. Paris, to appear.
A. Schinzel and W. Sierpirïski, Sur certaines hypothèses concernant les nombres
premiers, Acta Arith. 4 (1958), 185-208.
A. Selberg, On the noimal density of primes in small intervals and the difference
between consecutive primes, Arch. Math. Naturvid. J. 47 (1943), 87-105.
Download