Introduction to Analytic Number Theory Math 531 Lecture Notes, Fall 2005 A.J. Hildebrand Department of Mathematics University of Illinois http://www.math.uiuc.edu/~hildebr/ant Version 2013.01.07 18 Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 Chapter 1 Arithmetic functions I: Elementary theory 1.1 Introduction and basic examples A simple, but very useful concept in number theory is that of an arithmetic function. An arithmetic function is any real- or complex-valued function defined on the set N of positive integers. (In other words, an arithmetic function is just a sequence of real or complex numbers, though this point of view is not particularly useful.) Examples (1) Constant function: The function defined by f (n) = c for all n, where c is a constant, is denoted by c; in particular, 1 denotes the function that is equal to 1 for all n. (2) Unit function: e(n), defined by e(1) = 1 and e(n) = 0 for n ≥ 2. (3) Identity function: id(n); defined by id(n) = n for all n. (4) Logarithm: log n, the (natural) logarithm, restricted to N and regarded as an arithmetic function. (5) Moebius function: µ(n), defined by µ(1) = 1, µ(n) = 0 if n is not k squarefree (i.e., divisible by the square of a prime), and Qkµ(n) = (−1) if n is composed of k distinct prime factors (i.e., n = i pi ). 19 20 CHAPTER 1. ARITHMETIC FUNCTIONS I (6) Characteristic function of squarefree integers: µ2 (n) or |µ(n)|. From the definition of the Moebius function, it follows that the absolute value (or, equivalently, the square) of µ is the characteristic function of the squarefree integers. (7) Liouville function: λ(n), defined by λ(1) = 1 and λ(n) = (−1)k if n Q is composed of k notQnecessarily distinct prime factors (i.e., if n = ki=1 pαi i then λ(n) = ki=1 (−1)αi ). (8) Euler phi (totient) function: φ(n), the numberP of positive integers m ≤ n that are relatively prime to n; i.e., φ(n) = nm=1,(m,n)=1 1. (9) Divisor function: d(n), the number of positive divisors P of n (including the trivial divisors d = 1 and d = n); i.e., d(n) = d|n 1. (Another common notation for this function is τ (n).) (10) Sum-of-divisorsPfunction: σ(n), the sum over all positive divisors of n; i.e., σ(n) = d|n d. (11) Generalized functions: σα (n), defined by P sum-of-divisors α σα (n) = d|n d . Here α can be any real or complex parameter. This function generalizes the divisor function (α = 0) and the sum-ofdivisors function (α = 1). (12) Number of distinct prime P by ω(1) = 0 and Q factors: ω(n), defined ω(n) = k if n ≥ 2 and n = ki=1 pαi i ; i.e., ω(n) = p|n 1. (13) Total number of prime divisors: Ω(n), defined in the same way as ω(n), except that prime Thus, Qkmultiplicity. Pkdivisors are counted with αi p ; i.e., Ω(n) = α if n ≥ 2 and n = Ω(1) = 0 and Ω(n) = i=1 i i=1 i P pm |n 1. For squarefree integers n, the functions ω(n) and Ω(n) are equal and are related to the Moebius function by µ(n) = (−1)ω(n) . For all integers n, λ(n) = (−1)Ω(n) . (14) Ramanujan sums: Given a positive integer q,P the Ramanujan sum cq is the arithmetic function defined by cq (n) = qa=1,(a,q)=1 e2πian/q . (15) Von Mangoldt function: Λ(n), defined by Λ(n) = 0 if n is not a prime power, and Λ(pm ) = log p for any prime power pm . Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY 1.2 21 Additive and multiplicative functions Many important arithmetic functions are multiplicative or additive functions, in the sense of the following definition. Definition. An arithmetic function f is called multiplicative if f 6≡ 0 and (1.1) f (n1 n2 ) = f (n1 )f (n2 ) whenever (n1 , n2 ) = 1; f is called additive if it satisfies (1.2) f (n1 n2 ) = f (n1 ) + f (n2 ) whenever (n1 , n2 ) = 1. If this condition holds without the restriction (n1 , n2 ) = 1, then f is called completely (or totally) multiplicative resp. completely (or totally) additive. The condition (1.1) can be used to prove the multiplicativity of a given function. (There are also other, indirect, methods for establishing multiplicativity, which we will discuss in the following sections.) However, in order to exploit the multiplicativity of a function known to be multiplicative, the criterion of the following theorem is usually more useful. Theorem 1.1 (Characterization of multiplicative functions). An arithmetic function f is multiplicative if and only if f (1) = 1 and, for n ≥ 2, Y (1.3) f (n) = f (pm ). pm ||n The function f is completely multiplicative if and only if the above condition is satisfied and, in addition, f (pm ) = f (p)m for all prime powers pm . Remarks. (i) The result shows that a multiplicative function is uniquely determined by its values on prime powers, and a completely multiplicative function is uniquely determined by its values on primes. (ii) With the convention that an empty product is to be interpreted as 1, the condition f (1) = 1 can be regarded as the special case n = 1 of (1.3). With this interpretation, f is multiplicative if and only if f satisfies (1.3) for all n ∈ N. Proof. Suppose first that f satisfies f (1) = 1 and (1.3) for n ≥ 2. If n1 and n2 are positive integers with (n1 , n2 ) = 1, then the prime factorizations of n1 and n2 involve disjoint sets of prime powers, so expressing each of f (n1 ), Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 22 CHAPTER 1. ARITHMETIC FUNCTIONS I f (n2 ), and f (n1 n2 ) by (1.3) we see that f satisfies (1.1). Moreover, since f (1) = 1, f cannot be identically 0. Hence f is multiplicative. Conversely, suppose that f is multiplicative. Then f is not identically 0, so there exists n ∈ N such that f (n) 6= 0. Applying (1.3) with (n1 , n2 ) = (n, 1), we obtain f (n) = f (1 · n) = f (1)f (n), which yields f (1) = 1, upon dividing by f (n). Q Next, let n ≥ 2 be given with prime factorization n = ki=1 pαi i . “Shaving off” prime powers one at a time, and applying (1.3) inductively, we have αk−1 f pαk k f (n) = f pα1 1 · · · pαk k = f pα1 1 · · · pk−1 = · · · = f (pα1 1 ) · · · f pαk k , so (1.3) holds. If f is completely multiplicative, then for any prime power pm we have f (pm ) = f (pm−1 · p) = f (pm−1 )f (p) = · · · = f (p)m . m ) = f (p)m for all prime Conversely, if f is multiplicative and satisfies f (p Q powers pm , then (1.3) can be written as f (n) = ri=1 f (pi ), where now n = Q r i=1 pi is the factorization of n into single (not necessarily distinct) prime factors pi . Since, for any two positive integers n1 and n2 , the product of the corresponding factorizations is the factorization of the product, it follows that the multiplicativity property f (n1 n2 ) = f (n1 )f (n2 ) holds for any pair (n1 , n2 ) of positive integers. Hence f is completely multiplicative. Theorem 1.2 (Products and quotients of multiplicative functions). Assume f and g are multiplicative function. Then: (i) The (pointwise) product f g defined by (f g)(n) = f (n)g(n) is multiplicative. (ii) If g is non-zero, then the quotient f /g (again defined pointwise) is multiplicative. Proof. The result is immediate from the definition of multiplicativity. Analogous properties hold for P additivemfunctions: an additive function satisfies f (1) = 0 and f (n) = pm ||m f (p ), and the pointwise sums and differences of additive functions are additive. Tables 1.1 and 1.2 below list the most important multiplicative and additive arithmetic functions, along with their values at prime powers, and basic properties. (Properties that are not obvious from the definition will be established in the following sections.) Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY 23 Function value at n value at pm properties e(n) 1 if n = 1, 0 else 0 unit element w.r.t. Dirichlet product, e ∗ f =f ∗ e=f id(n) (identity function) n pm s(n) (char. fct. of squares) 1 if n = m2 with m ∈ N, 0 else 1 if m is even, 0 if m is odd µ2 (n) (char. fct. of squarefree integers) 1 if n is squarefree, 0 else 1 if m = 1, 0 if m > 1 µ(n) (Moebius function) 1 if n = 1, k if (−1)Q n = ki=1 pi (pi distinct), 0 otherwise −1 if m = 1, 0 if m > 1 P λ(n) (Liouville function) 1 if nP= 1, k (−1)Qi=1 αi if n = ki=1 pαi i (−1)m P φ(n) (Euler phi function) #{1 ≤ m ≤ n : (m, n) = 1} pm (1 − 1/p) d(n) (= τ (n)) (divisor function) P m+1 d=1∗1 σ(n) (sum of divisor function) P pm+1 − 1 p−1 σ = 1 ∗ id d|n 1 d|n d d|n µ(d) = 0 if n≥2 µ∗1=e d|n λ(d) = s(n) λ∗1=s P d|n φ(d) =n φ ∗ 1 = id Table 1.1: Some important multiplicative functions Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 24 CHAPTER 1. ARITHMETIC FUNCTIONS I Function value at n value at pm properties ω(n) (number of distinct prime factors) 0 if n = 1, k if Q n = ki=1 pαi i 1 additive Ω(n) (total number of prime factors) 0 if n = 1, P k α if i=1 Qki αi n = i=1 pi m completely additive log n (logarithm) log n log pm completely additive Λ(n) (von Mangoldt function) log p if n = pm , 0 if n is not a prime power log p neither additive nor multiplicative log = Λ ∗ 1 Table 1.2: Some other important arithmetic functions 1.3 The Moebius function The fundamental property of the Moebius function is given in the following theorem. P Theorem P 1.3 (Moebius identity). For all n ∈ N, d|n µ(d) = e(n); i.e., the sum d|n µ(d) is zero unless n = 1, in which case it is 1. Proof. There are many ways to prove this important identity; we give here a combinatorial proof that does not require any special tricks or techniques. We will later give alternate proofs, which are simpler and more elegant, but which depend on some results in the theory of arithmetic functions. P If n = 1, then the sum d|n µ(d) reduces to the single term µ(1) = 1, so the formula holds in this case. Next, suppose n ≥ 2 and let Qkasserted αi n = i=1 pi be the canonical prime factorization of n. Since µ(d) = 0 if d is not Q squarefree, the sum over d can be restricted to divisors of the form d = i∈I pi , where I ⊂ {1, 2, . . . , k}, and each such divisor contributes a Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY term µ(d) = (−1)|I| . Hence, X X µ(d) = 25 (−1)|I| . I⊂{1,...,k} d|n Now note that, for any r ∈ {0, 1, . . . , k}, there are kr subsets I with |I| = r, and for each such subset the summand (−1)|I| is equal to (−1)r . Hence the above sum reduces to k X r=0 k (−1) = (1 − 1)k = 0, r r by the binomial theorem. (Note that k ≥ 1, since we assumed n ≥ 2.) Hence P we have d|n µ(d) = 0 for n ≥ 2, as claimed. Motivation for the Moebius function. The identity given in this theorem is the main reason for the peculiar definition of the Moebius function, which may seem rather artificial. In particular, the definition of µ(n) as 0 when n is not squarefree appears to be unmotivated. The Liouville function λ(n), which is identical to the Moebius function on squarefree integers, but whose definition extends to non-squarefree integers in a natural way, appears to be a much more natural function to work with. However, this function does not satisfy the identity of the theorem, and it is this identity that underlies most of the applications of the Moebius function. Application: Evaluation of sums involving a coprimality condition. P The identity of the theorem states that d|n µ(d) is the characteristic function of the integer n = 1. This fact can be used to extract specific terms from a series. A typical application is the evaluation of sums over integers n that are relatively prime to a given integer k. By the theorem, the characP teristic function of integers n with (n, k) = 1 is given by d|(n,k) µ(d). Since the condition d|(n, k) is equivalent to Pthe simultaneous conditions d|n and d|k, one can formally rewrite a sum n,(n,k)=1 f (n) as follows: X f (n) = n (n,k)=1 X f (n)e((n, k)) = n = X d|k X f (n) n µ(d) X n d|n Math 531 Lecture Notes, Fall 2005 f (n) = X d|k X µ(n) d|(n,k) µ(d) X f (dm). m Version 2013.01.07 26 CHAPTER 1. ARITHMETIC FUNCTIONS I The latter sum can usually be evaluated, and doing so yields a formula for the original sum. (Of course, one has to make sure that the series involved converge.) The following examples illustrate the general method. Evaluation of the Euler phi P function. By definition, the Euler phi function is given by φ(n) = m≤n,(m,n)=1 1. Eliminating the coprimality condition (m, n) = 1, as indicated above, yields the identity φ(n) = X X m≤n d|(m,n) µ(d) = X d|n µ(d) X m≤n,d|m 1= X µ(d)(n/d). d|n (For an alternative proof of this identity see Section 1.7.) Pq Ramanujan sums. The functions cq (n) = a=1,(a,q)=1 exp(2πian/q), where q is a positive integer, are called Ramanujan sums. By eliminating the P condition (a, q) = 1 using the above method, one can show that cq (n) = d|(q,n) dµ(q/d). When n = 1, this formula reduces to cq (1) = µ(q), Pq and we obtain the remarkable identity a=1,(a,q)=1 exp(2πia/q) = µ(q), which shows that the sum over all “primitive” k-th roots of unity is equal to µ(q). A weighted average of the Moebius function. While the estimation P of the partial sums of the Moebius function n≤x µ(n) is a very deep (and largely unsolved) P problem, remarkably enough a weighted version of this sum, namely n≤x µ(n)/n is easy to bound. In fact, we will prove: Theorem 1.4. For any real x ≥ 1 we have X µ(n) (1.4) ≤ 1. n≤x n Proof. Note first that, without loss of generality, one can assume that P x = N, where N is a positive integer. We then evaluate the sum S(N ) = n≤N e(n) in two different ways. On the one hand, by the Pdefinition of e(n), we have S(N ) = 1; on the other hand, writing e(n) = d|n µ(d) and interchanging P summations, we obtain S(N ) = d≤N µ(d)[N/d], where [t] denotes the greatest integer ≤ t. Now, for d ≤ N − 1, [N/d] differs from N/d by an amount that is bounded by 1 in absolute value, while for d = N , the Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY 27 quantities [N/d] and N/d are equal. Replacing [N/d] by N/d and bounding the resulting error, we therefore obtain X X µ(d) X S(N ) − N ≤ |µ(d)| · |[N/d] − (N/d)| ≤ |µ(d)| ≤ N − 1. d d≤N d≤N d≤N −1 Hence, X µ(d) ≤ (N − 1) + |S(N )| = (N − 1) + 1 = N, N d≤N d which proves (1.4). The Moebius function and the Prime Number Theorem. Part of the interest in studying the Moebius function stems from the fact that the behavior of this function is intimately related to the Prime Number Theorem (PNT), which says that the number π(x) of primes below x is asymptotically equal to x/ log x as x → ∞ and which is one of the main results of Analytic Number Theory. We will show later that the PNT is “equivalent” to the P fact that µ(n) has average value (“mean value”) zero, i.e., limx→∞ (1/x) n≤x µ(n) = 0. The latter statement has the following probabilistic interpretation: If a squarefree integer is chosen at random, then it is equally likely to have an even and an odd number of prime factors. Mertens’ conjecture. A famous conjecture, attributed to Mertens (though it apparently was first statedPby Stieltjes who mistakenly believed √ that he had proved it) asserts that | n≤x µ(n)| ≤ x for all x ≥ 1. This conjecture had remained open for more than a century, but was disproved (though only barely!) in 1985 by A. Odlyzko and H. te Riele, who used extensive computer calculations, along with some theoretical arguments, to √ show that the above sum exceeds 1.06 x for infinitely many x. Whether the constant 1.06 can be replaced by any constant c is still an open problem. Heuristic arguments, based on the assumption that the values ±1 of the Moebius function on squarefree integers are distributed like a random sequence of numbers ±1, strongly suggest that this is the case, but a proof has remained elusive so far. Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 28 1.4 CHAPTER 1. ARITHMETIC FUNCTIONS I The Euler phi (totient) function Theorem 1.5. The Euler phi function satisfies: (i) P = n for all n ∈ N. d|n φ(d) (ii) φ(n) = P d|n µ(d)(n/d). (iii) φ is multiplicative. Q Q (iv) φ(n) = pm ||n (pm − pm−1 ) = n p|n (1 − 1/p) for all n ∈ N. Proof. (i) Split the set A = {1, 2, . . . , n} into the pairwise disjoint subsets Ad = {m ∈ A : (m, n) = d}, d|n. Writing an element m ∈ Ad as m = dm0 , we see that Ad = {dm0 :P1 ≤ m0 ≤ n/d, (m0 , n/d) =P1}, and so |Ad | = φ(n/d). Since n = |A| = d|n |Ad |, it follows that n = d|n φ(n/d). Writing d0 = n/d and noting that, as d runs over all positive divisors of n, so does d0 , we obtain the desired identity. (ii) This identity was proved in the last section. Alternatively, as we shall show in Section 1.7, one can derive it from the identity (i). (iii) We defer the proof of the multiplicativity until Section 1.7. (iv) This follows immediately from the multiplicativity of φ and the fact that, at n = pm , φ(pm ) = pm − pm−1 . To see the latter formula, note that an integer is relatively prime to pm if and only if it is not a multiple of p and that of the pm positive integers ≤ pm exactly pm−1 are multiples of p. Formula (iv) of the theorem can be used to quickly compute values of φ. For example, the first 7 values are φ(1) = 1, φ(2) = (2 − 1) = 1, φ(3) = (3 − 1) = 2, φ(4) = (22 − 2) = 2, φ(5) = (5 − 1) = 4, φ(6) = φ(2 · 3) = (2 − 1)(3 − 1) = 2, φ(7) = (7 − 1) = 6. Carmichael’s conjecture. It is easy to see that not every positive integer occurs as a value of φ; for example, φ(n) is never equal to an odd prime. In other words, the range of φ is a proper subset of N. At the beginning of this century, R.D. Carmichael, a professor at the University of Illinois and author of a textbook on number theory, observed that there seems to be no integer that appears exactly once as a value of φ; in other words, for each m ∈ N, the pre-image φ−1 (m) = {n ∈ N : φ(n) = m} has either cardinality 0 or has cardinality ≥ 2. In fact, Carmichael claimed to have a proof of this result and included the “proof” as an exercise in his number theory textbook, but his “proof” was incorrect, and he later retracted the claim; he changed the Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY 29 wording of the exercise, calling the result an “empirical theorem.” This “empirical theorem” is still open to this date, and has become a famous conjecture, known as “Carmichael’s conjecture,” The numerical evidence for the conjecture is overwhelming: the conjecture (that the cardinality of 10 φ−1 (m) is never 1) is true for values m up to 1010 . While the conjecture is still open, Kevin Ford, a former UIUC graduate student and now a faculty member here, proved a number of related conjectures. In particular, he showed that for every integer k ≥ 2 there exist infinitely many m such that φ−1 (m) has cardinality k. This was known as “Sierpinski’s conjecture”, and it complements Carmichael’s conjecture which asserts that in the case k = 1, the only case not covered by Sierpinski’s conjecture, the assertion of Sierpinski’s conjecture is not true. 1.5 The von Mangoldt function The definition of the von Mangoldt function may seem strange at first glance. One motivation for this peculiar definition lies in the following identity. Theorem 1.6. We have X Λ(d) = log n (n ∈ N). d|n Proof. For n = 1, the identity holds since Λ(1) = 0 = log 1. For n ≥ 2 we have, by the definition of Λ, X X Λ(d) = log p = log n. d|n pm |n (For the last step note that, for each prime power pα ||n, each of the terms p1 , p2 , . . . , pα contributes a term log p to the sum, so the total contribution arising from powers of p is α(log p)P= log pα . Adding Q up those contributions α α over all prime powers p ||n, gives pα ||n log p = log pα ||n pα = log n.) The main motivation for introducing the von Mangoldt function is that P the partial sums n≤x Λ(n) represent a weighted count of the prime powers pm ≤ x, with the weights being log p, the “correct” weights to offset the density of primes. It is not hard to show that higher prime powers (i.e., those with m ≥ 2) contribute little to the above sum, so the sum is essentially a weighted sum over prime numbers. In fact, studying the asymptotic behavior of the above sum is essentially equivalent to studying the behavior Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 30 CHAPTER 1. ARITHMETIC FUNCTIONS I of the prime counting function π(x); P for example, the PNT is equivalent to the assertion that limx→∞ (1/x) n≤x Λ(n) = 1. In fact, most proofs of the PNT proceed by first showing the latter relation, and then deducing from this the original form of the PNT. The reason for doing this is that, because of the identity in the above theorem (and some similar relations), working with Λ(n) is technically easier than working directly with the characteristic function of primes. 1.6 The divisor and sum-of-divisors functions Theorem 1.7. The divisor function d(n) and the sum-of-divisors function σ(n) are multiplicative. Their values at prime powers are given by d(pm ) = m + 1, σ(pm ) = pm+1 − 1 . p−1 Proof. To prove the multiplicativity of d(n), let n1 and n2 be positive integers with (n1 , n2 ) = 1. Note that if d1 |n1 and d2 |n2 , then d1 d2 |n1 n2 . Conversely, by the coprimality condition and the fundamental theorem of arithmetic, any divisor d of n1 n2 factors uniquely into a product d = d1 d2 , where d1 |n1 and d2 |n2 . Thus, there is a 1 − 1 correspondence between the set of divisors of n1 n2 and the set of pairs (d1 , d2 ) with d1 |n1 and d2 |n2 . Since there are d(n1 )d(n2 ) such pairs and d(n1 n2 ) divisors of n1 n2 , we obtain d(n1 n2 ) = d(n1 )d(n2 ), as required. The multiplicativity of σ(n) can be proved in the same way. (Alternate proofs of the multiplicativity of d and σ will be given in the following section.) The given values for d(pm ) and σ(pm ) are obtained on noting that the divisors of pm are exactly the numbers p0 , p1 , . . . , pm . Since there are m + 1 such divisors, we have d(pm ) = m + 1, and applying the geometric series formula to the sum of these divisors gives the asserted formula for σ(pm ). Perfect numbers. The sum-of-divisors function is important because of its connection to so-called perfect numbers, that is, positive integers n that are equal to the sum of all their proper divisors, i.e., all positive divisors except n itself. Since the divisor d = n is counted in the definition of σ(n), the sum of proper divisors of n is σ(n) − n. Thus, an integer n is perfect if and only if σ(n) = 2n. For example, 6 is perfect, since 6 = 1 + 2 + 3. It is an unsolved problem whether there exist infinitely many perfect numbers. However, a result of Euler states: Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY 31 Theorem (Euler). An even integer n is perfect if and only if n is of the form n = 2p−1 (2p − 1) where p is a prime and 2p − 1 is also prime. This result is not hard to prove, using the multiplicity of σ(n). The problem with this characterization is that it is not known whether there exist infinitely many primes p such that 2p − 1 is also prime. (Primes of this form are called Mersenne primes, and whether there exist infinitely many of these is another famous open problem.) There is no analogous characterization of odd perfect numbers; in fact, no single odd perfect number has been found, and it is an open problem whether odd perfect numbers exist. 1.7 The Dirichlet product of arithmetic functions The two obvious operations on the set of arithmetic functions are pointwise addition and multiplication. The constant functions f = 0 and f = 1 are neutral elements with respect to these operations, and the additive and multiplicative inverses of a function f are given by −f and 1/f , respectively. While these operations are sometimes useful, by far the most important operation among arithmetic functions is the so-called Dirichlet product, an operation that, at first glance, appears mysterious and unmotivated, but which has proved to be an extremely useful tool in the theory of arithmetic functions. Definition. Given two arithmetic functions f and g, the Dirichlet product (or Dirichlet convolution) of f and g, denoted by f ∗ g, is the arithmetic function defined by X (f ∗ g)(n) = f (d)g(n/d). d|n In particular, we have (f ∗ g)(1) = f (1)g(1), = f (1)g(p) + Pm(f ∗ g)(p) k )g(pm−k ) for any f (p f (p)g(1) for any prime p, and (f ∗ g)(pm ) = k=0 prime power pm . It is sometimes useful to write the Dirichlet product in the symmetric form X (f ∗ g)(n) = f (a)g(b), ab=n where the summation runs over all pairs (a, b) of positive integers whose product equals n. The equivalence of the two definitions follows immediately Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 32 CHAPTER 1. ARITHMETIC FUNCTIONS I from the fact that the pairs (d, n/d), where d runs over all divisors of n, are exactly the pairs (a, b) of the above form. One motivation for introducing this product is the fact that the definitions of many common arithmetic functions have the form of a Dirichlet product, and that many identities among arithmetic functions can be written concisely as identities involving Dirichlet products. Here are some examples: Examples (1) d(n) = P d|n 1, so d = 1 ∗ 1. P (2) σ(n) = d|n d, so σ = id ∗1. P (3) d|n µ(d) = e(n) (see Theorem 1.3), so µ ∗ 1 = e. P (4) d|n µ(d)(n/d) = φ(n) (one of the applications of the Moebius identity, Theorem 1.3), so µ ∗ id = φ. P (5) d|n φ(d) = n (Theorem 1.5), so φ ∗ 1 = id. P (6) d|n Λ(d) = log n (Theorem 1.6), so Λ ∗ 1 = log. A second motivation for defining the Dirichlet product in the above manner is that this product has nice algebraic properties. Theorem 1.8 (Properties of the Dirichlet product). (i) The function e acts as a unit element for ∗, i.e., f ∗ e = e ∗ f = f for all arithmetic functions f . (ii) The Dirichlet product is commutative, i.e., f ∗ g = g ∗ f for all f and g. (iii) The Dirichlet product is associative, i.e., (f ∗ g) ∗ h = f ∗ (g ∗ h) for all f, g, h. (iv) If f (1) 6= 0, then f has a unique Dirichlet inverse, i.e., there is a unique function g such that f ∗ g = e. Proof. (i) follows immediately from the definition of the Dirichlet product. For the proof of (ii) (commutativity) and (iii) (associativity) it is useful to work Pwith the symmetric version of the Dirichlet product, i.e., (f ∗ g)(n) = ab=n f (a)g(b). The commutativity of ∗ is immediate from Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY 33 this representation. To obtain the associativity, we apply this representation twice to get X X X ((f ∗ g) ∗ h)(n) = (f ∗ g)(d)h(c) = f (a)g(b)h(c) dc=n X = dc=n ab=d f (a)g(b)h(c), abc=n where the last sum runs over all triples (a, b, c) of positive integers whose product is equal to n. Replacing (f, g, h) by (g, h, f ) in this formula yields the same final (triple) sum, and we conclude that (f ∗ g) ∗ h = (g ∗ h) ∗ f = f ∗ (g ∗ h), proving that ∗ is associative. It remains to prove (iv). Let f be an arithmetic function with f (1) 6= 0. By definition, a function g is a Dirichlet inverse of f if (f ∗ g)(1) = e(1) = 1 and (f ∗ g)(n) = e(n) = 0 for all n ≥ 2. Writing out the Dirichlet product (f ∗ g)(n), we see that this is equivalent to the infinite system of equations (A1 ) (An ) f (1)g(1) = 1, X g(d)f (n/d) = 0 (n ≥ 2). d|n We need to show that the system (An )∞ n=1 has a unique solution g. We do this by inductively constructing the values g(n) and showing that these values are uniquely determined. For n = 1, equation (A1 ) gives g(1) = 1/f (1), which is well defined since f (1) 6= 0. Hence, g(1) is uniquely defined and (A1 ) holds. Let now n ≥ 2, and suppose we have shown that there exist unique values g(1), . . . , g(n − 1) so that equations (A1 )–(An−1 ) hold. Since f (1) 6= 0, equation (An ) is equivalent to X 1 (1.5) g(n) = − g(d)f (n/d). f (1) d|n,d<n Since the right-hand side involves only values g(d) with d < n, this determines g(n) uniquely, and defining g(n) by (1.5) we see that (An ) (in addition to (A1 )–(An−1 )) holds. This completes the induction argument. Examples (1) Since µ ∗ 1 = e, the Moebius function is the Dirichlet inverse of the function 1. Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 34 CHAPTER 1. ARITHMETIC FUNCTIONS I (2) Multiplying the identity φ = µ ∗ id (obtained in the last section) by 1 gives φ ∗ 1 = 1 ∗ φ = 1 ∗ µ ∗ id = e ∗ id = id, so we get the identity φ ∗ 1 = id stated in Theorem 1.5. The last example is a special case of an important general principle, which we state as a theorem. P Theorem 1.9 (Moebius inversion formula). If g(n) = d|n f (d) for all P n ∈ N, then f (n) = d|n g(d)µ(n/d) for all n. Proof. The given relation can be written as g = f ∗ 1. Taking the Dirichlet product of each side in this relation with the function µ we obtain g ∗ µ = (f ∗ 1) ∗ µ = f ∗ (1 ∗ µ) = f ∗ e = f , which is the asserted relation. Finally, a third motivation for the definition of the Dirichlet product is that it preserves the important property of multiplicativity of a function, as shown in the following theorem. This is, again, by no means obvious. Theorem 1.10 (Dirichlet product and multiplicative functions). (i) If f and g are multiplicative, then so is f ∗ g. (ii) If f is multiplicative, then so is the Dirichlet inverse f −1 . (iii) If f ∗ g = h and if f and h are multiplicative, then so is g. (iv) (Distributivity with pointwise multiplication) If h is completely multiplicative, then h(f ∗ g) = (hf ) ∗ (hg) for any functions f and g. Remarks. (i) The product of two completely multiplicative functions is multiplicative (by the theorem), but not necessarily completely multiplicative. For example, the divisor function d(n) can be expressed as a product 1 ∗ 1 in which each factor 1 is completely multiplicative, but the divisor function itself is only multiplicative in the restricted sense (i.e., with the coprimality condition). The same applies to the Dirichlet inverse: if f is completely multiplicative, then f −1 is multiplicative, but in general not completely multiplicative. (ii) By Theorem 1.8, any function f with f (1) 6= 0 has a Dirichlet inverse. Since a multiplicative function satisfies f (1) = 1, any multiplicative function has a Dirichlet inverse. (iii) Note that the distributivity asserted in property (iv) only holds when the function h is completely multiplicative. (In fact, one can show that this property characterizes completely multiplicative functions: If h is any non-zero function for which the identity in (iv) holds for all functions f and g, then h is necessarily completely multiplicative.) Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY 35 Proof. (i) Let f and g be multiplicative and let h = f ∗ g. Given n1 and n2 with (n1 , n2 ) = 1, we need to show that h(n1 n2 ) = h(n1 )h(n2 ). To this end we use the fact (see the proof of Theorem 1.7) that each divisor d|n1 n2 can be factored uniquely as d = d1 d2 with d1 |n1 and d2 |n2 , and that, conversely, given any pair (d1 , d2 ) with d1 |n1 and d2 |n2 , the product d = d1 d2 satisfies d|n1 n2 . Hence X X X f (d1 d2 )g((n1 n2 )/(d1 d2 )). f (d)g(n1 n2 /d) = h(n1 n2 ) = d1 |n1 d2 |n2 d|n1 n2 Since (n1 , n2 ) = 1, any divisors d1 |n1 and d2 |n2 satisfy (d1 , d2 ) = 1 and (n1 /d1 , n2 /d2 ) = 1. Hence, in the above double sum we can apply the multiplicativity of f and g to obtain X X h(n1 n2 ) = f (d1 )g(n/d1 )f (d2 )g(n2 /d2 ) d1 |n1 d2 |n2 = (f ∗ g)(n1 )(f ∗ g)(n2 ) = h(n1 )h(n2 ), which is what we had to prove. (ii) Let f be a multiplicative function and let g be the Dirichlet inverse of f . We prove the multiplicativity property (1.6) g(n1 n2 ) = g(n1 )g(n2 ) if (n1 , n2 ) = 1 by induction on the product n = n1 n2 . If n1 n2 = 1, then n1 = n2 = 1, and (1.6) holds trivially. Let n ≥ 2 be given, and suppose (1.6) holds whenever n1 n2 < n. Let n1 and n2 be given with n1 n2 = n and (n1 , n2 ) = 1. Applying the identity (An ) above, we obtain, on using the multiplicativity of f and that of g for arguments < n, X 0= f (d)g(n1 n2 /d) d|n1 n2 = X X f (d1 )f (d2 )g(n1 /d1 )g(n2 /d2 ) + (g(n1 n2 ) − g(n1 )g(n2 )) d1 |n1 d2 |n2 = (f ∗ g)(n1 )(f ∗ g)(n2 ) + (g(n1 n2 ) − g(n1 )g(n2 )) = e(n1 )e(n2 ) + (g(n1 n2 ) − g(n1 )g(n2 )), = g(n1 n2 ) − g(n1 )g(n2 ), since, by our assumption n = n1 n2 ≥ 2, at least one of n1 and n2 must be ≥ 2, and so e(n1 )e(n2 ) = 0. Hence we have g(n1 n2 ) = g(n1 )g(n2 ). Thus, Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 36 CHAPTER 1. ARITHMETIC FUNCTIONS I (1.6) holds for pairs (n1 , n2 ) of relatively prime integers with n1 n2 = n, and the induction argument is complete. (iii) The identity f ∗ g = h implies g = f −1 ∗ h, where f −1 is the Dirichlet inverse of f . Since f and h are multiplicative functions, so is f −1 (by (ii)) and f −1 ∗ h (by (i)). Hence g is multiplicative as well. (iv) If h is completely multiplicative, then for any divisor d|n we have h(n) = h(d)h(n/d). Hence, for all n, X X h(f ∗ g)(n) = h(n) f (d)g(n/d) = h(d)f (d)h(n/d)g(n/d) d|n d|n = ((hf ) ∗ (hg))(n), proving (iv). Application I: Proving identities for multiplicative arithmetic functions. The above results can be used to provide simple proofs of identities for arithmetic functions, using the multiplicativity of the functions involved. To prove an identity of the form f ∗ g = h in the case when f , g, and h are known to be multiplicative functions, one simply shows, by direct calculation, that (∗) (f ∗ g)(pm ) = h(pm ) holds for every prime power pm . Since, by the above theorem, the multiplicativity of f and g implies that of f ∗ g, and since multiplicative functions are uniquely determined by their values at prime powers, (∗) implies that the identity (f ∗ g)(n) = h(n) holds for all n ∈ N. Examples P (1) Alternate proof of the identity d|n µ(d) = e(n). The identity can be written as µ ∗ 1 = e, and since all three functions involved are multiplicative, it suffices to verify that the identity holds on prime Pm m m powers. Since e(p ) = 0 and (µ ∗ 1)(p ) = k=0 µ(pk ) = 1 − 1 + 0 − 0 · · · = 0, this is indeed the case. P 2 (2) Proof of d|n µ (d)/φ(d) = n/φ(n). This identity is of the form f ∗ 1 = g with f = µ2 /φ and g = id /φ. The functions f and g are both quotients of multiplicative functions and therefore are multiplicative. Hence all three functions in the identity f ∗ 1 = g are multiplicative, and it suffices to verify the identity at prime powers. We have g(pmP ) = pm /φ(pm ) = pm /(pm − pm−1 ) = (1 − 1/p)−1 , and 2 k k −1 (f ∗ 1)(pm ) = m k=0 (µ (p )/φ(p )) = 1 + 1/(p − 1) = (1 − 1/p) , and Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY 37 so g(pm ) = (f ∗ 1)(pm ) for every prime power pm . Thus the identity holds at prime powers, and therefore it holds in general. (3) The Dirichlet inverse of λ. Since µ ∗ 1 = e, the function 1 is the Dirichlet inverse of the Moebius function. To find the Dirichlet inverse of λ, i.e., the unique function f such that λ ∗ f = e, note first that since λ and e are both multiplicative, f must be multiplicative as well, and it therefore suffices to evaluate f at prime powers. Now, for any prime power pm , m 0 = e(p ) = m X k m−k f (p )λ(p k=0 )= m X f (pk )(−1)m−k , k=0 P k k so f (pm ) = − m−1 k=0 f (p )(−1) . This implies f (p) = 1, and by inm duction f (p ) = 0 for m ≥ 2. Hence f is the characteristic function of the squarefree numbers, i.e., λ−1 = µ2 . Application II: Evaluating Dirichlet products of multiplicative functions. Since the Dirichlet product of multiplicative functions is multiplicative, and since a multiplicative function is determined by its values on prime powers, to evaluate a product f ∗ g with both f and g multiplicative, it suffices to compute the values of f ∗ g at prime powers. By comparing these values to those of familiar arithmetic functions, one can often identify f ∗ g in terms of familiar arithmetic functions. Examples Pm P k k (1) The function λ∗1. We have (λ∗1)(pm ) = m k=0 (−1) , k=0 λ(p ) = which equals 1 if m is even, and 0 otherwise. However, the latter values are exactly the values at prime powers of the characteristic function of the squares, which is easily seen to be multiplicative. Hence λ ∗ 1 is equal to the characteristic function of the squares. P (2) The function fk (n) = d|n,(d,k)=1 µ(d). Here k is a fixed positive integer, and the summation runs over those divisors of n that are relatively prime to k. We have fk = gk ∗ 1, where gk (n) = µ(n) if (n, k) = 1 and gk (n) = 0 otherwise. It is easily seen that gk is multiplicative, so fk is also multiplicative. On prime powers pm , gk (pm ) = −1 if m = 1 and p - k and gk (pm ) = 0 otherwise, so Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 38 CHAPTER 1. ARITHMETIC FUNCTIONS I P k m fk (pm ) = m i=0 g(p ) = 1 − 1 = 0 if p - k, and fk (p ) = 1 otherwise. By the multiplicativity of fk it follows that fk is the characteristic function of the set Ak = {n ∈ N : p|n ⇒ p|k}. Application III: Proving the multiplicativity of functions, using known identities. This is, in a sense, the previous application in reverse. Suppose we kow that f ∗ g = h and that f and h are multiplicative. Then, by Theorem 1.10, g must be multiplicative as well. Examples (1) Multiplicativity of φ. Since φ ∗ 1 = id (see Theorem 1.5) and the functions 1 and id are (obviously) multiplicative, the function φ must be multiplicative as well. This is the promised proof of the multiplicativity of the Euler function (part (ii) of Theorem 1.5). (2) Multiplicativity of d(n) and σ(n). Since d = 1 ∗ 1, and the function 1 is multiplicative, the function d is multiplicative as well. Similarly, since σ = id ∗1, and 1 and id are multiplicative, σ is multiplicative. Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 INTRODUCTION TO ANALYTIC NUMBER THEORY 1.8 39 Exercises P 1.1 Evaluate the function f (n) = d2 |n µ(d) (where the summation runs over all positive integers d such that d2 |n), in the sense of expressing it in terms of familiar arithmetic functions. 1.2 The unitary divisor function d∗ (n) is defined as the number of representations of n as a product of two coprime positive integers, i.e., d∗ (n) = {(a, b) ∈ N2 : ab = n, (a, b) = 1}. Show that d∗ is multiplicative, and find its values on prime powers. 1.3 Determine an arithmetic function f such that X 1 n 1 = f (n ∈ N). φ(n) d d d|n 1.4 For each of the following arithmetic functions, “evaluate” the function, or express it in terms of familiar arithmetic functions. P (i) gk (n) = d|n,(d,k)=1 µ(d), where k ∈ N is fixed. (Here the summation runs over all d ∈ N that satisfy d|n and (d, k) = 1.) P (ii) hk (n) = d|n,k|d µ(d), where k ∈ N is fixed. 1.5 Show that, for every positive integer n ≥ 2, X n k = φ(n). 2 1≤k≤n−1 (k,n)=1 P 1.6 Let f (n) = d|n µ(d) log d. Find a simple expression for f (n) in terms of familiar arithmetic functions. 1.7 Let f (n) = #{(n1 , n2 ) ∈ N2 : [n1 , n2 ] = n}, where [n1 , n2 ] is the least common multiple of n1 and n2 . Show that f is multiplicative and evaluate f at prime powers. 1.8 Let f be a multiplicative function. We know that the Dirichlet inverse f −1 is then also multiplicative. Show that f −1 is completely multiplicative if and only if f (pm ) = 0 for all prime powers pm with m ≥ 2 (i.e., if and only if f is supported by the squarefree numbers). Math 531 Lecture Notes, Fall 2005 Version 2013.01.07 40 CHAPTER 1. ARITHMETIC FUNCTIONS I 1.9 Given an arithmetic function f , a “Dirichlet square root” of f is an arithmetic function g such that g ∗ g = f . Prove by elementary techniques that the constant function 1 has two Dirichlet square roots, of the form ±g, where g is a multiplicative function, and find the values of g at prime powers. 1.10 Let f (n) = φ(n)/n, and let {nk }∞ k=1 be the sequence of values n at which f attains a “record low”; i.e., n1 = 1 and, for k ≥ 2, nk is defined as the smallest integer > nk−1 with f (nk ) < f (n) for all n < nk . (For example, since the first few values of the sequence f (n) are 1, 1/2, 2/3, 1/2, 4/5, 1/3, . . ., we have n1 = 1, n2 = 2, and n3 = 6, and the corresponding values of f at these arguments are 1, 1/2 and 1/3.) Find (with proof) a general formula for nk and f (nk ). 1.11 Let f be a multiplicative function satisfying limpm →∞ f (pm ) = 0. Show that limn→∞ f (n) = 0. 1.12 An arithmetic function f is called periodic if there exists a positive integer k such that f (n + k) = f (n) for every n ∈ N; the integer k is called a period for f . Show that if f is completely multiplicative and periodic with period k, then the values of f are either 0 or roots of unity. (An root of unity is a complex number z such that z n = 1 for some n ∈ N.) Math 531 Lecture Notes, Fall 2005 Version 2013.01.07