Programming Interest Group http://www.comp.hkbu.edu.hk/~chxw/pig/index.htm Tutorial Five Combinatorics & Number Theory 1 Outline Combinatorics (组合) Basic Counting Techniques Recurrence Relations Binomial Coefficients Number Theory Prime Number Congruences Fermat’s Theorem Euler’s Theorem 2 Part I: Combinatorics Combinatorics is the mathematics of counting. It appears to be a collection of unrelated puzzles chosen at random. E.g. 1: Given n letters and n addressed envelopes, in how many ways can the letters be placed in the envelopes so that no letter is in the correct envelope? E.g. 2: (a Google interview problem) There are n stages. Every time you can go up either 1 stage or 2 stages. How many different ways can you reach the last stage? 3 Basic Counting Techniques Product Rule If there are |A| possibilities from set A and |B| possibilities from set B, then there are |A|x|B| ways to combine one from A and one from B. E.g., suppose you own 5 shirts and 4 pants, then there are 5x4 =20 different ways you can get dressed. Sum Rule If there are |A| possibilities from set A and |B| possibilities from set B, then there are |A|+|B| ways for either A or B to occur (assuming the elements of A and B are distinct). 4 Basic Counting Techniques Inclusion-Exclusion Formula: |AUB| = |A| + |B| - |A∩B| |AUBUC| = |A| + |B| + |C| - |A∩B| - |A∩C| - |B∩C| + |A∩B ∩C| B A B A C 5 Basic Counting Techniques Permutations A permutation is an arrangement of n items, where every item appears exactly once. There are n! different permutations. 2n-1 ≤ n! ≤ nn-1 =====> n! is a very large number. 232 = 4,294,967,296 10! = 3,628,800 11! = 39,916,800 12! = 479,001,600 13! = 6,227,020,800 > 232 6 Basic Counting Techniques Number of subsets A subset is a selection of elements from n possible items. (including the empty set) There are 2n distinct subsets of n things. E.g., set {a, b, c} has 8 (=23) subsets: Ф, {a}, {b}, {c}, {a, b}, {a, c}, {b, c}, {a, b ,c} 7 Recurrence Relations Recurrence relation is an equation which is defined in terms of itself. E.g. 1: an = an-1 + 1, a1 =1 E.g. 2: an = 2an-1, a1 =2 an = n an = 2n E.g. 3: an = nan-1, a1 =1 an = n! 8 Recurrence Relations The interview question from Google To reach the nth stage, there are two possibilities: First reach the (n-2)th stage, then go to the nth stage directly First reach the (n-1)th stage, then go to the nth stage We can have the following recurrence relation: an = an-1 + an-2 a1 = 1, a2 = 2 9 Fibonacci Numbers Fn = Fn-1 + Fn-2; F0 = 0; F1 = 1 0, 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, … Closed form: = -0.61803… 1 5 n 1 5 n Fn (( ) ( ) )/ 5 2 2 It can easily proved by mathematical induction. Golden ratio: 1 5 2 1.61803 Fibonacci numbers grow exponentially. 10 Binomial Coefficients How many ways are there to choose k things out of n possibilities? n n! k ( n k )! k ! Coefficients of (a+b)n n n n n1 ( a b) a a b 0 1 n n nk k a b k n n b n 11 How to calculate binomial coefficients? It’s difficult to calculate the value of binomial coefficients by using the previous equation directly: arithmetic overflow will happen for n > 12! Pascal’s triangle (1654), or 杨辉三角 (1261) 1 1 1 1 2 1 1 1 3 4 3 6 1 4 1 n n 1 n 1 k k 1 k 1 5 10 10 5 1 1 6 15 20 15 6 1 12 How to calculate binomial coefficients? (Cont.) #define MAXN 100 /* compute n choose m */ long binomial_coef(int n, int m) { int i, j; long bc[MAXN][MAXN}; /* table of binomial coef */ for(i = 0; i <= n; i++) bc[i][0] = 1; for(j = 0; j <= n; j++) bc[j][j] = 1; for(i = 1; i <= n; i++) for(j = 1; j < i; j++) bc[i][j] = bc[i-1][j-1] + bc[i-1][j]; return (bc[n][m]); } 13 Catalan Numbers n 1 C0 1, Cn Ck Cn1k k 0 1 2n n 1 n How many ways are there to build a balanced formula from n pairs of parentheses? Divide the formula into two parts: (……)(……) The first part contains k pairs, the second part has n1-k pairs. So the first part has Ck possibilities, while the second part has Cn-1-k possibilities. 14 Practice 1: How many pieces of land? http://acm.uva.es/p/v102/10213.html You are given an elliptical shaped land and you are asked to choose n arbitrary points on its boundary. Then you connect all these points with one another with straight lines (that’s n*(n-1)/2 connections for n points). What is the maximum number of pieces of land you will get by choosing the points on the boundary carefully? 15 How many pieces of land? Input The first line of the input file contains one integer S (0 < S < 3500), which indicates how many sets of input are there. The next S lines contain S sets of input. Each input contains one integer N (0<=N<2^31). Output For each set of input you should output in a single line the maximum number pieces of land possible to get for the value of N. Sample Input: 4 1 2 3 4 Sample Output: 1 2 4 8 16 Analysis Denote the solution by f(n), so f(1) = 1, f(2) = 2, etc. What’s the relationship between f(n) and f(n-1)? Assume we have a maximum number of pieces for n-1 points, which is f(n-1). Now we add the nth point. We can connect this point to other n-1 points. So we have n-1 new lines. Each line will lead to more pieces. 17 Analysis (Cont.) i nodes on the left k Consider adding a new line between node n and k. n-2-i nodes on the right The new line will intersect with i(n-2-i) number of lines. So, the new line will be divided into i(n-2-i)+1 segments. Each segment means a new piece! n n 1 f (n) f (n 1) [i(n 2 i) 1] i 0 18 Practice 2: Counting http://acm.uva.es/p/v101/10198.html Gustavo knows how to count, but he is now learning how write numbers. As he is a very good student, he already learned 1, 2, 3 and 4. But he didn't realize yet that 4 is different than 1, so he thinks that 4 is another way to write 1. Besides that, he is having fun with a little game he created himself: he make numbers (with those four digits) and sum their values. For instance: 132 = 1 + 3 + 2 = 6 112314 = 1 + 1 + 2 + 3 + 1 + 1 = 9 (remember that Gustavo thinks that 4 = 1) After making a lot of numbers in this way, Gustavo now wants to know how much numbers he can create such that their sum is a number n. For instance, for n = 2 he noticed that he can make 5 numbers: 11, 14, 41, 44 and 2 (he knows how to count them up, but he doesn't know how to write five). However, he can't figure it out for n greater than 2. So, he asked you to help him. 19 Counting (Cont.) The Input The Output Input will consist on an arbitrary number of sets. Each set will consist on an integer n such that 1 <= n <= 1000. You must read until you reach the end of file. For each number read, you must output another number (on a line alone) stating how much numbers Gustavo can make such that the sum of their digits is equal to the given number. Sample Input 2 3 Sample Output 5 13 20 Analysis From the problem, we know f(1) = 2, f(2) = 5 From the sample Input/Output, we know f(3) = 13 A number can only start from 1, 2, 3, or 4 If it starts from 1, then the remaining n-1 digits should have a summation of n-1 If it starts from 2, then the remaining n-1 digits should have a summation of n-2 If it starts from 3, then the remaining n-1 digits should have a summation of n-3 If it starts from 4, then the remaining n-1 digits should have a summation of n-1 (because 4 is another way to write 1) So, f(n) = 2f(n-1) + f(n-2) + f(n-3) For this problem we can build a table for all f(n), 1 <= n <= 1000 Remember to use big number library! 21 Part II: Number Theory Number theory is concerned with properties of integers. It is perhaps the most interesting and beautiful area of mathematics. Number theory plays an important role in modern cryptography. Some outstanding unsolved problems: Goldbach’s Conjecture: every even integer n > 2 is the sum of 2 primes. Twin Prime Conjecture: There are infinitely many twin primes. (If p and p+2 are primes, we say p and p+2 are twin primes.) Are there infinitely many primes of the form n2 + 1? How about the form 2n -1? 22 Numbers Natural numbers: N = {1, 2, 3,…} Integers: Z = {…, -2, -1, 0, 1, 2, … } Rational numbers: Q={n/m | m≠0} Real numbers: R 23 Number Base Conversion How to convert a decimal number to another base b? #include <iostream> #include <vector> #include <algorithm> using namespace std; int main() { int d, b, t, r; vector<int> v; cin >> d >> b; while (d > 0) { r = d % b; v.push_back(r); d = d/b; } reverse(v.begin(), v.end()); for (int i = 0; i < v.size(); ++i) cout << v[i]; cout << endl; return 1; } 24 Big Modular How to calculate A1A2…An mod p? The product may be very large and overflow! It can be calculated by: (A1 mod p) x (A2 mod p) x … (An mod p) Another example: how to calculate bp mod m? long bigmod(long b, long p, long m) { if (p == 0) return 1; else if (p%2 == 0) return square(bigmod(b, p/2, m)) % m; // square(x) = x * x else return ((b % m) * bigmod(b, p-1, m)) % m; } 25 Exponentiation How to calculate bp? Assume no overflow long square(long n) { return n*n; } long fastexp(long base, long power) { if (power == 0) return 1; else if (power%2 == 0) return square(fastexp(base, power/2)); else return base * (fastexp(base, power-1)); } 26 Proof by Induction Principle of Mathematical Induction: Let P(n) be a statement concerning the integer variable n. Let n0 be any fixed integer. P(n) is true for all integers n ≥ n0 if one can establish both of the following statements: (a) P(n) is true if n = n0 (b) Whenever P(n) is true for n0 ≤ n ≤ k, then P(n) is true for n = k+1 27 Divisibility d | n means there is an integer k such that n = d·k. d | n has several readings: d divides n d is a divisor of n d is a factor of n n is a multiple of d 28 Divisibility Properties n | n, and 1 | n d | n and n | m d | m d | n and d | m d | an + bm for all a and b d | n ad | an d|0 If d and n are positive and d | n, then d ≤ n. If d | n, then n/d | n, and small(d, n/d) <= sqrt(n) 29 The Division Algorithm If a and b are integers and b > 0, then there exist unique integers q and r satisfying the two conditions: a = bq + r, and 0≤r<b For b > 0, define a mod b = r where r is the remainder given by the division algorithm when a is divided by b, that is, a = bq + r and 0 ≤ r < b. 30 Prime Numbers A prime number is an integer >1 and only has divisors of 1 and itself It cannot be written as a product of other numbers E.g. 2,3,5,7 are prime, 4,6,8,9,10 are not prime numbers are central to number theory list of prime number less than 200 is: 2 3 5 7 11 13 17 19 23 29 31 37 41 43 47 53 59 61 67 71 73 79 83 89 97 101 103 107 109 113 127 131 137 139 149 151 157 163 167 173 179 181 191 193 197 199 31 Primality Testing We often need to find large prime numbers Traditionally we can use trial division i.e. divide by all numbers (primes) in turn, less than the square root of the number only works for small numbers Alternatively, we can use statistical primality tests based on properties of primes for which all primes numbers satisfy property but some composite numbers, called pseudo-primes, also satisfy the property 32 Trial division int is_prime(int n) { if (n <= 1) return 0; if (n == 2) return 1; // 2 is a prime if (n%2 == 0) return 0; // NO prime is EVEN, except 2 for (int i = 3; i * i <= n; i += 2) // start from 3, jump 2 numbers if (n%i == 0) // no need to check even numbers return 0; return 1; } 33 Divisibility check using smaller primes below sqrt(N) number N is a prime if and only if no primes below sqrt(N) can divide N Generate a prime table Suppose max primes generated will not be greater than 2^31-1 (2,147,483,647). We need smaller primes below sqrt(N), so we need to store primes no greater that sqrt(2^31-1) sqrt(2^31) = 46340.95 There will be at most 4792 primes in the range 1 to 46340.95 We only need about array of size (roughly) 4800 elements check whether a big number N is a prime or not by dividing it with small primes up to sqrt(N) If you can find at least one small primes that can divide N, then N is not prime, otherwise N is prime 34 Prime Table Generation (I) bool isPrime[MAXN]; int i, tot, prime[]; memset(isPrime, true, sizeof(isPrime)); tot = 0; prime[tot++] = 2; for ( i = 3; i < MAXN; i+=2) { if (isPrime[i]) prime[tot++] = i; for(j = i + i; j < MAXN; j+=i) isPrime[j] = false; } } 35 Prime Table Generation (II) bool isPrime[MAXN]; int i, tot, prime[]; memset(isPrime, true, sizeof(isPrime)); tot = 0; for ( i = 2; i <= n; i++) { if (isPrime[i]) prime[tot++] = i; for (j = 0; j < tot && i * prime[j] <= n; j++) { isPrime[i*prime[j]] = false; if (i % prime[j] == 0) break; } } 36 Prime Distribution There are an infinite number of primes. Let π(x) denote the number of primes p such that p ≤ x. Prime number theorem states that primes occur roughly every ln(n) integers: Can you prove it? π(x) ~ x/ln(x) for all x > 0. Since we can immediately ignore evens and multiples of 5, in practice we only need to test 0.4ln(n) numbers of size n before finding a prime 37 Prime Factorization to factor a number n is to write it as a product of other numbers: n=a × b × c note that factoring a number is relatively hard compared to multiplying the factors together to generate the number the prime factorisation of a number n is when its written as a product of primes E.g. 91=7×13 ; 3600=24×32×52 38 Prime Factorization #include<iostream> #include<vector> using namespace std; int is_prime(int n) { … } /* another C function for prime factorization */ primefactors(long x) { long I; long c = x; vector<int> primefactors(int a) { vector<int> primefactors; for (int i=1;i<=a;++i) if(a%i==0&&is_prime(i)) { primefactors.push_back(i); a=a/i; i=1; } return primefactors; } if ( c == 1) { printf(“1\n”); return; } while( (c%2) == 0) { printf(“%ld\n”, 2); c = c / 2; } i = 3; while ( i <= (sqrt(c) + 1) ) { if ( (c%i) == 0 ) { printf(“%ld\n”, i); c = c / I; } else i = i + 2; } int main() { int b=2500; if(!is_prime(b)) cout<<"not prime"<<endl; vector <int> p = primefactors(b); for(int i=0;i<p.size();++i) cout<<p[i]<<endl; return 0; } if (c > 1) printf(“%ld\n”, c); } 39 Number of divisors If the prime factorisation of a number n is Then the number of positive divisors of n is 40 A related problem http://acm.uva.es/p/v101/10110.html The Problem: Light, More Light There is man named "mabu" for switching on-off light in our University. He switches on-off the lights in a corridor. Every bulb has its own toggle switch. That is, if it is pressed then the bulb turns on. Another press will turn it off. To save power consumption (or may be he is mad or something else) he does a peculiar thing. If in a corridor there is n bulbs, he walks along the corridor back and forth n times and in ith walk he toggles only the switches whose position is divisable by i. He does not press any switch when coming back to his initial position. The ith walk is defined as going down the corridor (while doing the peculiar thing) and coming back again. Now you have to determine what is the final state of the last bulb. Is it on or off? 1 2 3 4 …. n-1 n 41 Light, More Light The question asks about the final state of the last bulb There are n rounds In the 1st round, because 1|n, press In the 2nd round, if 2|n, press; In the 3rd round, if 3|n, press; … In the nth round, because n|n, press 42 Light, More Light A simple algorithm: long i; long press = 0; for(i = 1; i <= n; i++) { if(n % i == 0) press++; } if( press % 2 == 0) printf(“no\n”); else printf(“yes\n”); A better algorithm: long i; long press = 0; for(i = 1; i <= n/2; i++) { if(n % i == 0) press++; } if( press % 2 == 0) printf(“yes\n”); else printf(“no\n”); 43 Light, More Light But the previous algorithm is O(n)! Is there a better way? We can make some observations: Define the number of factors of n as f(n). If f(n) is even, then the final state of the nth bulb is “off”; if f(n) is odd, then the final state of the nth bulb is “on”. f(1) = 1; f(2) = 2; f(3) = 2; f(4) = 3; f(5) = 2; f(6) = 4; f(7) = 2; f(8) = 4; f(9) = 3; f(10) = 4; f(11) = 2; f(12) = 6; f(13) = 2; f(14) = 4; f(15) = 4; f(16) = 5 44 Light, More Light It seems that, only if n is a square of another number, f(n) is odd ! How to prove? Prime factorization: Assume n p1a1 p2a2 pkak , where pi are all primes, then f ( n) (1 a1 )(1 a2 ) (1 ak ). If f ( n) is odd, then all 1 ai are odd, and then all ai are even. Therefore n ( p1a1 / 2 p2a2 / 2 pkak / 2 )2 . 45 Relatively Prime Numbers & GCD two numbers a, b are relatively prime if they have no common divisors apart from 1 E.g. 8 & 15 are relatively prime since factors of 8 are 1,2,4,8 and of 15 are 1,3,5,15 and 1 is the only common factor We can determine the greatest common divisor by comparing their prime factorizations and using least powers E.g. 300=21×31×52 18=21×32 hence GCD(18,300)=21×31×50=6 46 GCD: Euclid’s algorithm Two observations: If b|a, then gcd(a,b) = b If a = bt + r, then gcd(a,b) = gcd(b, r) Euclid’s algorithm is recursive, repeated replacing the bigger integer by its remainder mod the smaller integer. E.g.: a = 34398, b = 2132 gcd(34398, 2132) = gcd(34398 mod 2132, 2132) = gcd(2132, 286) = gcd(2132 mod 286, 286) = gcd(286, 130) = gcd(286 mod 130, 130) = gcd(130, 26) = gcd(130 mod 26, 26) = gcd(26, 0) = 26 47 Sample Code //Greatest Common Divisor int gcd(int a, int b) { while (b > 0) { a = a % b; a ^= b; b ^= a; a ^= b; } return a; } int t = b; b = a; a = t; // Lowest Common Multiple int lcm(int a, int b) { return a*b/gcd(a, b); } 48 Extended Euclid’s Algorithm gcd(a,b) can be expressed as ax+by Given a and b, find out not only gcd(a,b), but also x and y The previous function only gives gcd(a,b) int ext_gcd(int a, int b, int& x, int& y) { int t, ret; if (!b){ x=1; y=0; return a; } ret=ext_gcd(b, a%b, x, y); t=x; x=y; y=t-a/b*y; return ret; } 49 Congruences We say that a≡b (mod m) if m|(a-b). Congruences are an alternate notation for modular arithmetic. Addition: Suppose a≡b (mod n) and c≡d (mod n), then a+c≡b+d (mod n) Multiplication: If a≡b (mod n), then a·d≡b·d (mod n). If a≡b (mod n) and c≡d (mod n), then ac≡bd (mod n) Exponentiation: If a≡b (mod n), then am≡bm (mod n) Division??? 6·2≡6·1 (mod 3), but … 50 Congruences Theorem 1: Let m ≥ 2. If a and m are relatively prime, there exists a unique integer a* such that aa* ≡ 1 (mod m) and 0 < a* < m. We call a* the inverse of a modulo m. Theorem 2: Let m > 0. If ab ≡ 1 (mod m), then both a and b are relatively prime to m. Theorem 3: Let m > 0 and assume gcd(c,m)=1. Then ca ≡ cb (mod m) a ≡ b (mod m). 51 Fermat's Little Theorem If p is prime and gcd(a, p) = 1, then ap-1 mod p = 1 E.g., how to calculate 12347865435 mod 11? 1234 ≡ 2 (mod 11) 12347865435 ≡ 2 7865435 (mod 11) 210 ≡ 1 (mod 11) (according to Fermat’s litter theorem) 2 7865435 ≡ 2786543x10+5 (mod 11) ≡ (2^10)786543 x 25 (mod 11) ≡ 1786543x 25 (mod 11) ≡ 25 (mod 11) ≡ 10 (mod 11) So 12347865435 mod 11 = 10. 52 Euler Totient Function ø(n) when doing arithmetic modulo n complete set of residues is: 0..n-1 reduced set of residues is those numbers (residues) which are relatively prime to n E.g. for n=10, complete set of residues is {0,1,2,3,4,5,6,7,8,9} reduced set of residues is {1,3,7,9} number of elements in reduced set of residues is called the Euler Totient Function ø(n) 53 Euler Totient Function ø(n) To compute ø(n), we need to count number of elements to be excluded in general, it needs prime factorization, but for p (p prime) ø(p) = p-1 for p.q (p,q prime) ø(p·q) = (p-1)(q-1) E.g. ø(37) = 36 ø(21) = (3–1)×(7–1) = 2×6 = 12 54 Euler's Theorem aø(n) mod n = 1 where gcd(a, n)=1 It is a generalisation of Fermat's Theorem E.g. a=3;n=10; ø(10)=4; hence 34 = 81 = 1 mod 10 a=2;n=11; ø(11)=10; hence 210 = 1024 = 1 mod 11 55 More Practices http://acm.uva.es/p/v101/10104.html http://acm.uva.es/p/v101/10139.html http://acm.uva.es/p/v101/10168.html http://acm.uva.es/p/v100/10042.html 56