Statistics and Probability Quarter 3 – Module 7: Percentiles and t-distribution Statistics and Probability Alternative Delivery Mode Quarter 3 – Module 7: Percentile and T-Distribution First Edition, 2020 Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties. Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over them. Published by the Department of Education Secretary: Leonor Magtolis Briones Undersecretary: Diosdado M. San Antonio SENIOR HS MODULE DEVELOPMENT TEAM AUTHOR Co-Author – Language Editor Co-Author – Content Evaluator Co-Author – Illustrator Co-Author – Layout Artist : John William S. Bustamante : Gio B. Santisima : Rommel O. Magcalas : John William S. Bustamante : John William S. Bustamante TEAM LEADERS: School Head LRMDS Coordinator : Reycor E. Sacdalan, PhD : Pearly V. Villagracia SDO-BATAAN MANAGEMENT TEAM: Schools Division Superintendent OIC- Asst. Schools Division Superintendent Chief Education Supervisor, CID Education Program Supervisor, LRMDS Education Program Supervisor, AP/ADM Education Program Supervisor, Senior HS Project Development Officer II, LRMDS Division Librarian II, LRMDS : Romeo M. Alip, PhD, CESO V : William Roderick R. Fallorin, CESE : Milagros M. Peñaflor, PhD : Edgar E. Garcia, MITE : Romeo M. Layug : Danilo C. Caysido : Joan T. Briz : Rosita P. Serrano REGIONAL OFFICE 3 MANAGEMENT TEAM: Regional Director Chief Education Supervisor, CLMD Education Program Supervisor, LRMS Education Program Supervisor, ADM : May B. Eclar, PhD, CESO III : Librada M. Rubio, PhD : Ma. Editha R. Caparas, EdD : Nestor P. Nuesca, EdD Statistics and Probability Quarter 3 – Module 7: Percentiles and T-Distribution Introductory Message This Self-Learning Module (SLM) is prepared so that you, our dear learners, can continue your studies and learn while at home. Activities, questions, directions, exercises, and discussions are carefully stated for you to understand each lesson. Each SLM is composed of different parts. Each part shall guide you step-bystep as you discover and understand the lesson prepared for you. Pre-tests are provided to measure your prior knowledge on lessons in each SLM. This will tell you if you need to proceed on completing this module or if you need to ask your facilitator or your teacher’s assistance for better understanding of the lesson. At the end of each module, you need to answer the post-test to self-check your learning. Answer keys are provided for each activity and test. We trust that you will be honest in using these. In addition to the material in the main text, Notes to the Teacher are also provided to our facilitators and parents for strategies and reminders on how they can best help you on your home-based learning. Please use this module with care. Do not put unnecessary marks on any part of this SLM. Use a separate sheet of paper in answering the exercises and tests. And read the instructions carefully before performing each task. If you have any questions in using this SLM or any difficulty in answering the tasks in this module, do not hesitate to consult your teacher or facilitator. Thank you. What I Need to Know After going through this module, you are expected to: 1. Illustrate the t-distribution (M11-12SP-IIIg2) 2. Identify the percentile using the t-table (M11-12SP-IIIg5) 1 What I Know A. Read the statements carefully and choose the letter of the best answer. Write your answer on a separate sheet of paper. 1. What is the other term for t-distribution? A. Z-distribution B. Probability distribution C. Percentile distribution D. Student’s t-distribution 2. This refers to the maximum number of logically independent values which varies in the data sample. A. Level of significance B. Percentiles C. Degree of freedom D. Probability 3. This refers to the measure of position whose data is divided into 100 parts. A. Percentiles B. Quartiles C. Deciles D. Median 4. What is the level of significance if the confidence interval is 90% ? A. 0.05 B. 0.10 C. 0.005 D. 0.25 5. What will be the degree of freedom of the data with a sample size of 20? A. 17 B. 18 C. 19 D. 20 For number 6 to 8, use the given data below to answer the following questions. “There are 9 students who will be taking up the test, the researcher sets the level of significance to 0.05. The following are the scores of nine students in the test: 23, 25, 25, 26, 27, 28, 30, 40, 45 2 6. What will be the t-value of the given problem? A. 1.729 B. 1.456 C. 1.612 D. 1.551 7. Using the t-table, give the value of 25th percentile. A. 25 B. 26 C. 27 D. 28 8. What will be the t-value of 95th percentile? A. 1.729 B. 1.456 C. 1.612 D. 1.551 9. What will be the degree of freedom with a t-value of 1 and a sample size of 2? A. 1 B. 2 C. 3 D. 4 10. Find the t-value of 2.5th percentile on the t-table with a degree of freedom of 6. A. 2.570 B. 4.030 C. -2.570 D. -4.030 II. For numbers 11-15, complete the table below. Write your answer on a separate sheet of paper. Percentile 40th percentile 75th percentile 90th percentile 95th percentile 97.5th percentile n 23 20 25 15 10 3 t (11) (12) (13) (14) (15) Lesson 1 Percentile and the T-Distribution Percentiles have been discussed on your Grade 10 and their uses. These were also discussed that percentiles are identifier of a student’s performance in comparison to the performance of the whole class. In this lesson, you will explore and understand the student’s t-distribution and identify the percentiles using the t-tables. Before we discuss percentiles and t-distribution, it is necessary to study first the concept of percentiles. Let us have a recall about percentiles. Answer all the activities provided for you. Good luck and have a good time answering the module. What’s In A. Using the given data below, find the following percentiles. 40, 45, 50, 50, 56, 60, 65, 80, 90 th 1. 10 percentile 2. 25th percentile 3. 50th percentile 4. 75th percentile 5. 95th percentile B. Answer the following questions. 1. If Peter is higher than the 75 % of the class, what is his percentile rank? 2. If the 50th percentile of the scores of the class is 80, what will be the median? 3. If the scores of the 3 students are 45, 50, and 60, what will be the 25th percentile? 4. If the scores are 20, 30, 50, 60, 70, 75, and 80, what will be 75th percentile? 5. The percentile rank of Rose is 50, what percent of the class is her score belongs? 4 Notes to the Teacher This part aims to assess if the students have prior knowledge about the topic. Also, it prepares the students to absorb the lesson. What’s New Find 5 words on the puzzle below. For each word you find, make a brief definition or expectation about the word. P E R C E N T I L E A S C E R T T G Y D D S F F C B D S G H J K L F C B B L E E C D V R R H I C M N O D R R R H B Y J G F C B G H T S V D S E R T G H T N J U R R R H T K Y T T A B L E F T K Y B Y J E G H T G J J R J J B Y J G G J J G H J E T K Y H U K I U K N J U V H U K L U E O G J J B Y A B J J B Y J V S D F G H J F H U K N J S U E F G H J K Q S A T F B F K Y Q M K D T V A L U E Q C V Y N M L R I T A R L F I Q W E R T Y U I O O O J E O L E V E L O F S I G N I F I C A N C E R W D A L B N F R F K Y H U K E R N F D T F E J J B Y C E G J J B Y A D T Y R O L Q W U K N J E E G U K N J S C W E R M P W M I M I T E D Y Q M K G J Y L D F H 1. Using the words that you have found, what are the things that come into your mind? 5 2. What are the possible things or applications of these words in statistics? 3. Recalling your topic in normal distribution, how will it be related to the normal curve? The answer in this question requires an understanding of t-distribution. What is It The t-distribution (also called Student’s t-distribution) is a family of distributions that look almost identical to the normal distribution curve, only a bit shorter and stouter. The t-distribution is used instead of the normal distribution when you have small samples. The larger the sample size, the more the t distribution looks like the normal distribution. In fact, for sample sizes larger than 20 (e.g. more degrees of freedom), the distribution is almost exactly like the normal distribution. The t-distribution is similar to a normal distribution. It has a precise mathematical definition. Instead of diving into complex math, let’s look at the useful properties of the t-distribution and why it is important in analysis of any set of statistical quantitative data. Like the normal distribution, the t-distribution has a smooth shape. Like the normal distribution, the t-distribution is symmetric. If you think about folding it in half at the mean, each side will be the same. Like a standard normal distribution (or z-distribution), the t-distribution has a mean of zero. The normal distribution assumes that the population standard deviation is known. The t-distribution does not make this assumption. The t-distribution is defined by the degrees of freedom. These are related to the sample size. The t-distribution is most useful for small sample sizes, when the population standard deviation is not known, or both. As the sample size increases, the t-distribution becomes more similar to a normal distribution. Using the t-table, shown on the next page we can be able to identify the t value of a certain data. 6 Let us be familiar first with the following words: 1. Degree of freedom - This refers to the maximum number of logically independent values which vary in the data sample. 2. Percentile – This is a measure of position with data divided into 100 parts. Below is the t-table which you can use to find the t-value df/v 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 40 60 120 0.40 0.325 0.289 0.277 0.271 0.267 0.265 0.263 0.262 0.261 0.260 0.260 0.259 0.259 0.258 0.258 0.258 0.257 0.257 0.257 0.257 0.257 0.256 0.256 0.256 0.256 0.256 0.256 0.256 0.256 0.256 0.255 0.254 0.254 0.25 1.000 0.816 0.765 0.741 0.727 0.718 0.711 0.706 0.703 0.700 0.697 0.695 0.694 0.692 0.691 0.690 0.689 0.688 0.688 0.687 0.686 0.686 0.685 0.685 0.684 0.684 0.684 0.683 0.683 0.683 0.681 0.679 0.677 0.10 3.078 1.886 1.638 1.533 1.476 1.440 1.415 1.397 1.383 1.372 1.363 1.356 1.350 1.345 1.341 1.337 1.333 1.330 1.328 1.325 1.323 1.321 1.319 1.318 1.316 1.315 1.314 1.313 1.311 1.310 1.303 1.296 1.289 0.05 6.314 2.920 2.353 2.132 2.015 1.943 1.895 1.860 1.833 1.812 1.796 1.782 1.771 1.761 1.753 1.746 1.740 1.734 1.729 1.725 1.721 1.717 1.714 1.711 1.708 1.706 1.703 1.701 1.699 1.697 1.684 1.671 1.658 right tailed area ( α ) 0.025 0.01 0.005 12.706 31.821 63.657 4.303 6.965 9.925 3.182 4.541 5.841 2.776 3.747 4.604 2.571 3.365 4.032 2.447 3.143 3.707 2.365 2.998 3.499 2.306 2.896 3.355 2.262 2.821 3.250 2.228 2.764 3.169 2.201 2.718 3.106 2.179 2.681 3.055 2.160 2.650 3.012 2.145 2.624 2.977 2.131 2.602 2.947 2.120 2.583 2.921 2.110 2.567 2.898 2.101 2.552 2.878 2.093 2.539 2.861 2.086 2.528 2.845 2.080 2.518 2.831 2.074 2.508 2.819 2.069 2.500 2.807 2.064 2.492 2.797 2.060 2.485 2.787 2.056 2.479 2.779 2.052 2.473 2.771 2.048 2.467 2.763 2.045 2.462 2.756 2.042 2.457 2.750 2.021 2.423 2.704 2.000 2.390 2.660 1.980 2.358 2.617 7 0.0025 127.321 14.089 7.453 5.598 4.773 4.317 4.029 3.833 3.690 3.581 3.497 3.428 3.372 3.326 3.286 3.252 3.222 3.197 3.174 3.153 3.135 3.119 3.104 3.091 3.078 3.067 3.057 3.047 3.038 3.030 2.971 2.915 2.860 0.0010 318.309 22.327 10.215 7.173 5.893 5.208 4.785 4.501 4.297 4.144 4.025 3.930 3.852 3.787 3.733 3.686 3.646 3.610 3.579 3.552 3.527 3.505 3.485 3.467 3.450 3.435 3.421 3.408 3.396 3.385 3.307 3.232 3.160 0.0005 636.619 31.599 12.924 8.610 6.869 5.959 5.408 5.041 4.781 4.587 4.437 4.318 4.221 4.140 4.073 4.015 3.965 3.922 3.883 3.850 3.819 3.792 3.768 3.745 3.725 3.707 3.690 3.674 3.659 3.646 3.551 3.460 3.373 Illustrative example: 1. Identify the t-value whose number of samples n = 7 and has an area (𝛼) equal to 0.05. 2. Find the t-value whose degree of freedom is 20 and has 𝛼 = 0.01. 3. Identify the t-value of the following percentiles given the number of samples. a. 95th percentile , N= 30 b. 90th percentile, N=20 Solution: 1. To identify the t-value, identify first the degree of freedom using the formula df= n-1 where n is the sample size. df = n-1 df = 7-1 df = 6 Locate the t-value on the t-table using the degree of freedom and the area (𝛼) df/v 1 2 3 4 5 6 7 0.40 0.325 0.289 0.277 0.271 0.267 0.265 0.263 0.25 1.000 0.816 0.765 0.741 0.727 0.718 0.711 0.10 0.05 3.078 6.314 1.886 2.920 1.638 2.353 1.533 2.132 1.476 2.015 1.440 1.943 1.415 1.895 right tailed area ( α ) 0.025 0.01 0.005 12.706 31.821 63.657 4.303 6.965 9.925 3.182 4.541 5.841 2.776 3.747 4.604 2.571 3.365 4.032 2.447 3.143 3.707 2.365 2.998 3.499 0.0025 127.321 14.089 7.453 5.598 4.773 4.317 4.029 0.0010 318.309 22.327 10.215 7.173 5.893 5.208 4.785 0.0005 636.619 31.599 12.924 8.610 6.869 5.959 5.408 Therefore, the t-value with sample size of 7 and has 𝛼 = 0.05 is 1.943. 2. Since the degree of freedom and 𝛼 are already given locate the t-value on the t-table. Therefore the t-value is 2.528. 8 3. a. The 95th percentile is the number where 95% of the values lie below it and 5% lie above it, so you want the right-tail area to be 0.05. Move across the row, find the column for 0.05, and then locate the t-value using n=30 or df = 29. So therefore, the t-value of P95 with n= 30 is 1.699. b. The 90th percentile is the number where 90% of the values lie below it and 10% lie above it, so you want the right-tail area to be 0.01. Move across the row, find the column for 0.05, and then locate the t-value using n=20 or df = 19. 9 Therefore, the t-value of 90th percentile with n= 20 is 1.328 What’s More Complete the table below by identifying the degree of freedom, right-tailed area and the t-value for each percentiles and the sample size. Percentile n 1. 60th percentile 2 2. 75th percentile 5 3. 25th percentile 15 4. 90th percentile 7 5. 95th percentile 20 df 10 Right tailed area t What I Have Learned With steps closer on closing this module, answer the following questions with at most 3 sentences. Write your answer on a separate sheet of paper. 1. What are the characteristics of a t-distribution? (Give at least 3 characteristics). 2. Enumerate the steps on identifying the t-value using the number of samples and the right tailed area. 3. How to identify the t-value of a certain percentile using the t-table? 11 What I Can Do ACTIVITY 1: Triple Matching Type: Match column A (the t-value) to Column b (the degree of freedom) and then to column C (the percentile). A B 1. 0.277 2. 1.734 3. 1.000 4. 1.345 5. 2.447 C A. 1 a. 97.5th percentile B. 6 b. 90th percentile C. 18 c. 75th percentile D. 3 d. 95th percentile E. 14 e. 60th percentile ACTIVITY 2: Rowena wants to test a randomly selected 20 students for her study. Identify the t-value of the following percentiles. 1. 75th percentile 2. 5th percentile 3. 60th percentile 4. 40th percentile 5. 90th percentile Assessment A. Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a separate sheet of paper. 1. Which of the following is NOT a characteristics of t-distribution? A. Like the normal distribution, the t-distribution has a smooth shape. B. Like the normal distribution, the t-distribution is symmetric. If you think about folding it in half at the mean, each side will be the same. C. Like a standard normal distribution (or z-distribution), the tdistribution has a mean of one. D. The normal distribution assumes that the population standard deviation is known. The t-distribution does not make this assumption. 12 2. This refers to the measure of position with data divided into 100 parts. A. Percentiles B. Quartiles C. Deciles D. Median 3. This refers to the maximum number of logically independent values, which vary in the data sample. A. Level of significance B. Percentiles C. Degree of freedom D. Probability 4. What is the right-tailed area if the confidence interval is 75%? A. 0.05 B. 0.10 C. 0.005 D. 0.25 5. What will be the degree of freedom of the data whose sample size is 18? A. 17 B. 18 C. 19 D. 20 For number 6 to 8, use the given data below to answer the following questions. There are 9 students who will be taking up the test, the researcher sets the level of significance to 0.05. The following are the scores of the nine students: 23, 25, 25, 26, 27, 28, 30, 40, 45 6. What will be the t-value of the given problem? A. 1.729 B. 1.456 C. 1.612 D. 1.551 7. What will be the value of 25th percentile? A. 25 B. 26 C. 27 D. 28 13 8. Using the t-table, give the t-value of 95th percentile? A. 1.729 B. 1.456 C. 1.612 D. 1.551 9. What will be the degree of freedom whose t-value is 0.689? A. 1 B. 2 C. 3 D. 4 10. Find the t-value of 2.5th percentile using the t-table with a degree of freedom of 6. A. 2.570 B. 4.030 C. -2.570 D. -4.030 II. For numbers 11-15, fill out the table below. Percentile 40th percentile 75th percentile 90th percentile 95th percentile 97.5th percentile n 2 10 15 20 25 t Additional Activities A researcher conducts a diagnostic examination to 20 students. The following are the interpretation of the diagnostic examination. 1. James got a score which is 60% higher than those who took the diagnostic test. 2. Rowena has a percentile rank of 90. 3. Jimuel got a score which is only 5% higher than those who took the diagnostic test. 4. Joanna has a percentile rank of 75. 5. Lester is 90% lower than those who took the diagnostic test. 14 , identify the t-value of the percentiles of the following by filling up the table: Student 1. Joanna 2. James 3. Jimuel 4. Lester 5. Rowena t-value Question: 1. Who do you think got the highest score? 2. Who do you think passed the diagnostic examination? 15 What’s New 16 What I Can Do What's More A. 1. D e 2. C d 3. A c 4. E b 5. B a B. 1. 0.688 2. -1.729 3. 0.257 4. -0.257 5. 0.328 What I Know A. 1. D 2. C 3. A 4. B 5. C What’s In A. 6. A 7. A 8. A 9. A 1. 40 2. 50 3. 56 4. 65 5. 90 10.C B. B. 1. 75 2. 80 3. 45 4. 75 5. 50% P P40 P75 P90 P95 P97.5 n 23 20 25 15 10 t 0.256 0.688 1.318 1.761 2.262 Answer Key 17 Assessment A. 1. C 2. B 3. C 4. D 5. A 6. A 7. A 8. C 9. C 10. C B. 1. -0.325 2. 0.703 3. 1.345 4. 1.729 5. 2.064 Additional Activities 1. 0.688 2. 0.257 3. -1.729 4. -1.328 5. 1.328 Questions : 1. Rowena 2. Cannot be determined References Calaca.N, Uy, C. ,Noble, N.M.& Manalo. R. A. (2016). Statistics and Probability. VIBAL BOOKSTORE. https://www.dummies.com/education/math/statistics/how-to-find-percentiles-fora-t-distribution/courses.lumenlearning.com. https://www.jbstatistics.com/r-basics-finding-percentiles-and-areas-for-the-tdistribution/ 18 For inquiries or feedback, please write or call: Department of Education – Region III, Schools Division of Bataan - Curriculum Implementation Division Learning Resources Management and Development Section (LRMDS) Provincial Capitol Compound, Balanga City, Bataan Telefax: (047) 237-2102 Email Address: bataan@deped.gov.ph