Writing Intensive Statistics Course Dr. Renan Sezer LaGuardia Community College Why Write? Write to learn Improve communication skills Develop analytical reading ability Develop ability to interpret result Frequent feedback Writing Exercises Low stake writing assignments Lead Up or Follow Up “Staged” high stake writing assignments Outcomes Better understanding of the concepts Higher success rate on exams Increased teacher-student interaction Improved writing Reduced fear of writing What did the students say? 77% - helpful in learning the subject better 85% - helped better prepare for the exam 69% - recommend such course for future students 15% - not sure 15% - not recommend Writing Assignment I (Week 1) A small company pays its president $200,000. The vice-president gets $150,000. The foreman earns $30,000. There are three workers in the company, earning $25,000, $22,000, 22,000 respectively. The secretary earns $19,000. 1. 2. 3. 4. 5. Find the mode, median, mean income in this company. The union representative wants to negotiate new wages. In this negotiation would he emphasize mode, median or mean? Explain why in a paragraph. Which of the three measures would the president of the company stress in the negotiations? Explain why in a paragraph. Which do you think best represents this sample? Explain why in a paragraph. Can you say that the measure you chose in #4 is always best to use? Explain why or why not in a paragraph. A Mediocre One: 1) Mode = $22,000 (appears twice) 19,000, 22,000, 22,000, 25,000, 30,000, 150,000, 200,000 Mean: 468,000 / 7 = $ 66,857 Median: 19t, 22t, 22t, 25t, 30t, 150t, 200t 2)I think that the union representative would want to base his petition on the mode salary. By doing this, using the mode, he can argue that the three workers and the secretary are underpaid. 3) ? (I have no idea) but as a president of a company I would be primarily concerned in the company’s profits and not increasing my employees’ wages. I would stress the mean since it a higher number than the median or mode. 4) Median is best choice. I think that the mean ($66,857) wage is not a true representation of the individual wages since there are such large discrepancies among them, but then neither is the mode or the median. There is no best representation. 5) Since I feel that neither mean, mode or median are of any use to best represent this sample: the best choice would depend on job descriptions and their respective hourly wages. My answer probably makes no sense but neither does this problem since union and management are usually at odds when it comes to coming to common ground when the problem is about employee demands and especially wages. The union wants better wages and working conditions while management is concerned about production and profit margin. I do not feel that this approach of using mean, mode and median will help to solve the problem. A Bad One: 1) a) Mode is $22,000 b) Mean is $3319714.2 The sam of all wages divited by 7 23238000 / 7 = 3319714.2 c) Median is $25000 2) The union representative who wants to negotiate new wages has to emphazize the median of that company. He has to compare it with the standarts given by the graph. By doing so, he can show that the median of the company is far less than the standarst. 3) The president of the company would stress in the negotiations on the Mode because it is close to the standarts given in the graph. 4) Median represents best this sample. 5) The measure that I chosed in this sample is Median But, that doesn’t mean that it is always the best to use because it is not going to be fair in other circumstances. Some times is better to use Mean. Writing Assignment II (Week 2) Describe in your own words what mean and standard deviation indicate. Give examples. (Note: You are not asked to describe how mean and standard deviation are calculated.) Writing Assignment III (Week 3) Drug X has standard deviation of 3 days recovery time. Drug Y has standard deviation of 7 days recovery time. Based solely on this information which of the following statements is true? a) Drug X is a better drug than drug Y. b) Drug Y is a better drug than drug X. c) Drug X and drug Y are equally good. d) No decision can be made. Given the standard deviation above, suppose drug X has mean recovery time of 30 days and drug Y has mean recovery time of 15 days. Explain which drug is better. Given the standard deviation above, suppose drug X has mean recovery time of 8 days and drug Y has mean recovery time of 20 days. Explain which drug is better. Looking back on your answers to previous questions, write a paragraph about reaching a decision based solely on standard deviation. Writing Assignment IV (Week 4) Explain in your own words what it means for two events to be independent. (You are not asked to give a formula.) Give an example. Explain in your own words what it means for two events to be dependent. (You are not asked to give a formula.) Give an example. Explain in your own words what it means for two events to be mutually exclusive. Be sure to use the concepts of dependence and independence, which you have used in part A, and B, in explaining mutually exclusive events. Give an example. Writing Assignment V (Week 7) It is impossible to set up a table giving areas of regions lying beneath normal curves for each different pair of means, , and standard deviations, . Clearly one way to get around this problem is to standardize the distribution so that we can work with a normal curve having mean 0, and standard deviation 1. Then, a single table will be sufficient to estimate areas. Suppose that Tom and Diane are enrolled in two different sections of a statistics class. Assume a sufficiently large class size, so that the scores on the first exam follow a normal distribution. In Tom’s section, the average is 60 and his score is 72. In Diane’s section, the average is 71 and her score is 83. Both are happy because their scores are 12 points above the average of each respective section. Who did better and why? Suppose that the standard deviations in Diane’s section was =6.4 and in Tom’s section was =6.5. Now who did better and why? Find Diane’s percentile and Tom’s percentile. Writing Assignment VI (Week 8) Often in real life it is almost impossible to find the mean of a large population. A) If you were a statistician and needed to find the mean of a large population, what could you do to estimate the population mean? B) Does the method that you describe in part A take into consideration the possible occurrence of great numbers of outliers? C) How might you modify or extend the method you describe in part A to reduce the influence of outliers on your estimate of the mean. Writing Assignment VII (Week 9) Describe in your own words the (celebrated) Central Limit Theorem. What assumptions must be met in order to use its conclusions? Give an example of a situation where the Central Limit theorem may fail because of unmet assumptions Writing Assignment VIII (Week 10) Compare the graph of the normal distribution of x (whole population) to the graph of the normal distribution of (means of the samples that are taken, that is, sample means). In your discussion be sure to include comparison of the means, standard deviations. Could the two curves coincide exactly? Writing Assignment IX (Week 11) Cancer patients using a certain treatment have an average survival time of 2 years with a standard deviation of 4 months. A physician claims that his new method of treatment increases the average survival time of patients, while keeping the standard deviation the same. 100 randomly chosen patients who received this new treatment had an average survival time of 26 months. 1) What is the probability that 100 randomly chosen patients had an average survival of 26 months or more? 2)Interpret the result of problem 1 in a paragraph, defending your reasoning whether the claim that the new treatment is more effective or is bogus (not more effective). Steps For Completing The Final Project: (1) Use the Internet to obtain sample data. There are many, many possibilities here, depending mainly on you own interests. The only restriction is that the data that you choose must be part of a larger whole. For example, data collected over the 50 states on teenage pregnancy would be a natural sample of “national” teenage pregnancy. Data collected over various countries could be a reasonable sample of “global” data, but you would have to be careful about country selection. Task assigned: week 1. Due date: Beginning of week 3. Feedback: week 4. (2) Compute the descriptive statistics for your data (mean, mode, median, standard deviation, minimum, maximum, quartiles), and prepare a histogram and box plot. Task assigned: beginning of week 4. Due date: beginning of week 5. Feedback: end of week 5. (3) Describe in your own words the data you have chosen for your project, and why you have selected it. What goals do you hope to achieve in carrying out your project? Please include in this writing a discussion of any formulas you will use in the computation section to follow. Define the variables used in the formulas. Task assigned: beginning of week 4. Due date: beginning of week 7. Feedback: beginning of week 9. (4) Carry out the computation at a confidence level of 95%. Task assigned: end of week 9. Due date: end of week 11. Feedback: week 12. (5) Write a cogent conclusion for your project and submit the completed work to your teacher. Task assigned: end of week 9. Due date: end of week 11. Feedback: week 12. Questions ? Workload? Time? Class size? Other? YOU CAN FIND THIS INFORMATION IN OUR COURSE WEB SITE: http://faculty.lagcc.cuny.edu/rsezer/