Assignment 2 ECON 306 Total Point: 30 Due date: Saturday May 18, 2024 Time: 11:59PM Submission: Online (via Canvas) Format: PDF and Excel Instruction. - This is an individual assignment. Identical works will not be graded. - Submit PDF file and Excel file that you use to calculate Textbook: Applied Statistics in Business and Economics, 5th Edition, Doane, David P., and Lori E. Seward., Mcgraw-Hill, 2016. Exercise 1. A. Below are five questions from a survey of MBA students. Answers were written in the blank at the left of each question. For each question, state the data type (categorical, discrete numerical, or continuous numerical) and measurement level (nominal, ordinal, interval, ratio). Explain your reasoning. If there is doubt, discuss the alternatives. What is your gender? (Male = 0, Female = 1) What is your approximate undergraduate college GPA? (1.0 to 4.0) About how many hours per week do you expect to work at an outside job this semester? What do you think is the ideal number of children for a married couple? On a 1 to 5 scale, which best describes your parents? 1 = Mother clearly dominant ↔ 5 = Father clearly dominant _____ Q1 _____ Q2 _____ Q3 _____ Q4 _____ Q5 B. (a) Give three original examples of discrete data. (b) Give three original examples of continuous data. In each case, explain and identify any ambiguities that might exist. Hint: Do not restrict yourself to published data. Consider data describing your own life (e.g., your sports performance, financial data, or academic data). You need not list all the data, merely describe them and show a few typical data values. Exercise 2. Which measurement level (nominal, ordinal, interval, ratio) is each of the following variables? Explain. a. b. c. d. e. f. Number of passengers on Delta Flight 833. Waiting time (minutes) after gate push back before Delta Flight 833 takes off. Brand of cell phone owned by a cabin attendant on Delta Flight 833. Ticket class (first, business, or economy) of a randomly chosen passenger on Delta Flight 833. Outside air temperature (Celsius) when Delta Flight 833 reaches 35,000 feet. Passenger rating (on 5-point Likert scale) of Delta’s in-flight food choices. 36 Applied Statistics in Business and Economics probability of being selected as the items we chose at the beginning of the sampling process. This could lead to a bias (a tendency to overestimate or underestimate the parameter we are trying to measure) in our sample results. Sampling with replacement does not lead to bias. When should we worry about sampling without replacement? Only when the population is finite and the sample size is close to the population size. Consider the Russell 3000® Index, which has 3000 stocks. If you sample 100 stocks, without replacement, you have “used” only Exercise 3. about 3 percent of the population. The sample size n 5 100 is considered small relative to the population size N 5 3000. A common criterion is that a finite population is effectively infinite The target population is all stocks in the S&P Is each the following a parameter statistic? if the sample is less500 thanindex. 5 percent of theofpopulation (i.e., if nyN # .05).or InaChapter 8, you will learn how to adjust for the effect of population size when you make a sample estimate. For now, you ratio only need to recognize thatinsuch a. The average price/earnings for all 500 stocks the adjustments S&P index.are of little consequence when the population is large. b. TheproportionofallstocksintheS&P500indexthathadnegativeearningslastyear. c. The proportion of energy-related stocks in a random sample of 50 stocks. nite for Population? d. The average rateInfi of return 20 stocks recommended by a broker. Chapter 2 Final PDF Data Collection PDF When the sample is less than 5 percent of the population (i.e., when nyN # Final .05), then the to printer Exercise 4. See Section 2.4. Sampling Methods of the Textbook. 2.20 Suppose you want to studyis the numberinfi of nite. e-mail by students your statistics population effectively An accounts equivalentowned statement is that a in population is effectively whenisititisifatyou least times each as large as thewho sample whenID Nyn $ 20).endclass. What kindinfi ofnite sample (a)20survey student has (i.e., a student number Below is a 6 x 8ing array of all moviegoers. a population. random in ancontaining odd number.the (b)ages survey the students Treat sitting this in theasfront row, and (c)Select survey aevery fifth sample of 10 moviegoers’ ages by using (a) simple random sampling with Excel’s = RANDBETWEEN( ) function student who arrives at the classroom? Because computers are easier, we rarely use random number tables. Table 2.6 shows a (see Option A in 2.6 below), (b) simple between random sampling with Excel’s = 2.21 Below is a 6 Table 3few 8 array containing thetoages of moviegoers file 1 and Hunger Games). Treaton this alternative ways choose 10 integers(see 875. All are based a software INT(1+max(Data)*RAND()) function (see Option B in Table 2.6 below), (c) systematic sampling, (d) as a population.algorithm Select a random sample of 10 moviegoers’ ages by using (a) simple random sampling that creates uniform decimal numbers between 0 and 1. Excel’s function =RAND( ) sampling, (e)does convenience sampling. Explain your methods. with a and random number simple random sampling with Excel’sfunction. =RANDBETWEEN( (c) this,table, and (b) many pocket calculators have a similar We call) function, these pseudorandom s in Business andjudgment Economics systematic sampling, (d) judgment sampling, and algorithms (e) convenience sampling. Explain your methods. generators because even the best eventually repeat themselves (after a cycle of millions of numbers). Thus, random data encryption scheme could conceiv32 34 33 12 a software-based 57 13 58 16 the number of days on the market35for Intel the 36 recent home sales20in the city of ably and hardware-based ISES 3.4 (a) The table shows 23 23 be broken. 62 To enhance 65 data security, 15 other firms 17 are examining methods (e.g., based on thermal noise or radioactive decay) to prevent patterns or repetition. Sonando Hills. Construct a frequency and31histogram, nice (round) 14 11 51 distribution 33 13 using 11 58 bin limits. Fortunately, most applications don’t require that degree of randomness. For example, the iPod (b) Describe the 23 distribution features. 10 and note 63 any unusual 34 12 15Homes62 13 Shuffle’s song choices are not strictly random because its random numbers are generated by 40 11 18 62 64 30 42 20 repeat period is so an algorithm from a “seed number” that eventually repeats. However, the 18 70 86 notice. 12149 and MINITAB’s 86 random 21 5652 that an 11 51 never 38 15 21 3numbers66 great iPod17 user would Excel’s are good enough for most purposes. In the previous what was the 26 proportion of all2848 moviegoers6 who were55under age21 96 2.22 (a)41 50 population,176 30? (b) For each of the samples of size n 5 10 that you took, what was the proportion of moviegoers TABLE 43 2.6 under 20 age 30? (c)Excel—Option 56 16 20 into 1030spreadsheet 31 cells. Was each sample closefunction to the population proportion? Enter the57 Excel =RANDBETWEEN(1,875) A71 proportion 3.5 Pressfor F9 to new friends sample. into cells B1:B10. Choose three Some Ways In 44 Excel, type a92 list containing 10 of ayour 44 2.23to Get 179 names 80get 98 44 66 15 10 Random Integers Enter the function =INT(1+875*RAND()) 10A1:A10, spreadsheet cells. Excel—Option B names at random by randomizing this list. To do this, enter =RAND( ) intointo cells copy thePress F9 between 1 and 875 to get a new sample. random column and paste it using Paste Special > Values to fix the random numbers, and then sort The website Internet www.random.org give you many kinds of excellent in (a) The table shows oncolumn. a stateThe civil exam bywill24sample. applicants for positions the list raw by thescores random firstservice three names aretaken the random random numbers (integers, decimals, etc). law enforcement. Construct a frequency distribution and histogram, using nice (round) bin limits. MINITAB Use MINITAB’s Random Data menu with the Integer option. (b) Describe distribution and note any unusual features. Civil 2.5theDATA SOURCES Pocket Calculator 83 Press the RAND key to get a random number in the interval [0,1], multiply by 875, then round up to the next integer. 93 74 98 85 82 79 78 LO One goal of a statistics course is to help you learn where to find data that might be needed. 82 Fortunately, 68 many excellent 67 82 are widely78available, either 83 in libraries 70 or through 99 private sources Find everyday prin Randomizing aofList purchase. Table 2.10 summarizes a few them. electronic data sou 18 96Section 3.2. 93 of the textbook. 62 64 93 27 58 Exercise 5.The See U.S. Census Bureau and the U.S. Bureau of Labor Statistics are rich sources of data on To randomize a list (assuming it is in a spreadsheet) we can insert the Excel function many different aspects of life in the United States. The publications library supported by the =RAND( ) beside each row. This creates a column of random decimal numbers between 3.6 (a) MakeCensus a frequency and histogram (using appropriate bins) for these 28reports observaBureau distribution can 0beand found at www.census.gov. annual 1. Copy the random numbersThe and monthly, paste themquarterly, in the sameand column using Paste Special > by spent the Bureau of Labor statistics be found at www.bls.gov. Itsort should noted tions on published the amount for dinner four in can downtown Chicago on Then Friday night. (b) Repeat Values to “fi x” for them (otherwise they will keep changing). all thebecolumns by the that until 2012, the Statistical Abstract of the United theWhy? largest,The most random number and voilà—the list was is now random! fiDinner rstgeneral, n items and on the ranthe exercise, using a different number ofcolumn, bins. Which isStates preferred? domized list can now be used random sample. This methodAis2012 especially useful freely available annual compendium of facts andasfiagures from public sources. review of when the by list the is very long (perhaps of lines). The fiarst n items113 are a random sample 95 publications 103sponsored 109 170millions 114 107of the U.S. Census Bureau concluded with decision to quit publishing entire list, for they as likely as any previous others. years’ publications at the U.S. youarecan still access 124 the Statistical 105 Abstract. However, 80 104 84 176 115 Census website. 69 95 and monthly134 108 61 Economic Report 160 of the Presi128 For annual time series economic data, try the 68 dent (ERP),95which is published 61 every February. 150 The tables52in the ERP can87be downloaded 136 for free in Excel format. Data on cities, counties, and states can be found in the State and Metropolitan Area Data Book, published every few years by the Bureau of the Census and 3.7 (a) Makeavailable a frequency distribution histogram for the monthly off-campus rent paid by on CD-ROM in many and libraries. doa37304_ch02_022-055.indd 36 11/21/14 5:07 PM 30 students.Annual (b) Repeat the from exercise, using a different ofmost bins.bookstores. Which is These preferred? Why? almanacs several major publishersnumber are sold at include data reprinted from the above sources, but also information on recent events, sports, stock marRents ket, elections, Congress, world nations, states, and higher education. One of these almanacs 730 should be on every 730 informed citizen’s 730 shelf. 930 700 570 690 1,030 Type of Data 740 Examples 620 720 670 TABLE 2 learn how Excel fits a trend line, how to interpret it, and when such a line is meaningful. But since almost every student discovers this option the first time he or she makes a scatter plot, we must mention Excel’s fitted trend line here purely as a descriptive tool that may help you find patterns in (X, Y) data. Exercise 6. See Section 3.7. of the textbook. 3.21 (a) Use Excel to make a scatter plot of the data for bottled water sales for 10 weeks, placing Price on the X-axis and Units Sold on the Y-axis. Add titles and modify the default colors, fonts, etc., as you judge appropriate to make the scatter plot effective. (b) Describe the relationship (if any) between X and Y. Weak? Strong? Negative? Positive? Linear? Nonlinear? WaterSold Unit Price Units Sold 1.15 0.94 1.04 1.05 1.08 1.33 0.99 1.25 1.16 1.11 186 216 173 182 183 150 190 165 190 201 3.22 (a) Use Excel to make a scatter plot of these vehicle data, placing Weight on the X-axis and City MPG on the Y-axis. Add titles and modify the default colors, fonts, etc., as you judge appropriate to make the scatter plot effective. (b) Describe the relationship (if any) between X and Y. Weak? Strong? Negative? Positive? Linear? Nonlinear? Weight and MPG for 20 Randomly Selected Vehicles Vehicle Acura TSX BMW 3-Series Chevrolet Corvette Chevrolet Silverado 1500 Chrysler Pacifica Dodge Caravan Ford Focus Infiniti FX Jaguar XJ8 Lexus IS300 Lincoln Aviator Mazda 6 Land Rover Freelander Mercedes-Benz S-Class Nissan 350Z Nissan Xterra Pontiac Vibe Pontiac Grand Am Toyota Sienna Volvo C70 Random sample of 2003 vehicles. CityMPG City MPG Weight (lbs.) 23 19 19 14 17 18 26 16 18 18 13 19 17 17 20 16 28 25 19 20 3,320 3,390 3,255 4,935 4,660 4,210 2,760 4,295 3,805 3,390 5,000 3,355 3,640 4,195 3,345 4,315 2,805 3,095 4,120 3,690 SECTION E