Chapter 1: Data Collection • Why Statistics? A Manager Needs to Know Statistics in order to: – Properly present and describe information – Draw conclusions about populations based on sample information – Understand Statistical relationship (causality) – Improve processes – Obtain reliable forecasts • www.unlv.edu/faculty/nasser Key Concepts • A population (universe) is the collection of all items or things under consideration – A parameter is a summary measure that describes a characteristic of the entire population • A sample is a portion of the population selected for analysis – A statistic is a summary measure computed from a sample to describe a characteristic of the population Key Concepts, Continued • Descriptive statistics (art)-- Collecting, summarizing, and describing (presenting) data from a sample or a population • Inferential statistics – The process of using sample statistics to draw conclusion about the population parameters Example: Descriptive Statistics • Collect data – e.g., Survey • Present data – e.g., Tables and graphs • Characterize data – e.g., Sample mean = X n i Example: Inferential Statistics • Estimation – e.g., Estimate the population mean weight using the sample mean weight • Hypothesis testing – e.g., Test the claim that the population mean weight is 120 pounds Sources of data • Before collection of data , a decision maker needs to: – Prepare a clear and concise statement of purpose – Develop a set of meaningful measurable specific objective – Determine the type of analyses needed – Determine what data is required Sources of Data, Continued • Primary Data Collection – Experimental Design – Conduct Survey – Observation (focus group) • Secondary Data Compilation/Collection – Mostly governmental or industrial, but also individual sources Types of Data • Random Variable – Values obtained are not controlled by the researcher (theoretically values differ from item to item) • Data from a RV are either: – Quantitative • Continuous (measuring) • Discrete (Counting) – Qualitative (categorical) • Nominal • Ordinal Types of Sampling Methods • Non-Probability Sampling -- Items included are chosen without regard to their probability of occurrence. i. Judgment ii. Quota iii. Chunk iv. Convenience • Probability Sampling – Items are chosen based on a known probability. Let N=size of the population and n=desired sample size i. ii. With replacement -- Prob. of each item and any round =(1/N) Without replacement -- Prob. of each item =(1/N), 1/(N-1), …1/[N(n-1)] Types of Probability Sampling • Items in the sample are chosen based on known probabilities Probability Samples Simple Random Systematic Stratified Cluster Types of Probability Samples, Con’t • Simple Random Sample -- Every individual or item from the frame has an equal chance of being selected. In addition, any selected sample has the same chance of being selected as any other. – Samples obtained from table of random numbers or computer random number generators • Systematic Samples -- Divide frame of N individuals into groups of k individuals: k=N/n. Randomly select one individual from the 1st group. Then Select every kth individual thereafter Types of Probability Samples, Con’t • Stratified samples -- Divide population into subgroups (called strata) according to some common characteristic. A simple random sample is selected from each subgroup. Samples from subgroups are combined into one • Cluster Samples -- Population is divided into several “clusters,” each representative of the population. Then, a simple random sample of clusters is selected – All items in the selected clusters can be used, or items can be chosen from a cluster using another probability sampling technique Evaluation of a Survey • • • • • What is the purpose of the survey? Is the survey based on a probability sample? Coverage error – appropriate frame? Nonresponse error – follow up Measurement error – good questions elicit good responses • Sampling error – always exists