Using Excel: Descriptive Statistics

advertisement
Using EXCEL 2010:
Descriptive Statistics
Department of Operations
Weatherhead School of Management
1
Installing Data Analysis
 To do statistics in Excel, first install the Data Analysis Toolpak
add-in as follows:
 For a Mac, go to the following web site to download and install
StatPlus LE: www.analystsoft.com/en/products/statplusmacle/
 For a PC, use the following instructions:
 Click the File button on the left corner.
2
Installing Data Analysis (Cont.)
 Then you will see the following menu. Click on Options.
Click
3
Installing Data Analysis (Cont.)
 In the Excel Options dialog, click on Add-Ins.
Click
4
Installing Data Analysis Cont.)
 Then you will see the following window.
 There is a Manage section at the bottom. Click on Go.
Click
5
Installing Data Analysis (Cont.)
 In the Add-Ins menu, select Analysis ToolPak.
 Then click on OK.
Click
6
Installing Data Analysis (Cont.)
 It will take some time to load the package.
 When finished, click on the Data tab in the main menu.
 You should see the Data Analysis module on right.
7
The Frequency Table
 Open the file SoftDrink.xls to create the frequency table.
8
The Frequency Table (Cont.)
 Note: In this example, the data is given in TEXT form.
 The frequency table needs to be created manually.
 Type the titles “Item” and “Frequency” in the columns you want
for the frequency table.
9
The Frequency Table (Cont.)
 Manually type all values in the “Item” column. Be sure not to
miss.
Coke Classic
Diet Coke
Pepsi-Cola
Dr. Pepper
Sprite
10
The Frequency Table (Cont.)
 The function COUNTIF is used here to calculate the
frequencies.
 This function needs two values: the range and the criteria.
 Range is the range of cells with the values you want to count.
 Criteria describes which values in that range are to be counted.
11
The Frequency Table (Cont.)
 In cell D2 type “=COUNTIF(”
 Then select the data range. (A2:A51, in this example.)
12
The Frequency Table (Cont.)
 Type a “,”
 For the criteria, simply select the cell in the “Item” column whose
whose values you want to count. (C2, for example)
 Press Enter to perform the function in this cell
13
The Frequency Table (Cont.)
 Type (or copy) the same function for the rest of the items.
 You will get the following frequency table.
=COUNTIF(A$2:A$51,C3)
=COUNTIF(A$2:A$51,C2)
=COUNTIF(A$2:A$51,C4)
=COUNTIF(A$2:A$51,C6)
=COUNTIF(A$2:A$51,C5)
14
The Bar Graph for SoftDrink.xls
 Based on the frequency table you just created, you can get the
bar graph.
 Note: You CANNOT create the bar graph using the sample data
directly.
 How to create the bar graph for the following data is shown on
the next slide.
Item
Coke Classic
Frequency
19
Diet Coke
8
Pepsi-Cola
13
Dr. Pepper
5
Sprite
5
15
The Bar Graph (Cont.)
 First select the data in the frequency table. (C2:D6, in this
example.)
16
The Bar Graph (Cont.)
Click “Insert” tab in Menu.
Then click “Bar”.
Then click “Clustered
Bar”.
17
The Bar Graph (Cont.)
 Then you will see the following bar graph.
18
The Bar Graph (Cont.)
The column option in
Excel creates a vertical
bar graph.
Click “Insert” tab in Menu.
Then click “Bar”.
Then click “Clustered
Bar”.
19
The Bar Graph (Cont.)
 Then you will see the following bar graph.
20
The Pie Chart for SoftDrink.xls
 Creating a pie chart is similar to the bar graph.
 Note: You CANNOT create the bar graph using the sample data
directly.
 How to create the pie chart for the following data is shown on
the next slide.
Item
Coke Classic
Frequency
19
Diet Coke
8
Pepsi-Cola
13
Dr. Pepper
5
Sprite
5
21
The Pie Chart (Cont.)
 First select the data in the frequency table. (C2:D6, in this
example.)
22
The Pie Chart (Cont.)
Click the “Insert” tab in Menu.
Then click “Pie”.
Then click “Pie”.
23
The Pie Chart (Cont.)
 Then you will see the following pie chart.
24
The Pie Chart (Cont.)
Click the pie chart to active Chart Tools.
Click
To show the percentages, change
the layout, as follows.
Click “Chart Tools” tab in Menu.
Then click “Quick Layout”.
Then click “Layout 1”.
25
The Pie Chart (Cont.)
 Then you will see the following pie chart.
26
The Histogram
 Open the file Audit.xls.
Click “Data” tab in Menu.
Then click “Data Analysis”.
27
The Histogram (Cont.)
 In the Data Analysis menu, select Histogram.
 Then click on OK
28
The Histogram (Cont.)
 You will see the Histogram menu.
 In the input section, select the range with your data in the Input
Range. (A2:A21, in this example.)
 Then in the output section, select the cell where you want to
save the result. [Here, the result is put in the same worksheet.]
 Select Chart Output.
 Then click on OK
29
The Histogram (Cont.)
 You will see the following histogram.
30
The Histogram (Cont.)
 In the histogram menu, the bin range was left blank.
 However, you can create your own bin ranges, as follows.
 Type in the bin range first. In this example, the bin ranges are
set to the following (< 10, 10 – 15, 15 – 20, and so on).
31
The Histogram (Cont.)
 In the input section of the histogram menu, enter the bin range
you just created. (B2:B7, in this example.)
 All other settings are the same as before.
32
The Histogram (Cont.)
 You will see the following histogram.
33
Arithmetic Mean for Salary.xls
 AVERAGE(number1, [number2], ...) computes the average
(arithmetic mean) of the numbers.
 Number1 is the first number, cell reference, or range for which
you want the average.
=AVERAGE(B2:B13)
34
Median for Salary.xls
 MEDIAN(number1, [number2], ...) computes the median of the
numbers.
 Number1 is the first number, cell reference, or range for which
you want the average.
=MEDIAN(B2:B13)
35
Percentiles for Salary.xls
 PERCENTILE.INC(array,k) computes the k-th percentile of
values in a range.
 Array is the array or range of data. K is the percentile value.
=PERCENTILE.INC(B2:B13,0.25)
=PERCENTILE.INC(B2:B13,0.5)
=PERCENTILE.INC(B2:B13,0.75)
36
Mode for Salary.xls
 MODE(number1,[number2],...]) computes the mode of the
numbers.
 Number1 is the first number, cell reference, or range for which
you want the average.
=MODE(B2:B13)
37
Variance for Salary.xls
 VAR.P(number1,[number2],...]) calculates the population
variance.
 VAR.S(number1,[number2],...]) calculates the sample
variance.
=VAR.P(B2:B13)
=VAR.S(B2:B13)
38
Standard Deviation for Salary.xls
 STDEV.P(number1,[number2],...]) calculates the population
standard deviation.
 STDEV.S(number1,[number2],...]) calculates the sample
standard deviation.
=STDEV.P(B2:B13)
=STDEV.S(B2:B13)
39
Descriptive Statistics
 Open the file Salary.xls.
Click “Data” tab in Menu.
Then click “Data Analysis”.
40
Descriptive Statistics (Cont.)
 In the Data Analysis menu, select Descriptive Statistics.
 Then click on OK
41
Descriptive Statistics (Cont.)
 You will see the Descriptive




Statistics menu.
In the input section, enter
the data range in Input
Range. (B2:B13 here.)
In the output section, enter
the cells where you want to
save the result. [Here the
results are put in the same
worksheet.]
Select Summary Statistics.
Then click on OK
42
Descriptive Statistics (Cont.)
 You will see the following results.
43
Scatter Diagrams
 Open file Stereo.xls.
44
Scatter Diagrams (Cont.)
 First select the data
range consisting of
two columns.
(B3:C12, here.)
 One column contains
values for X and the
other column for Y.
 You can also include
the row with the
column labels.
45
Scatter Diagrams (Cont.)
Click the “Insert” tab in the Menu.
Then click “Scatter”.
Then click “Scatter with Only Markers”.46
Scatter Diagrams (Cont.)
 Then you will see the following scatter diagram.
47
Correlation and Covariance
 COVARIANCE.P(array1,array2) computes the population





covariance XY.
Array1 is the cell range of values for the X variable.
Array2 is the cell range of values for the Y variable.
COVARIANCE.S(array1,array2) computes the simple
covariance SXY.
CORREL(array1,array2) computes the
sample correlation = population correlation.
Open file Stereo.xls as an example (see the next slide).
48
Correlation and Covariance (Cont.)
=COVARIANCE.P(B3:B12,C3:C12)
=COVARIANCE.S(B3:B12,C3:C12)
=CORREL(B3:B12,C3:C12)
49
Download