Uploaded by ansh garg

stats f sheet

advertisement
By CA Anand V Kaku
STATISTICS FORMULAE
No. of class interval × class lengths = Range
LCB = LCL –D/2
UCB=UCL +D/2
Mid-point =LCB+UCB/2 or LCL+UCL/2
Frequency density of a class interval = frequency of that class interval/corresponding class length
Relative frequency and percentage frequency of a class interval=class frequency/ total frequency
ARITHMETIC MEAN
…
Discrete Series
GEOMETRIC MEAN
.
…..
GM=
HARMONIC MEAN
Logarithm of G for a set of
observations is the AM of the
logarithm of the observations; i.e.
∑ log
log
∑
.
Frequency
Distribution
…
…
GM=
∑
1
MODE
the value that occurs the maximum
number of times
∑ log
…..
.
∑
2
X
where,
= LCB of the modal class. i.e. the
class containing mode.
= frequency of the modal class
= frequency of the pre – modal
class
= frequency of the post modal class
C = class length of the modal class
By CA Anand V Kaku
STATISTICS FORMULAE
GEOMETRIC MEAN
ARITHMETIC MEAN
Relationship
variables
HARMONIC MEAN
if z = xy, then
GM of z =(GM of x)×(GM of y)
MODE
ymo=a+bxmo
if z = x/y then
GM of z =(GM of x)/(GM of y)
Weighted Mean
Weighted A.M
∑
.
∑
∑
.
log
∑
Combined Mean
Important points
-
.
∑
-
∑
.
-
the algebraic sum of deviations of a
set of observations from their AM is
zero ∑
0
-
-
-
AM is affected due to a change of
origin and/or scale which implies
that if the original variable x is
changed to another variable y by
effecting a change of origin,
-
-
-
Relation among
Averages
For Given two positive numbers (A.M) x (H.M) = (G.M)2
AM
GM
HM
The equality sign occurs, as we have already seen, when all the observations are equal.
Mode = 3 Median – 2 Mean
STATISTICS FORMULAE
MEDIAN
Discrete Series
/Unclassified Data
Median
1
Size of
2
QUARTILES (Q1,Q2 & Q3)
DECILES (D1,D2 ,D3,…… D9)
PERCENTILES (P1,P2,P3,….,
P99 )
Q1 quartile is given by the
1
value
the D1 Decile is given by the
1
value
the P1 Percentile is given by
the
1
value
Dn Decile is given by the
1
value
Pn Percentile is given by the
1
value
the Qn quartile is given by the
1
value
Group Frequency
Distribution
2
By CA Anand V Kaku
.
X
= lower class boundary of the
median class i.e. the class
containing median.
N = total frequency.
= less than cumulative frequency
corresponding to l1. (Pre median
class)
= less than cumulative frequency
corresponding to l2. (Post median
class)
= being the upper class boundary
of the median class.
C = - = length of the median
class.
yme = a + b xme
∑
is minimum if we choose A
as the median.
.
X
= lower class boundary of the
Quartile class i.e. the class
containing Quartile.
N = total frequency.
p= , ,
for Q1, Q2 ,Q3
respectively
= less than cumulative
frequency corresponding to l1.
(Pre Quartile class)
= less than cumulative
frequency corresponding to l2.
(Post Quartile class)
= being the upper class
boundary of the Quartile class.
C = - = length of the Quartile
class.
X
= lower class boundary of
the Decile class i.e. the class
containing Decile.
N = total frequency.
p= , , , … . ,
for D1, D2
,D3 ,…., D9 respectively
= less than cumulative
frequency corresponding to
. (Pre Decile class)
= less than cumulative
frequency corresponding to
. (Post Decile class)
= being the upper class
boundary of the Decile
class.
C = - = length of the
Decile class.
.
X
= lower class boundary of
the Percentile class i.e. the
class containing Percentile.
N = total frequency.
p=
,
,
,…,
for P1,
P2 ,P3,…, P99 respectively
= less than cumulative
frequency corresponding to
l1 . (Pre Percentile class)
= less than cumulative
frequency corresponding to
. (Post Percentile class)
= being the upper class
boundary of the Percentile
class.
C = - = length of the
Percentile class.
By CA Anand V Kaku
STATISTICS FORMULAE
DISPERSIONS
RANGE (R)
Absolute
Range= Laregest (L) – Smallest (S)
MEAN DEVIATION (M.D)
about A
M.DA= ∑|
MEAN DEVIATION (M.D)
about A.M ( )
M.D about Mean = ∑|
MEAN DEVIATION (M.D)
about Median
M.D about Median = ∑|
Relative
100
Co efficient of Range =
Co efficient of M.D from A =
.D
100
Co efficient of M.D from A.M =
|
STANDARD DEVIATION (σ)
σ
σ
∑ x
|
.D
|
X
|
| Deviation for First n Natural
Standard
numbers ,
QUARTILE DEVIATION
(Qd)
VARIANCE (σ2)
100
Co efficient of M.D from Median
.D
100
=
Co efficient of Variation =
Qd=
Combined Standard Deviation,
Where d1=
σ12=
, d2=
100
Co-efficient of Q.D =
Or
Co-efficient of Q.D =
Variance means Square of Standard Deviation
M.Dy = | | M.Dx
.
M.Dy = | | M.Dx
100 σy = | | σx
X
Standard Deviation for Two numbers,
M.Dy = | | M.Dx
X
n
∑x
n
If
y = a+bx
Ry = | | R x
100
STATISTICS FORMULAE
By CA Anand V Kaku
CORRELATION
KARL PEARSON’S PRODUCT MOMENT
CORRELATION COEFFICIENT
&
Where,
1
For tied ranking,
∑
| || |
1
∑
∑
∑
where
1
where c is concurrent deviation,
∑
X
∑
∑
∑
COEFFICIENT OF CONCURRENT
DEVIATIONS
6∑
∑
&
∑
SPEARMAN’S RANK CORRELATION
COEFFICIENT
∑
6 ∑
∑
12
1
m is one less than number of pairs of
observations
By CA Anand V Kaku
STATISTICS FORMULAE
REGRESSION ANALYSIS
Y depends on X
y= a + byx x
Simple Regression
Equation
Normal Equations
b
x
x
Regression Co
efficient
X depends on Y
x = a + bxy y
b
b
x
b
y
,
b
b
b
y
y
,
b
Intersection
point of these
two lines is ,
b
∑
∑
∑
∑
b
∑
b
∑
∑
∑
∑
∑
b
where
1
By CA Anand V Kaku
STATISTICS FORMULAE
NAME
Binomial
Distribution
CONDITION
Trials are independent
and each trail has only
two outcomes Success
& Failure.
PROBABILITY MASS FUNCTION
n
x
P(X=x) = Cx p q
n-x
Notation
X ∼ B (n,p)
MEAN
VARINACE
μ=np
σ2=npq
Trials are independent
and probability of
occurrence is very small
in given time.
P(X = x) =
X ∼ P (m)
.
If (n+1)p is integer
Mode, μ0 = (n+1)p-1
then
p+q=1
If (n+1)p is non-integer then
Mode,μ0 =Highest Integer in
(n+1)p
For x = 0,1,2,..,n
Poisson
Distribution
Remarks
MODE
μ=m
σ2=m
If m is Integer Mode,μ0 =m-1
e = 2.71828
If m is non- integer Mode,μ0
= Highest Integer in m
!
For x = 0,1,2,..,n
Normal or
Gaussian
Distribution
When distribution is
symmetric
X ∼N (μ,σ2)
/
σ√
For
∞
∞
z=
Mean
=
Median
=
Mode
=μ
σ2
μ
Mean Deviation =
0.8 σ
First Quartile
= μ - 0.675σ
Third Quartile
= μ + 0.675σ
Quartile
Deviation=0.675σ
Point of Inflexion
x=μ - σ and x=μ + σ
Download