10/29/12& sta$s$cs&for&designers &

advertisement
10/29/12&
how&can&I&find&out&what&%&are&red?&
sta$s$cs&for&designers&
dr.&carman&neustaedter&
what&if&there&are&too&many&to&count?&
how&can&I&find&out&the&%&red?&
count&a&sample&and&generalize&
how&can&we&improve&the&sample?&
take&more&samples&
1&
10/29/12&
variability:&the&difference&between&mul8ple&
measurements&from&a&popula8on&
is&the&%&red&in&one&container&the&same&as&another?&
sta$s$cs&is&about&the&rela8onship&between&samples&
and&a&popula8on&
two&types&of&sta$s$cs&
descrip$ve&stats:&describe&data&
e.g.,&graphs,&variability,&central&tendency&
&
inferen$al&stats:&make&inferences&(conclusions)&
about&the&data&
e.g.,&a&sample&generalizes&to&the&popula8on&
&
&
descrip$ve&stat&examples&
central&tendency&
frequency&distribu$on:&plot&of&how&frequently&
each&value&appears&in&the&data&for&each&level&of&
the&IV&
mean:&add&up&all&measurements,&divide&by&total&
number&of&measurements&
&
median:&the&measurement&with&half&of&the&
measurements&above&it&and&half&below&it&
&
mode:&the&most&frequent&measurement&
2&
10/29/12&
mode?&
median?&
mean?&
mode?&
median?&
mean?&
Frequency&Distribu8on&
6&
5&
4&
3&
2&
1&
0&
7.6&
mode?&
median?&
mean?&
Frequency&Distribu8on&
6&
7.7&
7.8&
7.9&
8&
8.1&
8.2&
8.3&
8.4&
mode?&
median?&
mean?&
5&
4&
3&
2&
1&
0&
7.6&
7.7&
7.8&
7.9&
8&
8.1&
8.2&
8.3&
8.4&
mode&is&the&8.0P8.9&group&
mode?&
median?&
mean?&
mode?&
median?&
mean?&
8.03&would&give&10&above&
it&and&10&below&it&
3&
10/29/12&
is&central&tendency&enough?&
mode?&
median?&
mean?&
=&160.46&/&20&=&8.02&&
no,&these&data&sets&have&the&same&mean&but&different&variability&
&
we&also&need&to&describe&what&the&variability&is&
measures&of&variability&
measures&of&variability&
range:&the&smallest&score&subtracted&
from&the&largest&score&
&
e.g.,&8.40&–&7.64&=&0.76&
“median&of&8.03&with&range&of&0.76”&
&
&
variance:&average&of&the&squared&difference&
between&the&scores&and&the&mean&
&
standard&devia$on:&the&square&root&of&the&
variance&
standard&devia$on&
e.g.,&or&the&average&distance&of&each&data&point&from&the&mean&
&
standard&devia$on&
probability&distribu8on&graph&
probability&
probability&distribu8on&graph&
68.2%&of&scores&fall&within&
one&standard&devia8on&
from&the&mean&
probability&
1& mean& 1&
stddev&
stddev&
1& mean& 1&
stddev&
stddev&
4&
10/29/12&
standard&devia$on&
standard&devia$on&
probability&distribu8on&graph&
probability&distribu8on&graph&
95.45%&of&scores&fall&
within&two&standard&
devia8ons&from&the&mean&
probability&
99.73%&of&scores&fall&
within&three&standard&
devia8ons&from&the&mean&
probability&
1& mean& 1&
stddev&
stddev&
graphing&mean&and&standard&devia$on&
1& mean& 1&
stddev&
stddev&
graphing&mean&and&standard&devia$on&
Mean&Time&Per&Task&
9&
8&
7&
Time&(s)& 6&
68.2%&of&scores&fall&within&
one&standard&devia8on&
from&the&mean&
5&
4&
3&
2&
1&
0&
Task&1&
Task&2&
Task&3&
Task&4&
differences&between&means&
differences&between&means&
condi$on&one:&
3,&4,&4,&4,&5,&5,&5,&6&
&
&
&
&
&
condi$on&two:&
4,&4,&5,&5,&6,&6,&7,&7&
condi$on&one:&
3,&4,&4,&4,&5,&5,&5,&6&
&
&
&
&
&
condi$on&two:&
4,&4,&5,&5,&6,&6,&7,&7&
Task&5&
Time&for&Task&One&
8&
7&
6&
5&
4&
3&
2&
1&
0&
condi8on&1&
mean&=&4.5&+/&0.9&&
condi8on&2&
mean&=&5.5&+/&1.2&&
is&there&a&sta$s$cally&significant&difference&
between&the&means?&
5&
10/29/12&
tOtest&
different&types&of&distribu$ons&
a&simple&sta$s$cal&test:&allows&one&to&say&
something&about&differences&between&means&at&
a&certain&confidence&level&
&
tPtests&work&on&normal&distribu8ons&
tOtest&
levels&of&significance&
null&hypothesis&of&the&tOtest:&
no&difference&exists&between&the&means&of&two&sets&
of&collected&data&
&
possible&results:&
&
1)&I&am&95%&sure&that&the&null&hypothesis&is&rejected&
e.g.,&there&is&probably&a&true&difference&between&
the&means&
&
2)&I&cannot&reject&the&null&hypothesis&
e.g.,&the&means&are&likely&the&same&
how&certain&do&we&want&to&be?&
&
95%&=&p&<&0.05&(there&is&a&5%&chance&we&are&wrong)&
5&out&of&100&8mes,&this&result&would&occur&by&chance&
&
99%&=&p&<&0.01&(there&is&a&1%&chance&we&are&wrong)&
1&out&of&100&8mes,&this&result&would&occur&by&chance&
&
&
in&HCI,&we&typically&pick&p&<&0.05&or&p&<&0.01&
decide&on&the&p&level&before&tes8ng&
calcula$ng&the&tOtest&
calcula$ng&the&tOtest&
compute&a&tPtest&using&a&
stats&package&
e.g.,&Excel,&SPSS,&JMP&
&
significance:&if&p&<&0.05,&there&is&a&sta8s8cally&
significant&difference&between&the&two&
samples.&&&
&
if&your&nullPhypothesis&said&there&was&no&
difference&(no&effect),&then&reject&it&
1.&select&a&p&value&you&want&to&test&
against,&e.g.,&0.05&
&&
&
2.&compare&each&condi8on’s&column&of&
measurements&in&the&test&
&
3.&tPtest&gives&you&a&pPvalue&
&
&
&
&
6&
10/29/12&
calcula$ng&the&tOtest&
demo&in&Excel&
significance:&if&p&<&0.05,&there&is&a&sta8s8cally&
significant&difference&between&the&two&
samples.&&&
&
if&your&nullPhypothesis&said&there&was&no&
difference&(no&effect),&then&reject&it&
&
no&significance:&if&p&>=&0.05,&there&is&not&a&
sta8s8cally&significant&difference&between&the&
two&samples.&&But&we&cannot&say&they&are&the&
same.&
&
if&your&nullPhypothesis&said&there&was&no&
difference&(no&effect),&then&you&cannot&reject&
it.&&you&also&cannot&accept&it.&
&&
&
different&types&of&tOtests&
different&types&of&tOtests&
unpaired:&comparing&two&sets&of&independent&
observa8ons&
direc$onal&(oneOtailed):&tests&if&data&set&A&is&
greater&than&data&set&B,&but&not&the&other&way&
&
nonOdirec$onal&(two&tailed):&tests&both&
direc8ons&
e.g.,&different&subjects&in&each&group&(between&subjects)&
&
paired:&comparing&two&sets&of&dependent&
observa8ons&
e.g.,&same&subjects&in&each&group&(within&subjects)&
&
most&commonly&we&use&a&twoPtailed&test&because&our&
distribu8ons&are&symmetric&around&the&mean&
types&of&errors&
types&of&errors&
type&1&–&false&posi$ve:&we&see&a&difference&but&
there&isn’t&really&one&
&
type&2&–&false&nega$ve:&we&don’t&see&a&
difference&but&there&really&is&one&
e.g.,&our&samples&happen&to&be&different&because&of&random&sampling.&if&we&
selected&different&samples,&we’d&see&a&difference&
&
&
low&confidence&level&(e.g.,&p&<&0.1):&greater&chance&of&Type&1&errors&
&
&
e.g.,&our&samples&happen&to&be&the&same&because&of&random&sampling.&if&we&
selected&different&samples,&the&samples&would&be&different&
&
high&confidence&level&(e.g.,&p&<&0.0001):&greater&chance&of&Type&2&errors&
&
7&
10/29/12&
single&factor&analysis&of&variance&(ANOVA)&
what&if&we&want&to&compare&more&than&two&sets&
of&data?&
compare&three&or&more&means&
e.g.,&comparing&mousePtyping&on&three&keyboards&
&
qwerty&
&
S1P10&
&
&
&
possible&results:&
alphabe8c&
S11P20&
dvorak&
S21P30&
mouse&typing&is&fastest&on&qwerty&keyboard&
the&same&on&alphabe8c&&&dvorak&
analysis&of&variance&(ANOVA)&
compares&rela8onships&between&many&factors&
&
qwerty&
alphabe8c&
dvorak&
cannot&touch&
type&
S1P10&
S11P20&
S21P30&
can&touch&type&
S31P40&
S41P50&
S51P60&
when&can&we&use&different&descrip$ve&and&
inferen$al&sta$s$cs?&
&
what&type&of&data&do&we&need?&
scales&of&measurements&
scales&of&measurements&
nominal&scale:&numbers&are&used&as&a&name&or&
iden8fier;&no&real&quan8ta8ve&proper8es&
&
ordinal&scale:&numbers&can&be&ordered&or&ranked,&
but&we&don’t&know&the&difference&between&ranks&
&
allowable&stats:&
median&&&percen8les&
&
can&only&count&the&frequencies,&no&allowable&stats&
&
&
&
&
e.g.,&first&place&runner&finished&ahead&of&second&place&finisher&but&we&don’t&know&
by&how&much&they&won&
&
e.g.,&children&rate&their&preference&of&a&new&mouse&compared&to&their&old&one:&
1&–&worst&
2&–&not&as&good&
3&–&neutral&
4&–&bejer&
5&–&best&
&
8&
10/29/12&
scales&of&measurements&
scales&of&measurements&
interval&scale:&numbers&can&be&ordered&and&
we&know&the&difference&between&ranks;&&
zero&is&by&conven8on&
&
allowable&stats:&
mean,&standard&devia8on,&variance,&range,&
tPtest,&ANOVA&
ra$o&scale:&interval&scale&with&an&absolute,&nonP
arbitrary&zero&
&
allowable&stats:&
all&those&for&interval,&coefficient&of&varia8on&
&
&
e.g.,&temperature&in&degrees&C&or&F&
&
e.g.,&Likert&scale&(where&there&is&no&real&“zero”&value)&
1&–&strongly&disagree&
2&–&disagree&
3&–&neutral&
4&–&agree&
5&–&strongly&agree&
e.g.,&temperature&in&degrees&K,&length,&weight,&8me&periods&
e.g.,&how&fast&did&people&perform&on&interface&A&vs.&interface&B&
e.g.,&accuracy&on&interface&A&vs.&interface&B&
&
&
&
&
wrapping&up&
9&
Download