Relevance.ppt

advertisement
Relevance
in information science
© Tefko Saracevic
1
Preface
1975 & 1976

Relevance: A Review of the Literature
and a Framework for Thinking on the
Notion in Information Science
2006

© Tefko Saracevic
Relevance: A Review of the Literature
and a Framework for Thinking on the
Notion in Information Science. Part II
Relevance
2
Purpose
trace the evolution of thinking on
relevance in information science for
the past three decades
provide an updated framework within
which the still widely dissonant ideas
on relevance might be interpreted
and related to one another
© Tefko Saracevic
Relevance
3
Relevance & information
technology
Relevance around forever
Computers ~ 50 years

internet ~ 30; web ~ 15
IT changed information activities
Systems designed to provide
relevant information
Invisible hand of relevance always
present
© Tefko Saracevic
Relevance
4
… in information science
Information retrieval (IR) systems
offer their version of what may be
relevant
People go their way & asses relevance
The two worlds interact
Concern here:


© Tefko Saracevic
human world of relevance
how IR deals with relevance NOT covered
Relevance
5
Significance of relevance
research
Contribution of relevance
scholarship:

knowledge for knowledge sake
 better understanding of the notion of
relevance significant in itself

potential for making information systems
better – more human oriented
 technological advances often dependent on
better understanding of underlying notions,
phenomena
© Tefko Saracevic
Relevance
6
Historical note
Relevance came into IR unannounced
From very start after WWII IR was
about retrieval of relevant information
First concerns: about “false drops” –
non relevant retrievals
First direct recognition in 1955 with
proposal for precision & recall as
evaluation measures based on relevance
© Tefko Saracevic
Relevance
7
Organization of
presentation
1. What is the NATURE of relevance?
 1.1 Meaning?
 1.2 Theories?
 1.3 Models?
2. What are MANIFESTATIONS of
relevance?
3. What is the BEHAVIOR of
relevance?
4. What are the EFFECTS of
relevance?
© Tefko Saracevic
Relevance
8
1.1 Meaning of relevance
Intuitively well understood


same perception globally – “y’know”
a “to” and context always present
Relevance:


a relation between objects P & Q along
property R
may also include a measure S of the strength
of connection
© Tefko Saracevic
Relevance
9
… in information science
Relation between inf. or inf. objects
(Ps) & contexts (Qs) based on some
property R & measure of intensity S
But: relevance is not given – it is
established
Big questions & challenges


© Tefko Saracevic
How does relevance happen?
Who does it, under what circumstances,
and how?
Relevance
10
Inference:
created - derived
Systems & automatons create
relevance
Users derive relevance


but could be more complex & a continuum
derive could involve more than topicality
 greater expertise = greater derivative powers


create could involve “intelligence”
also involves
 intent
 selection & interaction processes
© Tefko Saracevic
Relevance
11
Summary:
Meaning of relevance
Relevance attributes involve:



relation
intention
context
 internal; external




© Tefko Saracevic
inference
selection
interaction
measurement
Relevance
12
1.2 Theories of relevance
Theories suggested in several fields


logic – in deduction of inferences, to reject
fallacies – relevance logics
philosophy – in phenomenology structure &
functioning of the “life-world” (Alfred Schutz)
 it is stratified & relevance is the principle for
stratification
 no single relevance but an interdependent
system of relevances (plural)
 thematic (topical), interpretational, &
motivational relevance
© Tefko Saracevic
Relevance
13
… theories in
communication
Sperber & Wilson: Relevance Theory




based on inferential model of communication
what must be relevant and why to an
individual with a single cognitive intention?
posited a cognitive & a communicative
principle of relevance
assessed in terms of cognitive effects &
processing effort
 individuals pick up the most relevant stimuli &
process them to maximize their relevance
© Tefko Saracevic
Relevance
14
Summary:
Theories of relevance
IS did not develop indigenous theory
but a few “theories-on-loan” attempts



Logic theories not applied, yet
Schutz’s life-world theory used to some
extend in specifying manifestations &
models
Sperber & Wilson’s theory used few
times as explanation & guide
 not tested
© Tefko Saracevic
Relevance
15
1.3 Models of relevance
Reviews – also produced models
Syracuse school of relevance

dynamic & situational model
& Nilan, 1990)
(Schamber, Eisenberg
 connection with human information behavior
“Whole history of relevance”

(Mizzaro, 1997)
duality in modeling & studying
 documents & queries, 1959-1976
 dynamics & multidimensionality, 1977-present
© Tefko Saracevic
Relevance
16
Split between system &
user models
Opposing views of IR: systems & users
 IR traditional model does not deal with users
Battle royal started by Dervin & Nilan
(1986)
 criticism of system viewpoint
 call for user orientation
Several user & interaction models
proposed – to reconcile, bridge

still relevance has two basic models &
cultures & they map like Australia
© Tefko Saracevic
Relevance
17
Stratified model
Situational
Affective
intent; motivation ...
Cognitive
knowledge; structure...
Query
USE
R
Ada
pta
t io n
characteristics …
hardware; connections...
Processing
software; algorithms …
Content
inf. objects; representations...
© Tefko Saracevic
n
o f in
form
atio
Engineering
MP
UTE
R
Ada
ptat
ion
INTERFACE
CO
INT
ER
AC
T
ION
&
Surface level
Use
RE
LE
VA
NC
E
ST
RA
TA
tasks; context...
Relevance
18
User vs system models
“Informing systems design”

became mantra of all relevance studies
“Tell us what to do and we will do it.”

response from systems side
But “telling” is not that simple
Issue is not a conflict but:

how can we make user & system side work
together for benefit of both?
© Tefko Saracevic
Relevance
19
Summary:
relevance models
All IR & inf. seeking models have
relevance at their base
Traditional IR model has most
simplified –“weak”- version of relevance

but with the weak model IR is successful
Variety of integrative models have
been proposed

more complex models = increased challenge
to incorporate in practice
© Tefko Saracevic
Relevance
20
© Tefko Saracevic
Relevance
21
2. Manifestations of
relevance
“How many relevances in IR?”
(Mizzaro, 1998)
 Several manifestations recognized since 1950s
 Issue: What given objects (Ps & Qs) are related
by what given property (Rs) as relation?
 [adjective] relevance or different name
Duality strikes again


subject (topic, system) relevance
vs user (psychological, cognitive) relevance
objective vs subjective relevance
© Tefko Saracevic
Relevance
22
Issue of primacy:
weak and strong relevance
Does topical relevance underlie all
others?
 predictably two answers: yes and no


in a strict correspondence between
query & answer topical is basic – weak
relevance
if derivation is involved topical may not
be basic – strong relevance
 weak relevance is more associated with
systems
 strong more with people
© Tefko Saracevic
Relevance
23
Beyond duality
Numerous other kinds of relevance
were identified

for user relevances:
 psychological, cognitive, affective, situational,
socio-cognitive, pertinence, utility …

for topical relevances:
 logical, systems, algorithmic, documentary,
bibliographic …

each indicates different relations
© Tefko Saracevic
Relevance
24
Summary:
Manifestations of relevance
There are a limited number of
manifestations – could be grouped:





System or algorithmic relevance
Topical or subject relevance
Cognitive relevance or pertinence
Situational relevance or utility
Affective relevance
These are interdependent – they
feed on each other interactively
© Tefko Saracevic
Relevance
25
3. Behavior of relevance
Relevance does not behave, people do


how humans determine relevance of
information or information objects?
reviewed only studies that have data
 some 30 studies – started in 1991
 related to information seeking & use studies &
implicit relevance studies – not reviewed
Pattern for review:
 [author] used [subjects] to do [tasks] in order to
© Tefko Saracevic
study [object of research]
Relevance
26
Relevance clues
What makes information or information
objects relevant? What do people look
for in order to infer relevance?
 two approaches: topic & clues analysis
Clues research:
 uncover & classify attributes or criteria that
users concentrate on while making relevance
inferences
 usually on documents but also other objects
© Tefko Saracevic
Relevance
27
Examples
Schamber (1991) interviewed 30 users of weather
information using different sources, from oral
reports to documents and maps in order to derive
and categorize their relevance criteria. She
identified 22 categories in 10 groups.
Barry (1994) interviewed 18 academic users (not
specified as being students or faculty) who had
requested an information search for documents
related to their work in order to categorize their
relevance criteria. She identified 23 categories in
7 groups
© Tefko Saracevic
Relevance
28
Relevance dynamics
Do relevance inferences and criteria
change over time for the same user
and task, and if so, how?
As a user progresses through various
stages of a task



© Tefko Saracevic
the user’s cognitive state changes
the task changes as well
thus, something about relevance also is
changing.
Relevance
29
Relevance feedback
What factors affect the process of
relevance feedback?

What types of feedback? How much is
feedback used?
Dealing here with manual not
automatic feedback

© Tefko Saracevic
behavior of people when involved in
manual feedback
Relevance
30
Summary on behavior
Caveats abound – nothing standardized

still refreshing to see data
Some generalizations on clues:

criteria finite in number, similar, but
different weights assigned
 Different users, tasks, progress in tasks, classes
of users = similar criteria = different weights
 Different ratings of relevance = similar criteria
= different weights.
© Tefko Saracevic
Relevance
31
…summary clues
Clues criteria:







© Tefko Saracevic
content
object
validity
use or situational match
cognitive match
affective match
belief match
Relevance
32
… summary clues
Criteria are not independent; people
apply multiple criteria; they interact


content (topic) criteria very important
but not sole – interact with others
for search outputs value of results as a
whole critical
Visual information = faster inference
than textual information
© Tefko Saracevic
Relevance
33
… summary dynamics
Inferences dependent on task stage




© Tefko Saracevic
criteria stable, selection changes
Different stages = differing selections
but different stages = similar criteria =
different weights
Increased focus = increased
discrimination = more stringent
relevance inferences
What is topical changes with progress in
time and task
Relevance
34
… summary feedback
Several kinds

search term, content, magnitude, tactics
Use of relevance feedback = increase
in performance

however, used rarely in practice
Searching behavior different when
using feedback
© Tefko Saracevic
Relevance
35
5. Effects of relevance
Works both ways: relevance affected by
and affects host of factors
Relevance judges


What factors inherent in relevance judges make
a difference in relevance inferences?
How large are and what affects individual
differences in relevance inferences?
 similar question asked for a number of information
activities – indexing, searching …
 most often studied: domain knowledge
© Tefko Saracevic
Relevance
36
Relevance judgments
What factors affect relevance
judgments?


short answer: a lot of them
approach: classify into tables e.g.
 Schamber (1994) 80 factors, 6 categories
 Harter (1996) 24 factors, 4 categories

different approach here:
 classify studies along basic assumptions in IR
evaluations
© Tefko Saracevic
Relevance
37
Central assumptions
Relevance is:






topical
binary
independent
stable
consistent
if pooling: complete
Not to prove or disprove these assumptions
but to organize studies along questions
© Tefko Saracevic
Relevance
38
Beyond topicality
Do people infer relevance based on
topicality only?
Other factors enter & interact
Only a few studies directly addressed
 Wang & Soergel (1998) 11 criteria for
selection, with topicality being top
 Xu & Chen (2006) in web searching: topicality &
novelty most significant, then reliability &
understandability
© Tefko Saracevic
Relevance
39
Beyond binary
Are relevance inferences binary i.e.
relevant – not relevant? If not, what
gradation do people use in inferences
about relevance of information or
information objects?
Number of studies addressed this


studied distributions of relevance
inferences
regions of relevance
© Tefko Saracevic
Relevance
40
Beyond independence
Are information objects assessed
independently of each other? Does the
order or size of the presentation affect
relevance judgments?


Only a few studies on the questions
Includes presentation of different
representations
© Tefko Saracevic
Relevance
41
Beyond stability
Are relevance judgments stable as
tasks and other aspects change? Do
relevance inferences and criteria
change over time for the same user and
task, and if so how?

mentioned already under dynamics
 judgments not completely stable, criteria are
 Plato: “Everything is flux.”
© Tefko Saracevic
Relevance
42
Beyond consistency
Are relevance judgments consistent
among judges or group of judges?
 human judgments about anything informational
are not consistent, relevance included
Gull (1956) opened the Pandora box
 classic example of law of unintended
consequences
Some 6 studies addressed consistency

© Tefko Saracevic
subjects: experts, students
Relevance
43
But does it matter?
How does inconsistency in human
relevance judgments affect results of
IR evaluation?

main contention by critics
Five studies 1968 – 2000 addressed
the question

© Tefko Saracevic
four also showed magnitude of agreement
Relevance
44
Summary: judges
Subject expertise accounts strongly
 higher expertise = higher agreement, less
differences
 lesser expertise = more leniency in judgment

large variability in relevance inferences
by individuals
 same range as in other cognitive processes
© Tefko Saracevic
Relevance
45
Summary: judgments
Relevance is measurable!
None of the 5 postulates hold

but by simplifying relevance for labs IR
made significant advances
Relevance judgments are not binary
but are bimodal

regions of low, middle and high relevance
 high peaks at both end
Order affect relevance judgment
© Tefko Saracevic
Relevance
46
… summary: judgments
Consistency:
 Higher expertise = higher consistency =
more stringent. Lower expertise = lower
consistency= more encompassing
 overlap using different populations hovers
around 30%
 higher expertise up to 80%
 when 3rd, 4th … judge added overlap falls
 Higher expertise =larger overlap. Lower
expertise =smaller overlap. More judges =
less overlap.
© Tefko Saracevic
Relevance
47
Summary: does it matter?
In lab conditions disagreement among
judges does not affect evaluation

rank order of different IR systems changes
minimally
 Different judges = same relative performance (on
the average)


swaps in ranking do occur = low probability
but performance for individual topics differs
significantly
 law of averages kicks in
© Tefko Saracevic
Relevance
48
Summary: measures
Users can use a variety of scales


© Tefko Saracevic
there is no “best” scale
magnitude scales very appropriate but
hard to explain & analyze
Relevance
49
Epilogue
Many things changed in IR &
information science but goals the same
As to nature of relevance:
 marked progress in understanding
 little in theory
 diversification in models in models
As to manifestations:
 consensus there are several kinds of
relevance, grouped in a half dozen or so well
distinguished classes - interdependent
© Tefko Saracevic
Relevance
50
… epilogue
As to behavior & effects:

seen a number of experimental &
observational studies
 lifted the discourse beyond debate,
anecdotes to data interpretation
 but generalizations difficult – findings
should be treated as hypotheses
© Tefko Saracevic
Relevance
51
… epilogue: reflections
Relevance is poor

no funding for relevance research
 of studies with data less than 17% mentioned outside
funding, half from outside the US
 scholarship progressed sporadically & all over the
place
Globalization of IR – globalization of
relevance

© Tefko Saracevic
as relevance went global & to the masses many
& different research questions emerged
Relevance
52
… epilogue: reflections…
Proprietary IR – proprietary relevance

major search engines proprietary =
relevance proprietary
 for innovation must study users & use, but
findings kept private
 relevance research into public & private branch
 paradox of the internet
© Tefko Saracevic
Relevance
53
… epilogue: research
agenda - beyonds
Beyond behaviorism & black box


© Tefko Saracevic
many studies stimulus-response & no
diagnostics
need to go use/adapt other approaches
Relevance
54
… epilogue: research
agenda - beyonds
Beyond mantra

“implication for system design”
 incorporating user concerns &
characteristics not that simple
 integration between user/cognitive &
systems approaches needed
 relevance research and IR research should
at least get engaged, if not married
 interactive research on the right track
© Tefko Saracevic
Relevance
55
… epilogue: research
agenda - beyonds
Beyond students

~70% of behavior & effect studies used
students as population
 not surprising – they are affordable
 we know a lot about student relevance
 does it generalize to other populations?
© Tefko Saracevic
Relevance
56
In conclusion
Information technology & systems
will change dramatically


even in the short run
and in unforeseeable directions
But relevance is here to stay!
© Tefko Saracevic
Relevance
57
© Tefko Saracevic
Relevance
58
© Tefko Saracevic
Relevance
59
Acknowledgments
Ying Zhang & Yuelin Li for searches &
organization
Sandra Lanman for editing
Students in Spring 2006 class Human
Information Behavior for comments

Jeanette de Richemond for corrections
SCILS for research & collegial
atmosphere. BEST COLLEAGUES!
Gus Friedrich for understanding & time
© Tefko Saracevic
Relevance
60
Download