Classification Design Creating subject languages INF 384 C, Spring 2009

advertisement
Classification Design
Creating subject languages
INF 384 C, Spring 2009
Outline
• What is being represented? Defining the
domain of representation.
• What overall structure best fits the domain?
Determining the general form of the subject
language.
• How should concepts be arranged within the
structure? Ordering concepts at each level of
the classification.
INF 384 C, Spring 2009
Defining a subject domain
A subject language represents a domain of knowledge
as a system of related concepts.
Last week, for example, you decided that “lawn
maintenance” and “sustainable urban gardening in
Austin” were represented by a particular group of
concepts.
But how do you determine what “lawn maintenance” is?
INF 384 C, Spring 2009
Defining domains as discourse
communities
Hjorland and Albrechtsen argue that information
systems should be based on discourse
communities of academic disciplines.
Psychology = what psychology researchers say,
because this is the closest we can get to the
“reality” of psychology.
INF 384 C, Spring 2009
Representing competing ideas
But a behaviorist psychologist and a psychologist who
adopts a psychoanalytic perspective may have widely
differing views, even on basic concepts.
Hjorland and Albrechtsen contend that a domain
analysis should uncover these schools of thought that
exist within a discipline, in order to represent them
within an information system.
INF 384 C, Spring 2009
Understanding the significance
of domain analysis
The domain analytic approach acknowledges problems
with the semantic validity of information systems and
asserts the need for systematic and rigorous justification
for knowledge representations.
The domain analytic approach clarifies that a subject
may have many interpretations and provides a means
for validating some of those interpretations.
INF 384 C, Spring 2009
Understanding the limitations
of domain analysis
The domain analytic approach ignores concerns
related to an information system’s audience and
purpose.
The domain analytic approach doesn’t explain
how a designer of subject languages should
decide between competing approaches to the
subject.
INF 384 C, Spring 2009
Performing a domain analysis
Find documents that help you determine ontology
(important things), epistemology (important problems,
theories, methods and practices), and social structures in
the selected domain.
For example:
–
–
–
–
–
FAQs, beginners’ guides, and textbooks.
Bibliographies.
Professional organizations.
Online user forums.
Actual people!
INF 384 C, Spring 2009
Using a domain analysis to make
decisions
A domain analysis can help you determine:
• Which sources of information are more
important or representative than others.
• Which concepts to include in a subject
language.
• How those concepts might be defined, labeled,
and related to other concepts.
INF 384 C, Spring 2009
Understanding your responsibility
as a classificationist
A domain analysis won’t actually make decisions for
you; it will just provide information for you to use, in
concert with your audience description, classification
purpose, and your own goals and editorial judgment.
Every classification is a type of argument for a
particular interpretation of a subject. You need to own
that!
INF 384 C, Spring 2009
Types of classificatory structures
Kwasnik identifies three types of structures:
• Hierarchies.
• Trees.
• Facets.
INF 384 C, Spring 2009
Hierarchies
A hierarchy groups concepts from most general
to most specific: each branch inherits
characteristics from its parents.
Dogs exhibit all of the characteristics of
mammals (and thus all characteritics of primates,
and of vertebrates) plus additional
characteristics.
INF 384 C, Spring 2009
Trees
According to Kwasnik, a tree is a looser form
that shows a consistent principle of organization,
but does not have the strict rules of inheritance
and so forth that hierarchies have.
Example:
Grandparents
Parents
Children
INF 384 C, Spring 2009
Facets
Facets are fundamental, ideally orthogonal, categories
of concepts.
Concepts from different facets can be combined to form
more complex concepts without needing to enumerate
all possible combinations in advance.
Example: By keeping processes and materials separate,
we can describe roasting, braising, or frying any kind of
food item, without enumerating each type of food for
every process.
INF 384 C, Spring 2009
Example facets
From Aitchison, possible facets:
• Abstract entities.
• Attributes.
• Materials (constituent substances).
•
Naturally occurring entities.
•
Living entities/organisms.
•
Artifacts (man-made).
•
•
•
•
•
Parts/components.
Agents/actors.
Equipment/apparatus.
Patients (recipients of actions).
Products (of actions).
INF 384 C, Spring 2009
•
Processes (intransitive actions).
•
•
•
Operations (transitive actions).
Place.
Time.
Arrangement within hierarchies
Two forms:
• Showing multiple principles of division that
relate children to their parent node (subfacets).
• Ordering via appropriate principles at each
array (Ranganathan’s canons and such).
INF 384 C, Spring 2009
Showing principles of division
(subfacets)
shoes
high heels
hiking boots
mary-janes
pumps
running shoes
sandals
slingbacks
stilettos
wedges
winter boots
INF 384 C, Spring 2009
shoes
(by season)
winter
spring
(by function)
hiking
running
(by style)
boots
pumps
sandals
(by feature)
slingbacks
mary-janes
(by heel type)
stilettos
wedges
(by heel height)
high heels
Ordering concepts at each level
An “array” is a group of siblings (descriptors at the
same level of hierarchy). These need to be ordered, even
if you don’t need subfacets! Possible orders:
• General to specific.
• Chronological.
• Close to far away.
• Order of a process.
INF 384 C, Spring 2009
Example: music tempos
Allegro
Andante
Largo
Moderato
Presto
Vivace
Largo
Andante
Moderato
Allegro
Vivace
Presto
(alphabetical)
(slowest to fastest)
INF 384 C, Spring 2009
Your mission
• Continue doing research to define the subject.
• Continue compiling a list of potential concepts
to include in your classification.
• For each potential concept, why should it be
there? What is your basis for making this
decision?
• Start thinking about how your concepts might
be structured.
INF 384 C, Spring 2009
Download