Bibliometric research methods
Faculty Brown Bag
IUPUI
Cassidy R. Sugimoto
Overview
Vocabularly
Citation analysis
Citation indices
Bibliometric laws
Impact factor
Applications
Vocabulary
Scholarly Communications
Formal and information
Scientometrics
Scientific communication
Infometrics
Thinking beyond scholarly “texts”
Webometrics
web
Bibliometrics
Application of statistical and mathematical methods
(formal channels)
Citation analysis
Citing
document
Cited
document
A
B
A references B
B is cited by A
Why do people cite?
Why are some articles not cited?
What does a citation mean?
Who’s on first?
Embedded citation index
from ` En mishpat:
Babylonian Talmud (1546)
(Weinberg, 1997)
Shepard’s Citation Index (1873)
Shapiro (1992)
Institute for Scientific Information (ISI)
Scopus
GoogleScholar
Comparison
Scopus
n=7,333 (86%)
Web of Science
n=6,108 (71%)
Scopus
29%
(2,441)
Overlap
57%
(4,892)
Web of
Science
14%
(1,216)
Distribution of unique and overlapping citations in Scopus and Web of
Science (n=8,549)
Are you a citation index?
Bibliometric research OR
“Why I love good indexes”
Citation analysis
Citing
document
Cited
document
A
B
A references B
B is cited by A
Citation analysis: methods
Not just articles…
Variable:PRODUCERS
Variable:PRODUCERS
Variable:ARTIFACTS
Variable:CONCEPTS
Hybrid approaches
Chaomei Chen: http://www.pages.drexel.edu/~cc345/citespace/figures/terrorism1990-2003-300dpi.png
h-index
Hirsch (2005)
A scientist has index h if h of [his/her] Np
papers have at least h citations each, and
the other (Np − h) papers have at most h
citations each.
Bibliometric laws
Lotka’s Law (1926)
the number (of authors) making n
contributions is about 1/n² of those
making one; and the proportion of
all contributors, that make a single
contribution, is about 60 percent
(60,15,7…6>10)
Not statistically exact
May be changing with the current model of scholarship
Bibliometric laws
Bradford’s law (1934)
Journals in a field can be divided into three parts:
1) Core: relatively few # of journals producing 1/3 of all articles
2) Zone 2: same # of articles, but > # of journals
3) Zone 3: same # of articles, but > # of journals
The mathematical relationship of the number of journals in the core to the
first zone is a constant n and to the second zone the relationship is n².
1:n:n²
Not statistically exact
General power law distribution (akin to Pareto’s law in economics)
Bibliometric laws
Zipf’s Law (1935)
listing
wordsUlysses
occurring within that text in order of decreasing
Jamesthe
Joyce's
frequency,
the rank of
a word
on that list multiplied by its
10th most frequent:
2,653
times
frequency
equal a265
constant.
100th mostwill
frequent:
times The equation for this
relationship
is: r x f = k133
where
r is the rank of the word, f is the
200th most frequent:
times
frequency,
and
k is multiplied
the constant
rank of the
word
by the frequency of the word
equals a constant that is approximately 26,500
Not statistically exact
General power law probability distribution
Bibliometric laws
Other power law probability
distributions
Pareto’s law (economics)
80-20 rule
Law of the vital few
Principle of factor sparsity
PageRank (google)
The Long Tail (markets)
Journal impact factors
As a research method…
Reliability?
Validity?
Limitations?
Applications?
Finding and use
Collection development
Reference services
Collection evaluation
Use studies
Information retrieval algorithms
Diffusion of ideas
Domain areas and interdisciplinarity
Mapping science
Writing your paper…