雲端衝擊下的圖書典藏The Impact of Mass Digitization Project on

advertisement
雲端衝擊下的圖書典藏
黃鴻珠
淡江大學圖書館
2011-11-10
1
綱要
前言
紙本圖書典藏的問題
如何解決紙本圖書的典藏
雲端下的紙本圖書典藏
我國紙本圖書典藏的建議
結語
2
前言
What to withdraw? Print collection
management in the wake of digitization
3
What to withdraw ?(1/2)
1. Digitizes with high standards of quality?
2. Errors are actively being corrected?
3. Digital copies are reliably preserved?
4. Image-intensive?
5. Terms of provision are reliable?
http://www.ithaka.org/ithaka-s-r/research/what-towithdraw/What%20to%20Withdraw%20%20Print%20Collections%20Management%20in%20the%20Wake%20of
%20Digitization.pdf
4
What to withdraw? (2/2)
6. How should individual libraries handle these
materials?
7. Time horizon for ensuring that some print
copies are retained across the community?
8. Level of assurance that at least one copy
remains after the state time horizon?
5
書本的危機
 Codex in crisis
 RLG Partnership Annual Meeting 2010 “When the
books leave the building : The Future of Research
Libraries, Collections and Services ”
 6 Reasons We're In Another 'Book-Burning' Period in
History, S Peter Davis October 11, 2011
6
6 Reasons We're In Another 'Book-Burning'
Period in History,
6. A Library Near You Is Doing It Right Now
5. It's Cheaper Than Giving Them Away
4. It Has to Be Done in Secret.
3.The Economy Is Killing the Library
2. Libraries Can't Grow Fast Enough
1. The Books Are Going Digital
http://www.cracked.com/article_19453_6-reasons-were-inanother-book-burning-period-in-history.html
7
UCSD 大量淘汰圖書
 As Library Footprint Shrinks, University of
California, San Diego, Weeds 150,000 Volumes
 Library at Scripps Institution of Oceanography
scheduled to close in 2012
By Bob Warburton Oct 18, 2011
http://www.libraryjournal.com/lj/home/892450264/as_library_footprint_shrinks_university.html.csp
8
Constance Malpas, “Change in emphasis: share printed environment” in
RLG Partnership symposium, 2010
http://www.slideshare.net/oclcr/books-leave-the-building-malpas-final
An end to magical thinking
The books have already left
the building.
>70 million volumes off-site
• 30% of Columbia’s collection
• 40% of UC Berkeley’s
• 50% of UCLA’s
• +50% of Harvard’s, etc.
6 editions held by 1,116 libraries
No evidence that loss of browsing
has adversely affected scholarship
or institutional reputation
Change in Emphasis: Shared Print Environment (Malpas)
2
9
圖書典藏面臨的問題
紙本與電子書同時劇增
讀者使用空間的需求
10
電子書典藏的問題
單館 vs 聯盟
單冊 vs 合集
限期 vs 永久
在地 vs 雲端
11
紙本與電子書的量同時劇增
5 Myths about the “Information Age”
1.
2.
3.
4.
5.
The book is dead
We have entered the information age
All information is now available online
Libraries are obsolete
The future is digital
By Robert Darnton ,5 Myths About the ‘Information Age’, The Chronicle of
Higher Education, April 17, 2011
http://chronicle.com/article/5-Myths-About-the-Information/127105/
12
紙本圖書出版情況
2011年全球年出版 100 萬種
2010 -10-01 英國出現 “Super Thursday” ,
當日出版 800 種
美國 2009 年出版 288,355 種
美國 2009 年非正式管道出版 764,448 種
By Robert Darnton ,5 Myths About the 'Information Age'
The Chronicle of Higher Education, April 17, 2011
http://chronicle.com/article/5-Myths-About-the-Information/127105/
13
ARL 2008-2009 圖書年增量
http://www.arl.org/bm~doc/arlstat09.pdf
14
美國著名圖書館年增量
 美國研究圖書館圖書年成量為 2%
Reshaping the Research Library: Some Observations on the Future
of Academic Collections
15
讀者使用空間的需求
Library as a Place
Place as a Library
Information Commons
Learning Commons
16
Ohio State University
http://chronicle.com/blogPost/Library-Renovation-at Ohio/4700
17
解決紙本圖書的典藏
 進行淘汰
 調整典藏地
館外典藏
聯合典藏
高密度倉儲典藏
 圖書數位化
 推動圖書徵集新政策
PDA、DDA、UDA
 提供新服務
POD、Espresso Book Machine
18
The rationale for weeding (1/2)
1.Electronic resources are the dominant
information format
2.Print use is low and declining
3.Library stacks and storage facilities are
crowded
4.Library space is wanted for other purposes
http://sampleandholdr2.blogspot.com/2011_10_01_archive.html
19
The rationale for weeding (2/2)
5. Keeping print books on the shelves is expensive
6. Many copies of the same titles exist in many
libraries
7. Secure digital versions exist for millions of titles
8. The infrastructure to support sharing exists and is
growing
9. Savings from adopting shared print can support
other library services
20
圖書使用調查 –
The University of Pittsburgh
1969 年購置的 36,892 冊書
觀察 6 年內借出與未借出的情況
結果發現約 40% 的書 6 年內未被借出
2 年內未借出的書未來被借的機率為 1/4
6 年內未借出的書未來被借的機率為 1/50
Use of Library Materials: The University of Pittsburgh Study, by Allen Kent,
New York: Marcel Dekker, 1979
http://sampleandhold-r2.blogspot.com/
21
OhioLINK-OCLC Collection and
Circulation Analysis Project 2011
Encompasses 89 libraries and tracks collection
overlap and circulation activity across 30
million items
The circulation activity is drawn from the
Spring of 2007 and 2008
The analysis of a year’s statewide circulation
statistics indicate that 80% of the circulation is
driven by just 6% of the collection.
22
汰書原則 – Ohio State University
1. Their content is available in electronic format and digital
preservation standards and best practices are assured.
2. A minimum of two copies of the same print resources are
available in OhioLINK’s depository system, or when an
electronic resource supersedes or renders a printed
resource obsolete.
3. Consult with appropriate teaching faculty and other
interested users about the proposed withdrawals.
23
Book De-selection Principles –
Grand Valley State University
Withdrawal candidate criteria
No circulations since 1998
Publication year before 2000
More than 100 US holdings
More than 10 in-state holdings (Michigan)
Not listed in Resources for College Libraries
Never reviewed in CHOICE
http://sampleandhold-r2.blogspot.com/2011_03_01_archive.html
24
Book retained Principles –
Grand Valley State University
Fewer than 10 US holdings
Not represented in Hathi Trust
No circulations since 1998
Publication year before 2000
http://sampleandhold-r2.blogspot.com/2011_03_01_archive.html
25
Sample Withdrawal Candidate List
http://sustainablecollections.com/storage/sample_reports/sample-withdrawallist.png
26
聯合典藏
分享空間
1902 年哈佛大學校長 Eliot 即提出
分享空間與館藏
Center for Research Libraries
1949 the Midwest Inter-Library Center (MILC)
1960 年代中期更為現代名稱
由地區發展到全國、全球
27
高密度典藏系統
北美地區現有 70 座
管理、合作的模式不一
28
CIC/Committee on Institutional Cooperation
Shared print storage project
Short term goal: Create immediate
opportunities to relieve space pressures at CIC
libraries
Medium term goal: Systematic plan for
managing lesser used print collections
 Long term goal: Advance nationally
coordinated efforts in print preservation
29
WEST
Collection Development and Management
Licensed Resources
Shared print
Mass Digitization
Digital Special Collections
Shared Cataloging
30
UKRR (UK Reserch Reserve)
 UKRR is a club
 A partnership between 29 UK universities and the
British Library
 Shared responsibility for the distributed UKRR
collection
 Encourages culture change:
Owning physical collections is no longer the only sign of a
good library
http://www.bl.uk/blpac/pdf/dareshorley.pdf
31
How are UKRR members rewarded for taking
part?
• For every metre of material offered for checking
UKRR pays the library £26
• Guaranteed 24 hour desk top document delivery from
BLDSC
http://www.bl.uk/blpac/pdf/dareshorley.pdf
32
圖書大量數位化
Google Books Search
HathiTrust
Internet Archive
Open Book Alliance
Project Gutenberg
The Digital Public Library of America
The European Library
中美百萬冊數字圖書館
33
Google Books Library Project
 The Library Project's aim is simple: make it easier
for people to find relevant books – specifically,
books they wouldn't find any other way such as
those that are out of print – while carefully
respecting authors' and publishers' copyrights.
Our ultimate goal is to work with publishers and
libraries to create a comprehensive, searchable,
virtual card catalog of all books in all languages
that helps users discover new books and
publishers discover new readers.
http://books.google.com/googlebooks/library.html
34
http://www.hathitrust.org/
An outgrowth of the Google Books Project,
the HathiTrust:
• Includes digital versions of the books from many US
library collections which are in Google Book Search
• Provides preservation quality, digital, library owned
content
• Will be certified as a Trusted Digital Repository
35
HathiTrust 資料量
2011-11-07
9,728,814 total volumes
5,164,518 book titles
256,880 serial titles
3,405,084,900 pages
436 terabytes
115 miles
7,905 tons
2,654,933 volumes (~27% of total)
in the public domain
36
HathiTrust
2013 與哈佛大學圖書館同量, 10 年內達
3,000萬冊
2013 開放非合作館使用
37
http://openlibrary.org/
38
Goals of mass digitization–
University of California
 Expand the UC Libraries ability to give faculty,
students and the public access to information and
support our exploration of new service models. These
projects are designed to:
 Enhance student and faculty research
 Enable scholars to trace the evolution of ideas and
perform other sophisticated textual analysis more easily
 Fulfill its public service mission -- can now be read by
anyone, anywhere, anytime.
 Preserve and protect our collections
http://www.cdlib.org/services/collections/massdig/faq.html
39
New questions and service models -- after
mass digitization
Enhanced Discovery and Access
Collection Management
New Services to Users
Curating through Collaboration
Funding Reallocation
http://www.cdlib.org/services/collections/massdig/faq.html
40
雲端下紙本圖書的典藏
Cloud-sourcing Research Collections:
Managing Print in the Mass-digitized
Library Environment
Constance Malpas
Program Officer OCLC Research, 2011
http://www.oclc.org/research/publications/library/2011/2011-01.pdf
41
The Cloud Library project
The objective of the project was to examine
the feasibility of outsourcing management of
low-use print books held in academic libraries
to shared service providers, including largescale print and digital repositories.
http://www.oclc.org/research/publications/library/2011/2011-01.pdf
42
Cloud-sourcing Research Collections
研究的對象
NYU
HathiTrust
ReCAP
Research Collections Access & Preservation
研究的結果
 HathiTrust 與 ReCAP 重複 20%
HathiTrust、ReCAP、NYU 重複 200,000冊
43
Cloud-sourcing Research Collections : Key
findings (1/4)
Mass digitized monographic corpus already
substantially duplicates academic print
collection
30% or more of titles in local collection have been
digitized
44
Cloud-sourcing Research Collections:Key
findings (2/4)
Extant inventory in large-scale shared print
repositories substantially mirrors digitized
corpus
~75% of mass-digitized titles already ‘backed up’
in one or more preservation repositories (ReCAP,
UC Regional Facilities, CRL, LC)
45
Cloud-sourcing Research Collections :Key
findings (3/4)
Opportunity to benefit from externalization
is widely distributed; every academic library
is affected
Potential market for service is broad; aggregate
savings significant
46
Cloud-sourcing Research Collections :Key
findings (4/4)
Maximum benefit will be achieved when
distribution network for in-copyright content
is available
Public domain content inadequate to mobilize
collective resources
47
Cloud-Sourcing Research Collections: Managing
Print in the Mass-Digitized Library Environment
 "...we estimate that the median space savings that
could be achieved at an ARL library if a robust
shared print offer were in place today to be
approximately 36,000 linear feet or the equivalent of
more than 45,000 ASF [assignable square feet]... "In
economic terms, the total annual cost avoidance -assuming all of these books are currently managed
on-site -- exceeds $2 million per library."
48
Cloud-Sourcing Research Collections:
需探討的要項
1. Document types
2. Subject Distribution
3. Right Status
4. Distribution of System-wide Print Holding
49
雲端圖書分享的主要問題
50
圖書徵集的新策略
PDA (Patron-Driven Acquisitions)
DDA (Demand-Driven Acquisitions)
UDA (User-Driven Acquisitions)
51
University of Denver Use Data
(Titles Cataloged 2000-2004)
All
4+ 23,854 (18.8%)
3 10,461 ( 8.2%)
2 16,257 ( 12.8%)
1
26,155 ( 20.6%)
0 50,266 ( 39.6%)
Univ. Press
4,029 (19.9%)
1,954 (9.6%)
3,134 (15.5%)
4,882 (24.1%)
6,278 (31.0%)
“Rethinking Library Acquisition: Demand-Driven Purchasing for Scholarly
Books“
http://www.slideshare.net/MichaelLevineClark/aaup-demand-driven-616
52
University of Denver Use Data (U.P. Titles
Cataloged in 2000)
4+
3
2
1
0
Ever Used
Used 2005 or Later
932 (22.1%)
882 (20.1%)
424 (10.0%)
349 (8.3%)
682 (16.1%)
439 (10.4%)
968 (22.9%)
475 (11.2%)
1,217 (28.8%) 2,078 (49.2%)
“Rethinking Library Acquisition: Demand-Driven Purchasing for
Scholarly Books“
http://www.slideshare.net/MichaelLevineClark/aaup-demand-driven-616
53
Patron-Driven Acquisitions
In a typical workflow, the library imports
bibliographic records into its catalogue at no
cost. When a patron finds a patron-driven
record in the course of research, a short-term
loan can allow him to borrow the book, and the
transaction charge to the library will be a small
percentage of the list price. Typically, a library
will automatically buy a book on a third or
fourth use.
54
Patron-Driven Acquisitions
Infrastructure
MARC records prior to purchase
Rush order and delivery
Enhanced metadata
Lost revenue and margin must be recouped
Possible role in print-on-demand
Delivery options:print, POD, eBook
55
圖書新服務 –隨選印製
In sourcing
Espresso Book Machine
http://www.youtube.com/watch?v=Q946sfGLxm4
Out sourcing
LighteningSource
56
Let Them Eat... Everything by Rick Anderson, Univ. of Utah
http://www.slideshare.net/CharlestonConference/let-them-eateverything-by-rick-anderson-university-of-utah
57
圖書服務的新構想
The User-Driven Purchase Give Away
Library : A Thought Experiment
David W. Lewis
July 2010
58
Embedded librarian
David W. Lewis 2005-2025
59
HathiTrust
對中文圖書典藏可能的影響
60
HathiTrust
收錄我國書館界教師著作的採樣
61
美國東亞圖書館中文館藏 2010年
CEAL 統計資料
62
University Library A bleak future for physical
collections?





Acid paper
Disintegrating bindings
Inappropriate storage conditions
Rough handling (and book drops!)
Leaking roofs
63
University Library A collection typology
 Heritage:
 Significant & distinctive collections which
continue to be developed
 Legacy:
 Significant & distinctive collections: historic
strengths but no longer added to
 Self-renewing:
 Supporting current research & teaching
 Finite:
 No longer relevant - can be considered for
withdrawal
64
No Brief Candle: Re-conceiving Research Libraries
for the 21st Century.
Thinking deeply about what it is we want to
conserve,
identifying those areas where libraries have the
competitive advantage (e.g. preservation and
standards), and
finding new ways to engage and expand our
traditional position at the center of campus and
at the crossroads of disciplines.
65
Change in Academic Collection
 Shift to licensed electronic content is accelerating
Research journals – a well established trend
Scholarly monographs – in progress
 Print collections delivering less (and less) value at great
(and growing) cost
Est. $4.25 US per volume per year for on-site collections
Library purchasing power decreasing as per-unit cost rises
 Special collections marginal to educational mandate at
many institutions
Costly to manage, not (always) integral to teaching, learning
取自 “Reshaping the research library”
66
紙本 vs 電子圖書
67
Rationales for retaining some copies of
the print version
 the need to fix scanning errors;
 insufficient reliability of the digital provider
 inadequate preservation of the digitized versions;
 the presence of significant quantities of
important non-textual material that may be
poorly represented in digital form;
 campus political considerations.
Roger C. Schonfeld Presenter (2011): What to Withdraw? Print Collection
Management in the Wake of Digitization, The Serials Librarian, 60:1-4, 141-145
68
http://www.news.com.au/technology/internet-pioneers-new-project-to-preserve-a-copy-of-every-book-everpublished/story-e6frfro0-1226106431988
69
http://www.teleread.com/library/why-preserve-books-the-new-physical-archive-of-the-internetarchive-by-brewster-kahle/
70
變動的世界
Toward Greater Sanity in Scholarly
Communication
Less sane
 Interlibrary loan
 Big Deals





Subscriptions
Approval plans
Reference/Bib instruction
Redundant cataloging
Print runs
More sane
 Article purchases
(document delivery)
 Wikipedia
 Shared cataloging
 Ease of use
 PDA (for books)
 Print on demand
 Rick Anderson, Let them eat … everything: embracing a Patron-Driven Future
J. Willard Marriott Library
71
我國紙本圖書典藏的建議 (1/2)
1. 館藏觀念的改變
-- Diffuse and hard to define
-- Less selective more inclusive
-- Global + Local
2. 雲端下紙本圖書的地位
3. 雲端下紙本圖書的典藏
-- Elsevier 期刊 2008-2010
國圖、中興、成大典藏
-- 聯合、全域
72
我國紙本圖書典藏的建議 (2/2)
4. 設立大學數位圖書館
-- 技術、政策、組織
5. 圖書的未來
-- 清明上河圖動畫 + 蔣勳的解說
73
結語
It is not enough to do your best; you must
know what to do, and then do your best.
–W. Edwards Deming
74
謝謝聆聽
敬請指教!
75
Download