Consuls Presentation - Harvard Wiki

advertisement
HARVARD DISCOVERY DAY
Discovery Trends and Issues
Marshall Breeding
Independent Consult, Author,
Founder and Publisher, Library Technology Guides
http://www.librarytechnology.org/
http://twitter.com/mbreeding
November 13, 2013
Harvard Discovery Day
Twitter hashtag:
#discoveryday
Discovery Resources
Discovery on LTG
The Evolution of Library Resource
Discovery
Discovery in ARL Libraries
http://www.librarytechnology.org/arl-discovery.pl
Online Catalog
ILS Data
Search:
Scope of Search
Search Results


Books, Journals, and
Media at the Title
Level
Not in scope:
 Articles
 Book
Chapters
 Digital objects
Next-gen Catalogs or Discovery
Interface


Single search box
Query tools
Did you mean
 Type-ahead




Relevance ranked results
Faceted navigation
Enhanced visual displays
Cover art
 Summaries, reviews,


Recommendation services




Scope of Search
Books, Journals, and
Media at the Title Level
Other local and open
access content
Not in scope:
Articles
 Book Chapters
 Digital objects

Discovery Interface search model
Search:
Local
Index
ILS Data
Digital
Collections
ProQuest
Search Results
MetaSearch
Engine
EBSCOhost
…
MLA
Bibliography
ABC-CLIO
Real-time query and
responses
Discovery from Local to Web-scale

Initial products focused on interface improvements
AquaBrowser, Endeca, Primo, Encore, VuFind,
 Civica Sorcer, Axiell Arena
 Mostly locally-installed software


Current phase is focused on pre-populated indexes that
aim to deliver Web-scale discovery
Primo Central (Ex Libris)
 Summon (Serials Solutions)
 WorldCat Local (OCLC)
 EBSCO Discovery Service (EBSCO)
 Encore ES (+EDS Index)

Public Library Information Portal
ILS Data
Digital
Collections
Search:
Usagegenerated
Data
Customer
Profile
Consolidated Index
Search Results
Web Site
Content
Community
Information
Aggregated
Content
packages
…
Customerprovided
content
Reference
Sources
Archives
Pre-built harvesting and indexing
Discovery services as Website
Replacement



Portal environment that includes customized content
management service that can fulfill typical offerings
on library Web sites
Full integration between Web site and resource
discovery (ideally)
Examples:
 Axiell
Arena
 Infor Iguana
Web-scale Index-based Discovery
(2009- present)
Digital
Collections
Search:
Customer
Profile
Consolidated Index
Search Results
Usagegenerated
Data
ILS Data
Web Site
Content
Institutional
Repositories
Aggregated
Content
packages
…
Open Access
E-Journals
Reference
Sources
Pre-built harvesting and
indexing
Bento Box Discovery Model
Aggregated
Content
packages
ILS Data
VuFind / Blacklight
Search Results
Web Site
Content
Digital
Collections
Institutional
Repositories
Consolidated Index
Search:
Open Access
E-Journals
Pre-built harvesting and
indexing
Web-scale Search Problem
ILS Data
Digital
Collections
Search Results
Consolidated
Index
Search:
Web Site
Content
Institutional
Repositories
Aggregated
Content
packages
…
E-Journals
???
Problem in how to deal with resources not
provided to ingest into consolidated index
Pre-built harvesting and
indexing
Non
Participating
Content Sources
Citations / Metadata > Full Text



Citations or structured metadata provide key data
to power search & retrieval and faceted navigation
Indexing Full-text of content amplifies access
Important to understand depth indexing
 Currency,
dates covered, full-text or citation
 Many other factors
Challenge for Relevancy





Technically feasible to index hundreds of millions or
billions of records through Lucene or SOLR
Difficult to order records in ways that make sense
Many fairly equivalent candidates returned for any
given query
Must rely on use-based and social factors to
improve relevancy rankings
Objectivity: Does relevancy reflect bias or publisher
preferences
Challenges for Collection Coverage





To work effectively, discovery services need to
cover comprehensively the body of content
represented in library collections
What about publishers that do not participate?
Is content indexed at the citation or full-text level?
What are the restrictions for non-authenticated
users?
How can libraries understand the differences in
coverage among competing services?
Evaluating the Coverage of Indexbased Discovery Services





Intense competition: how well the index covers the body
of scholarly content stands as a key differentiator
Difficult to evaluate based on numbers of items indexed
alone.
Important to ascertain now your library’s content
packages are represented by the discovery service.
Important to know what items are indexed by citation
and which are full text
Important to know whether the discovery service favors
the content of any given publisher
Non-Cooperative Scenarios


Many A&I and specialized resources do not contribute
to Discovery services
Two major players are both publishers and discovery
service providers




EBSCO – ProQuest
ProQuest does not provide content to other discovery
services
EBSCO does not provide content to other discovery
services
Issue currently being pressed by Orbis Cascade
Alliance.
Open Discovery Initiative




NISO Work Group to Develop Standards and
Recommended Practices for Library Discovery
Services Based on Indexed Search
Informal meeting called at ALA Annual 2011
Co-Chaired by Marshall Breeding and Jenny
Walker
Term: Dec 2011 – Dec 2013
Balance of Constituents
21
Libraries
Marshall Breeding, Vanderbilt University
Jamene Brooks-Kieffer, Kansas State University
Laura Morse, Harvard University
Ken Varnum, University of Michigan
Sara Brownmiller, University of Oregon
Lucy Harrison, College Center for Library Automation
(D2D liaison/observer)
Michele Newberry
Publishers
Lettie Conrad, SAGE Publications
Roger Schonfeld, ITHAKA/JSTOR/Portico
Jeff Lang, Thomson Reuters
Linda Beebe, American Psychological Assoc
Aaron Wood, Alexander Street Press
Service Providers
Jenny Walker, Ex Libris Group
John Law, Serials Solutions
Michael Gorrell, EBSCO Information Services
David Lindahl, University of Rochester (XC)
Jeff Penka, OCLC (D2D liaison/observer)
ODI Project Goals:



Identify … needs and requirements of the three
stakeholder groups in this area of work.
Create recommendations and tools to streamline the
process by which information providers, discovery
service providers, and librarians work together to
better serve libraries and their users.
Provide effective means for librarians to assess the level
of participation by information providers in discovery
services, to evaluate the breadth and depth of content
indexed and the degree to which this content is made
available to the user.
ODI Timeline
23
Milestone
Target Date
Appointment of working group
Dec 2011
Approval of charge and initial work plan
Mar 2012
Agreement on process and tools
Jun 2012
Completion of information gathering
Jan 2013
Completion of initial draft
Jun 2013
Completion of final draft
Sep 2013
Public Review Period commences
Sep 2013
Status
Social Discovery
A more social user experience





Ratings, rankings, reviews
Enhanced content
Connections with the library
Connections with other users
Challenge: Must have critical mass of engagement
to have an impact
Social Strategies

Inherent
 BiblioCommons:
design and infrastructure created with
social flavor and features

Layered or integrated
 ChiliFresh:
integrate a third party social platform into
any given library catalog or discovery service
Socially-powered discovery




Leverage use data to increase effectiveness of
discovery
Usage data can identify important or popular
materials to inform relevancy engines
Identify related materials that may not otherwise
be uncovered through keyword matching
Be careful to avoid introducing bias loops
E-Book Integration
Critical concern for public libraries




Most libraries offer e-book lending programs
Strong demand: increasing use statistics
Print lending remains vigorous
Academic libraries will benefit at a later phase by
e-book lending models developed in the public
library sector
Commercial library e-book lending
services




OverDrive
3M Cloud Library
Baker & Taylor: Axis 360
“Douglas County Model”
 Locally
curated e-book collections and lending platform
E-book Lending Models



Phase I: Link out to e-book lending service
Phase II: Load MARC records in local catalog, then
link out on individual titles
Phase III: Discovery and lending operations
performed fully within the library’s catalog or
discovery environment
Full e-book lending


Discovery of print and e-book titles and copies
simultaneously
E-book transactions represented within patron’s
library account
 List
of charged items, due dates
 Service options: renew, return, etc.

Ability to check-out and download e-books into ereader
Library Interfaces with e-book
Integration






Polaris PowerPAC
BiblioCommons
SirsiDynix eResource Central
Innovative Interfaces: Encore ES
TLC: LS2 PAC
OdiloTK from OdiloTID
The e-book integration ecosystem




E-book lending services must expose APIs
Online catalog or discovery services must consume
APIs and adjust interface design and business logic
to accommodate discovery and lending operations
Challenge: each e-book service provider’s APIs are
different
Response: Work toward consistent or standard suite
of APIs
Library Technology Reports


The Current State of Library Resource Discovery
Products: Context, Library Perspectives, and Vendor
Positions
In press for Publication January 2014
LTR Components




Vender questionnaire
Library Survey
Industry announcements
Other articles and publications
Library Discovery Survey



Survey executed to
gather data from
libraries regarding their
experiences with
discovery services
Responses received by
396 Libraries:
29 Countries
represented, 252
responses from United
States
Academic
247
Consortium
Government Agency
Law
Medical
Museum
National
Other
Public
Special
State
Theology
15
2
7
5
1
1
1
96
14
4
3
Overall Satisfaction
Overall Effectiveness
Comprehensiveness: Academic Libraries
Relevancy Effectiveness
Objectivity in Discovery
Objectivity in Discovery: Academics
Example Product rating chart
Google Scholar




Default starting point for most researchers
Mostly outside of library control
Libraries can register link resolvers to facilitate
access to subscribed documents from registered IP
addresses
Recent cooperation between Thompson Reuters Web
of Science and Google Scholar
 Signals
deeper interest by Google in academic
resource discovery arena

Core service for Google?
http://www.google.com/intl/en/scholar/libraries.html
Discovery Trends
Discovery Service Installations
Product
2007 2008 2009 2010 2011 2012
EBSCO EDS
5,000
Primo
12
37
AquaBrowser
55
339
64
69
74
Encore
72
72
109
56
72
46
77
58
88
LS2 PAC
Summon
Enterprise
Civica Sorcer
Axiell Arena
Installed
53 506
50 164
16
111 101
58
1900
750
365
73
305
214 158
704
75
100
102
328
7
12
22
3
42
61
57
33
76
Trend
Increased uptake in academic libraries of
Web-scale discovery services


Almost a “must-have” product as one component of
overall resource discovery and dissemination
strategy
Yet, the number of discovery service installations
remains a fraction of library management systems.
Counter trend

Some libraries resist investing in a discovery service
citing:
 Google
Scholar performs adequate discovery
 High cost of discovery services
 Concerns for bias
Trend
Tendency toward re-alignment with
management systems




Alma + Primo / Primo Central
Sierra + Encore
WorldCat Local + WorldShare Management
Services
Intota + Summon
Library Services Platform


Library-specific software. Designed to help libraries
automate their internal operations, manage collections,
fulfillment requests, and deliver services
Services




Service oriented architecture
Exposes Web services and other API’s
Facilitates the services libraries offer to their users
Platform



General infrastructure for library automation
Consistent with the concept of Platform as a Service
Library programmers address the APIs of the platform to extend
functionality, create connections with other systems, dynamically
interact with data
Library Services Platform
Characteristics

Highly Shared data models



Delivered through software as a service



Multi-tenant
Unified workflows across formats and media
Flexible metadata management




Knowledgebase architecture
Some may take hybrid approach to accommodate local data
stores
MARC – Dublin Core – VRA – MODS – ONIX
BIBFRAME
New structures not yet invented
Open APIs for extensibility and interoperability
Library Services Platforms
Category
WorldShare
Alma
Management
Services
OCLC.
Ex Libris
Intota
Key precepts
Global
network-level
approach to
management
and discovery.
Consolidate
workflows,
unified
management:
print,
electronic,
digital;
Hybrid data
model
Knowledgeba
se driven.
Pure multitenant SaaS
Software model
Proprietary
Proprietary
Proprietary
Responsible
Organization
Serials
Solutions
Sierra
Services
Platform
Innovative
Interfaces, Inc
Kuali OLE
Service-oriented
architecture
Technology
uplift for
Millennium ILS.
More open
source
components,
consolidated
modules and
workflows
Proprietary
Manage library
resources in a format
agnostic approach.
Integration into the
broader academic
enterprise
infrastructure
Kuali Foundation
Open Source
New Library Management Model
Unified Presentation Layer
Search:
Library Services
Platform
API Layer
`
Digital
Coll
Consolidated index
Self-Check /
Automated
Return
ProQuest
EBSCO
…
JSTOR
Stock
Management
Enterprise
Resource
Planning
Learning
Management
Other
Resources
Smart Cad /
Payment
systems
Authentication
Service
Development / Deployment
perspective



Beginning of a new cycle of transition
Over the course of the next decade, academic
libraries will replace their current legacy products
with new platforms
Not just a change of technology but a substantial
change in the ways that libraries manage their
resources and deliver their services
Convergence

Discovery and Management solutions will
increasingly be implemented as matched sets
 Ex
Libris: Primo / Alma
 Serials Solutions: Summon / Intota
 OCLC: WorldCat Local / WorldShare Platform
 Except: Kuali OLE, EBSCO Discovery Service


Both depend on an ecosystem of interrelated
knowledge bases
API’s exposed to mix and match, but efficiencies
and synergies are lost
Why integrate resource management
and discovery?






Obviates the need to constantly synchronize local
inventory metadata representations into derivative
discovery environment
Transactional efficiency for real-time status and userinitiated requests and services
Strategic Knowledge base: resource management,
linking, scoping of discovery
Conceptual and technical cohesion
Possible cost savings relative to purchasing separately
Simplified support and business relationships
Counter trend
Many libraries continue separate
discovery strategies


Open source discovery + licensed Web-scale index
EBSCO Discovery Service: strategy to integrate with
any back-end ILS or LSP
Why not integrate management and
discovery?



Less dependence on single supplier
Best of breed in each product category
Increased customizability and local integration
Trend
Demise of the local catalog



Many library services platforms do not include the
concept of an online catalog dedicated to local
physical inventory
Designed for discovery services as public-facing
interface
Implication: Discovery service must incorporate
detailed functionality for local materials and
related services
Trend
Content providers cooperate with
discovery service providers for indexing
in Web-scale services


New content partnerships continue to be announced
Web-scale discovery service providers assert that
most scholarship in English now well covered and
are now focusing on international and specialized
resources
Trend
Evolving ecosystems of content discovery,
resource management and discovery

Single organization involved in the provision of
content resources
 resource management services

 Focus
on usage analytics
 discovery
services
Can discovery be used to influence content
acquisition?
Example:

ProQuest:
 ProQuest
content products
 Summon
 Intota
 Initial

module: Intota Assessment
EBSCO Information Services
 Subscription
Services
 EBSCOENT: ERM Essentials
 EBSCOhost content platform
 EBSCO Discovery Service
Future of A&E versus Discovery
Services?




Will discovery services eventually become powerful
enough to displace need for abstracting and
indexing services?
How much does specialized vocabulary and other
metadata contribute to the discovery process?
Specialized and scoped search methodologies
Researchers value this tools
Counter Trend
A&I providers and aggregators less likely
to participate


Competitive issues
Need to preserve the value of maintaining
subscriptions
Trend
Public libraries moving back to discovery
interface associated with their ILS


ILS Online Catalog modules now offer most discovery
interface characteristics
Examples:
Most libraries using Polaris ILS use Polaris PowerPAC catalog
interface. Displacing AquaBrowser or Endeca
 TLC libraries have largely abandoned AquaBrowser for LS2
PAC

Counter Trend
BiblioCommons

BiblioCommons is the primary discovery layer
widely implemented in public libraries that continues
to replace ILS online catalog modules
Trend
Strategic cooperation for management and
discovery infrastructure

New combinations of libraries forming partnerships
 Examples:

2CUL, Orbis Cascade Alliance
Model of single-institution infrastructure increasingly
challenged
Trend
Increased universe of discovery through
highly-shared infrastructure

Built-in resource sharing by many libraries
participating in shared automation environment
Open Access Discovery Services


Will cooperative open access / open source
discovery services emerge?
Cons:
 Very
high investment costs: platform, publisher relations,
content acquisition for indexing, synchronization
 Multi-million dollar investment and operational support

Pros:
 More
control and transparency for libraries
Major Products
Serials Solutions: Summon

Launched in June 2009
 First
“web-scale” discovery service
 Unified search results, facets, etc

Summon 2.0 released in 2013
 Emphasis
on tools to provide research assistance
beyond search results
 Topic explorer, scholar profiles, database
recommender, content spotlighting, etc
Ex Libris: Primo / Primo Central

Primo (discovery interface) launched in 2005
 Deployed

Primo Central: article-level index introduced in
2009
 Index

locally or cloud
maintained by Ex Libris, cloud hosted
Scholar Rank: technology designed to order search
results according to scholarly importance
EBSCO Discovery Service





Extends EBSCOhost platform with non-EBSCO
content
Users comfortable with EBSCOhost interface will
easily adapt to EDS
Platform Blending
Direct delivery of full-text from EBSCO sources
Linking to full text for non-EBSCO content
http://www.ebscohost.com/discovery
WorldCat Local

Statistics from OCLC web site:
 952+
million articles with one-click access to full text
 38+ million digital items from trusted sources like
Google Books, OAIster and HathiTrust
 14+ million eBooks from leading aggregators and
publishers
 48+ million pieces of evaluative content (Tables of
Contents, cover art, summaries, etc.) included at no
additional charge
 232+ million books in libraries worldwide
http://www.oclc.org/worldcat-local.en.html
Innovative Interfaces: Encore



Initial version: discovery interface only with local
index
Encore Synergy: XML Web services interfaces to
resource targets for articles
Encore / EDS integration: agreement with EBSCO to
integrate EDS for mutual subscribers
BiblioCommons: BiblioCore





Discovery service oriented to public libraries
Social features – share reading lists, etc
E-book discovery and lending integration
Full replacement for online catalog
Pooling of patrons across participating library
organizations
Blacklight



Open source discovery interface
Originated at the University of Virginia
Increasing interest by academic libraries
 Stanford,

Columbia, Cornell, etc
No open access article-level index
VuFind





Open source discovery interface
Originally developed at Villanova University
Widely deployed
Web-scale indexes integrated by subscribers
through APIs
No open access article-level index
Axiell: Arena





Comprehensive library portal
Discovery + Web site features
Positioned as discovery interface for Axiell’s several
ILS products
Discovery layer for Archival products: CALM and
Adlib
Can front both Library and Archival or museum
products simultaneously: CultereNet
Infor: Iguana





Comprehensive library portal
Discovery + Web site features
Widget based architecture
Positioned as marketing and communications portal
Replaces both online catalog and Web site
Catalog 2.0


Edited by Sally
Chambers
Chapter: “Nextgeneration discovery:
an overview of the
European scene”
Library Technology Report: 2007



Introduction to nextgeneration library
catalogs or discovery
interfaces
Trends
Profiles of major
products
Next-Gen Library Catalogs
Marshall Breeding
Neal-Schuman Publishers
March 2010
Volume 1 of The Tech Set
Questions and discussion
Download