HARVARD DISCOVERY DAY Discovery Trends and Issues Marshall Breeding Independent Consult, Author, Founder and Publisher, Library Technology Guides http://www.librarytechnology.org/ http://twitter.com/mbreeding November 13, 2013 Harvard Discovery Day Twitter hashtag: #discoveryday Discovery Resources Discovery on LTG The Evolution of Library Resource Discovery Discovery in ARL Libraries http://www.librarytechnology.org/arl-discovery.pl Online Catalog ILS Data Search: Scope of Search Search Results Books, Journals, and Media at the Title Level Not in scope: Articles Book Chapters Digital objects Next-gen Catalogs or Discovery Interface Single search box Query tools Did you mean Type-ahead Relevance ranked results Faceted navigation Enhanced visual displays Cover art Summaries, reviews, Recommendation services Scope of Search Books, Journals, and Media at the Title Level Other local and open access content Not in scope: Articles Book Chapters Digital objects Discovery Interface search model Search: Local Index ILS Data Digital Collections ProQuest Search Results MetaSearch Engine EBSCOhost … MLA Bibliography ABC-CLIO Real-time query and responses Discovery from Local to Web-scale Initial products focused on interface improvements AquaBrowser, Endeca, Primo, Encore, VuFind, Civica Sorcer, Axiell Arena Mostly locally-installed software Current phase is focused on pre-populated indexes that aim to deliver Web-scale discovery Primo Central (Ex Libris) Summon (Serials Solutions) WorldCat Local (OCLC) EBSCO Discovery Service (EBSCO) Encore ES (+EDS Index) Public Library Information Portal ILS Data Digital Collections Search: Usagegenerated Data Customer Profile Consolidated Index Search Results Web Site Content Community Information Aggregated Content packages … Customerprovided content Reference Sources Archives Pre-built harvesting and indexing Discovery services as Website Replacement Portal environment that includes customized content management service that can fulfill typical offerings on library Web sites Full integration between Web site and resource discovery (ideally) Examples: Axiell Arena Infor Iguana Web-scale Index-based Discovery (2009- present) Digital Collections Search: Customer Profile Consolidated Index Search Results Usagegenerated Data ILS Data Web Site Content Institutional Repositories Aggregated Content packages … Open Access E-Journals Reference Sources Pre-built harvesting and indexing Bento Box Discovery Model Aggregated Content packages ILS Data VuFind / Blacklight Search Results Web Site Content Digital Collections Institutional Repositories Consolidated Index Search: Open Access E-Journals Pre-built harvesting and indexing Web-scale Search Problem ILS Data Digital Collections Search Results Consolidated Index Search: Web Site Content Institutional Repositories Aggregated Content packages … E-Journals ??? Problem in how to deal with resources not provided to ingest into consolidated index Pre-built harvesting and indexing Non Participating Content Sources Citations / Metadata > Full Text Citations or structured metadata provide key data to power search & retrieval and faceted navigation Indexing Full-text of content amplifies access Important to understand depth indexing Currency, dates covered, full-text or citation Many other factors Challenge for Relevancy Technically feasible to index hundreds of millions or billions of records through Lucene or SOLR Difficult to order records in ways that make sense Many fairly equivalent candidates returned for any given query Must rely on use-based and social factors to improve relevancy rankings Objectivity: Does relevancy reflect bias or publisher preferences Challenges for Collection Coverage To work effectively, discovery services need to cover comprehensively the body of content represented in library collections What about publishers that do not participate? Is content indexed at the citation or full-text level? What are the restrictions for non-authenticated users? How can libraries understand the differences in coverage among competing services? Evaluating the Coverage of Indexbased Discovery Services Intense competition: how well the index covers the body of scholarly content stands as a key differentiator Difficult to evaluate based on numbers of items indexed alone. Important to ascertain now your library’s content packages are represented by the discovery service. Important to know what items are indexed by citation and which are full text Important to know whether the discovery service favors the content of any given publisher Non-Cooperative Scenarios Many A&I and specialized resources do not contribute to Discovery services Two major players are both publishers and discovery service providers EBSCO – ProQuest ProQuest does not provide content to other discovery services EBSCO does not provide content to other discovery services Issue currently being pressed by Orbis Cascade Alliance. Open Discovery Initiative NISO Work Group to Develop Standards and Recommended Practices for Library Discovery Services Based on Indexed Search Informal meeting called at ALA Annual 2011 Co-Chaired by Marshall Breeding and Jenny Walker Term: Dec 2011 – Dec 2013 Balance of Constituents 21 Libraries Marshall Breeding, Vanderbilt University Jamene Brooks-Kieffer, Kansas State University Laura Morse, Harvard University Ken Varnum, University of Michigan Sara Brownmiller, University of Oregon Lucy Harrison, College Center for Library Automation (D2D liaison/observer) Michele Newberry Publishers Lettie Conrad, SAGE Publications Roger Schonfeld, ITHAKA/JSTOR/Portico Jeff Lang, Thomson Reuters Linda Beebe, American Psychological Assoc Aaron Wood, Alexander Street Press Service Providers Jenny Walker, Ex Libris Group John Law, Serials Solutions Michael Gorrell, EBSCO Information Services David Lindahl, University of Rochester (XC) Jeff Penka, OCLC (D2D liaison/observer) ODI Project Goals: Identify … needs and requirements of the three stakeholder groups in this area of work. Create recommendations and tools to streamline the process by which information providers, discovery service providers, and librarians work together to better serve libraries and their users. Provide effective means for librarians to assess the level of participation by information providers in discovery services, to evaluate the breadth and depth of content indexed and the degree to which this content is made available to the user. ODI Timeline 23 Milestone Target Date Appointment of working group Dec 2011 Approval of charge and initial work plan Mar 2012 Agreement on process and tools Jun 2012 Completion of information gathering Jan 2013 Completion of initial draft Jun 2013 Completion of final draft Sep 2013 Public Review Period commences Sep 2013 Status Social Discovery A more social user experience Ratings, rankings, reviews Enhanced content Connections with the library Connections with other users Challenge: Must have critical mass of engagement to have an impact Social Strategies Inherent BiblioCommons: design and infrastructure created with social flavor and features Layered or integrated ChiliFresh: integrate a third party social platform into any given library catalog or discovery service Socially-powered discovery Leverage use data to increase effectiveness of discovery Usage data can identify important or popular materials to inform relevancy engines Identify related materials that may not otherwise be uncovered through keyword matching Be careful to avoid introducing bias loops E-Book Integration Critical concern for public libraries Most libraries offer e-book lending programs Strong demand: increasing use statistics Print lending remains vigorous Academic libraries will benefit at a later phase by e-book lending models developed in the public library sector Commercial library e-book lending services OverDrive 3M Cloud Library Baker & Taylor: Axis 360 “Douglas County Model” Locally curated e-book collections and lending platform E-book Lending Models Phase I: Link out to e-book lending service Phase II: Load MARC records in local catalog, then link out on individual titles Phase III: Discovery and lending operations performed fully within the library’s catalog or discovery environment Full e-book lending Discovery of print and e-book titles and copies simultaneously E-book transactions represented within patron’s library account List of charged items, due dates Service options: renew, return, etc. Ability to check-out and download e-books into ereader Library Interfaces with e-book Integration Polaris PowerPAC BiblioCommons SirsiDynix eResource Central Innovative Interfaces: Encore ES TLC: LS2 PAC OdiloTK from OdiloTID The e-book integration ecosystem E-book lending services must expose APIs Online catalog or discovery services must consume APIs and adjust interface design and business logic to accommodate discovery and lending operations Challenge: each e-book service provider’s APIs are different Response: Work toward consistent or standard suite of APIs Library Technology Reports The Current State of Library Resource Discovery Products: Context, Library Perspectives, and Vendor Positions In press for Publication January 2014 LTR Components Vender questionnaire Library Survey Industry announcements Other articles and publications Library Discovery Survey Survey executed to gather data from libraries regarding their experiences with discovery services Responses received by 396 Libraries: 29 Countries represented, 252 responses from United States Academic 247 Consortium Government Agency Law Medical Museum National Other Public Special State Theology 15 2 7 5 1 1 1 96 14 4 3 Overall Satisfaction Overall Effectiveness Comprehensiveness: Academic Libraries Relevancy Effectiveness Objectivity in Discovery Objectivity in Discovery: Academics Example Product rating chart Google Scholar Default starting point for most researchers Mostly outside of library control Libraries can register link resolvers to facilitate access to subscribed documents from registered IP addresses Recent cooperation between Thompson Reuters Web of Science and Google Scholar Signals deeper interest by Google in academic resource discovery arena Core service for Google? http://www.google.com/intl/en/scholar/libraries.html Discovery Trends Discovery Service Installations Product 2007 2008 2009 2010 2011 2012 EBSCO EDS 5,000 Primo 12 37 AquaBrowser 55 339 64 69 74 Encore 72 72 109 56 72 46 77 58 88 LS2 PAC Summon Enterprise Civica Sorcer Axiell Arena Installed 53 506 50 164 16 111 101 58 1900 750 365 73 305 214 158 704 75 100 102 328 7 12 22 3 42 61 57 33 76 Trend Increased uptake in academic libraries of Web-scale discovery services Almost a “must-have” product as one component of overall resource discovery and dissemination strategy Yet, the number of discovery service installations remains a fraction of library management systems. Counter trend Some libraries resist investing in a discovery service citing: Google Scholar performs adequate discovery High cost of discovery services Concerns for bias Trend Tendency toward re-alignment with management systems Alma + Primo / Primo Central Sierra + Encore WorldCat Local + WorldShare Management Services Intota + Summon Library Services Platform Library-specific software. Designed to help libraries automate their internal operations, manage collections, fulfillment requests, and deliver services Services Service oriented architecture Exposes Web services and other API’s Facilitates the services libraries offer to their users Platform General infrastructure for library automation Consistent with the concept of Platform as a Service Library programmers address the APIs of the platform to extend functionality, create connections with other systems, dynamically interact with data Library Services Platform Characteristics Highly Shared data models Delivered through software as a service Multi-tenant Unified workflows across formats and media Flexible metadata management Knowledgebase architecture Some may take hybrid approach to accommodate local data stores MARC – Dublin Core – VRA – MODS – ONIX BIBFRAME New structures not yet invented Open APIs for extensibility and interoperability Library Services Platforms Category WorldShare Alma Management Services OCLC. Ex Libris Intota Key precepts Global network-level approach to management and discovery. Consolidate workflows, unified management: print, electronic, digital; Hybrid data model Knowledgeba se driven. Pure multitenant SaaS Software model Proprietary Proprietary Proprietary Responsible Organization Serials Solutions Sierra Services Platform Innovative Interfaces, Inc Kuali OLE Service-oriented architecture Technology uplift for Millennium ILS. More open source components, consolidated modules and workflows Proprietary Manage library resources in a format agnostic approach. Integration into the broader academic enterprise infrastructure Kuali Foundation Open Source New Library Management Model Unified Presentation Layer Search: Library Services Platform API Layer ` Digital Coll Consolidated index Self-Check / Automated Return ProQuest EBSCO … JSTOR Stock Management Enterprise Resource Planning Learning Management Other Resources Smart Cad / Payment systems Authentication Service Development / Deployment perspective Beginning of a new cycle of transition Over the course of the next decade, academic libraries will replace their current legacy products with new platforms Not just a change of technology but a substantial change in the ways that libraries manage their resources and deliver their services Convergence Discovery and Management solutions will increasingly be implemented as matched sets Ex Libris: Primo / Alma Serials Solutions: Summon / Intota OCLC: WorldCat Local / WorldShare Platform Except: Kuali OLE, EBSCO Discovery Service Both depend on an ecosystem of interrelated knowledge bases API’s exposed to mix and match, but efficiencies and synergies are lost Why integrate resource management and discovery? Obviates the need to constantly synchronize local inventory metadata representations into derivative discovery environment Transactional efficiency for real-time status and userinitiated requests and services Strategic Knowledge base: resource management, linking, scoping of discovery Conceptual and technical cohesion Possible cost savings relative to purchasing separately Simplified support and business relationships Counter trend Many libraries continue separate discovery strategies Open source discovery + licensed Web-scale index EBSCO Discovery Service: strategy to integrate with any back-end ILS or LSP Why not integrate management and discovery? Less dependence on single supplier Best of breed in each product category Increased customizability and local integration Trend Demise of the local catalog Many library services platforms do not include the concept of an online catalog dedicated to local physical inventory Designed for discovery services as public-facing interface Implication: Discovery service must incorporate detailed functionality for local materials and related services Trend Content providers cooperate with discovery service providers for indexing in Web-scale services New content partnerships continue to be announced Web-scale discovery service providers assert that most scholarship in English now well covered and are now focusing on international and specialized resources Trend Evolving ecosystems of content discovery, resource management and discovery Single organization involved in the provision of content resources resource management services Focus on usage analytics discovery services Can discovery be used to influence content acquisition? Example: ProQuest: ProQuest content products Summon Intota Initial module: Intota Assessment EBSCO Information Services Subscription Services EBSCOENT: ERM Essentials EBSCOhost content platform EBSCO Discovery Service Future of A&E versus Discovery Services? Will discovery services eventually become powerful enough to displace need for abstracting and indexing services? How much does specialized vocabulary and other metadata contribute to the discovery process? Specialized and scoped search methodologies Researchers value this tools Counter Trend A&I providers and aggregators less likely to participate Competitive issues Need to preserve the value of maintaining subscriptions Trend Public libraries moving back to discovery interface associated with their ILS ILS Online Catalog modules now offer most discovery interface characteristics Examples: Most libraries using Polaris ILS use Polaris PowerPAC catalog interface. Displacing AquaBrowser or Endeca TLC libraries have largely abandoned AquaBrowser for LS2 PAC Counter Trend BiblioCommons BiblioCommons is the primary discovery layer widely implemented in public libraries that continues to replace ILS online catalog modules Trend Strategic cooperation for management and discovery infrastructure New combinations of libraries forming partnerships Examples: 2CUL, Orbis Cascade Alliance Model of single-institution infrastructure increasingly challenged Trend Increased universe of discovery through highly-shared infrastructure Built-in resource sharing by many libraries participating in shared automation environment Open Access Discovery Services Will cooperative open access / open source discovery services emerge? Cons: Very high investment costs: platform, publisher relations, content acquisition for indexing, synchronization Multi-million dollar investment and operational support Pros: More control and transparency for libraries Major Products Serials Solutions: Summon Launched in June 2009 First “web-scale” discovery service Unified search results, facets, etc Summon 2.0 released in 2013 Emphasis on tools to provide research assistance beyond search results Topic explorer, scholar profiles, database recommender, content spotlighting, etc Ex Libris: Primo / Primo Central Primo (discovery interface) launched in 2005 Deployed Primo Central: article-level index introduced in 2009 Index locally or cloud maintained by Ex Libris, cloud hosted Scholar Rank: technology designed to order search results according to scholarly importance EBSCO Discovery Service Extends EBSCOhost platform with non-EBSCO content Users comfortable with EBSCOhost interface will easily adapt to EDS Platform Blending Direct delivery of full-text from EBSCO sources Linking to full text for non-EBSCO content http://www.ebscohost.com/discovery WorldCat Local Statistics from OCLC web site: 952+ million articles with one-click access to full text 38+ million digital items from trusted sources like Google Books, OAIster and HathiTrust 14+ million eBooks from leading aggregators and publishers 48+ million pieces of evaluative content (Tables of Contents, cover art, summaries, etc.) included at no additional charge 232+ million books in libraries worldwide http://www.oclc.org/worldcat-local.en.html Innovative Interfaces: Encore Initial version: discovery interface only with local index Encore Synergy: XML Web services interfaces to resource targets for articles Encore / EDS integration: agreement with EBSCO to integrate EDS for mutual subscribers BiblioCommons: BiblioCore Discovery service oriented to public libraries Social features – share reading lists, etc E-book discovery and lending integration Full replacement for online catalog Pooling of patrons across participating library organizations Blacklight Open source discovery interface Originated at the University of Virginia Increasing interest by academic libraries Stanford, Columbia, Cornell, etc No open access article-level index VuFind Open source discovery interface Originally developed at Villanova University Widely deployed Web-scale indexes integrated by subscribers through APIs No open access article-level index Axiell: Arena Comprehensive library portal Discovery + Web site features Positioned as discovery interface for Axiell’s several ILS products Discovery layer for Archival products: CALM and Adlib Can front both Library and Archival or museum products simultaneously: CultereNet Infor: Iguana Comprehensive library portal Discovery + Web site features Widget based architecture Positioned as marketing and communications portal Replaces both online catalog and Web site Catalog 2.0 Edited by Sally Chambers Chapter: “Nextgeneration discovery: an overview of the European scene” Library Technology Report: 2007 Introduction to nextgeneration library catalogs or discovery interfaces Trends Profiles of major products Next-Gen Library Catalogs Marshall Breeding Neal-Schuman Publishers March 2010 Volume 1 of The Tech Set Questions and discussion