NDLTD Resources & Projects ETD 2006 U.S. Regional Conference St. Louis, Missouri October 27-28, 2006 Edward A. Fox, Executive Director, NDLTD fox@vt.edu http://fox.cs.vt.edu Outline Acknowledgements Info Life Cycle, DLs, 5S, DL Curric, OAI ETDs and NDLTD Partnerships, Union Catalog, Access, Preservation, Research Summary and Conclusions Acknowledgements • All those working with ETDs • NDLTD, including Board, Committees, and Members • ETD 2006 US Regional Conference Team • Sponsors • Presenters, Attendees Acknowledgements: ETD Mtgs • 1987 mtg in Ann Arbor: UMI, VT, … • 1992 mtg in Washington, DC: CNI, CGS, UMI, VT and 10 universities with 3 reps each • 1993 mtg in Atlanta to start Monticello Electronic Library (regional, US Southeast): SURA, SOLINET • 1994 mtg at VT: std: PDF + SGML + multimedia objects • 1996 funding by SURA, US Dept. of Education (FIPSE) • 1997 meetings in UK, Germany, ... • 1998 – 1st symposium – Memphis (20) • 1999 – 2nd symposium – Blacksburg (70) • 2000 – 3rd symposium – St. Petersburg (225) • 2001 – 4th symposium – Caltech (200) • 2002 – 5th symposium – BYU, Provo, Utah • 2003 – 6th symposium – Berlin (215) • 2004 – 7th symposium – U. Kentucky • 2005 – 8th symposium – Sydney, Australia • 2006 – 9th symposium – Quebec City, Canada Acknowledgements: Future ETD Conferences • 2007 – 10th symposium – Uppsala University, Sweden – 13-16 June • 2008 – 11th symposium – Dartington College of Arts, Devon, UK – 29 June – 2 July (tentative) Outline Acknowledgements Info Life Cycle, DLs, 5S, DL Curric, OAI ETDs and NDLTD Partnerships, Union Catalog, Access, Preservation, Research Summary and Conclusions Information Life Cycle Borgman et al.: Workshop Report on Social Aspects of Digital Libraries: http://www-lis.gseis. ucla.edu/DL/ Information Life Cycle Authoring Modifying Using Creating Retention / Mining Organizing Indexing Accessing Filtering Storing Retrieving Distributing Networking Quality and the Information Life Cycle Active Accurac y Comple teness Conform ance Timeliness Similarity Preservability Describing Organizing Indexing Authoring Modifying Semi-Active Pertinence Retention Significance Mining Creation Accessibility Storing Accessing Timeliness Filtering Utilization Archiving Distribution Seeking Discard Inactive Searching Browsing Recommending Relevance Ac ce s si b Networking Pr ese ility rva bil ity E: Ellis’ model K: Kuhlthau’s model E1:starting Digital Libraries (DLs) -- Objectives • • • • • • • • • World Lit.: 24hr / 7day / from desktop Ubiquitous Integrated “super” information systems Usable, Useful Higher Quality, Lower Cost Education, Knowledge Sharing, Discovery Disintermediation -> Collaboration Universities Reclaim Property Interactive Courseware, Student Works Informal 5S & DL Definitions DLs are complex systems that • • • • • help satisfy info needs of users (societies) provide info services (scenarios) organize info in usable ways (structures) present info in usable ways (spaces) communicate info with users (streams) A Minimal DL in the 5S Framework Streams Structured Stream Structures Spaces Structural Metadata Specification Scenarios Societies services Descriptive Metadata Specification indexing browsing searching hypertext Digital Object Collection Metadata Catalog Repository Minimal DL Infrastructure Services Repository-Building Creational Preservational Acquiring Cataloging Crawling (focused) Describing Digitizing Federating Harvesting Purchasing Submitting Conserving Converting Copying/Replicating Emulating Renewing Translating (format) Add Value Annotating Classifying Clustering Evaluating Extracting Indexing Measuring Publicizing Rating Reviewing (peer) Surveying Translating (language) Information Satisfaction Services Browsing Collaborating Customizing Filtering Providing access Recommending Requesting Searching Visualizing DL Curriculum Development Project • Collaborative Research launched by: - Department of Computer Science, Virginia Tech - School of Information and Library Science, University of North Carolina, Chapel Hill • Three year (2006 - 2008) funded project DL Topics in 19 Modules (original) OAI = Technical Umbrella for Practical Interoperability… Reference Libraries Museums Publishers E-Print Archives …that can be exploited by different communities OAI – Repository Perspective Required: Protocol MDO MDO MDO MDO MDO MDO MDO MDO DO DO DO DO OAI – Black Box Perspective OA 7 OA 4 OA 2 OA 1 OA 3 OA 6 OA 5 The World According to OAI Service Providers Discovery Current Awareness Data Providers Preservation Outline Acknowledgements Info Life Cycle, DLs, 5S, DL Curric, OAI ETDs and NDLTD Partnerships, Union Catalog, Access, Preservation, Research Summary and Conclusions ETDs: History of Rationales • EPub: SGML, Electronic Manuscript Project • Graduate Education: Reach next generation of researchers, educators, leaders • DL: Testbed, demonstration, case study • Institutional Repository: Good place to start since is easy, inexpensive, beneficial, and can be extended to lead to other beneficial activities The Networked Digital Library of Theses and Dissertations www.NDLTD.org Training Authors Expanding Access Preserving Knowledge Improving Graduate Education Enhancing Scholarly Communication Empowering Students & Universities Leader of the Worldwide ETD (Electronic Thesis and Dissertation) Initiative Q uickTim e™ and a Cinepak decom pr essor ar e needed t o see t his pict ur e. http://scholar.lib.vt.edu/theses/available/etd-2227102539751141/ NDLTD: How can a university get involved? • Select planning/implementation team – Graduate School – Library – Computing / Information Technology – Institutional Research / Educ. Tech. • Join as a member • Adapt a proven approach – Build interest and consensus – Start trial / allow optional submission NDLTD Goals • For Students: – Gain knowledge and skills for the Information Age, especially about Digital Libraries – Richer communication (digital info, multimedia, …) • For Universities: – Easy way to enter the digital library field and benefit • For the World: – Global digital library – large, useful, many services • Generally: – Save time and money – Increased visibility for all associated with university research results Some Countries • • • • • • • • • • • • • • • • • • • • • Argentina Australia Belgium Brazil Canada Chile China, Hong Kong Columbia Finland France Germany Greece India Italy Jamaica Korea Lithuania Malaysia Mexico Namibia Netherlands • • • • • • • • • • • • • • • • • • • • • • Namibia Netherlands Norway Peru Poland Russia Singapore S. Africa S. Korea Spain Sudan Sweden Switzerland Taiwan Thailand Turkey UK Ukraine United Arab Emirates USA Venezuela Yugoslavia Selected Projects / Sponsors • • • • • • • • • • Australia (ADT) Brazil (BDT, IBICT) Canada Catalunya Chile (Cybertesis) China (CALIS) Germany India (Vidyanidhi) Korea OhioLINK: 79 colleges/univs • Portugal (National Library) • South Africa • Texas Digital Library • UK (British Library, JISC, Edinburgh, …) • UNESCO (especially Latin America, Eastern Europe, Africa) • … NDLTD Members - 1 Association Research Libraries Ball State University Brigham Young University Government of Canada Griffith University John Hopkins University California Institute of Tech. Consorci de Biblioteques Universitàries de Catalunya Kauno Technologijos Universitetas Louisiana State University Georg August Universität Göttingen George Washington University Georgetown University L'Université du Québec à Rimouski McGill University New Jersey Institute of Technology Ohio University Oregon State U. Library Georgia Institute of Technology Georgia Southern University Georgia State University NDLTD Members - 2 Penn State University U. Maine Pontifícia U. Católica do Rio de Janeiro U. Missouri Portugal National Library U. New Orleans Rhodes University U. North Texas Rita Chu (individual) U. Pittsburgh Simon Fraser University U. Pretoria State of Kansas U. Southern Florida Texas Tech University U. Tennessee Triangle Research Lib. Net. U. Waterloo U. de las Américas, Puebla Uppsala Universitet Universität St. Gallen Utah Academic Library Assn. U. Alabama at Birmingham Virginia Commonwealth U. U. Arizona Virginia Tech U. Glasgow West Virginia U. Libraries U. Hong Kong Worcester Polytechnic Inst. U. Kentucky Yale University NDLTD Member Support • • • • Annual conference (…, Germany, …, Sweden, UK) ETD-L – listserv for discussion Union catalog Services for access: VT, OCLC, VTLS, Scirus, Google Scholar, … • Information for ETD projects – Standards, documentation (Guide, Marcel Dekker book) • Advocacy for ETD activities worldwide • … NDLTD Incorporation • Networked Digital Library of Theses and Dissertations incorporated May 20, 2003 in Virginia, USA • Charitable and educational purposes (501 c 3) • Officers – Executive Director (Ed Fox) – Secretary (Gail McMillan) – Treasurer (Scott Eldredge) Board of Directors • • • • • • • • • • • • • • Suzie Allard (ETD2004, U. Kentucky) Denise A. D. Bedford (World Bank) Julia C. Blixrud (ARL, SPARC) José Luis Borbinha (Natl Lib Portugal) Alex Byrne (ETD2005, ADT: Australia) Tony Cargnelutti (ETD2005, Australia) Vinod Chachra (VTLS) William Clark (Ohio State U.) Susan Copeland (RGU, UK) Jude Edminster (Bowling Green St. U.) Scott Eldredge (Treasurer, ETD2002, BYU) Edward A. Fox (Exec Director,Virginia Tech) John H. Hagen (West Virginia U.) Thomas B. Hickey (OCLC) • • • • • • • • • • • • • • • Christine Jewell (U. Waterloo, Canada) Joan K. Lippincott (CNI) Austin McLean (ProQuest) Gail McMillan (Secretary, Virginia Tech) Joseph Moxley (ETD2000, USF) Eva Müller (U. Uppsala, Sweden) Ana Pavani (PUC Rio, Brazil) Sharon Reeves (Nat’l Library Canada) Janice Rickards (chair of ADT) Peter Schirmbacher (ETD2003, Humboldt) Samson Soong (Hong Kong U. Science & Technology) Hussein Suleman (U.Cape Town, S. Africa) Shalini R. Urs (U. Mysore, India) Eric F. Van de Velde (ETD2001, Caltech) Ellen Wagner (Adobe) NDLTD Committees (Chairs) • • • • • • • • • • Awards (John Hagen) Conferences (Sharon Reeves) Development (Peter Schirmbacher) Executive (Edward Fox) Finance (Scott Eldredge) Implementation (Ana Pavani) Membership (Eric F. Van de Velde ) Nominating (Joan Lippincott) Standards (Thomas B. Hickey) Union Catalog (Vinod Chachra) NDLTD Committee Activities • Implementation – – – – How to apply standards? How to coordinate a national program? How to launch a pilot project? How to train students regarding copyright, digital libraries, electronic publishing, preservation, …? • Membership, Member Support, Public Relations – How to clarify and publicize member services? – How to double membership in 2 years? – Sub-committees for regional member support? • Please join / update your Member Info! Standards • • • • • PDF -> PDF/A SGML, XML, XML DTDs, XML Schema Multimedia References -> Reference List for CrossRef Packaging -> METS -> Ease Role of Search Engines Outline Acknowledgements Info Life Cycle, DLs, 5S, DL Curric, OAI ETDs and NDLTD Partnerships, Union Catalog, Access, Preservation, Research Summary and Conclusions Partnerships • • • • • • • • UMI/ProQuest Adobe, IBM, Microsoft, … UNESCO OCLC VTLS Ex Libris Scirus Google UNESCO and ETDs (by Axel Plathe at ETD2003) • Promoting the use of the Internet as a tool for disseminating scientific knowledge • Facilitating the transfer of ETD expertise from developed to developing countries • 1998: Member of the NDLTD Steering Committee • 1999: First UNESCO ETD meeting on ETD internationalisation • 2002: “UNESCO Guide to Electronic Theses and Dissertations” • 2003: Model training programmes and training courses • 2003: Sponsor pilot projects • 2003: Pilot projects (Africa, Europe, Latin-America) Union catalog: OCLC • OCLC runs OAI data provider on TDs. • Is getting data from WorldCat (so, from many sites!). • Harvests from all others who contact them (see Thom Hickey). • Need DC and either ETD-MS or MARC. • Need a set for ETDs, or separate data provider. OCLC SRU Interface ETD Union Search Mirror Site in China (CALIS) (http://ndltd.calis.edu.cn – popular site!) VTLS • VTLS offers its free VALET system to manage ETDs at institutions, building upon Fedora, as well as VTLS software. • VTLS runs a service provider atop the Union Catalog. It supports multilingual access through the interface, to metadata. VTLS Content Languages The VTLS service has data in 6 different languages. These are: English German Greek Korean Portuguese Spanish Examples follow Language = German; hits = 137 Full record display Expansion of Full-text Services • • • • Running since Sept 2005: Scirus In beta test: Google Scholar Next: Microsoft ? Challenges: – Broadening the coverage since OAI use has not spread as widely as we would like – Understanding use, throughout life cycle – Data and DL services quality problems – Inconsistency in way to get from metadata to the fulltext file(s) – Cross-language information retrieval Google Co-op, Custom Search Engines Preservation Information • Henry M. Gladney, Ph.D. (408)867-5454 http://home.pacbell.net/hgladney • A book, "Preserving Digital Information" is to be available approximately February 2007. See • http://www.springer.com/east/home?SGWID=5102-22-1736779190&changeHeader=true&SHORTCUT=www.sprin ger.com/3-540-37886-3 • http://home.pacbell.net/hgladney/PDI_front.pdf LOCKSS for ETDs • • • • Lots of copies keep stuff safe Stanford (Vicky Reich) Initial content: journals Experiments, studies of Int’l ETD service – Humboldt, PUC Rio, U. Cape Town, VT, … • Production service? User Expertise Years Users' Expertise in Years 200 180 160 120 100 80 60 40 20 Years 50 35 28 26 24 22 20 18 16 14 12 10 8 6 4 2 0 0 Users 140 17 xx 18 xx 19 0x 19 1x 19 2x 19 3x 19 4x 19 5x 19 6x 19 7x 19 8x 19 90 19 91 19 92 19 93 19 94 19 95 19 96 19 97 19 98 19 99 20 00 20 01 20 02 20 03 20 04 20 05 20 06 error Date Stamp of ETD 60,000 50,000 40,000 30,000 20,000 10,000 0 Year Supply-Demand Comparison 1 Architecture and Design ETD Resources and User Demands (Number of Queries) in NDLTD 50% ETDs Demands 2 Law 45% 3 Medicine, Nursing and Veterinary Medicine 40% 35% 30% 4 Arts and Science 25% 20% 5 Engineering and Applied Science 15% 10% 5% 0% 1 2 3 4 5 Academic Categories 6 7 8 6 Business and Commerce 7 Education 8 Others. (unclassifiabl e) NDLTD cross-language problem Language Number English 123,696 Portuguese 11434 German 4131 French 3868 Spanish 1561 Chinese 1463 Catalan 804 Others 19962 (most unclassified) Total 166919 (summer’05) Example concept map Ryan Richardson solution to NDLTD cross-language problem The Concept Map: From learning tool to cross-language knowledge discovery tool 4. More advanced techniques for concept map creation ETD of Hussein Suleman’s ETD Chapter 5 4. More advanced techniques for concept map creation Concept map with 65 nodes 4. More advanced techniques for concept map creation Same concept map pruned down to 15 nodes 4. More advanced techniques for concept map creation Detail of ToC maps for ETD by Suleman 4. More advanced techniques for concept map creation Detail of Relex map showing relationship between ‘OAI’ and ‘bandwidth’ 4. More advanced techniques for concept map creation Detail of ToC maps showing 1st sentence of section 2.8. Outline Acknowledgements Info Life Cycle, DLs, 5S, DL Curric, OAI ETDs and NDLTD Partnerships, Union Catalog, Access, Preservation, Research Summary and Conclusions Summary and Conclusions (Editorial: ETDs & NDLTD progressing well!) Summary Words and Phrases Crossing the Chasm Steps Spirit of NDLTD Selected Links Conference Summary Words - 1 accessibility aggregation alert annotate archive arts attitudes authentication authoring authorization automation browse catalog collaboration community components context conversion customer decentralized digitize discourse discovery dissemination DSpace federated Fedora global grid economic harvesting ingest innovation institutional integrity interaction Conference Summary Words - 2 interchange interoperability knowledge LOCKSS management metadata national OCR organization partnership PDF (/A) podcasting portal preservation provider regional repository retrieval scalability Scirus search server service sharing standardization strategic student summarization sustainable testimonial toolkit training tutorial Unicode usable VALET XML XSLT workflow Conference Summary Phrases - 1 alumni development always on business model concept map content management copyright compliance cost effective Creative Commons creative material cross language dark archive developing country digital library digital rights management digital signature disruptive technology document model Dublin Core Conference Summary Phrases - 2 e-knowledge e-publishing e-research e-science full text Google Scholar institutional repository LDAP server learning object mandatory deposit Million Book Project national initiative Net Gen OAI PMH online digital studio open access Open Archives Initiative open source Conference Summary Phrases - 3 persistent identifiers postgraduate research public domain restricted access retrospective conversion scholarly communication server log service oriented architecture social network stepping stone subject gateway survey data union catalog unlocking IP user centered value added voluntary participation walking the talk web based web services Journal? • Joseph Moxley [moxley@cas.usf.edu] • JET / JED: Journal of Electronic Theses and Dissertations • JODLAR: Journal of Digital Libraries and Repositories • Publisher: ? • Columns / Sections – – – – Project Case Studies Standards, Technologies, Best Practices Software, Systems, Services Statistics, Surveys, Analytical Studies Steps • • • • • • • • • Join NDLTD Launch initiative, dialog, encourage Pilot -> requirement OAI data provider Log, survey, analyze, improve Attend ETD xx Help other sites Serve on NDLTD committees Extend services: preservation, inst. rep., … Spirit of NDLTD • • • • Help make a better (smaller) world Win-win-win (everyone can benefit) Have fun helping others Helpers/teachers learn more than those they work with • Build on standards • ETDs are preservable, popular, expressive, “better” • Doable, feasible, learnable, affordable, sharable • Please join/support NDLTD! Selected Links - http://fox.cs.vt.edu • NDLTD (electronic theses and dissertations worldwide) – www.ndltd.org and etdguide.org – http://fox.cs.vt.edu/etd-search.htm • DL curriculum - http://curric.dlib.vt.edu/wiki, http://curric.dlib.vt.edu/DLcurric.html • Virginia Tech Digital Library Research Laboratory (DLRL, www.dlib.vt.edu)