Digital Assets Matter: Demystifying digital assets management IMAS forum presented by University Digitisation Service, Library and Research Services, ITS 24 November 2011 SESSION OVERVIEW • Anna Shadbolt – Welcome & Introductions • Silvia Paparozzi - The why and how of Optical Character • • • Recognition (OCR) Ben Kreunen - Principles and best practices in the management of digital assets – Including demonstration of Cumulus™ Andy Tseng - Implementation of Mediaflux™ for Research Data – Use cases – Peter Cross & Jason Lohrey will lead demonstration of Mediaflux Joe Arthur – Q & A The why and how of Optical Character Recognition (OCR) Silvia Paparozzi, Senior Digitisation Officer University Digitisation Service (UDS) Scanned Image Microsoft Office Tools Microsoft Office Tools OCRed Word Document OCRed Word Document Adobe Acrobat Professional Adobe Acrobat Professional OmniPage OmniPage White on black background Adobe Acrobat Pro OmniPage Audio Wave File More information University Digitisation Service www.digitisation.unimelb.edu.au/ocr Principles and best practices in the management of digital assets Ben Kreunen, Digitisation Technical Support Officer University Digitisation Service (UDS) Digital Asset Management System DAMS: Introduction Digital • • • • • Files on a computer/ computer storage device Images, Video, Audio Word, PDF, PowerPoint, Excel Remote files (URLs) Any file containing information/ data • Not necessarily a single file! DAMS: Introduction Asset • Something of value – – – – – Expensive to acquire Expensive to create Time consuming to create Difficult to replace/ irreplaceable Useful for creating other assets DAMS: Introduction Digital Asset • Digital file(s) + • Metadata – – – – – Descriptive Provenance Copyright Management etc… Click to edit Master title style Attached metadata Analog Asset Orientation ID Orientation Reference Title: Figure Author Click to edit Master title style Single image file Digital Asset Click to edit Master title style Collection of files Digital Asset DAMS: Introduction Management • Perform actions • Make decisions: – based on information – guided by standard processes, guidelines, rules DAMS: Introduction System • A set of tools to perform actions • – Access – Creating derivative files – Cataloguing – Archiving etc… A database consisting of: – Metadata • Physical, descriptive, administrative – Predefined, customisable actions – Predefined, customisable workflows – Access control etc… DAMS: Introduction System • A set of tools to perform actions • – Access – Creating derivative files – Cataloguing – Archiving etc… A database consisting of: – Metadata • Physical, descriptive, administrative – Predefined, customisable actions – Predefined, customisable workflows – Access control etc… DAMS: Introduction DAM Builders • Integration with external services • Integration with external data sources • Access DAM from within applications – e.g. InDesign, PowerPoint Watched folders Folder synchronisation • • • Asset editing/ annotation Hugo.arg : http://commons.wikimedia.org/wiki/File:Beaver_dam_on_Smilga.JPG DAMS: Introduction DAM Busters • Not just a catalogue/ • repository Search is not the only function • The GIGO principle applies to metadata • Must be integrated into workflows in order to be practical • Systems don’t manage assets, people do. Bzuk : http://commons.wikimedia.org/wiki/File:Lancaster_VR-A.jpg Edit Annotate Workflow Access Comment Share Search Link Publications Catalogue Presentations Metadata Video Documents Export Images Archive Import Click to edit Master title style Feature Flick (including add-ons) Canto Cumulus Canto Cumulus Clients Server • Desktop (Java) • • • • – Windows – Mac OSX – Linux • Web • Client/Server • Single User • “Cross” Client Basic Workgroup Enterprise Feature set activated as purchased • Catalogue sizes up to 1Tb Canto Cumulus Cataloguing • Automatically catalogue linked files in PowerPoint, Indesign, images in PDF documents Index text in documents for searching • • Customise metadata field mapping on import • Catalogue pages of documents as separate items – PDF, InDesign, PowerPoint, OpenDoc, iWork, Keynote, Outlook • Shot boundary determination for videos • Catalogue URLs e.g. images on other websites Canto Cumulus Metadata • • • • • IPTC, EXIF, XMP Customisable fields Write metadata back to assets Templates for bulk application Prefilling Canto Cumulus Connectivity Authentication/ Roles • • • • • Built-in • LDAP • AD (Enterprise) XML SOAP WSDL Oracle Canto Cumulus Asset processing Workflow • Collect and package • Add comments on asset • assets for specific purposes ZIP, Email, CD preparation • previews without requiring creator application Live rules-based access controls e.g. approvals, embargo dates • Restrict searches, interface display to specific categories Canto Cumulus 3rd Party Add-ons • eCommerce • ImageMagick • Automated workflows Click to edit Master title style Let’s play! Canto Cumulus Q&A Joe Arthur Manager, University Digitisation Service Contacts • Dr Andy Tseng Research Services, ITS – Phone: 903 53315 – Email: tsenga@unimelb.edu.au • University Digitisation Service – Phone: 834 46161 – Email: digitisation-enquiries@unimelb.edu.au • Anna Shadbolt – Phone: 834 43139 – Email: annams@unimelb.edu.au More information USEFUL LINKS University Digitisation Service (UDS) http://digitisation.unimelb.edu.au/ Contact: http://digitisation.unimelb.edu.au/about/contact Research Services – ITS http://www.its.unimelb.edu.au/research Contact: http://www.its.unimelb.edu.au/research/contact Information Management Advisory Service (IMAS) http://www.imas.unimelb.edu.au/ Contact: imas-info@unimelb.edu.au Mediaflux™ Arcitecta http://www.arcitecta.com/Products Cumulus™, Canto http://www.canto.com/ © Copyright The University of Melbourne 2011 All images sourced through Microsoft Office