Making metadata for Manga and applying Topic Maps to it Shoichiro HARA Center for Integrated Area Studies, Kyoto University Motomu NAITO Knowledge Synergy Inc. Outline 1. 2. 3. 4. Background and Purpose Functions of Manga Metadata Components of Manga Metadata Apply Topic Maps to the metadata - Bibliographical Metadata - Structural Metadata 5. Web Applications on topic maps 6. Conclusion 7. Challenge and Future work 1. Background and Purposes Manga has already established itself a genre of culture – One of foremost Japanese pop culture – – Media diversity/derivation: Weekly periodicals, books, animations, movies, dramas, theaters, parodies etc. Problems of Manga – World wide market Attractive to Japanese culture and language Media characteristics – Sales in 2011: 271.7 Bill. / 1112.3 Bill. Easy to be disappeared Easy to be out of print and/or cease publication Easy to be discarded Easy to be destroyed Importance of Metadata, digitization and preservation But there are no well established metadata for Manga database THEN, our trial to define new metadata of Manga Bookstore in Bangkok 2. Functions of Manga Metadata 1. Ability to describe media diversity and derivation – From weekly periodicals to books and animations etc. 2. Ability to keep all aspects of Manga – Bibliographic, texts, images, movies etc. 3. Importance of participatory data creation – – No special librarians/catalogers of Manga Need grass roots supports (expressly for foreign materials) 4. Importance of effective use of Web information – – Much information about bibliographies, explanatory notes, authors, publishers etc. are available from Web Metadata will be used to organize existing information Thus Our Metadata should be easy to create and use should be compatible with other metadata 3. Components of Manga Metadata Bibliographic Metadata – – Structural Metadata – – DC based simple structure Describes elementary information about general concept of work, each work/media (such as book, novel, TV drama, movie, etc.), each resource, etc. Describes structural component of Manga book such as Page, Coma(Frame), Picture, Text, Symbol, etc. Will try to apply TEI Topic Maps and Web application – – Express metadata as network Link each component with other Web data Materials Title:花より男子 (Hana yori dango) / Boys over Flowers Author:神尾葉子/ Kamio, Yoko Outlines – Original: Serial Manga published inマーガレット (Marguerite), a biweekly girls’ Manga magazine published by 集英社 (Shuei-sha), from 1992 to 2004. – Book: 37 volumes. The best selling girls’ Manga in Japan (58000000 copies until Sep. 2006) – Multimedia: animation, drama, movie. – Translated versions: USA, Taiwan, Thailand, France, Spain etc. Reasons for Research Material – Various medias: good sample for metadata – Various translated versions: good for multilingual text sample Material : Book version (Vol. 1) – Japanese (original) (Shuei-sha) – English (Viz Media) – Thai (Siam Inter Comics) Image Data Image scanning – – 300dpi Gray scale(8bit), Only cover page is color scanning Preprocessing – – Remove the cover Then cut the binding of the book (Bind the book again after scanning) 8 Composition of Manga Example of Structure Metadata 10 Example of Bibliographic Metadata element content Title BOYS over FLOWERS Creator Description Yoko Kamio, PSI element Content Love story, http://id.ndl.go.jp/auth/ndlsh/00569390 (PSI) print, PSI Format Manga, book, URL Publisher SHUEISHA Inc., PSI Contributor Agent, PSI, Role Date 1992 Type StillImage, PSI Subject Identifier Source URL of resource Language Ja, PSI Relation isFormatOf, URL of resource Coverage Heisei, PSI place, PSI Rights Yoko Kamio, PSI 4. Apply Topic Maps to the Metadata What and Why Topic Maps? Simple, Intuitive and Human-friendly model for organizing information – Models target domain as concept/subject network (topics and associations between topics) – Links concepts and related information resources (occurrences) Consists of types (which corresponding Ontology) and instances – Same vocabularies and syntax are used for Types and Instances Uses Subject Identifier (IRI or URI) to identify subjects (topics) and link/merge subjects Has very powerful Topic Maps query language: tolog Has remote access protocol: TMRAP to access remote topic maps Topic Maps is an ISO standard (ISO/IEC 13250) Can use matured open source tools, e.g. Ontopia http://code.google.com/p/ontopia/ The Basic Model of Topic Maps • Some pool of information or data – any type, format, or location • A knowledge layer, consisting of: • Topics – • representing relationships between subjects Occurrences – • draw a set of topics representing the key subjects of the domain in question draw Associations – • P.S. Topics, associations and occurrences have types, and all types are also topics… Cat Street BOYS over FLOWERS Yoko Kamio born in links to information that is somehow relevant to a given subject = The TAO of Topic Maps (Source: Steve Pepper, “Towards Seamless Knowledge” ) TOKYO knowledge information 13 General View and target of Manga Metadata Three types targets – Work(著作), Expression(表現形), Manifestation(体現形) Metadata is made for each instance of the types Describing relationships (e.g. derivation) between instances Bibliographic Metadata Metadata is described by DCMES’s (Dublin Core Metadata Element Set) 15 elements and relationship between them Attaching identifier(IRI or URI) to the elements as far as possible Refer to: Steve Pepper, Expressing Dublin Core Metadata using Topic Maps, http://www.jtc1sc34.org/repository/0906.htm Topic map of Bibliographic Metadata Topic Types (8): Work, Expression, Manifestation, Agent, Coverage class, Language class, Subject class, Type class Occurrence Types (5): Description, Date, Format, Identifier, Rights Association Types (9): createdBy, hasSubject, publishedBy, contributedBy, typeAs, sourceOf, expressedIn, relatedWith, coveredWith Association role Types (18): creator, subject, publisher, contributor, type, source, language, relation, coverage, resourcerole-for-creator, resource-role-for-subject, resource-role-forpublisher, resource-role-for-contributor, resource-role-for-type, resource-role-for-type, resource-role-for-language, resource-rolefor-relation, resource-role-for-coverage title is used as topic name of resources (Work, Expression, Manifestation) * Red characters represent DCMES Structural Metadata Structural Metadata is described by Composition elements of Manga (volume, scene, page coma, script/onomatopoeia, person) and relationship between them IRIs (URIs) are attached to all components Topic map of Structural Metadata Topic Types (6):Volume, Scene, Page, Coma, Script/Onomatopoeia, Person Occurrence Types (3):Attribute, String, image Association Types (8):part-of, appear, depict, speak, beforeafter, coma-before-after, script-before-after, scene-before-after Association Role Types (16):whole, part, coma-appearingperson, person-in-coma, coma-depicting-script, script-in-coma, speaker, script-of-speaker, before, after, coma-before, coma-after, script-before, script-after, scene-before, scene-after Scope : Japanese(ja), English(en), Thai(th) Topic map of Structural Metadata Instance topics – volume: 1 – scene: 29 – page: 158 – come: 550 – script/onomatopoeia: 1,239 – person (character): 18 Instance associations – part-of: 780 – appear: 520 – depict: 1239 – before-after: 157 – coma-before-after: 547 – Script-before-after: 1238 5. Web Applications on topic maps Make Bibliographic topic map and Structural topic map separately – Transform CSV format to Topic Maps syntax – Using “DB2TM” in Ontopia as a tool Make Web application separately – Using “Ontopia Navigator Framework” in Ontopia as a tool Link bibliographic topic map with structural topic map – One way link at the moment Functions of the Web applications Instance list for each topic type Instance details – Topic characteristics (topic name, internal occurrence, external occurrence, association role) Graphic representation Keyword search Tolog query Navigating topics based on associations Web service interface by TMRAP Web Application for Bibliographic Metadata Web Application for Structural Metadata Top page Key word search Structural Metadata Web application Script/Onomatopoeia list Script/Onomatopoeia detail Coma detail 6. Conclusion Manga and related resources should be preserved and inherited Metadata plays very important roles to realize it By our method metadata changes from mere list of elements to structured network of concepts (including derivation) Necessity for introduction of semantic technology – This research uses Topic Maps technology – Computer can identify concepts using IRI or URI – Queries can be done from various point of views and based on association In result It becomes easy to manage, find and access the metadata and information resources (target of metadata) It makes new way to create and add new value to domains 7. Challenge and Future work Increase examples of Manga Metadata Conform the Bibliographic Metadata to FRBR Develop environment for metadata input and maintenance Make metadata guideline for animation, drama, movie, etc. Develop environment for register, manage and use of Subject Identifier (IRI or URI) Use authority data and ID of author and organization – E.g. Web NDL Authorities Yoko Kamio: http://id.ndl.go.jp/auth/ndlna/00320527 Ensure enough terms for subject (NDLSH are not enough) Make multilingual metadata And all that Challenge and Future work (cont.) Utilization of the Metadata Send messages about Manga related information resources to all over the world and pass them down the generations Try to apply the metadata to the learning materials, user guides and maintenance manuals for the instruments, etc. In order to increase usefulness of the Metadata, we would like to make sure some ambiguity requests can also be fulfilled. Such as: I'd like to find “that manga” which I read when I was a child. I'd like to use “appropriate manga” to check “Trend, Culture, Fashion, etc.” of “some old days”. Thank you! Any suggestion?