Taxonomies and Semantic Enrichment

advertisement

Using Taxonomies to

Create People

Directories and Author

Networks

Bert Carelli, Vice President Business

Development www.accessinn.com

Goals and Agenda

 Why is it important right now that we develop better resources about people?

 What roles can taxonomies play in this effort?

 What opportunities are being created for knowledge discovery and collaboration?

 What broad initiatives and technologies should we be aware of?

Access Innovations Background

 Founded 1978: more taxonomies built than any other company worldwide

Thought leaders and educators in taxonomy standards

Data Harmony Software: field tested, battle hardened, award winning software

"7-fold productivity increase"

ProQuest CSA

"Cut search time 50%."

The Weather Channel

The Semantic Roadmap:

K

nowledge Organization Systems

Complex

High value

Semantic network

Ontology

Linked Entities

Contextual Specificity

Thesaurus

Taxonomy

Controlled vocabulary

Synonym set/ring

Name authority file

 Uncontrolled list

Simple

Low Value

Unrelated Entities

Ambiguity

The Web Transformed…

Mobile Social

 Web 1.0 = Getting Content Online (over

120 million websites and growing)

 Web 2.0 = Getting People Online

Facebook: 700 Million Users Projected for 2011 (Open-First)

Twitter: Over 9 Billion Updates per Month

Linkedin: Over 200 Million People

Searches per Month

A highly trusted source

Social technologies will boost the utility of this long-trusted source

Business Applications of Social Media

Sharing ideas for innovation

Co-developing new products or services

Enabling customers to get help from other customers

Generating brand awareness and visibility

Peer review

Assisting the selling process by sharing contacts in a community

Company or Member Profiles

User pastes or uploads CV

Button to autoextract taxonomy attributes

User Reviews tagging for accuracy

One Person, Many Representations

VIAF : Virtual International

Authority File http://viaf.org/viaf/95216565/

Creating an Author Authority Database

 Tag all articles in the repository with standard subjects

 Export author names, subjects, institutions, locations, etc.

 Disambiguate authors with the same or similar names

Author Data: View by Connections…

… or by location…

…Or in the Document itself:

http://dx.doi.org/10.1371/journal.pntd.0000228.x001

Many Repositories for Names

VIAF : Virtual International

Authority File http://viaf.org

Project VIVO

Designed around linked data standards

Resource Description Framework (RDF)

VIVO’s ontology integrates data from human resource systems, grants databases, faculty annual reporting systems, and publication databases

 Free open-source software download: http://vivo.sourceforge.net

Detailed Profiles of Medical/BioMedical Researchers

Explore a research area

Focus the results desired expertise and research activity

Information stored as Resource Description Framework (RDF)

Data is structured in the form of “triples” as subjectpredicate-object.

 Concepts and their relationships use a shared ontology to facilitate the harvesting of data from multiple sources.

Jane

Smith is member of has affiliations with author of

Book chapter

Dept. of

Genetic s

Genetics

Institute

College of

Medicine

Journal article

Book

Subject Predicate Object

Detailed Data Relationships

Connections among scientists illustrated

David Nelson

Biomedical Informatics organization with position for is research area of has research area has position in featured in

Mike Conlon

Clinical Translational

Science Institute (CTSI) organization with position for

Ed Tech Magazine

Gene Anderson features has author author of author of

Development of an Observational Instrument to

Measure Mother-Infant Separation Post Birth

Current and accurate data revealed has author

Visualizations: co-author networks

ORCID

125 Participant Organizations

25

ORCID Profile

Exchange

Testing Possible Matching Algorithms:

VIAF matching technology from OCLC

• Author Resolver from ProQuest

Matching capability from OKKAM

Access Innovations Author Authority

ORCID

F67572010

26

Integrating Identity into Publisher

Systems

Manuscript Submission

• Confirm background and authenticity of submitting authors

Peer review

• Confirm background and authenticity of prospective reviewers and identify potential conflicts of interest

Connection to publications and other works

• Linking ORCID profiles with scholarly output enables disambiguation and improves publisher decision-making.

Social networks

• Integrating profile information into publishers social network systems enhances value and stickiness of these services (e.g. Nature Networks)

Questions?

Bert Carelli

Vice President, Business Development

Access Innovations / Data Harmony

Bert_carelli@accessinn.com

510-551-3784

Corporate Number: 505-998-0800

Download