NSTL-SCOAP3_1 (Read-Only) - Indico

advertisement
工作交流
国家科技图书文献中心(NSTL)
与高能物理开放出版联盟
(SCOAP3)
NSTL-SCOAP3
Work Communication
Bai Haiyan
Zhang Xiaodan
Institute of Scientific and Technical Information of China
CONTENTS
Background
NSTL-SCOAP3 Web Service
NSTL-SCOAP3 Decision-support
Conclusion
1 Background
NSTL ,ISTIC & NSTL-SCOAP3 Working
Group
Objective and Web Service of NSTL
Background
NSTL
National Science and Technology Library (NSTL), founded in
2000, is a virtual sci-tech library on national level. NSTL provides
a nonprofit information service for the public national wide
through network service.
ISTIC
Institute of Scientific and Technical Information of China (ISTIC) ,
founded in 1956, is one of the NSTL nine members.
Information Technology Support Center (ITSC),as a division of
ISTIC, it is also the network center of NSTL and the co-chair
member of NSTL’s IT workforce.
NSTL-SCOAP3 Working Group
Founded in March 2014.
IT support team responsible for SCOAP3 web service in China
and provide IT support for decision-making.
Organization Structure of NSTL
NSTL DigitalOperationPlatform
Running mode
三、NSTL技术支撑平台
Objective and Web Service of NSTL
Objective:
Aims to build a sci-tech collection and secure its service
functionality in a strategic way at the national level in this digital
era
Resource:
More than 150 millions records of documentary data up to now
About 70 millions of abstract data rom more than 27,000 titles of
foreign journals and conference proceedings
About 85 millions of science citation records processed from about
3000 international core journals.
NSTL owns the intellectual property of the data.
Web Service:
Free information retrieval and Document Delivery Service
APIs for third-party institutional users
2
3
NSTL-SCOAP
Web Service in
building
Goals and Objectives
Workflow
Main functions, Bottlenecks & Solutions
Information retrieval
Information analysis
Linkage with research data
Goals and Objectives
NSTL level-Coping with OA trend
Collects, preserves, organizes OA articles and provides service
Transfers document service model from abstract-based to full textbased
Enriches service forms for institutional users and third-party users
User level-explore new service model
HEP personal users are used to visit INSPIRE, ArXiv.org
Why they use NSTL?
Become a document-centered linkage hub providing
information retrieval, information analysis and linkage with
research data etc.
Play roles as a tool built-in user’s IR, paper submission system
and other most familiar information environment
Expand from personal users to institutional users by broadly
cooperation with research institutes, journal publishers,
information platform etc. in HEP
Workflow
Retrieval
Analysis
Harvest from SCOAP3
Repository
Data
linkage
Decision-making
support
Web Service
(in building)
Feedback
No
❶ DataHarvesting
Ø Harvestdaily
Ø HarvestChina-Set
irregularly scheduled
Ø Toinsureharvest
completely
Data
Quality
Control
Yes
NSTL
Storage
❷ DataQualityControl
Ø Forerrorwithaffiliations,
authorsname,title
etc.,contactpublisher.
Ø ForerrorslikemissingDOI,
contactSCOAP3
❸ NSTLStorage
Ø Dataareparsedinto
metadataandfulltext
Ø BuildindexbySolr.
Ø build figures URLto
journal website
Functions (1) - Information Retrieval
(in building)
Objective:
Attempt to realize new information retrieval functionality based on
full text in XML or PDF format
Functions
Full text retrieval: pure text exclusive to formula, table, figure
Segment retrieval: table, figure, reference
Structure retrieval: background, conclusion etc. for research article
Bottlenecks
Data problems caused by data quality and standard degree:
XML file, JATS
User’ needs :formula retrieval is useful?
Solutions:
Begin from journals whose format and metadata are standard and
normative
Give close attention to related quality control projects
Homepage
Figure Retrieval
Table retrieval
Structure retrieval
Research
article’s
background
Research
article’s
conclusion
Functions(2)-Information Analysis
(in design)
Services:
Methods:
Hotspots Analysis, Trends Analysis, Subject structure analysis…
Word frequency statistics, co-word analysis, association
analysis…
Bottlenecks
Absence of keywords, 42% SCOAP3 articles
How to process keywords contain number or formula?
Solutions
Automatic indexing +manual indexing
STKOS (Science and Technology Knowledge Organization System)
Established by NSTL
Integrates 257 vocabularies and classifications in science and
technology
Includes more than 600,000 concepts from more than 2 million
source terms
Automatic indexing system developed by NSTL based on STKOS
has finished indexing for main body of resource.
CategoryClass=HEP
http://172.18.20.2:7001/nstl/stkos_list.jsp?fl=linktoCCUInotation_A022222867%E2%89%A1121705.05_%E9%AB%98%E8%83%BD%E7%89%A
9%E7%90%86%E5%AD%A6_%E8%8C%83%E7%95%B4%40&
HEP in STKOS
834 concepts
Integrated from more than 4000 terms
From 98 vocabularies
Functions(3)-linkage with research data
(in planning)
Goals and objectives:
Link paper with open data, as the response to user’s information
retrieval
Play role as linkage hub built in user’s information environment
through broad cooperation and reuse of open information
platforms
Help authors to provide publishable research data to open
data repository driven by related policy
Help authors to build linkage with research data deepened into
internal paper (References linkage, Table and figure linkage,
Article refer-to linkage)
Help users to submit paper, research data to ORCID’s
academy CV
Help user simplify the submission to build linkage once and
summit to all
Estimate personal or institutional impact
How to realize?
Three necessary conditions: open data, resolvable full text and
linkage builder
Cooperative and available targets:
IR(Knowledge Repository of Institute Of IHEP,CAS)
Journal publisher (Chinese Physics )
Open data repository (HEPData)
Information platform (INSPRIE)
Author
Open Data
linkage
• Preprint system
• IR system
• Paper submit system
•….
e.g. HEPData,
Resolvable
Full text
e.g. SCOAP3
Article
Figshare
Information Service,
e.g.INSPIRE
Professional
3 NSTL-SCOAP3 Decision-support
China contribution statistics
3 NSTL-SCOAP3 Decision-support
China contribution statistics
Conclusion
Great thankfulness for technical support from
SCOAP3
To be an active participator in SCOAP3 technical
work
Learn from CERN and follow CERN’s good
examples
Further Cooperation with CERN
Thanks!
Download