China’s Scientific Data Sharing Initiatives and Future Perspective Pro. Peng, Jie (pengj@istic.ac.cn) Dr. Liu, Runda (liurd@istic.ac.cn) 5 March 2012, Paris, Delivering data to science Institute of Scientific & Technical Information of China 1 Agenda 1. The Progress of Scientific Data Sharing Projects in China 2. Our Work in Scientific data Sharing 3. Conclusion & Outlook 2 1 The Progress of SDSP in China 3 Scientific Data Sharing • The Sharing of Scientific Data is important. • two types of sharing: sharing activities among resource holding bodies vs. sharing service between resource holding bodies and users. • Public Domain Data, Data the with the feature of Grey (greyData), Commercial data China Scientific Data Sharing Project • In 1982,Chinese Academy of sciences(CAS) Proposed the project of “Scientific database and information system” • In 1988,together with relating agencies and research institutes, CAS built World Data Center China Centers, and formed China Committee of Codata. • In 2001,Ministry of Science and Technology (MOST) conducted series of investigation and released series of reports. In the same year, meteorology data sharing pilot project was launched. 5 China Scientific Data Sharing Project(Cont.) • In 2003, Ministry of Finance start allocate special funding for MOST to construct China Scientific Infrastructure, Scientific Data Sharing Projects (SDSP) are among it. • Under the framework of SDSP, a comprehensive scientific data sharing activities was started. 24 government agencies involved in the building of 8 platforms in the field of Agriculture, Earth System Science, population and health etc. in the first stage. 6 Development stage of SDSP Comprehensive Operation and construction stage service Experiment stage Optimization and service National Scientific Data Sharing Network Pilots National Scientific Data center pilots Spread of scientific data sharing network Law and regulation Spread of national data center Preliminary research 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 The whole view of SDSP • SDSP was developed under comprehensive plan at the national level. (regulations and management structure, 263 standards and criteria) • Data of public good in different government sectors were put into a common sharing framework. (10 data centers or service network, 100 branches and nodes) The whole view of SDSP(Cont.) • SDSP make all these data accessible to all interested users at an affordable cost or free if possible(3000 databases for basic research and public welfare, 200 institutions, 140 TB data) • SDSP form a multi-tiled, cross agency, cross geographic location, cross discipline distributed scientific data sharing system that bridge the gap between different data holding agencies and institutes of public good and users.(A statistics of 2009, 170, 000 registered users, 62 M visits, 430 TB download) http://www.sciencedata.cn9 More results • An open mindset is formed in scientific data field – Regional SDSP was set up – Seeking a lot of joint Scientific Data Exchange programs, like HKH program with ICSITC – China actively participated in Codata – China also take part in WDC(now world data system) All WDC in China took part in SDSP since 2002. 10 2 Works relating Scientific data Sharing in ISTIC 11 Main duty • Institute of Scientific and Technical Information of China(ISTIC) is the only research institute affiliated with the Ministry of Science & Technology of China (MOST) conducting S&T Information research & service. • We collecting S&T literature. we also collecting other type of S&T information • Resource Sharing and Promotion center (RSPC) conducting research and practice including(not limited): – S&T resource management Theory studies (start from 2006, investigations, regulations etc.) – S&T information resource sharing technical solutions 12 2.1 Scientific data DOI in China 13 DOI Registration Agency in China • ISTIC in conjunction with WANGFANG Data Group became China’s only DOI® Registration Agency in March, 2007. • The agency focus on the development of Chinese language platform and gateway for DOI name use and are trying to attract metadata registration by building relating infrastructure. • The project start from Chinese journal article and scientific data, expanding to books and thesis. 14 The Progress of Scientific Data DOI in China • DOI name Coding Regulation • Metadata Description Standards • Service Platform Construction for Scientific data in Chinese Language, Provide DOI resolution and retrieval services • Provide linking between data and journal articles. • Build up Service Alliance, Registration of 15K natural S&T resources plus other more. 15 2.2 Scientific Data Classification and navigation system 16 Scientific Data Classification and navigation system • platform for scientific data resource information on the internet (metadata). • The system accelerate DOI registration and application of Scientific Data. • classify distributed scientific data resource on the internet effectively • Help improve the standard of scientific data resources in China and provide fast navigation and link. Multi-facet Keyword and Classification Connection mechanism • organize scientific data resource catalogue – Dynamic multi-facet classification and keyword connection indexing method – designs ranking scheme based on the weight of classification and keyword connection. 19 3 Conclusion & Outlook 20 Conclusion • SDSP is a government effort to promote the sharing of Scientific Data: Big budget, new Drive? • Theoretical foundation is important: Scientific Data Sharing is the transfer of certain rights of Scientific Data. • Technical endeavor: DOI registration, Linking, classification will help the management of Scientific Data resources. 21 Future focus • Data publish (a long way to go) – Datacenter view – Publisher view – Funding agency view • how to evaluate the result of data sharing infrastructures? It is important to build a third party evaluation mechanism. – The evaluation of data resource construction – Portal Information Architecture evaluation – Database function evaluation • Pro. Peng, Jie (pengj@istic.ac.cn) • Dr. Liu, Runda (liurd@istic.ac.cn) • 5 March 2012, Paris, Delivering data to science • Institute of Scientific & Technical Information of China 23