Data Issues in China
CODATA-China ever since has been making efforts on scientific data sharing in China through national and CAS programs.
National Data sharing – Scientific Data Sharing Program
Promoted by scientist, Ministry of Science and Technology (MOST) gives top priority to scientific data sharing and includes the implementation of Scientific Data Sharing Program (SDSP) into the construction of national science and technology infrastructure platform. This program is a national level program launched by MOST in 2002.
- Pilot project, 2002-2005
- Phase I: 2006-2010, infrastructure construction and data integration
More than 18 scientific data sharing projects were funded by MOST after 2002, including Resource and Environment, Agriculture, Population and Health, Basic and Frontier Sciences, Engineering and Technology, Regional Development fields in 24 departments. And it also established the data sharing environment, including policy, standards, data products, data sharing platform, etc.
For Research Data Sharing Polices, it added scientific data sharing section to China
Scientific and Technological Progress Law, and 40 rules and regulations on data sharing made by relevant departments in various domains.
This program has its Clearinghouse and Service System;
Chinese Science and Technology Resource Sharing Network is an open access portal provided service in September, 2009
A national level research data clearinghouse be opened to public
By the end of 2010, it built a data management and sharing service system with a three-tier structure of 40 scientific data centers or networks covering the 6 disciplines of natural science and environment, agriculture, population and health, basic and frontier sciences, engineering and technology and comprehensive regional science.
It built Chinese research data catalogue as follows,
– Population and HealthBio medical, clinical, public health, traditional Chinese medicine ,pharmacy, population and reproductive health, ect
– Earth system
Geographical, Natural Resources, Ecological and Environment, Polar Research, Space Science, earth Observation, etc.
Forest resource, forest protection, forest cultivation, wood science
– Meteorological and atmospheric science
– Agriculture Science
– Earthquake Science
– Basic Science
Physics, chemistry, astronomy, material science, biology, etc.
- Phase II: 2010-, open access service, evaluation and authority
Six research data system be authorized as national science and technology infrastructure and it make service assessment annually
– Forestry Science Data System
– Data Sharing Network of Earth System Science
– Data Sharing Network of Population and Health
– Agriculture Science Data system
– Meteorological Science Data System
– Earthquake Science data system
Scientific Data Base and its Application Environment of CAS
The data infrastructure includes massive storage system, data-intensive computing facility, high speed network and scientific databases. There are more than 20 large scientific facilities producing huge data and still 20 facilities are under construction, more than 100 field observation stations collecting long term monitoring data including Ecology, Environment, Space, more than 100 institutes has archived, managed and shared long-term research data.
- Scientific Databases (SDB)
Scientific Databases is a long-term mission with many institutes involved started in1986 which funded by CAS. It collecting multi-discipline research data and promoting data sharing with 337 research databases by 61 institutes and 500TB data available to open access and download, thus it’s a long-term, large-scale collaboration.
The SDB covering a range of disciplines, including Physics & Chemistry, Geosciences, Biosciences, Atmospheric & Ocean Science, Energy Science, Material Science, Astronomy & Space Science. From 2008-2010, it focused on data integration and improving research database to be resource database and even reference database. Now it has 8 Resource databases, 2 Reference databases, 4 application-Oriented databases. All databases can be accessed online, and most over 70% data are free and open. By the end of 2013, the scientific database access visitors added up to 62 million, and the data download was more than 660TB.
- Massive Storage System and Data Storage Service
The internet-based storage service system is data backup and storage system for large scientific facilities. It is the archiving and curation of research data and databases in CAS, supporting on-line data accessing, analyzing, and data-intensive applications. It has 10PB online disk storage, 25PB tape storage, 5000 CPU Core computing facility, and 2.5Gbps network connected with CSTNet, CNGI, GLORIAD.
Advanced research cyber-infrastructure based on the Next Generation Internet of China (CNGI), funded by National Development and Reform Commission, is a very comprehensive program started in 2010 with 500 Million RMB funding. Through this program, more than 100 institutes, 100 field stations, some large scientific facilities and data centers will be linked together by very high speed network via IPV6, so as to set up big data transmission and sharing environment supporting large-scale data intensive discovery. Supported by this program, Chinese Scientific Data Cloud constitutes 12 data centers and one data archive center, and provides big data online storage, data backup, data archive and data intensive analysis services.
Find out more
Highlights since 2012
Membership (coming soon)