Select Page

Data Repositories and Stewardship: Virtual SciDataCon 2021 Strand

Virtual SciDataCon 2021 is organised around a number of thematic strands.  This is the fifth of a series of announcements presenting these strands to the global data community. Please note that registration is free, but participants must register for each session they wish to attend.

The World Data System of the International Science Council is the sister organisation to CODATA and the co-organiser of Virtual SciDataCon 2021. WDS is a membership organisation that promotes ‘long-term stewardship of, and universal and equitable access to, quality-assured scientific data and data services, products, and information across all disciplines in the Natural and Social Sciences, and the Humanities.’ The WDS International Programme Office has recently moved to Oakridge, Tennessee, USA.  Three sessions on Tuesday 20 October look at various issues of concern for data repositories and for data stewardship worldwide.

Cross-border Collaboration in Developing Data Stewardship for Regional and Global Outreach of INFN Open Access Repositories, Tuesday 19 October, 11:00-12:30 UTC: REGISTER FOR THIS SESSION

The Italian National Institute of Nuclear Physics (INFN) has designed a prototype of Open Access Repository (www.openaccessrepository.it), which reuses the standard-based Invenio (back-end) and Zenodo (front-end) digital asset management systems (DAMS) with some extensions regarding the deployment model and the authentication. This prototype has been deployed in Ethiopia, Somalia and is in the process of being installed in Palestine, in Côte d’Ivoire and in further Middle East and African countries. Due to the same structure of the repositories and the similar academic culture in the countries, Ethiopia, Somalia, Côte d’Ivoire and Palestine are facing similar needs. Cross-border collaboration in the further development and operation of those repositories as well as on training activities for repository managers and users are key to develop the national repositories to regional and global open science repositories that empower and enable societies and act as interfaces between research and innovation. The centre piece of data that improves our world is the data steward or the data stewardship. Due to the fact that Open Access Repositories in Ethiopia, Côte d’Ivoire, Somalia and Palestine have the same starting point, the National Research and Education Networks (NREN) will initiate a discussion in order to find a way, how they could cooperate in developing a harmonised training for data stewards.

The State of Documenting and Reporting Data and Information Quality for Supporting Open Science, Tuesday 19 October, 13:00-14:30 UTC: REGISTER FOR THIS SESSION

Knowledge about the quality of data and information is important to support informed decisions on the (re)use of individual datasets and is an essential part of the ecosystem that supports open science. Data and information quality is essential for appropriate and effective use of data in research and/or applications. Especially with the emphasis on open data, as well as availability of data at very little or no cost, the data, metadata, and software are accessible to a much broader community of users, potentially across disciplines and international boundaries. Also, given the big data era, where abundant data products and services are available from a large number of sources for data-intensive applications, data quality information is especially important to ensure successful data reuse. We will explore activities and processes that contribute to planning for, improving, documenting and reporting, providing access to, and using data and information quality from various disciplines, ranging from Earth and space sciences to social sciences, covering a variety of data such as observational, experimental, model and citizen science data. The purpose of this session is to promote such information exchange and the resulting cross-discipline collaborations.

Making Your Data Center and Services Ready for AI: Case Studies, Tuesday 19 October, 16:00-17:30 UTC: REGISTER FOR THIS SESSION
Scientific data centers across the world are expected to adapt and modernize their data management to support rapidly evolving AI and machine learning-based cross domain research. Data centers and repositories are currently evaluating various strategies such as effective use of AI/ML for their data management operations and also provide AI-friendly data access services to the user community. Data operations such as data quality analysis, synthesis data product generations, data reduction where the data is generated, on-the-fly conditional queries are currently getting upgraded with the use of AI and machine learning. In addition, data centers that are successfully serving data to the broader user community are looking at improving data sharing and interoperability using AI. Although datasets may technically be interoperable, integration or communication among these datasets often fails because of a lack of cross-domain ontologies and standards, significantly impacting data sharing with inter-domain AI-based research activities. In this session, data center leaders will share their recent experience in adapting AI to improve the data management components and enable AI-friendly data services to user communities.

Virtual SciDataCon 2021 is organised by CODATA and the World Data System, the two data organisations of the International Science Council – PROGRAMME AT A GLANCE – FULL PROGRAMME – please note that registration is free, but participants must register for each session they wish to attend.