We will be holding a live panel discussion on the session at 15:00 UTC on Friday 16 October. Please register here: https://attendee.gotowebinar.
Please watch the recording of the session in advance of the discussion event. The recording for the session will be made available in the UN Data Forum Attendify on 12 October and you may also view it here and below.
Register for the UN Data Forum at: https://unstats.un.org/unsd/
- Virginia Murray, Public Health England and CODATA, ‘UNDRR / ISC Sendai Hazard Definition and Classification Review’
- Remo Lalli, United Nations, ‘Digital Document Management in the United Nations: a new paradigm’
- Cecilia Elizalde, United Nations, ‘Machine-readable resolutions’
- Chifundo Kanjala, London School of Hygiene and Tropical Medicine, ‘Infectious diseases cross-site data harmonisation in Africa’
- Alejandra Gonzalez Beltran, Science and Technology Facilities Council, ‘Data bridges for scientific and official data integration’
- Arofan Gregory, a Metadata Consultant working with the Data Documentation Initiative and with CODATA, ‘DDI-CDI: Data Documentation Initiative-Cross Domain Integration’ and ‘Summary of the Session Themes’
Recording of the Session
Solving today’s global grand challenges demands that both scientific and policy research embrace new innovations and synergies as well as multi-stakeholder partnerships. In order to help build the bridges needed between different data ecosystem domains and communities, the International Science Council, acting as the global voice for science, and working with the UN, has determined the need to launch the CODATA Decadal Programme to build the international communities of practice needed to increase large-scale usability of data across the scientific and policy spheres.
Analyses can benefit from cross-cutting techniques not restricted to those of a single domain. International data used to monitor policy are a fundamental ingredient in many research projects, both in academia and for policy research. They facilitate and improve the quality of research on the ground and can also be improved through the incorporation of data from the scientific community and elsewhere. Creating a two-way bridge between scientific research data and official data is vital.
New technologies leverage huge amounts of data, relying on semantic integration, openness, and the use of machine-learning. They require a massively increased use of data from both traditional and new sources. The plethora of formats and semantics, and the paucity of metadata create barriers to employing the new data-intensive techniques. We do not need new standards, new classifications, and new technology approaches, so much as we need to make those we already have work together effectively through multi-stakeholder partnerships on a broad scale.
Initial work has started through a series of workshops, projects and case studies, bringing together experts from diverse communities. The effort is use-case driven, covering urban resilience, disaster risk reduction, and infectious disease. These efforts are producing early draft guidelines – grounded in the FAIR data principles – to help build the communities of practice for the use of data across domain boundaries. The FAIR data principles are gaining traction across the global data ecosystem and being embraced by multiple stakeholders, including international organizations, public sector, civil society, academia and the business community.
Building on existing technical standards such as DDI and SDMX, on the wealth of semantic technologies from the W3C and elsewhere, and on machine-learning and data science techniques, the Decadal Programme will bring a cross-disciplinary, trans-community approach to how we manage, combine and use data. This session will provide an overview of these efforts, covering data documentation, technology, semantic integration, dissemination, and other key aspects.