{"id":3228,"date":"2025-12-12T07:48:31","date_gmt":"2025-12-12T07:48:31","guid":{"rendered":"https:\/\/codata.org\/blog\/?p=3228"},"modified":"2025-12-12T07:48:31","modified_gmt":"2025-12-12T07:48:31","slug":"bridging-two-worlds-reflections-from-the-idw2025-panel-on-research-data-and-data-science","status":"publish","type":"post","link":"https:\/\/codata.org\/blog\/2025\/12\/12\/bridging-two-worlds-reflections-from-the-idw2025-panel-on-research-data-and-data-science\/","title":{"rendered":"Bridging Two Worlds: Reflections from the IDW2025 Panel on Research Data and Data Science"},"content":{"rendered":"<p><span style=\"font-weight: 400;\"><a href=\"https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-161.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignright wp-image-3230 size-full\" src=\"https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-161.jpg\" alt=\"\" width=\"1024\" height=\"683\" srcset=\"https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-161.jpg 1024w, https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-161-300x200.jpg 300w, https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-161-768x512.jpg 768w, https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-161-624x416.jpg 624w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a>At IDW2025, a group of speakers from around the globe gathered to address a long-standing problem: <\/span><a href=\"https:\/\/scidatacon.org\/event\/9\/contributions\/47\/\"><span style=\"font-weight: 400;\">although <\/span><i><span style=\"font-weight: 400;\">data<\/span><\/i><span style=\"font-weight: 400;\"> is the common currency of both research data management and data science, the two communities often work in parallel worlds<\/span><\/a><span style=\"font-weight: 400;\">\u2014each with its own conferences, training pipelines, infrastructures, and priorities. As Christine Kirkpatrick noted in her opening remarks, this separation persists despite converging challenges around stewardship, reproducibility, education, and ethics. She framed the session as an invitation to rethink how these domains might come together.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">What followed was a set of short talks revealing just how interdependent\u2014yet disconnected\u2014these communities have become, and how much potential lies in more intentional collaboration.<\/span><\/p>\n<p><b>From Observation to Interpretation: A Research Lifecycle View (Leo Lahti)<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Leo Lahti opened with a fundamental question: <\/span><i><span style=\"font-weight: 400;\">How do we move from raw observation to meaningful interpretation in modern research?<\/span><\/i><span style=\"font-weight: 400;\"> His answer traced the entire research lifecycle, positioning openness, interoperability, and transparency as essential ingredients. Drawing on studies that show how different choices in data preparation lead to drastically different results, Lahti made a compelling case for shared standards and methodological clarity.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">His overarching argument: bridging data science and research data management is not merely technical, it is epistemic. It requires both communities to adopt shared infrastructures, shared educational foundations, and shared norms that elevate transparency as a scientific value.<\/span><\/p>\n<p><b>The Human Infrastructure of Data (Daphne Raban)<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Daphne Raban shifted the lens to data stewardship which she called a \u201cbridge profession\u201d sitting at the intersection of technology, governance, and human judgment. As data volumes grow and automated tools proliferate, she reminded us that <\/span><i><span style=\"font-weight: 400;\">stewardship is what keeps data meaningful, contextualized, and ethically sound.<\/span><\/i><\/p>\n<p><span style=\"font-weight: 400;\">Raban illustrated the diverse impact of stewards across healthcare, finance, government, and research institutions, grounding her argument in the data cycle perspective advanced through the Israeli national initiative on data science education. In her framing, stewardship is not just about compliance; it&#8217;s about building trustworthy, reusable data ecosystems sustained by communication, documentation, and collaboration.<\/span><\/p>\n<p><b>Parallel Universes: Awareness Gaps in Data Education (Phil Bourne)<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Phil Bourne then highlighted a striking and often overlooked fact: students in data science programs worldwide typically have no exposure to organizations like CODATA, RDA, or WDS. Meanwhile, those global data organizations often operate with limited awareness of the educational and research priorities of academic data science. These are, Bourne argued, <\/span><a href=\"https:\/\/pebourne.wordpress.com\/2023\/04\/16\/deans-blog-parallel-universes\/\"><i><span style=\"font-weight: 400;\">parallel universes<\/span><\/i><\/a><span style=\"font-weight: 400;\"> that urgently need bridges.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">His proposed actions were concrete: connect student groups, align leadership networks, embed governance into data science curricula, and convene joint thematic workshops on AI, synthetic data, and data ethics. He framed data as a continuum &#8211; from production to engineering to analysis to societal impact &#8211; and argued that without collaboration across these steps, sustainability and trustworthiness will remain elusive.<\/span><\/p>\n<p><b>Data Literacy for Everyone: K\u201312 and Community College Pathways (Padmanabhan Seshaiyer)<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Padmanabhan Seshaiyer expanded the conversation into the educational pipeline, urging the community not to wait until university to introduce data literacy. He showcased innovative K\u201312 and community college bridge programs that pair culturally relevant pedagogy with inquiry-based learning grounded in the Data Cycle.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Students move from no-code tools to higher-code environments, tackling authentic problems\u2014from geometry-based triangulation tasks to investigations of social issues such as bullying and community safety. These programs embed ethics, design thinking, social justice, and civic reasoning alongside technical skills.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Seshaiyer\u2019s message was clear: building an equitable data future requires early, inclusive, and context-aware data education.<\/span><\/p>\n<p><b>Embedding FAIR into Australia\u2019s Climate Modelling Software (Kelsey Druken)<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Finally, Kelsey Druken offered a concrete case study of integrating data stewardship directly into infrastructure within Australia\u2019s national climate modelling system, ACCESS. Climate modelling, she reminded the audience, is intensely data-rich, but native model outputs often lack documentation, standards, and consistent metadata. FAIR compliance tends to happen afterward, manually, and inconsistently.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">ACCESS-NRI is now working to embed FAIR principles <\/span><i><span style=\"font-weight: 400;\">inside<\/span><\/i><span style=\"font-weight: 400;\"> the software workflows themselves. By developing versioned data specifications, harmonized naming conventions, controlled vocabularies, and comprehensive metadata at the point of production, they aim to ensure that FAIR becomes the default, not the afterthought.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Druken\u2019s work powerfully illustrates what it looks like when practice and infrastructure finally align\u2014a challenge raised repeatedly throughout the panel.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Across these five talks, several themes emerged:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Interoperability and transparency<\/b><span style=\"font-weight: 400;\"> must be built into workflows\u2014not bolted on later.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Education is the shared foundation<\/b><span style=\"font-weight: 400;\">, from K\u201312 to graduate programs to professional development.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Stewardship is central<\/b><span style=\"font-weight: 400;\">, not peripheral, to both science and data science.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Organizational silos hinder progress<\/b><span style=\"font-weight: 400;\"> across global and academic communities.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>A roadmap is needed<\/b><span style=\"font-weight: 400;\">, and the audience\u2019s input will help shape one for future CODATA, RDA, and ADSA collaborations.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Possible next steps include forming a CODATA Task Group or RDA Interest Group, coordinating ecosystem tools and shared training resources, and proposing a companion session for the next ADSA meeting. Though there was not broad support for creating a new (interest or task) group, the people assembled were interested in further opportunities for continuing the conversation.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Amy Nurnberger (MIT), who attended the session, has already taken action following the IDW session. She and others have proposed a follow-on session at the upcoming Research Data Access and Preservation (RDAP) virtual conference to ensure the information science and library communities weigh in on bridging this divide.\u00a0\u00a0<\/span><\/p>\n<p><b>The Future of Data Depends on Us Learning to Work Together<\/b><\/p>\n<p><span style=\"font-weight: 400;\">If there was one message that resonated across the session, it was this: <\/span><b>No single community can build the data ecosystem and the community needed; it requires data stewards, data scientists, educators, and the infrastructure providers working together.<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Bridging the gaps between these worlds is not simply a matter of efficiency or coordination. It is a matter of <\/span><i><span style=\"font-weight: 400;\">scientific integrity<\/span><\/i><span style=\"font-weight: 400;\">, <\/span><i><span style=\"font-weight: 400;\">ethical responsibility<\/span><\/i><span style=\"font-weight: 400;\">, and <\/span><i><span style=\"font-weight: 400;\">global impact<\/span><\/i><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The panel made clear that the future of data &#8211; open, FAIR, ethical, and societally meaningful &#8211; will be built only when we stop treating research data and data science as parallel tracks and instead recognize them as parts of a shared, interdependent community.<\/span><\/p>\n<p><a href=\"https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-228.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignright size-full wp-image-3229\" src=\"https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-228.jpg\" alt=\"\" width=\"1024\" height=\"683\" srcset=\"https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-228.jpg 1024w, https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-228-300x200.jpg 300w, https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-228-768x512.jpg 768w, https:\/\/codata.org\/blog\/wp-content\/uploads\/2025\/12\/IDW_2025_14th_Oct_DAY_TWO-228-624x416.jpg 624w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>At IDW2025, a group of speakers from around the globe gathered to address a long-standing problem: although data is the common currency of both research data management and data science, the two communities often work in parallel worlds\u2014each with its own conferences, training pipelines, infrastructures, and priorities. As Christine Kirkpatrick noted in her opening remarks, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[47],"tags":[],"class_list":["post-3228","post","type-post","status-publish","format-standard","hentry","category-idw2025"],"_links":{"self":[{"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/posts\/3228","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/comments?post=3228"}],"version-history":[{"count":1,"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/posts\/3228\/revisions"}],"predecessor-version":[{"id":3231,"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/posts\/3228\/revisions\/3231"}],"wp:attachment":[{"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/media?parent=3228"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/categories?post=3228"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/codata.org\/blog\/wp-json\/wp\/v2\/tags?post=3228"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}