Category Archives: Data Science Journal

Posts relating to the data science journal

April 2020: Publications in the Data Science Journal


Title:
Correction: ‘Developing a Research Data Policy Framework for All Journals and Publishers
Author
: Iain Hrynaszkiewicz, Natasha Simons, Azhar Hussain, Rebecca Grant, Simon Goudie
URL: 
http://doi.org/10.5334/dsj-2020-017

Title:
Developing an Open Data Portal for the ESA Climate Change Initiative
Author: Philip Kershaw, Kevin Halsall, Bryan N. Lawrence, Victoria Bennett, Steve Donegan, Alan Iwi, Martin Juckes, Eduardo Pechorro, Ruth Petrie, Joe Singleton, Ag Stephens, Alison Waterfall, Antony Wilson, Alexander Wood
URL: http://doi.org/10.5334/dsj-2020-016
Title: Digital Objects – FAIR Digital Objects: Which Services Are Required?
Author
: Ulrich Schwardmann
URL: http://doi.org/10.5334/dsj-2020-015

March 2020: Publications in the Data Science Journal


Title:
Dataset after Seven Years Simulating Hybrid Energy Systems with Homer Legacy
Author
: Alexandre Beluco, Frederico A. During F°, Lúcia M. R. Silva, Jones S. Silva, Lúis E. Teixeira, Gabriel Vasco, Fausto A. Canales, Elton G. Rossini, José de Souza, Giuliano C. Daronco, Alfonso Risso
URL: 
http://doi.org/10.5334/dsj-2020-014

Title:
GIS Project ROSA: FAIR Principles in the Petroleum Industry
Author: Anastasia Odintsova , Alena Rybkina, Julia Nikolova, Anna Korolkova
URL: http://doi.org/10.5334/dsj-2020-013
Title: MASER: A Science Ready Toolbox for Low Frequency Radio Astronomy
Author
: Baptiste Cecconi, Alan Loh, Pierre Le Sidaner, Renaud Savalle, Xavier Bonnin, Quynh Nhu Nguyen, Sonny Lion, Albert Shih, Stéphane Aicardi, Philippe Zarka, Corentin Louis, Andrée Coffre, Laurent Lamy, Laurent Denis, Jean-Mathias Grießmeier, Jeremy Faden, Chris Piker, Nicolas André, Vincent Génot, Stéphane Erard, Joseph N. Mafi, Todd A. King, Jim Sky, Markus Demleitner
URL: http://doi.org/10.5334/dsj-2020-012

Title:
Experimental Data of Muon Hodoscope URAGAN for Investigations of Geoffective Processes in the Heliosphere
Author
: Anna Kovylyaeva , Ivan Astapov, Anna Dmitrieva, Vladimir Borog, Natalia Osetrova, Igor Yashin
URL: http://doi.org/10.5334/dsj-2020-011

Title:
Risk Assessment for Scientific Data
Author
: Matthew S. Mayernik , Kelsey Breseman, Robert R. Downs, Ruth Duerr, Alexis Garretson, Chung-Yi (Sophie) Hou, Environmental Data Governance Initiative (EDGI) and Earth Science Information Partners (ESIP) Data Stewardship Committee
URL: http://doi.org/10.5334/dsj-2020-010

Title:
How Do People Make Relevance Judgment of Scientific Data?
Author
: Jianping Liu , Jian Wang, Guomin Zhou, Mo Wang, Lei Shi
URL: http://doi.org/10.5334/dsj-2020-009

Title:
Who Bears the Burden of Long-Lived Molecular Biology Databases?
Author
: Heidi J. Imker
URL: http://doi.org/10.5334/dsj-2020-008

 

February 2020: Publications in the Data Science Journal

February 2020:  Publications in the Data Science Journal


Title:
Impacts and Challenges of ICT Based Scale-up Campaigns: Lessons Learnt from the Use of SMS to Support Maize Farmers in the UPTAKE Project, Tanzania
Author
: Lucy Karanja, Stephanie Gakuo, Monica Kansiime, Dannie Romney, Henry Mibei, James Watiti, Leonard Sabula, Daniel Karanja
URL: 
http://doi.org/10.5334/dsj-2020-007

Title:
Automatic Data Standardization for the Global Cryosphere Watch Data Portal
Author:Mathias Bavay , Joel Fiddes, Øystein Godøy
URL: http://doi.org/10.5334/dsj-2020-006

Title:
Developing a Research Data Policy Framework for All Journals and Publishers
Author
: Iain Hrynaszkiewicz , Natasha Simons, Azhar Hussain, Rebecca Grant, Simon Goudie
URL: http://doi.org/10.5334/dsj-2020-005

January 2019: Publications in the Data Science Journal

January 2019:  Publications in the Data Science Journal


Title: 
Data Curation Profiling to Assess Data Management Training Needs and Practices to Inform a Toolkit
Author
: Bradley Bishop, Hannah Gunderman, Rowena Davis, Tina Lee, Rebecca Howard, Robert Samors, Fiona Murphy, Judit Ungvari
URL: 
http://doi.org/10.5334/dsj-2020-004

Title:
Data Without Software Are Just Numbers
Author: James Harold Davenport , James Grant, Catherine Mary Jones
URL: http://doi.org/10.5334/dsj-2020-003

Title: 
Knowledge Grid: An Intelligent System for Collaboration and Knowledge Management in Nigerian Universities
Author
: Boluwaji A. Akinnuwesi , Adedoyin Odumabo, Benjamin S. Aribisala
URL: http://doi.org/10.5334/dsj-2020-002

Title: 
A Discussion of Value Metrics for Data Repositories in Earth and Environmental Sciences
Author
: Cynthia Parr, Corinna Gries, Margaret O’Brien, Robert R. Downs, Ruth Duerr, Rebecca Koskela, Philip Tarrant, Keith E. Maull, Nancy Hoebelheinrich, Shelley Stall
URL: http://doi.org/10.5334/dsj-2020-001

December 2019: Publications in the Data Science Journal

December 2019:  Publications in the Data Science Journal

Title: The Norwegian National Ground Segment; Preservation, Distribution and Exploitation of Sentinel Data
Author
: Trygve Halsne, Lara Ferrighi, Bard Saadatnejad, Nico Budewitz, Frode Dinessen, Lars-Anders Breivik, Øystein Godøy
URL: 
http://doi.org/10.5334/dsj-2019-061
Title: “Data Stewardship Wizard”: A Tool Bringing Together Researchers, Data Stewards, and Data Experts around Data Management Planning
Author:Robert Pergl, Rob Hooft, Marek Suchánek, Vojtěch Knaisl, Jan Slifka
URL: http://doi.org/10.5334/dsj-2019-059
Title: A Discussion of Value Metrics for Data Repositories in Earth and Environmental Sciences
Author
: Cynthia Parr, Corinna Gries, Margaret O’Brien, Robert R. Downs, Ruth Duerr, Rebecca Koskela, Philip Tarrant, Keith E. Maull, Nancy Hoebelheinrich, Shelley Stall
URL: http://doi.org/10.5334/dsj-2019-058

November 2019: Publications in the Data Science Journal

November 2019:  Publications in the Data Science Journal

Title: Reviving an Old and Valuable Collection of Microscope Slides Through the Use of Citizen Science
Author
: John Pring, Lesley Wyborn, Neal Evans
URL: 
http://doi.org/10.5334/dsj-2019-057
Title: Efficient Stratified Sampling Graphing Method for Mass Data
Author: Jianjun Wang, Yingang Zhao, Jun Chen, Suqing Zhang, Xudong Zhao, Yufei He
URL: http://doi.org/10.5334/dsj-2019-056
Title: A Comprehensive Video Dataset for Multi-Modal Recognition Systems
Author
: Anand Handa, Rashi Agarwal, Narendra Kohli
URL: http://doi.org/10.5334/dsj-2019-055
Title: Proper Attribution for Curation and Maintenance of Research Collections: Metadata Recommendations of the RDA/TDWG Working Group
Author
: Anne E. Thessen , Matt Woodburn, Dimitrios Koureas, Deborah Paul, Michael Conlon, David P. Shorthouse, Sarah Ramdeen
URL: 
http://doi.org/10.5334/dsj-2019-054
Title: Intelligent Electronic Management of Library by Radio Frequency Identification Technology
Author
: Qinglan Huang, Hongyi Huang
URL: 
http://doi.org/10.5334/dsj-2019-053
Title: The History and Future of Data Citation in Practice
Author
: Mark A. Parsons, Ruth E. Duerr, Matthew B. Jones
URL: 
http://doi.org/10.5334/dsj-2019-052

 

October 2019: Publications in the Data Science Journal

October 2019:  Publications in the Data Science Journal

Title: Different Preservation Levels: The Case of Scholarly Digital Editions
Author
: Elias Oltmanns, Tim Hasler, Wolfgang Peters-Kottig, Heinz-Günter Kuper
URL: 
http://doi.org/10.5334/dsj-2019-051
Title: A Method for Extending Ontologies with Application to the Materials Science Domain
Author: Huanyu Li, Rickard Armiento, Patrick Lambrix
URL: http://doi.org/10.5334/dsj-2019-050
Title: Analysis of Several Years of DI Magnetometer Comparison Results by the Geomagnetic Network of China and IAGA
Author
: ufei He, Xudong Zhao , Dongmei Yang, Fuxi Yang, Na Deng, Xijing Li
URL: http://doi.org/10.5334/dsj-2019-049

 

September 2019: Publications in the Data Science Journal

September 2019:  Publications in the Data Science Journal

Title: Data Sharing at Scale: A Heuristic for Affirming Data Cultures
Author
: Lindsay Poirier, Brandon Costelloe-Kuehn
URL: 
http://doi.org/10.5334/dsj-2019-048
Title: Building Infrastructure for African Human Genomic Data Management
Author:Ziyaad Parker , Suresh Maslamoney, Ayton Meintjes, Gerrit Botha, Sumir Panji, Scott Hazelhurst, Nicola Mulder
URL: http://doi.org/10.5334/dsj-2019-047
Title: Analysis of Rainfall and Temperature Data Using Ensemble Empirical Mode Decomposition
Author
: Willard Zvarevashe, Symala Krishnannair, Venkataraman Sivakumar
URL: http://doi.org/10.5334/dsj-2019-046
Title: Policy Needs to Go Hand in Hand with Practice: The Learning and Listening Approach to Data Management
Author
: Maria Cruz, Nicolas Dintzner, Alastair Dunning, Annemiek van der Kuil, Esther Plomp, Marta Teperek, Yasemin Turkyilmaz-van der Velden, Anke Versteeg
URL: 
http://doi.org/10.5334/dsj-2019-045
Title: The Australian Research Data Commons 
Author
: Michelle Barker, Ross Wilkinson, Andrew Treloar
URL: 
http://doi.org/10.5334/dsj-2019-044
Title: The Impact of Targeted Data Management Training for Field Research Projects – A Case Study
Author
: Jonathan L. Petters , George C. Brooks, Jennifer A. Smith, Carola A. Haas
URL: 
http://doi.org/10.5334/dsj-2019-043

July 2019: Publications in the Data Science Journal

July 2019:  Publications in the Data Science Journal

Title: Real Estate Evaluation Model Based on Genetic Algorithm Optimized Neural Network
Author
: Yan Sun
URL: 
http://doi.org/10.5334/dsj-2019-036
Title: Abnormal Pattern Prediction: Detecting Fraudulent Insurance Property Claims with Semi-Supervised Machine-Learning
Author: Sebastián M. Palacio
URL: http://doi.org/10.5334/dsj-2019-035
Title: A Regional Project in Support of the SADC Cyber-Infrastructure Framework Implementation: Weather and Climate
Author
: Mary-Jane Morongwa Bopape , Happy Marumo Sithole, Tshiamo Motshegwa, Edward Rakate, Francois Engelbrecht, Emma Archer, Anneline Morgan, Lwando Ndimeni, Joel Botai
URL: http://doi.org/10.5334/dsj-2019-034
Title: Designing Transnational Hydroclimatological Observation Networks and Data Sharing Policies in West Africa
Author
: Seyni Salack , Aymar Bossa, Jan Bliefernicht, Sina Berger, Yacouba Yira, Kamil A. Sanoussi, Samuel Guug, Dominicus Heinzeller, Adolphe S. Avocanh, Barro Hamadou, Symphorien Meda, Belko A. Diallo, Igor B. Bado, Inoussa A. Saley, Elidaa K. Daku, Namo Z. Lawson, Aida Ganaba, Safiétou Sanfo, Koufanou Hien, Arone Aduna, Gero Steup, Bernd Diekkrüger, Moussa Waongo, Antonio Rogmann, Ralf Kunkel, John P. A. Lamers, Mouhamadou B. Sylla, Harald Kunstmann, Boubacar Barry, Laurent G. Sedogo, Christian Jaminon, Paul Vlek, Jimmy Adegoke, Moumini Savadogo
URL: 
http://doi.org/10.5334/dsj-2019-033
Title: An Automated Machine Learning Based Decision Support System to Predict Hotel Booking Cancellations
Author
: Nuno Antonio, Ana de Almeida, Luis Nunes
URL: 
http://doi.org/10.5334/dsj-2019-032
Title: Indigenous Data Governance: Strategies from United States Native Nations 
Author
: Stephanie Russo Carroll, Desi Rodriguez-Lonebear, Andrew Martinez
URL: 
http://doi.org/10.5334/dsj-2019-031
Title: Building Open Access to Research (OAR) Data Infrastructure at NIST
Author
: Gretchen Greene , Raymond Plante, Robert Hanisch
URL: 
http://doi.org/10.5334/dsj-2019-030
Title: The Landscape of Rights and Licensing Initiatives for Data Sharing
Author
: Sam Grabus, Jane Greenberg
URL: 
http://doi.org/10.5334/dsj-2019-029
Title: Data Sharing Practices among Researchers at South African Universities
Author
: Siviwe Bangani, Mathew Moyo
URL: 
http://doi.org/10.5334/dsj-2019-028

Publishing an article in CODATA Data Science Journal

This article was first published by Ms. Neema Mduma https://neylicious.github.io/ml/2019/05/11/paper.html – Neema is an alumni of the CODATA-RDA School of Research Data Science.

In early 2017, I was privileged to work as a researcher in the Dropwall project (by Rose Funja) which was among the winning project of the Data for Local Impact Innovation Challenge (DLIIC). The main focus of the project was to develop a tool that will help fighting dropout among secondary school girls. The findings from this project show a high rate of dropout among secondary school students particularly girls, and coincide with reports from other studies which show that school dropout is a big challenge in developing countries. On addressing this problem, machine learning techniques has gained much attention in recent years. However, most of the work has been carried out in developed countries, there are only a handful of studies conducted in developing countries on school dropout using machine learning techniques with the consideration of local context and data imbalance problem. This motivated me to continue working (in my PhD) on school dropout using machine learning.

In August 2018, I attended a CODATA-RDA Research Data Science Summer School which was held at the Abdus Salam International Centre of Theoretical Physics (ICTP) in Trieste, Italy. The aim was on building competence in data analysis and security for participants from all disciplines and backgrounds from Sciences to Humanities. The level of engagements and interactions between participants and instructors was outstanding. We were introduced to various opportunities (by The Executive Director of CODATA, Dr. Simon Hodson) such as CODATA Data Science Journal where I later managed to publish the breathtaking findings from the Dropwall project titled A Survey of Machine Learning Approaches and Techniques for Student Dropout Prediction.