Category Archives: Data Science Journal

Posts relating to the data science journal

March 2020: Publications in the Data Science Journal

	Title: Dataset after Seven Years Simulating Hybrid Energy Systems with Homer Legacy Author: Alexandre Beluco, Frederico A. During F°, Lúcia M. R. Silva, Jones S. Silva, Lúis E. Teixeira, Gabriel Vasco, Fausto A. Canales, Elton G. Rossini, José de Souza, Giuliano C. Daronco, Alfonso Risso URL: http://doi.org/10.5334/dsj-2020-014
	Title: GIS Project ROSA: FAIR Principles in the Petroleum Industry Author: Anastasia Odintsova , Alena Rybkina, Julia Nikolova, Anna Korolkova URL: http://doi.org/10.5334/dsj-2020-013
	Title: MASER: A Science Ready Toolbox for Low Frequency Radio Astronomy Author: Baptiste Cecconi, Alan Loh, Pierre Le Sidaner, Renaud Savalle, Xavier Bonnin, Quynh Nhu Nguyen, Sonny Lion, Albert Shih, Stéphane Aicardi, Philippe Zarka, Corentin Louis, Andrée Coffre, Laurent Lamy, Laurent Denis, Jean-Mathias Grießmeier, Jeremy Faden, Chris Piker, Nicolas André, Vincent Génot, Stéphane Erard, Joseph N. Mafi, Todd A. King, Jim Sky, Markus Demleitner URL: http://doi.org/10.5334/dsj-2020-012
	Title: Experimental Data of Muon Hodoscope URAGAN for Investigations of Geoffective Processes in the Heliosphere Author: Anna Kovylyaeva , Ivan Astapov, Anna Dmitrieva, Vladimir Borog, Natalia Osetrova, Igor Yashin URL: http://doi.org/10.5334/dsj-2020-011
	Title: Risk Assessment for Scientific Data Author: Matthew S. Mayernik , Kelsey Breseman, Robert R. Downs, Ruth Duerr, Alexis Garretson, Chung-Yi (Sophie) Hou, Environmental Data Governance Initiative (EDGI) and Earth Science Information Partners (ESIP) Data Stewardship Committee URL: http://doi.org/10.5334/dsj-2020-010
	Title: How Do People Make Relevance Judgment of Scientific Data? Author: Jianping Liu , Jian Wang, Guomin Zhou, Mo Wang, Lei Shi URL: http://doi.org/10.5334/dsj-2020-009
	Title: Who Bears the Burden of Long-Lived Molecular Biology Databases? Author: Heidi J. Imker URL: http://doi.org/10.5334/dsj-2020-008

February 2020: Publications in the Data Science Journal

February 2020: Publications in the Data Science Journal

	Title: Impacts and Challenges of ICT Based Scale-up Campaigns: Lessons Learnt from the Use of SMS to Support Maize Farmers in the UPTAKE Project, Tanzania Author: Lucy Karanja, Stephanie Gakuo, Monica Kansiime, Dannie Romney, Henry Mibei, James Watiti, Leonard Sabula, Daniel Karanja URL: http://doi.org/10.5334/dsj-2020-007
	Title: Automatic Data Standardization for the Global Cryosphere Watch Data Portal Author:Mathias Bavay , Joel Fiddes, Øystein Godøy URL: http://doi.org/10.5334/dsj-2020-006
	Title: Developing a Research Data Policy Framework for All Journals and Publishers Author: Iain Hrynaszkiewicz , Natasha Simons, Azhar Hussain, Rebecca Grant, Simon Goudie URL: http://doi.org/10.5334/dsj-2020-005

January 2019: Publications in the Data Science Journal

January 2019: Publications in the Data Science Journal

	Title: Data Curation Profiling to Assess Data Management Training Needs and Practices to Inform a Toolkit Author: Bradley Bishop, Hannah Gunderman, Rowena Davis, Tina Lee, Rebecca Howard, Robert Samors, Fiona Murphy, Judit Ungvari URL: http://doi.org/10.5334/dsj-2020-004
	Title: Data Without Software Are Just Numbers Author: James Harold Davenport , James Grant, Catherine Mary Jones URL: http://doi.org/10.5334/dsj-2020-003
	Title: Knowledge Grid: An Intelligent System for Collaboration and Knowledge Management in Nigerian Universities Author: Boluwaji A. Akinnuwesi , Adedoyin Odumabo, Benjamin S. Aribisala URL: http://doi.org/10.5334/dsj-2020-002
	Title: A Discussion of Value Metrics for Data Repositories in Earth and Environmental Sciences Author: Cynthia Parr, Corinna Gries, Margaret O’Brien, Robert R. Downs, Ruth Duerr, Rebecca Koskela, Philip Tarrant, Keith E. Maull, Nancy Hoebelheinrich, Shelley Stall URL: http://doi.org/10.5334/dsj-2020-001

December 2019: Publications in the Data Science Journal

December 2019: Publications in the Data Science Journal

	Title: The Norwegian National Ground Segment; Preservation, Distribution and Exploitation of Sentinel Data Author: Trygve Halsne, Lara Ferrighi, Bard Saadatnejad, Nico Budewitz, Frode Dinessen, Lars-Anders Breivik, Øystein Godøy URL: http://doi.org/10.5334/dsj-2019-061
	Title: “Data Stewardship Wizard”: A Tool Bringing Together Researchers, Data Stewards, and Data Experts around Data Management Planning Author:Robert Pergl, Rob Hooft, Marek Suchánek, Vojtěch Knaisl, Jan Slifka URL: http://doi.org/10.5334/dsj-2019-059
	Title: A Discussion of Value Metrics for Data Repositories in Earth and Environmental Sciences Author: Cynthia Parr, Corinna Gries, Margaret O’Brien, Robert R. Downs, Ruth Duerr, Rebecca Koskela, Philip Tarrant, Keith E. Maull, Nancy Hoebelheinrich, Shelley Stall URL: http://doi.org/10.5334/dsj-2019-058

November 2019: Publications in the Data Science Journal

November 2019: Publications in the Data Science Journal

	Title: Reviving an Old and Valuable Collection of Microscope Slides Through the Use of Citizen Science Author: John Pring, Lesley Wyborn, Neal Evans URL: http://doi.org/10.5334/dsj-2019-057
	Title: Efficient Stratified Sampling Graphing Method for Mass Data Author: Jianjun Wang, Yingang Zhao, Jun Chen, Suqing Zhang, Xudong Zhao, Yufei He URL: http://doi.org/10.5334/dsj-2019-056
	Title: A Comprehensive Video Dataset for Multi-Modal Recognition Systems Author: Anand Handa, Rashi Agarwal, Narendra Kohli URL: http://doi.org/10.5334/dsj-2019-055
	Title: Proper Attribution for Curation and Maintenance of Research Collections: Metadata Recommendations of the RDA/TDWG Working Group Author: Anne E. Thessen , Matt Woodburn, Dimitrios Koureas, Deborah Paul, Michael Conlon, David P. Shorthouse, Sarah Ramdeen URL: http://doi.org/10.5334/dsj-2019-054
	Title: Intelligent Electronic Management of Library by Radio Frequency Identification Technology Author: Qinglan Huang, Hongyi Huang URL: http://doi.org/10.5334/dsj-2019-053
	Title: The History and Future of Data Citation in Practice Author: Mark A. Parsons, Ruth E. Duerr, Matthew B. Jones URL: http://doi.org/10.5334/dsj-2019-052

October 2019: Publications in the Data Science Journal

October 2019: Publications in the Data Science Journal

	Title: Different Preservation Levels: The Case of Scholarly Digital Editions Author: Elias Oltmanns, Tim Hasler, Wolfgang Peters-Kottig, Heinz-Günter Kuper URL: http://doi.org/10.5334/dsj-2019-051
	Title: A Method for Extending Ontologies with Application to the Materials Science Domain Author: Huanyu Li, Rickard Armiento, Patrick Lambrix URL: http://doi.org/10.5334/dsj-2019-050
	Title: Analysis of Several Years of DI Magnetometer Comparison Results by the Geomagnetic Network of China and IAGA Author: ufei He, Xudong Zhao , Dongmei Yang, Fuxi Yang, Na Deng, Xijing Li URL: http://doi.org/10.5334/dsj-2019-049

September 2019: Publications in the Data Science Journal

September 2019: Publications in the Data Science Journal

	Title: Data Sharing at Scale: A Heuristic for Affirming Data Cultures Author: Lindsay Poirier, Brandon Costelloe-Kuehn URL: http://doi.org/10.5334/dsj-2019-048
	Title: Building Infrastructure for African Human Genomic Data Management Author:Ziyaad Parker , Suresh Maslamoney, Ayton Meintjes, Gerrit Botha, Sumir Panji, Scott Hazelhurst, Nicola Mulder URL: http://doi.org/10.5334/dsj-2019-047
	Title: Analysis of Rainfall and Temperature Data Using Ensemble Empirical Mode Decomposition Author: Willard Zvarevashe, Symala Krishnannair, Venkataraman Sivakumar URL: http://doi.org/10.5334/dsj-2019-046
	Title: Policy Needs to Go Hand in Hand with Practice: The Learning and Listening Approach to Data Management Author: Maria Cruz, Nicolas Dintzner, Alastair Dunning, Annemiek van der Kuil, Esther Plomp, Marta Teperek, Yasemin Turkyilmaz-van der Velden, Anke Versteeg URL: http://doi.org/10.5334/dsj-2019-045
	Title: The Australian Research Data Commons Author: Michelle Barker, Ross Wilkinson, Andrew Treloar URL: http://doi.org/10.5334/dsj-2019-044
	Title: The Impact of Targeted Data Management Training for Field Research Projects – A Case Study Author: Jonathan L. Petters , George C. Brooks, Jennifer A. Smith, Carola A. Haas URL: http://doi.org/10.5334/dsj-2019-043

July 2019: Publications in the Data Science Journal

July 2019: Publications in the Data Science Journal

	Title: Real Estate Evaluation Model Based on Genetic Algorithm Optimized Neural Network Author: Yan Sun URL: http://doi.org/10.5334/dsj-2019-036
	Title: Abnormal Pattern Prediction: Detecting Fraudulent Insurance Property Claims with Semi-Supervised Machine-Learning Author: Sebastián M. Palacio URL: http://doi.org/10.5334/dsj-2019-035
	Title: A Regional Project in Support of the SADC Cyber-Infrastructure Framework Implementation: Weather and Climate Author: Mary-Jane Morongwa Bopape , Happy Marumo Sithole, Tshiamo Motshegwa, Edward Rakate, Francois Engelbrecht, Emma Archer, Anneline Morgan, Lwando Ndimeni, Joel Botai URL: http://doi.org/10.5334/dsj-2019-034
	Title: Designing Transnational Hydroclimatological Observation Networks and Data Sharing Policies in West Africa Author: Seyni Salack , Aymar Bossa, Jan Bliefernicht, Sina Berger, Yacouba Yira, Kamil A. Sanoussi, Samuel Guug, Dominicus Heinzeller, Adolphe S. Avocanh, Barro Hamadou, Symphorien Meda, Belko A. Diallo, Igor B. Bado, Inoussa A. Saley, Elidaa K. Daku, Namo Z. Lawson, Aida Ganaba, Safiétou Sanfo, Koufanou Hien, Arone Aduna, Gero Steup, Bernd Diekkrüger, Moussa Waongo, Antonio Rogmann, Ralf Kunkel, John P. A. Lamers, Mouhamadou B. Sylla, Harald Kunstmann, Boubacar Barry, Laurent G. Sedogo, Christian Jaminon, Paul Vlek, Jimmy Adegoke, Moumini Savadogo URL: http://doi.org/10.5334/dsj-2019-033
	Title: An Automated Machine Learning Based Decision Support System to Predict Hotel Booking Cancellations Author: Nuno Antonio, Ana de Almeida, Luis Nunes URL: http://doi.org/10.5334/dsj-2019-032
	Title: Indigenous Data Governance: Strategies from United States Native Nations Author: Stephanie Russo Carroll, Desi Rodriguez-Lonebear, Andrew Martinez URL: http://doi.org/10.5334/dsj-2019-031
	Title: Building Open Access to Research (OAR) Data Infrastructure at NIST Author: Gretchen Greene , Raymond Plante, Robert Hanisch URL: http://doi.org/10.5334/dsj-2019-030
	Title: The Landscape of Rights and Licensing Initiatives for Data Sharing Author: Sam Grabus, Jane Greenberg URL: http://doi.org/10.5334/dsj-2019-029
	Title: Data Sharing Practices among Researchers at South African Universities Author: Siviwe Bangani, Mathew Moyo URL: http://doi.org/10.5334/dsj-2019-028

Publishing an article in CODATA Data Science Journal

This article was first published by Ms. Neema Mduma https://neylicious.github.io/ml/2019/05/11/paper.html – Neema is an alumni of the CODATA-RDA School of Research Data Science.

In early 2017, I was privileged to work as a researcher in the Dropwall project (by Rose Funja) which was among the winning project of the Data for Local Impact Innovation Challenge (DLIIC). The main focus of the project was to develop a tool that will help fighting dropout among secondary school girls. The findings from this project show a high rate of dropout among secondary school students particularly girls, and coincide with reports from other studies which show that school dropout is a big challenge in developing countries. On addressing this problem, machine learning techniques has gained much attention in recent years. However, most of the work has been carried out in developed countries, there are only a handful of studies conducted in developing countries on school dropout using machine learning techniques with the consideration of local context and data imbalance problem. This motivated me to continue working (in my PhD) on school dropout using machine learning.

In August 2018, I attended a CODATA-RDA Research Data Science Summer School which was held at the Abdus Salam International Centre of Theoretical Physics (ICTP) in Trieste, Italy. The aim was on building competence in data analysis and security for participants from all disciplines and backgrounds from Sciences to Humanities. The level of engagements and interactions between participants and instructors was outstanding. We were introduced to various opportunities (by The Executive Director of CODATA, Dr. Simon Hodson) such as CODATA Data Science Journal where I later managed to publish the breathtaking findings from the Dropwall project titled A Survey of Machine Learning Approaches and Techniques for Student Dropout Prediction.

June 2019: Publications in the Data Science Journal

June 2019: Publications in the Data Science Journal

	Title: Developing a Model Guidelines Addressing Legal Impediments to Open Access to Publicly Funded Research Data in Malaysia Author: Haswira Nor Mohamad Hashim URL: http://doi.org/10.5334/dsj-2019-027
	Title: Proposed Guideline for Minimum Information Stroke Research and Clinical Data Reporting Author:Judit Kumuthini, Lyndon Zass, Melek Chaouch, Michael Thompson, Paul Olowoyo, Mamana Mbiyavanga, Faniyan Moyinoluwalogo, Gordon Wells, Victornia Nembeware, Nicola J. Mulder, Mayowa Owolabi, URL: http://doi.org/10.5334/dsj-2019-026
	Title: A Column Styled Composable Schema Matcher for Semantic Data-Types Author: Xiaofeng Liao, Jordy Bottelier, Zhiming Zhao URL: http://doi.org/10.5334/dsj-2019-025
	Title: Importance and Incorporation of User Feedback in Earth Science Data Stewardship Author: Hampapuram Ramapriyan, Jeanne Behnke URL: http://doi.org/10.5334/dsj-2019-024
	Title: Establishing, Developing, and Sustaining a Community of Data Champions Author: James L. Savage, Lauren Cadwallader URL: http://doi.org/10.5334/dsj-2019-023
	Title: The Definition of Reuse Author: Stephanie van de Sandt, Sünje Dallmeier-Tiessen, Artemis Lavasa, Vivien Petras URL: http://doi.org/10.5334/dsj-2019-022
	Title: Geoscientists’ Perspectives on Cyberinfrastructure Needs: A Collection of User Scenarios Author: Karen I. Stocks, Sam Schramski, Arika Virapongse, Lisa Kempler URL: http://doi.org/10.5334/dsj-2019-021
	Title: Data Distribution Centre Support for the IPCC Sixth Assessment Author: Martina Stockhause, Martin Juckes, Robert Chen, Wilfran Moufouma Okia, Anna Pirani, Tim Waterfield, Xiaoshi Xing, Rorie Edmunds URL: http://doi.org/10.5334/dsj-2019-020