TWAS Fellowship for Research and Advanced Training: Deadline 1 October

TWAS Research and Advanced Training Fellowship: 2017 Call for Applications 

TWAS, the academy of sciences for the developing world,, is now accepting applications for the TWAS Research and Advanced Training Fellowship programme.

The fellowships are offered to scientists from developing countries and are tenable at centres of excellence in various developing countries.

Eligible fields include one or more of the following: agricultural and biological sciences, medical and health sciences, chemistry, engineering, astronomy, space and earth sciences, mathematics and physics.

Please see for the latest information regarding the above programme, including eligibility criteria, guidelines, etc.

Women scientists are especially encouraged to apply. The closing date is 1 October 2017.

Governance of domain specific data and metadata standards to support FAIR Data

By: Xiaogang (Marshall) Ma

On May 26, 2016, I attended the Workshop on Research Data Management [] at the 2016 Annual Meeting of the American Crystallographic Association, New Orleans, LA, USA and gave a talk on Open Science, FAIR DATA and Data Standards.

The workshop was organized by the International Union of Crystallography (IUCr)’s Diffraction Data Deposition Working Group (DDDWG), and was co-chaired by John R. Helliwell and Brian McMahon, who are the DDDWG chair and the IUCr CODATA representative, respectively. The workshop had two plenary sessions: (1) What every experimentalist needs to know about recording essential metadata of primary (raw) diffraction data and (2) Research Data Management policy mandates and requirements on Principal Investigators (PIs). It also covered a technical session on high-data-rate/high-performance-computing issues of research data management for MX. The first plenary session was closely related to the efforts within DDDWG, and the second session covered broad topics on the open science trends, open data mandates, best practices and successful stories. The technical session covered demonstration of state-of-the-art progress from industry.

My 30-minute talk was in the second plenary session. The talk was originally intended to be given by Simon Hodson, CODATA executive director. Due to a travel schedule issue, he could not make it, but he helped provided the main body of the presentation slides. For me this was also a nice experience to re-fresh my knowledge about open science, FAIR Data, data standards and CODATA’s many activities in relation to these issues. Especially I really enjoyed introducing a slide in which Simon put together the historical events of policy push for Open Access, Open Data and Open Science. To explain the slide in detail I also did some background study. For example, the three B activities (Budapest, Berlin and Bethesda) during 2002-3 were well known for promoting Open Access. We can see the significant increase in the number of open access publications since then []. Then, how about Open Data and the efforts ongoing now, such as FAIR Data? Can we foresee that after 10 or 15 years there will be positive results similar to Open Access? To achieve that more efforts are needed from all the stakeholders, including every one of us. Within CODATA I have been working together with Dr. Lesley Wyborn and other colleagues in a Task Group [] that aims at surveying and coordinating data standard efforts amongst scientific unions.

During the past months, our Task Group has been contributing to efforts led by CODATA to broaden inter-unions coordination and collaboration. Besides giving the talk, another role for me at the New Orleans workshop is to set up deeper connections between IUCr and CODATA. IUCr has done excellent work on data standards and open data. It is also one of the first scientific bodies that endorsed the Science International Accord on Open Data in a Big Data World. IUCr also published a position paper [] as a response to the accord. Prof. John Helliwell will be the IUCr representative to attend the Inter-Union Workshop on 21st Century Scientific and Technical Data – Developing a roadmap for data integration. The workshop is sponsored by CODATA’s new Commission on Data Standards for Science and will take place in Paris France on 19-21 June 2017. The workshop’s purpose is to share details of our data and information activities, agree on good practice, seek consensus about how unions and disciplinary groups can best work together in establishing a global network of scientific research data that is consistent with the four principles of FAIR Data – i.e., that data produced by research and for research should be Findable, Accessible, Interoperable and Reusable. Based on the outputs of the workshop, a substantially larger workshop or conference will take place in late 2017 or early 2018 to discuss the potential and scope of a broad coordinated effort across the scientific community and the establishment of an ICSU and CODATA Commission as part of a decadal initiative to promote the data standards necessary for inter-disciplinary research including that which addresses the priority global challenges.

Humans of Data 16

IMG_3801-edit1-900x600“I wonder whether any ethics are applied in collection of samples of Ebola and HIV/AIDS in emergency situations.  When I talk to doctors about it, they are aware that some researchers from the developed world provide expertise and fund research in pandemic situations.  But there are issues on data collection ethics based on informed consent by subjects that deserve scrutiny, given the emergency situations and language barriers under which data is collected.  Are there Memoranda of Understanding between African governments and researchers under these conditions?  There is a need for transparency and openness.  Given the extensive ethical regulations for research on human subjects in the developed world, African countries – which are prone to pandemic disease problems – must engage in the discourse on ethics of data collected under the unique situations that they experience.

It’s the same in the humanities and social sciences: researchers come and take and go.  It is rare for research projects to include funding in the initial project proposal for reporting back to the subjects of the research.  In Botswana, there was a national scan for indigenous knowledge.  We were promised there would be a report back [to the community], but the [research team] never came back.  And then researchers are surprised that participants don’t trust them!

Colonialism was first about land resources.  Now, without open access, globalisation of research may become the next wave of colonisation.  Lower and middle income country researchers need to engage in open debates among themselves on the ownership of data, and how to develop collaboration from collection to analysis with a view to facilitating shared benefits and innovative re-use.  Only in this way will the issues of intellectual property rights be negotiated in an equal exchange.  All researchers – but especially Africa’s researchers – should reflect on the necessary policy and regulatory frameworks that should be negotiated with local institutions and national governments, as part of their intellectual contributions to evidence-based solutions and sustainable development.

Openness is about exposing your strengths and weaknesses.  No one should be intimidated that some have more money.  Others have ideas.”

Humans of Data 15

IMG_3846-edit1-900x600“I’m very passionate about open science.  From where I stand in the context of Africa, there’s so much data we create in government, universities or communities. But at the moment, data, which is the base for reports and provides evidence for government decisions, is not accessible to all except the researcher and specialist research reference committees.  The Botswana Government has a closed research culture, as does the local research community within the academic and private sector circles.  As a librarian, that has always been my concern.  When work is done in that way, you find that the resultant data is archived and owned by the funding authority.

The current system is dysfunctional due to a lack of regulatory mechanisms, appropriate follow-up processes and systems for creating national open databases. Without reliable databases for research reports, research data cannot be open or accessible.”

Humans of Data 14


“My background is in the political and social sciences, and I can connect with a lot of the ethical issues, the equality issues, making data sharing genuinely equal here and in other countries – that’s important to me.

I get a kick out of helping researchers. Working in a library context I was helping a student find an article they needed. Now I can sit down with a researcher and provide reassurance on their data management plan and that’s important too.

Having enough technical knowledge so that you can understand what’s going on, and also using liaison skills – I really like that combination.  There’s a whole community at the university that is interested in open data, there are all these people who are really excited about it.  I’m excited because there’s a community who is excited about the same things I am.  It’s good that everyone’s still working out solutions for data sharing; you want to help build these resources, and a culture change.

But in the social sciences and humanities, we need to recognise we actually have data.  There needs to be a default to open, which relies upon a change in culture and policy.  We can archive some of these datasets through liaison with young researchers and learning how researchers work.  Young researchers are going to be sustaining the effort, but this needs everyone’s participation.”

International Training workshop on Big Data for Science and Sustainability

Opening ceremony of training workshop

Opening ceremony of training workshop

The CODATA PASTD – IGU joint action of the International Training workshop on Big Data for Science and Sustainability in Developing Countries was successfully held from 17th -19th March, 2017 in Hyderabad, India. Training workshop is academic event of The Xth IGU International Conference on “Urbanization, Health & Well Being and Sustainable Development Goals”. Supported by the International Geographical Union (IGU) and Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences (IGSNRR, CAS), the Hyderabad training workshop is one of CODATA PASTD’s three capacity building activities in 2017. Other training activities will be held in Madagascar in September and in China in November.

The training course introduces young scientists to the ideas of open data, data sharing and data publication.  The training also covers Big Data, data analysis and applications in order to develop skills as ‘data scientists’.  The three day training workshop included lectures and hands-on practice, which aims to develop the skills and capacity necessary for preservation of and open access to research data in developing countries.

Prof. R..B.Singh, Vice Chairman of the International Federation (IGU), delivered an opening speech

Prof. R..B.Singh, Vice Chairman of the International Federation (IGU), delivered an opening speech

Prof. R.B.Singh, Vice President of the International Geographical Union (IGU) and Co-Chair of Strategy and Policy Sub-group of CODATA PASTD, and Yukio Himiyama, President of the International Geographical Union, attended the opening and closing ceremonies respectively. 56 students from 13 universities in India attended the training courses. CODATA PASTD member, Dr. Yunqiang Zhu, Co-chair of Capacity Building Sub-group of CODATA PASTD and professor from IGSNRR, CAS, and Dr. B. Srinagesh from Osmania University organized the training as co-chairs. Chinese scientists worked along with Indian colleagues to give courses on open Big

Yukio Himiyama, President of the International Federation (IGU), awarded a certificate to the trainees

Yukio Himiyama, President of the International Federation (IGU), awarded a certificate to the trainees

Data discovery, data publication and sharing, the Indian Earth observation system, geospatial data interoperability, geospatial data infrastructure and data sharing principles.

Participation in the training workshop was active and enthusiastic and students reported the results were beneficial and favourable. Professor R. R. Shingh, and Dr. V. Raghavaswamy, Deputy Director of the National Remote Sensing Centre, India, expressed their hope that the PASTD training course will continue in future and cultivate new generation of young data scientists with growing awareness of developments in data science and the benefits of international cooperation.

The closing ceremony of the training workshop

The closing ceremony of the training workshop

Humans of Data 13

IMG_4383“Science is about discovering that things aren’t as you expected.  The more I learn, the more I realise I don’t know.  One of the fun things about what I do just now is that I get to see a lot of different research communities and how they conceive of and represent data, and what data mean to them.

There are really a lot of different discipline-oriented communities. I come from a domain repository – we just called it a data center – and for me, it’s interesting coming from that environment as opposed to the library, institution, repository, or iSchool environments, who are dealing with very similar issues and approaching them with different perspectives.

I do think in some areas there is emerging consensus and that’s exciting to see. The very fact that everyone accepts PIDs on data, that’s almost universal, we might argue about which one, but the strong consensus is that there should be something.  We’re seeing greater convergence about metadata standards, too, particularly in my field.  I think we’re getting better at listening to each other from different domains – historians and ecologists discover they have the same data problems.  This makes them feel they’re not alone but also that their problems are generic and can have common solutions.  There is a community.  When I first started at a data center 25 years ago, I’d be the data person at science conferences. That’s not the way it works any more.

We are in dire times just now.  We seem to be in an age of growing authoritarianism, and some people are trying to pretend there isn’t evidential knowledge.  This makes research all the more important. Data sharing, open knowledge, open data, it’s more important than it’s ever been.

Humans of Data 12


“My first job after university, I was doing computer stuff in a medical research place. I got a reputation as someone who was good at rescuing things off of old tapes and punchcards.  It had been expensive to collect that data, and people had sometimes suffered in providing it. But it was also a detective job and it was important.  But it was disappointing (though great for me professionally) when years later, I could come back into the field, and the sense of what was wrong then was still there. We still lose data because it’s on some piece of media that someone neglects or we’ve lost the documentation. Or we lose it because nobody knows where it is. If we don’t know it exists someone goes and repeats the work.

Now being able to work with this community of other people is great, making sure that stuff that could be of value in the future gets kept – it matters in lots of ways.  It matters because it saves us money, and that is important because it’s our taxes. And it matters because collectively as a society we’ll learn stuff from it: data can help prevent disasters, it can help improve crops, and many other important things in society.

This community is important to what I do every day. The only negative thing is that it gives you the sense of too many possibilities.  And you think, ‘Yeah, I can help you do this thing’.  And you don’t have time to do it all, which can be a crushing disappointment.  But it’s so nice to learn a bunch of things, and it’s an embarrassment of riches – things you can go and do, people you can collaborate with.  My job is often telling one group of people, ‘Hey, you should know about this other group’.  If that helps someone to reach out and collaborate, I feel like I’ve done something positive.”

Humans of Data 11


“I really love this group of people who work on data management and sharing.  I’m excited to be part of this very welcoming community.  I never experienced this elsewhere – it’s very nice to collaborate, to network. People are really happy to do work voluntarily. They are people who want to do not just their day to day job, but to change the world!”

Humans of Data 10


“Helping researchers to manage and share their data is what really motivates me. I was a researcher before, and much of research is not shared because the only incentive is to publish in ‘high impact factor’ journals. Nobody cares about what you’ve found out as an early career researcher, unless it’s published in a ‘high impact factor’ journal. I want to share more of the science of discovery. I love contributing to this change.

Data sharing is such an important part of opening up science. What’s really rewarding is when you explore with researchers how they can open up their research. People get a sparkle in their eyes. For me to get one convert really matters.  That’s what I’m most happy about.

It’s really important to understand the people you’re speaking with, to have this connection. There is never enough talking and advocacy, having a personal connection and understanding their motivation. That can’t be solved by any technical solution. It’s social change, cultural change. I strongly believe as an ex-scientist that it’s so important to change the reward system for research. It’s got to be transparent and get beyond only valuing what’s in the ‘high impact factor’ journal.”