Note: This is the second blog of a series of blog posts on data availability within the context of TCdata360, wherein each post will focus on a different aspect of data availability. The first blog post can be viewed here.
With open data comes missing data. In this blog series, we hope to explore data availability by looking at it from various perspectives within the context of the TCdata360 platform: by country, dataset, topic, and indicator.
In our previous blog post, we took a look at the country-level data availability over time through an interactive motion bubble plot inspired by the famous Gapminder visualization. In this follow-up post, we’ll still look at data availability from a geographical lens – but now looking into country classifications and other details that aren’t evident in a bubble plot, as well as the data availability leaders and laggers over time.
Overall Data Availability Leaders and Laggers
First, let’s focus on comparing individual countries to get a better sense of country-level differences in data availability. We computed for each country’s overall data availability by taking the median data availability across all years (1955-2016). Looking at the top 20 and bottom 20 countries in terms of overall data availability generates a few interesting patterns.