[CKAN Monthly Live] A recap from our November meetup #5
This month’s CKAN Monthly Live highlighted The Pacific Data Hub - a central repository of reliable, current Pacific data. PDH provides authoritative data on a wide range of topics, from education, fisheries, and geoscience to energy, the environment and economic development.
ckan, CKAN instances, open data portal, ckan monthly live
20 Nov 2021
This month’s CKAN Monthly Live highlighted The Pacific Data Hub - a central repository of data about the Pacific and from the Pacific. The platform serves as a gateway to the largest and most diverse collection of data and information about the Pacific across key areas including population statistics, fisheries, geoscience, agriculture, aquaculture, energy, education, human rights, climate change and oceans. The Pacific Data Hub is set up in a way that it allows Pacific countries to own and access the data as their own. Any government, donor, academic, or member of the community can both store, and access, good quality and reliable data.
We had a presentation from The Pacific Community (SPC) who talked about some of the good practices when building the Pacific Data Hub on a modern platform to help streamline data delivery to the public and showed us an overview of some datasets and a demo of how it all works in real. Sioeli Tonga and his colleagues Phil Bright and Stanislas Ozier shared some interesting insights about their data journey with CKAN and their experience in trying to establish a sustainable data infrastructure. For them CKAN was a proven data management platform with an active development community and a pool of development agencies who are ready and willing to help them get the ball rolling.
The Pacific Community (SPC) has gathered 70 years of data. That’s a lot of data! It’s been stored in different platforms, databases, systems or tools like Indicator Database, Microdata Library, etc. The problem was that the SPC was not the only one holding data in the Pacific - there were also other organizations having their own platforms and systems. So the main idea behind the Pacific Data Hub was to provide an entry point to all this data. What was the best way to gather all this data together? Of course, CKAN. CKAN was the best way to collect all the data they already had through harvesters, provide a Data store for new data that was coming in and provide a Search. On top of that they developed some dashboards based on specific topics such as SDG, maritime boundaries, education, etc. that can get the data from CKAN or sometimes directly from the other systems:
The Pacific Data Hub has five components, specially designed for functionality and ease of navigation:
the data catalogue - that has the largest central data and publications repository for the region
the SDG Dashboard - that holds country-specific data on the 132 Pacific Sustainable Development Indicators selected by the region
PDH.stat - a data explorer for country-specific development indicators and for official statistics.
The Microdata Library - a gateway to the Pacific region’s survey, census, and administrative-based microdata and documentation.
the PacificMap - a spatial data explorer that allows for visualisation of geographic data.
Some other interesting topics the SPC touched upon were connected with the main challenges the team had to face:
metadata quality was not great (harvested and manually published data)
publication workflow had to be improved
standardized schemas had to be implemented
monitoring and maintenance of harvesters was not easy (especially when the resources that were harvested from change)
had to define meaningful topics
usage analytics - not only gathering user statistics but actually analyzing it
Beyond the technology challenges, engagement with stakeholders, establishing and maintaining trust, and giving them confidence in what they were doing has been a real challenge for the SPC .
The SPC are continuously improving the Pacific Data Hub adding new features and usability improvements such as DOI and citations, trying to improve visioning and user documentation especially for publishers.
About the project
The Pacific Data Hub is a result of almost two years of working round the clock to create the central repository of open data and knowledge products about the Pacific region. After a successful testing phase, on 1 December 2020 the full production phase was launched. PDH is developed as a CKAN+Drupal installation that seamlessly combines both platforms into a best-of-both-worlds CMS plus open data platform solution. It is hosted on an AWS infrastructure and optimised for CKAN+Drupal deployments. This foundation supports custom UI and data visualisations to provide an open data portal that is both beautiful and highly functional to the public.
About the Pacific Community
The Pacific Data Hub is an innovative programme of work, led by the Pacific Community-SPC and supported by the New Zealand Ministry of Foreign Affairs & Trade. The Pacific Community (SPC) has been a custodian of Pacific data for decades and has been charged with a critical role in the compilation, updating and dissemination of the Pacific Sustainable Development indicator to assist countries in meeting the Sustainable Development Goals reporting requirements. The PDH is part of an emerging Pacific Data Ecosystem, a partnership between Pacific Island Countries and Territories, SPC and the Secretariat of the Pacific Regional Environment Programme to promote greater coordination in data management, dissemination and uptake initiatives. All members and development partners work together to strengthen data management capabilities, develop capacity and improve data literacy within the Pacific region, ensuring that all data resources are well-managed, shared responsibly and used ethically.
We are not going to have a meetup in December due to Christmas and holidays. Our next meeting will be on 19 January 2022! You can sign up here to be notified for future meetups. We’ll announce the topic very soon! We look forward to seeing you all on CKAN MonthlyLive!
This month’s CKAN Monthly Live was about harmonising health data around the globe. We had Fjelltopp - a technical consultancy building and maintaining CKAN instances for the United Nations AIDS relief fund (UNAIDS) tak about the AIDS Data Repository project that aims to improve the quality, accessibility and consistency of HIV data and HIV estimates by providing a centralised platform with tools to help countries manage and share their HIV data.
This month’s CKAN Monthly Live highlighted the Open Data Portal of The NHS Business Services Authority (NHSBSA). Our speakers - Nick Greenley and Craig Anderson - told us more about the main challenges they stumbled upon when trying to make more of their data open, encourage use and re-use while protecting the confidentiality of patients and keeping their data