Support Us

You are browsing the archive for Events.

Next meetup: previewing with Recline, 31 May

May 24, 2012 in Events

The next online community meetup will be on CKAN’s Recline Data Viewer, on Thursday 31 May at 5pm UK time. Recline lets users do simple data exploration from within CKAN, including graphs and map views, without having to download the data. See this recent blog post on its latest features.

Anyone with an interest in data previewing is welcome to attend – whether you have questions, points for discussion or would just like to listen in. To sign up, add your Skype ID to the Etherpad here.

Cities, CKAN and Open Data

May 1, 2012 in Events

A quick note about the Open Data Cities Conference in Brighton a couple of weeks ago. I attended with my colleagues Ira, who’s CKAN’s Product Manager, and Laura James, who was one of the many excellent speakers. My write-up of the day is over on the Open Knowledge Foundation blog – it was an interesting day so take a look. The distilled essence is that there is a lot of interest in Open Data at the city level at the moment.

By good timing, CKAN’s entry to the Living Labs award was a proposal for an Open Data package for cities with CKAN at its heart, and is one of the shortlisted entries – final announcements are due later this week!

News from the CKAN team 19 April 2012

April 19, 2012 in Deployments, Events, Extensions, Feature, News

Here’s an update on what the CKAN team have been doing lately. It’s been a while since the last one and we’ve been busy, so there’s plenty to report.

Features

  • Adrià has implemented a great map view in Recline, CKAN’s built-in data viewer. If some structured data resource contains latitude and longitude information, this will enable it to be viewed on a map from within CKAN. A sneak preview is here (select the ‘map’ view).

  • Ross has done some work on a ‘Related Stuff’ extension, to allow you to link a dataset to other things (an app, a news article, etc) that use it. (We’re not sure about the name – suggestions welcome!)

  • Ross has been improving CKAN groups (which are becoming ‘Organizations’). These will link together both users and datasets to give fine control over access to datasets within a group, and enable better workflow for data publishing organisations.

  • David has finished working on a CKAN extension for Google Docs, allowing users to upload, download and edit CKAN resources from a Google Docs spreadsheet. More info on this coming soon!

  • Ian has been getting ATOM feeds working, enabling users to get updates on changes to datasets or groups of interest.

CKAN sites

  • Adrià’s also fixed some problems with the IATI extension, used to run the aid data registry by the International Aid Transparency Initiative.

  • We’ve done various bits of work for the forthcoming EC portal. Among other things, Toby has fixed some internationalisation issues, John has worked on the theme and user interface, and Ross has worked on importing and exporting records in RDF format.

Other news

  • Mark and Ira went to the plenary meeting in Vienna of the LOD2 project – a research and development project on Linked Data, of which CKAN is a small but important part. There is a post on the overview session on the OKFN blog.

  • The next release of CKAN will be version 1.7. We hope to be able to release it on 2 May.

CKAN @ Open Data Cities conference

April 16, 2012 in Events

The first Open Data Cities conference takes place this Friday, 20 April, in Brighton – and the Open Knowledge Foundation is one of the organisations supporting it. Cities are the new frontier for Open Data – there is more and more government data online, but a wealth of data at city level whose release will unlock huge potential. In the organisers’ words, the conference will focus on how to “engage with citizens to build more creative, prosperous and accountable communities”.

Greg Hadfield, founder of Open Data Brighton & Hove, has put together a terrific schedule, including a talk from the Open Knowledge Foundation’s own Laura James. The many other excellent speakers include Emer Coleman of the Government Digital Service, Ian Holt from Ordnance Survey, and MySociety’s Tom Steinberg.

Tickets are still available – and OKF supporters can get a 25% discount by using the top secret code ‘OKFN’ on the booking page. If you attend, be sure to come and chat to us at the CKAN stall, ask questions, tell us your ideas, or give us feedback (or chocolate).

Online meetup: Drupal integration, 26 April

April 12, 2012 in Events

The next CKAN online meetup will be on Drupal integration, on Thursday 26 April at 5pm UK time. Anyone who is interested is welcome to attend.

Drupal is a powerful and very widely-used content management system. It’s already integrated with CKAN’s data management system in some deployments, notably at data.gov.uk. What needs to be done to make this easier to replicate? What features would help? Come along to discuss, add your ideas or just listen.

To attend the meeting, sign up with your skype id on the etherpad here. Please also add any specific topic areas you’d like to discuss!

Online meetup this week: CKAN groups

March 26, 2012 in Events, Feature

This Thursday’s online meetup (29 March, 5pm UK time) will be on CKAN Groups. A lot of ongoing development work has gone into improving the functionality and interface of groups, especially with the publisher profile – see here. ‘Publishers’ are being renamed as ‘Organizations’.

Organizations allow you to group your users and datasets together so that they are easier to manage. The visibility of datasets within an Organization can be controlled to limit access to members of that Organization or to make it public.

Anyone is welcome to attend the meeting – to find out more, ask questions, provide input or feedback or just listen. If you’d like to attend, please sign up on the Etherpad here.

CKAN at the LOD2 plenary

March 24, 2012 in Deployments, Events

I am in Vienna, where the plenary meeting of partners on the LOD2 project finished yesterday afternoon. LOD2 is a project funded by the European Union to develop ways of working with Linked Open Data. The Open Knowledge Foundation is one of the partners and has built the CKAN-powered publicdata.eu as part of the project. publicdata.eu aggregates metadata from data catalogues all over Europe, using CKAN’s harvesting and metadata import and export. Rufus recently posted here slides from the presentation he gave on publicdata.eu at the ePSI conference last week.

I’ve written a post on the Open Knowledge Foundation blog about the opening session of the LOD2 plenary on Wednesday afernoon, where all the different parts of the project gave a presentation of their work and their current state of play. It was an interesting overview of the LOD2 project and of the state of Linked Data technology in general.

Online meetup this week: catalogue metadata standards

March 12, 2012 in Events

This week’s CKAN online meetup will be on catalogue metadata standards. It will be on Thursday (15 March) at 1700 GMT (5pm UK time, 6pm Central European time). Anyone is welcome to attend – details below.

Data catalogues are springing up everywhere these days. This profusion of available data is great, but to maximise its potential, catalogues need to be able to share metadata – thus enabling, for example, searching across multiple catalogues. DCAT is one emerging standard for catalogue metadata, based on standard Linked Data classifications. What should be in such a standard and what should CKAN be doing about it?

How to attend

If you are interested in attending, please sign up on the etherpad. If you’ve any particular questions you’d like to talk about, note them down there, too.

Past and future meetings

Notes from the last meetup, on CKAN’s Datastore, are in the etherpad here. Thanks to those who attended this very interesting meeting.

After this week, the next meeting will be on 29 March, on groups in CKAN and the Datahub. A group is a curated collection of datasets, and there have been a number of new features and improvements for group maintainers recently. As usual, all are welcome!

Online Community Meetup: Data Stores and Data APIs

March 1, 2012 in Events, News

Today’s online meetup will be on the topic of Data Stores and Data APIs in relation to CKAN and will discuss specifically CKAN’s current Data Store and Data API. All are welcome.

  • When: 5-6pm GMT
  • Where: Via Skype – add your name on the etherpad here.
  • Subject: CKAN’s DataStore and Data storage capabilities – details below.

Previously, data indexed on CKAN was usually stored offsite (e.g. on the publisher’s own site). For over a year CKAN has had general (“blob”) storage capabilities. Over the last 6 months CKAN has also developed a DataStore in which structured data can be stored. The DataStore provides a Data API, allowing the data to be queried, visualisations to be built on top, etc.

The API is used by the Recline Data Explorer which is integrated into CKAN and is deployed on thedatahub.org.

There are exciting developments under way in this area – if you’d like to talk or hear about them, or have suggestions, come to the meetup!

Data = Seized, Sanitised and Sanity-checked. Open Data Day 2011

December 12, 2011 in Events

This post is by Mark Brough, Research Officer at Publish What You Fund, Lucy Chambers, Community Coordinator for OpenSpending, and Irina Bolychevsky, Product Owner for CKAN. It is cross-posted on the OpenSpending Blog and the Open Knowledge Foundation Blog and Mark Brough’s contribution is also featured on aidinfolabs.org.

Saturday, December 3rd was Open Data Day, and London took the challenge to throw a hackday to help data be opened, cleaned and shown off to the world…

Fuelled only by enthusiasm, caffeine and 5 packets of ready-made popcorn, the CKAN, OpenSpending and IATI teams, along with some new faces, joined forces to liberate as much data as they could…

OpenSpending + IATI + CKAN

As part of the IATI Open Data Day challenges, Mark Brough did some work to get the existing IATI Data into OpenSpending. David Read, from the CKAN team, and a new face to the data wrangling crew, Johannes, scraped data on aid donations from France and Austria that were locked-up in web apps in order to help fill in the gaps in the global aid data jigsaw puzzle.

  • France:
  • Austria: (Now also loaded into OpenSpending!)

These, along with many other datasets discovered on the day via tweets and emails have been added to the Open Data Day Group on theDataHub.org.

You can see the results of the IATI wrangling process on OpenSpending.org/iati. This following section is written by Mark.

1. Getting the data

Downloading the existing IATI data has already become quite a big task; with 19 publishers so far, the data currently amounts to over 750MB with 1169 packages. Fortunately this is made easier by the IATI Registry, which provides an API to access all existing datasets, and a simple script (links at end) can retrieve all of the data.

2. Extracting the data

Extracting the data from the XML files is more complicated. Although IATI data uses a standard schema, there are a few cases where publishers have either used the markup incorrectly, or else interpreted the definitions slightly differently. This can be simple problems such as stating that an organisation is “implementing” rather than “Implementing”, or placing the date within the text of the tag and not the “iso-date” attribute of that tag, or more significant problems such as placing implementing organisations in the “accountable” organisation field.

However, these problems are still fairly limited and follow fairly regular patterns, so they are not too hard to overcome. There are more significant problems when some donors have for example used three-letter (ISO-3) country codes, rather than two-letter (ISO-2) country codes. (This is considered below in “next steps”.)

3. Wrangling the data

OpenSpending is designed to show spending data, and has a powerful aggregation system to show large collections of transactions in a meaningful way. However, IATI data is organised by activities, with transactions nested within activities (projects), and – reflecting the business models of funders – activities sit within other activities (e.g., projects within programs), although they are not nested in the actual XML. Furthermore, one of the significant advantages of IATI compared to other aid data formats is that it permits multiple sectoral classifications, allowing you to assign a proportion of the value of an activity to each sector. So, you might have an activity that is 50% related to health and 50% to education.

To prepare the data for OpenSpending, each transaction inherits the properties of its activity (and, if that activity has a parent, that parent activity’s title and description). Then, the transaction is broken out into mini transactions, with the proportion of the activity assigned to each sector used to assign a proportion of the value of the transaction to each sector. So, from transactions, you get mini “sector-transactions”.

This takes about 40 minutes to compile, and then one final step remains: to convert the currencies to a single currency. Currently, USD, EUR and GBP amounts are used in the IATI data. All data is converted to USD using the average for 2010 from the OECD’s Financial Indicators (MEI) dataset. (This is also considered below in “next steps”.)

4. Loading the data

OpenSpending’s new web-based loading interface makes it relatively easy to load data in, although you currently also have to write a model and views (links at end).

Results

The results can be viewed in the OpenSpending IATI dataset. You can explore the data by recipient country, sectors, funding organisation, and drill down through the data to see the data for an individual country.

Problems with the data

So far I’ve noticed the following problems:

  • “Unknown” recipient location is incorrectly marked as “South Sudan”
  • Recipient countries are listed twice, as Spain has used ISO3 rather than ISO2 country codes.
  • Sweden is listed as “Ministry of Foreign Affairs” (this is how they have listed themselves as the Funding Organisation in the data)
  • Sweden’s implementing organisations have been lost as they placed them in the accountable organisation field.

Please let me know if you see anything else problematic, if you have and criticisms of feedback of the way the data has been presented, or if you think there are other ways you’d like to be able to explore the data, based on the available dimensions.

Next steps

As mentioned above, there are some problems with the data which should properly be dealt with at the level of the donor agency. But there are others that will probably have to be dealt with by users of the data:

  • Mapping between different sector vocabularies, so that you can see all “Health” projects, and not only the health projects according to a single vocabulary
  • Mapping between countries and regions, so that every project in a country has a related region
  • Correctly converting currencies using the “value-date” column to get a more precise (at least month-specific) conversion.

What else have you noticed with the data? Is there anything else that should be changed? Anything interesting?

You can contact Mark about this data via the OpenSpending mailing list

Useful Links

Please create an account to get started.

Subscribe via email

Archives

Twitter

michelinecaseymichelinecasey: RT @HarlanH: #DataScience DC's event on Data Platforms is next Tuesday! http://t.co/2WFkFm0TIy #OpenData @CKANproject @Socrata 
17 hours ago
davefauthdavefauth: RT @HarlanH: #DataScience DC's event on Data Platforms is next Tuesday! http://t.co/2WFkFm0TIy #OpenData @CKANproject @Socrata 
19 hours ago
DataCommunityDCDataCommunityDC: RT @HarlanH: #DataScience DC's event on Data Platforms is next Tuesday! http://t.co/2WFkFm0TIy #OpenData @CKANproject @Socrata 
19 hours ago
HarlanHHarlanH: #DataScience DC's event on Data Platforms is next Tuesday! http://t.co/2WFkFm0TIy #OpenData @CKANproject @Socrata 
19 hours ago
amercaderamercader: Thanks to wonders of modern technology @shevski and I will be joining the "State of the Portal" panel at @foss4gna repr @CKANproject
2 days ago