https://defradigital.blog.gov.uk/2016/02/18/feeding-the-hunger-for-data/

Feeding the hunger for data

This week we have published a collection of datasets from the National Food Survey from 1974-2000, as open data.

This rich data collection is comprised of food diary records from approximately 150,000 households across the 70s, 80s and 90s, who were asked to keep 7 day diaries of the food and drink they brought into their homes. This data has historically been used to produce official government statistical estimates of average expenditure, purchases and derived nutrient intakes for the population of the country. Understanding the nation’s buying and eating habits has been a key piece of evidence to support Government policy since World War 2, and continues to be so, albeit for very different reasons than when it started.

The data you can now download as open data provides lots of information about how British families’ diets have changed over time. While we’ve had to remove some detail to make it safe for publication as open data, there is still lots of fascinating insights about the households who have participated: who kept hens, who owned freezers or microwaves!

For me, as the head of the statistics team responsible for the survey we now call Family Food, it’s been a journey of discovery. I am conscious that I’m just the current custodian of a series which began in 1940. Uniquely in the Civil Service, under the Statistics Code of Practice statisticians have to publish our contact details on our statistical products, and we are acutely aware of our watchdogs in the Information Commissioner's Office, the UK Statistics Authority and elsewhere. Our names are on our data outputs and there are potentially severe consequences for us (and irreversible damage to public trust) if we break the rules on disclosure and confidentiality. However, our Code also tells us to publish as much as we possibly can - we always want people to use and engage with our data. So going beyond statistics into open datasets has been an exciting step forward for the team.

foodstats

Our challenge

Our task has been to successfully ensure that we maintain confidentiality whilst maximising the value of our data for users out there. We’ve done much more than just prepare this data for publication: the small cross-department project team working on this has established a robust internal procedure which will inform and support future Defra activities in this area. There have been missteps and delays along the way, but the path should be clearer for those who follow us. We have published a privacy impact assessment (PIA) which documents some of this process alongside the data release. We’ve also created a commentable version of our PIA for you to provide feedback on what else we might do in future.

The Secretary of State for Defra has set a challenge for the department - to become a more open, collaborative and data-driven organisation - and a benchmark for Whitehall. It’s exciting for a statistician to see data and evidence given such a high profile in this way. It presents an opportunity and a test for us, and for me personally it has been a great learning experience.

Defra holds lots of data that can and should be published openly - it helps us do our work, and might have value for other people too. There’s also the data that we simply have to keep closed, because it’s sensitive and needs to be kept secure. And in the middle Defra has lots of data that sits in a bit of a grey area - there’s a risk that if combined with other data, it might enable people to be identified, or it might contain bits of data provided by third parties. The Family Food Survey data falls into this grey area, and so we’ve introduced new processes and taken extra care to try to make available a version of the data we hold that is both safe and usable. Publishing anonymised data can feel scary, but we have expertise to draw upon and a well established framework to define the safe limits we can work within. We can meet this challenge.

The future

For users out there, you get access to a version of the longest running continuous household survey of its kind in the world. The survey results are already published annually as statistical datasets, but access to the underlying diary data itself might open up new kinds of uses. We have already heard stories of possible applications that people are interested in: schoolchildren could keep their own food diaries and then compare them with actual historic survey data as part of Science or Design and Technology lessons. Or local history groups could scour the datasets for interesting local data. Or something else totally out of the blue. The exciting thing is that we just cannot predict what you’ll do!

For a survey which started during WW2, this is not the end. It is not even the beginning of the end. But it is, perhaps, the end of the beginning (sorry, Winston!). As I noted in my first post there are two other phases to go, and not even this stage is over for good. We have had to make judgement calls in treating this data for release, and so some information has been removed. The untreated data is still available under a more restrictive licence via the UK Data Service.

Get in touch

We want to hear what you are doing with the data, and if there is more you want from it.

So, please get in touch:

Leave a comment