Download presentation
Published byReynard Lloyd Modified over 7 years ago
1
Open data to fight corruption: Training for journalists
Turkey, 11 November 2015 Jean Brice Tetka Data and Technology Coordinator, People Engagement Programme
2
What is Open Data? Open data is data that can be freely used, modified and shared by anyone for any purpose.
3
key features of openness
Availability and access The data must be available as a whole and at no more than a reasonable reproduction cost, preferably by downloading over the internet. The data must also be available in a convenient and modifiable form.
4
key features of openness
Reuse and redistribution The data must be provided under terms that permit reuse and redistribution including the intermixing with other datasets. The data must be machine-readable.
5
key features of openness
Universal participation Everyone must be able to use, reuse and redistribute data. There should be no discrimination against fields of endeavour or against persons or groups.
6
Why open data? Transparency
In a well-functioning, democratic society citizens need to know what their government is doing. To do that, they must be able to freely access government data and information and to share that information with other citizens. Transparency isn’t just about access, it is also about sharing and reuse: to understand material it needs to be analyzed and visualised and this requires that the material is open.
7
Why open data? Releasing social and commercial value
In a digital age, data is a key resource for social and commercial activities. Everything from finding your local post office to building a search engine requires access to data, much of which is created or held by government. By opening up data, government can help drive the creation of innovative business and services that deliver social and commercial value.
8
Why open data? Participation and engagement
It is needed for participatory governance and for business and organisations engaging with your users and the audience. Much of the time citizens are only able to engage with their own government sporadically, maybe just at an election every 4 or 5 years. By opening up and communication data, citizens are enabled to be much more directly informed and involved in decision-making. This is more than transparency: it’s about making a full “read/write” society, not just about knowing what is happening in the process of governance but being able to contribute to it.
9
Why Open Data is important for journalists
Journalists can gather, filter and visualise what is happening beyond what the eye can see.
10
Why Open Data is important for journalists
Journalists can analyze the dynamics of a complex situation like riots or political debates, show the fallacies and help everyone to see possible solutions to complex problems.
11
Why Open Data is important for journalists
Journalists can write a strong, beautiful, impactful story and provide informative data visualisation.
12
Examples of articles using Open Data
Politicians’ health priorities: This article is a mix of text, statistics and a picture
13
Examples of articles using Open Data
Women and work: An article with some statistics and visuals
14
Examples of articles using Open Data
How long would it take you to earn a top footballer’s salary?
15
OPEN Data sets Available: TURKEY
16
National Data sets National Statistics
17
National Data sets Government Budget
hptt://
18
National Data sets Legislation
19
National Data sets Company Register
20
International Data sets
The World Bank
21
International Data sets
The Organisation for Economic Co-operation and Development (OECD)
22
International Data sets
Transparency International
23
Practice
24
Practice: Collect and Clean Data
How to Collect Data Before you start collecting data you should be really clear about the topic you want to present. Identify what variables you need to present that topic. Check who collects those variables. This can be agencies or organisations, such as governments or corporations. You can also collect data from other journalists.
25
Practice: Collect and Clean Data
How to Clean Data The information that can be found might have omissions or is misleading. Before you start the cleaning of your data, you should save the original version and work on a copy. If your data is not in an Excel format, you have to convert it and obtain a table that you can work on. Check duplicate entries and empty entries. You can create default values where no information was held. Correct formatting errors (e.g. words instead of numbers). Standardise your data by assigning the same value to information meaning the same thing (e.g. BBC and B.B.C. and British Broadcasting Corporation). Find missing data from other sources like another datasets or a newspaper.
26
Practice: Collect and Clean Data
Some tools Using “Open Refine” to clean messy data: Open Refine is downloadable software, which can quickly sort and reconcile the imperfections in real-world data. Convert PDFs: Convert PDF documents into usable spreadsheets with Tabula.
27
Practice: Analyse Data
Analysing is about applying statistical techniques to the data. A popular tool is Excel where you can use simple spreadsheet functions like: Sort and filter functions Pivot tables
28
Practice: Analyse Data
Tutorials
29
Practice: Extract pertinent information
As a model, we will follow similar steps as used in investigation of crimes: Imagine that you are an inspector. You have a statement that you want to make and you have a lot information. You will start tracking evidences now.
30
Practice: Extract pertinent information
Document Everything Keep a detailed record of all information you have collected, don’t trust your memory. Organise very well all of your information by topics, date and type of files. Corruption Mining International statistics National statistics 2015 Excel PDF 2014 Health Defence
31
Practice: Extract pertinent information
Nail Down the Timeline All information needs to refer to a specific time. You must align all your information to a timeline, to be able to represent the chronology of how information was provided.
32
Practice: Extract pertinent information
Now, you have to investigate on relationships between information. Write down all hypothesis arising from your analysis and represent them on a chart mentioning all actors involved. Follow every lead
33
Practice: Extract pertinent information
The most important is not just to collect hypothesis or evidence, but to determine what is true or right. You should be able to justify the statistics or statements you will make. Be ready to demonstrate to any audience how you moved from data to your statement. Try to present your assumptions to a few people before you publish them. Verify your new assumptions
34
Practice: Extract pertinent information
It can be tricky to navigate into multiple datasets to find information. You need to be patient and check whether you need more datasets or a technical support to analyse your data. Sometimes, looking for an information for a statement you will find something else. The best is to write it down and keep looking for what you are searching. Persevere
35
Practice: Extract pertinent information
Some tools Graphs, networks, connections and relations Visualise Timeline Extract text from images (OCR) A manual for investigative journalists
36
Sources
37
www.transparency.org facebook.com/transparencyinternational
twitter.com/anticorruption blog.transparency.org © 2014 Transparency International. All rights reserved.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.