<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
    xmlns:admin="http://webns.net/mvcb/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:content="http://purl.org/rss/1.0/modules/content/">

    <channel>
    
    <title>DDJ &#45; News &amp; Analysis</title>
    <link>http://datadrivenjournalism.net/news_and_analysis</link>
    <description>DDJ &#45; News &amp; Analysis</description>
    <dc:language>en</dc:language>
    <dc:creator>support@ejc.net</dc:creator>
    <dc:rights>Copyright 2012</dc:rights>
    <dc:date>2012-02-14T13:06:19+00:00</dc:date>
    <admin:generatorAgent rdf:resource="http://expressionengine.com/" />
    

    <item>
      <title>The New British Invasion: Scraper Wiki hits the Big Apple</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/scraperwiki_new_york_data_camp</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/scraperwiki_new_york_data_camp#When:13:06:19Z</guid>
      <description><![CDATA[<p>
	At our <a href="https://scraperwiki.com/events/jdcny/">New York data camp</a>,&nbsp;we set out to liberate data, teach people to liberate data,&nbsp;and find stories in data.&nbsp;About 100 people showed up for the event, and&nbsp;about 40 of them attended the &#39;Learn to Scrape&#39; sessions.</p>
<p>
	The hacking was punctuated by talks by&nbsp;Tom Lee of the <a href="http://www.sunlightfoundation.com">Sunlight Foundation</a>&nbsp;and Jake Porway of <a href="http://datawithoutborders.cc">Data Without Borders</a>.</p>
<p style="text-align: center; ">
	<a href="https://scraperwiki.com/"><img alt="logo_large1.png" src="http://datadrivenjournalism.net/uploads/logo_large1.png" /></a></p>
<p>
	<strong>Projects</strong></p>
<p>
	<a href="https://twitter.com/#!/dancow">Dan Nguyen</a>&nbsp;scraped Florida mugshots from and used <a href="http://face.com">face.com</a>&#39;s API to analyse each photo to tell you the arrestee&#39;s mood.</p>
<p>
	<a href="https://twitter.com/#!/mhkeller">Michael Keller</a>,&nbsp;<a href="https://twitter.com/#!/marcgeorges">Marc Georges</a>&nbsp;et al.&nbsp;related the NYPD <a href="http://www.nyc.gov/html/nypd/html/analysis_and_planning/stop_question_and_frisk_report.shtml">stop, question and frisk data</a>&nbsp;to&nbsp;nine mosques referenced in an NYPD report on surveillance&nbsp;in order to see whether there had been unusual changes&nbsp;in stopping activity around these mosques.</p>
<p>
	The dataset is insanely messy, but they fortunately had access to a relatively clean version that Data Without Bordes had <a href="http://wiki.datawithoutborders.cc/index.php?title=Project:Current_events:NYC_DD:NYCLU">developed</a> in November.</p>
<p>
	They were still going strong&nbsp;<a href="https://twitter.com/#!/mhkeller/status/165937116671524865">after</a> the data camp.&nbsp;<a href="https://twitter.com/#!/mhkeller/status/166008574621913092">Refusing to leave</a>,&nbsp;they moved to a <a href="https://twitter.com/#!/SusanEMcG/status/165950415945478148">different</a> room after getting kicked out of the data camp space.</p>
<p>
	<a href="https://twitter.com/mik3cap/">Mike Caprio</a> and team cleaned a spreadsheet&nbsp;of 80,000 records from the New York lobbiest website to power a site on New&nbsp;York lobbyists based on the <a href="http://chicagolobbyists.org">Chicago Lobbyists site</a>.&nbsp;It appears that $120 million was spent on New York on lobbiests in 2011.</p>
<p>
	I helped one team relate contracts from&nbsp;<a href="http://scraperwiki.com/scrapers/open_book_new_york">Open Book New York</a>&nbsp;to data that they had scraped by hand (!) from hand-written forms&nbsp;in order to identify pontential conflicts of interest.</p>
<p>
	I helped another team identify potential stories (outliers) in the&nbsp;<a href="http://nycopendata.socrata.com/Other/Graffiti-Locations/2j99-6h29">NYC Open Data graffiti locations dataset</a>.</p>
<p>
	<a href="https://twitter.com/SusanEMcG">Susan McGregor</a>&nbsp;was&nbsp;<a href="https://twitter.com/#!/SusanEMcG/status/166346113014706178">&quot;clearly hooked&quot;</a>&nbsp;because she liberated <a href="http://www.nyc.gov/lobbyistsearch/">lobbyist contract details</a>&nbsp;the next evening&nbsp;<a href="https://twitter.com/#!/SusanEMcG/status/166354779499536385">instead of watching the Superbowl</a>.</p>
<p>
	<strong>Technical Awards &nbsp;</strong></p>
<p>
	<a href="https://twitter.com/mik3cap/">Mike Caprio</a>&nbsp;won *Best Data Liberator*&nbsp;for <a href="https://scraperwiki.com/scrapers/iowa_accident_reports/">liberating</a> the Iowa accident reports database.</p>
<p>
	<a href="https://twitter.com/michellekoeth">Michelle Koeth</a>&nbsp;won *Best Creation of an API* for <a href="https://scraperwiki.com/scrapers/hospitalcompare/">scraping</a>&nbsp;New York, NY hospitals from <a href="http://www.hospitalcompare.hhs.gov/hospital-search.aspx">Medicare Hospital Compare</a>.</p>
<p>
	Jeremy Baron, from UN peacekeeping team, won *Best Use of ScraperWiki* for <a href="https://scraperwiki.com/scrapers/un-memstates-contribs-toregbudget-2012/">scraping&nbsp;United Nations PDFs</a>.&nbsp;This team also scraped peacekeeping&nbsp;<a href="http://scraperwiki.com/scrapers/un_peacekeeping_statistics">statistics</a>&nbsp;and <a href="https://scraperwiki.com/scrapers/un_resolution_55235_peacekeeping_contributions/">contributions</a>.</p>
<br />
<p>
	<strong>Honorary ScraperWikian &nbsp;</strong></p>
<p>
	Susan McGregor was awarded &quot;Honorary ScraperWikian&quot;. We haven&#39;t decided what that means yet.&nbsp;</p>
<p>
	<strong>Learning</strong></p>
<p>
	Teaching the Learn to Scrape sessions and working with many of the project teams, I got&nbsp;the impression that we had opened participants to thinking more about how data can be scraped,&nbsp;transformed and analyzed to identify unusual subsets and potential stories.&nbsp;<a href="http://plyr.had.co.nz/">Split-applied-combined</a>, if you will.</p>
<p>
	Our &#39;Learn to Scrape&#39; sessions seemed to work as well; I found several participants&nbsp;who had claimed no knowledge of webscraping prior to the sessions&nbsp;to be creating reasonably complex scrapers by the next afternoon.</p>
<p>
	<strong>What Next?</strong></p>
<p>
	<a href="https://scraperwiki.com/events/">More data camps</a> are coming up,&nbsp;and several groups plan on contining to work on their projects.&nbsp;But in the mean time, we now have&nbsp;<a href="https://scraperwiki.com/scrapers/nyc_school_budgets/">lots</a>&nbsp;of&nbsp;<a href="https://scraperwiki.com/scrapers/nyc_lobbyist_directory_browser/">data</a>&nbsp;for you to analyse!</p>
<p>
	&nbsp;</p>
]]></description> 
      <dc:date>2012-02-14T13:06:19+00:00</dc:date>
    </item>

    <item>
      <title>The Data Journalism Handbook: Final call for contributions</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/the_data_journalism_handbook_final_call_for_contributions</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/the_data_journalism_handbook_final_call_for_contributions#When:12:44:36Z</guid>
      <description><![CDATA[<p>
	<em>This article is cross posted on the Open Knowledge Foundation blog.</em></p>
<p>
	&nbsp;</p>
<p>
	Where can I find data and how can I request access to it? What tools are available to me? How can I find useful stories within datasets? And, most importantly, how can I make a living through the practise of data journalism?<br />
	<br />
	Do these questions sound familiar? They should, because these are all questions that any data journalist, who is just getting started, should be asking.</p>
<p>
	Leading data journalists from the New York Times, the Guardian, the BBC, and other top media organisations from around the world, are working together to answer all of these questions (and more) in the <a href="https://docs.google.com/a/ejc.net/document/d/18YOaGj0LyRn6x1tcCH2wIWHYqwnMiDCGInbVHe210rM/edit?authkey=CLrotIQH&amp;hl=en_GB">Data Journalism Handbook</a>. The handbook is the first comprehensive practical guide to data journalism. As Aron Pilhofer, editor of Interactive News at the New York Times, noted at the beginning of the project: &ldquo;A project like this is quite necessary. It&rsquo;s kind of surprising that someone hasn&rsquo;t tried to do this until now.&rdquo;</p>
<p>
	&nbsp;</p>
<p style="text-align: center; ">
	<img alt="Screen_shot_2012-02-06_at_12.50.36_PM.png" src="http://datadrivenjournalism.net/uploads/Screen_shot_2012-02-06_at_12.50.36_PM.png" style="width: 300px; height: 225px; " /></p>
<p>
	&nbsp;</p>
<h3>
	<strong>Interested in contributing? We want YOU!</strong></h3>
<p>
	Work on the handbook was kick-started in November of last year at the <a href="https://mozillafestival.org/">Mozilla Festival</a> in London. In just two days, 55 contributors drafted 60 pages (20,000 words) for six chapters. The handbook is a community project; therefore, anyone who has experience in data journalism can help draft the book. The work, in turn, goes back to the community as anyone is allowed to freely use, modify, adapt, and reuse the handbook.</p>
<p>
	Contributions have been flooding in and since Mozilla we have been editing, updating, filling in gaps, and restructuring. We are very close to a first complete draft of the book and we need your help to get there. We are looking for authors, editors and peer reviewers to draft chapters, review content, style and accuracy of the book.</p>
<p>
	In the table of contents below you can see the progress that has been made and where input is needed:</p>
<ul>
	<li>
		<strong>0. Preface</strong> (<span style="color:#0000ff;">in progress</span>)
		<ul>
			<li>
				0.1 The purpose of this book</li>
			<li>
				0.2 Add to this book</li>
			<li>
				0.3 Share this book</li>
		</ul>
	</li>
	<li>
		<strong>1. Introduction</strong> (<span style="color:#40e0d0;">done</span>)
		<ul>
			<li>
				1.1 What is data journalism?</li>
			<li>
				1.2 Why is it important?</li>
		</ul>
	</li>
	<li>
		<strong>2. Introducing data journalism in the newsroom</strong>
		<ul>
			<li>
				2.1 Changes in the newsroom (<span style="color:#ff8c00;">contributors needed</span>)</li>
			<li>
				2.2 How is it done: journo-developers vs. coders for hire (<span style="color:#0000ff;">in progress</span>)</li>
		</ul>
	</li>
	<li>
		<strong>3. Types of outcomes/projects and case studies</strong>
		<ul>
			<li>
				3.1 Data powered stories (<span style="color:#0000ff;">in progress</span>)</li>
			<li>
				3.2 Data served with stories (<span style="color:#0000ff;">in progress</span>)</li>
			<li>
				3.3 Data driven applications (<span style="color:#ff8c00;">contributors needed</span>)</li>
		</ul>
	</li>
	<li>
		<strong>4. Working on the data story</strong>
		<ul>
			<li>
				4.1. Step 1: Getting your data
				<ul>
					<li>
						4.1.1 Where does data live?&nbsp;(<span style="color:#0000ff;">in progress</span>)</li>
					<li>
						4.1.2 Asking for data&nbsp;(<span style="color:#0000ff;">in progress</span>)</li>
					<li>
						4.1.3 Getting your own data&nbsp;(<span style="color:#0000ff;">in progress</span>)</li>
					<li>
						4.1.4 Crowdsourcing data (<span style="color:#ff8c00;">contributors needed</span>)</li>
				</ul>
			</li>
			<li>
				4.2 Step 2: Understanding your data
				<ul>
					<li>
						4.2.1 Data literacy (<span style="color:#40e0d0;">done</span>)</li>
					<li>
						4.2.2 Working with data tips (<span style="color:#0000ff;">in progress</span>)</li>
					<li>
						4.2.3 Tools and techniques for analysing data (<span style="color:#ff8c00;">contributors needed</span>)</li>
					<li>
						4.2.4 Harnessing expert opinions: Annotating datasets (<span style="color:#ff8c00;">contributors needed</span>)</li>
				</ul>
			</li>
			<li>
				4.3 Step 3: Finding a story in your data (<span style="color:#0000ff;">in progress</span>)
				<ul>
					<li>
						4.3.1&nbsp; From datasets to stories - approaches&nbsp;</li>
				</ul>
			</li>
			<li>
				4.4 Step 4: Delivering your data project (<span style="color:#ff8c00;">contributors needed</span>)
				<ul>
					<li>
						4.4.1 Serving data with stories&nbsp;</li>
					<li>
						4.4.2 Visualising data&nbsp;</li>
					<li>
						4.4.3 Data driven applications&nbsp;</li>
				</ul>
			</li>
		</ul>
	</li>
	<li>
		<strong>5. Engagement, outreach and community</strong> (<span style="color:#ff8c00;">contributors needed</span>)</li>
</ul>
<ul>
	<li>
		<strong>6. How to make data journalism sustainable</strong>
		<ul>
			<li>
				6.1 Measuring impact (<span style="color:#ff8c00;">contributors needed</span>)</li>
			<li>
				6.2 Business models (<span style="color:#ff8c00;">contributors needed</span>)</li>
		</ul>
	</li>
	<li>
		<strong>7. Appendix</strong> (<span style="color:#0000ff;">in progress</span>)
		<ul>
			<li>
				7.1 Further resources&nbsp;</li>
			<li>
				7.2 Glossary</li>
		</ul>
	</li>
</ul>
<p>
	Whether you are a budding data journalist putting the manual through its paces or someone with years of experience, we&#39;d love your help to fine-tune the book and get it ready for the press (no pun intended).</p>
<p>
	If you&#39;re interested to contribute to the first Data Journalism Handbook please <strong>fill in <a href="https://docs.google.com/a/ejc.net/spreadsheet/viewform?formkey=dDhrMnBMMEI5RUlWUldaanJMSFhObXc6MQ">this form</a> as soon as possible</strong> indicating your level and area of expertise and the chapter that you would like to contribute to. We will get in touch with you as soon as we can once you&#39;ve submitted the form.</p>
<h3>
	<strong>Deadline</strong></h3>
<p>
	A first complete draft is planned to be ready by the <strong>end of February</strong>.<br />
	<br />
	The official launch of the The Data Journalism Handbook will be at the <a href="http://datadrivenjournalism.net/news_and_analysis/diving_into_data_the_school_of_data_journalism_at_the_international_journal">International Journalism Festival</a> in Perugia from 25-29 April. The book will be available online as an e-book. Participants at the festival will have the opportunity to buy a printed copy of the book and enjoy a meet and greet with the authors where they may exchange knowledge and learn helpful tips about how to successfully become data journalists.&nbsp;</p>
<h3>
	<strong>Questions?</strong></h3>
<p>
	For questions get in touch with the Data Journalism Handbook coordinators: bounegru@ejc.net&nbsp;(European Journalism Centre) or lucy.chambers@okfn.org&nbsp;(Open Knowledge Foundation).</p>
<p>
	&nbsp;</p>
<p>
	<em>Image by <a href="http://projeqt.com/kate#fsi89948ci21866q">Kate Hudson</a>.</em></p>
]]></description> 
      <dc:date>2012-02-06T12:44:36+00:00</dc:date>
    </item>

    <item>
      <title>Open Data London presents: 5 exciting new data&#45;driven projects</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/Open_Data_London_presents_5_exciting_new_data-driven_projects</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/Open_Data_London_presents_5_exciting_new_data-driven_projects#When:18:45:15Z</guid>
      <description><![CDATA[<p>
	Monday 30 January saw the third Open Data London meet-up take place at the Centre for Creative Collaboration. The event was co-hosted by Kat Baybrooke, community coordinator for the Open Knowledge Foundation, and Oliver Keyes, of Wikimedia.</p>
<p style="text-align: left; ">
	Following an opening talk on upcoming developments from Wikimedia, such as&nbsp;<a href="http://wikihadoop.com/index.php/Main_Page">Wikihadoop</a>, five presentations were given; each covering a specific group or project concerned with the use or pursuit of open data.</p>
<p style="text-align: center; ">
	<img alt="Open_Data_london.jpg" src="http://datadrivenjournalism.net/uploads/Open_Data_london.jpg" style="width: 300px; height: 300px; " /></p>
<p>
	First, Velichka Dimitrova introduced the Open Economics working group, which is seeking to break down the barriers between members of the public and the economic data which is increasingly influential in their daily lives. The group is currently focusing on a project aimed at modelling social progress in Italy, which it hopes to enter into the <a href="http://www.appsforitaly.org/en/blog/che-cosa-e-apps4italy/">Apps4Italy</a> competition. Ideas for future tasks were also discussed, of which two of the most interesting were an analysis of &lsquo;author cartels&rsquo; within the economic journal publishing industry, and the proposal of an &lsquo;open data accreditation badge&rsquo; which could be awarded to organisations and companies who seek to make open to the public any data they hold.</p>
<p>
	Next, Julian Tate, of <a href="http://opendatamanchester.wordpress.com/">Open Data Manchester</a>, gave a fascinating glimpse of his group&rsquo;s ongoing analysis of public transport data in the city. Somewhat perversely, competing bus companies currently refuse to publish fare information online, fearing that rivals will undercut them by tiny margins and steal their customers. According to Julian&rsquo;s research, thousands of potential commuters are put off using the service because of the lack of transparency, and far more would use public transport if prices for the various journeys were openly available. Julian and his team are now attempting to do what Transport for Greater Manchester have as yet been unable to do, and compile - through a mixture of crowdsourcing and pleading with bus companies - a comprehensive and open list of fares for journeys throughout the region.</p>
<p>
	Kevin Carter then introduced his venture; the ingeniously titled <a href="http://www.landscape-portrait.com/">Landscape Portrait</a>. The project aims to blend art and data to produce what is effectively an &lsquo;identity map&rsquo;. The idea is that individuals provide answers to a number of questions about their aspirations, feelings and concerns, and these are then compared with statistically-generated regional stereotypes. The result is an intriguing spatial patchwork of social outlooks and aspirations.</p>
<p>
	Jo Pugh then announced a hack day at the <a href="http://www.nationalarchives.gov.uk/">National Archives</a>, which will take place on 24/25 March at the Archives in Kew. Pugh described how the very existence of the Archives in only justified if the records held there are examined and shared - precisely what the hack day is intended for. Anyone who has an idea for making use of the records - some of which stretch back for centuries&nbsp; - should come along.</p>
<p>
	Finally, <a href="http://keiichimatsuda.com/">Keiichi Matsuda</a> outlined his project; &lsquo;Prism: a Sculptural aggregator of live data in London&rsquo;. While the name might be something of a handful, the idea itself is beautifully simplistic. Matsuda is creating an exhibition for the <a href="http://www.londondesignfestival.com/">London Design Festival</a> where he plans to project interesting London-centric statistics onto a series of angular sculptures in what he refers to as a display of &ldquo;urban informatics&rdquo;.</p>
]]></description> 
      <dc:date>2012-02-03T18:45:15+00:00</dc:date>
    </item>

    <item>
      <title>Open Economics hack day models social progress in Italy</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/open_economics_hackday</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/open_economics_hackday#When:18:27:43Z</guid>
      <description><![CDATA[<p>
	On Saturday 28 January an eclectic mix of coders, researchers, designers, academics and data journalists gathered at the Barbican Centre for an <a href="http://openeconomics.net/">Open Economics</a> hackday, coordinated by Velichka Dimitrova. One of the younger members of the <a href="http://okfn.org/">Open Knowledge Foundation</a> family, Open Economics is a group concerned with increasing the transparency of economic data, aiming to tear down - or at least reduce - the barriers between the public and the economic data which has an ever-growing significance for their lives.</p>
<p>
	<a href="http://www.flickr.com/photos/iliasbartolini/6782917515/in/set-72157629083857811/lightbox/"><img alt="open_economics_hackday.jpg" src="http://datadrivenjournalism.net/uploads/open_economics_hackday.jpg" /></a></p>
<p>
	The Eurozone crisis has dominated headlines across the continent in recent months, and will not be leaving the spotlight any time soon. Along with Greece, Italy has been one of the biggest causes for concern, and the Italian economy provided the backdrop for the day&rsquo;s main focus.</p>
<p>
	Following the success of <a href="http://yourtopia.net/">Yourtopia</a>, Open Economics&rsquo; main project to date, the central aim of Saturday&rsquo;s activities was to develop an application to map social progress in Italy over time and across its 21 administrative regions.</p>
<p>
	<object height="360" width="640"><param name="movie" value="http://www.youtube.com/v/Nv-00MAxa0g&amp;hl=en_US&amp;feature=player_embedded&amp;version=3" /><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><embed allowfullscreen="true" allowscriptaccess="always" height="360" src="http://www.youtube.com/v/Nv-00MAxa0g&amp;hl=en_US&amp;feature=player_embedded&amp;version=3" type="application/x-shockwave-flash" width="640"></embed></object></p>
<p>
	Over the years, different bodies have come up with different methods of measuring social change, from the qualitative - public surveys - to the quantitative - unemployment statistics and income distribution - but none can be considered sufficiently holistic to provide a fully representative overview of the state of affairs.</p>
<p>
	This new project, Yourtopia 2, will take a wide variety of measures and combine them into a broad model, with the hope of providing an unprecedented analysis of socioeconomic welfare which can then be applied to other countries and regions. All being well, the final application will be submitted as an entry to the <a href="http://www.appsforitaly.org/en/blog/che-cosa-e-apps4italy/">Apps4Italy</a> competition.</p>
<p>
	As with Yourtopia, the aim for Yourtopia 2 is to end up with an interactive tool which users can customize depending on which factors they consider most important to achieving social progress. As a result, Saturday&rsquo;s focus was on gathering data for the different variables, beginning design of the user interface, and carrying out further research into how the eventual model will be built.</p>
<p>
	In one corner, a handful of data researchers scoured the web for sources of regional unemployment data and Gini coefficients, in another, coders and designers worked tirelessly to iron out glitches in the model and create a slick user interface, while elsewhere academics debated which factors the model should incorporate in order to be most effective.</p>
<p>
	The project is ongoing. Anyone wishing to contribute in some way had the possibility to take part in the online follow-up session last Saturday. Should you wish to take part in future meetings keep an eye on the Open Economics <a href="http://openeconomics.net/">website</a>.&nbsp;</p>
<p>
	&nbsp;</p>
<p>
	<em>Photo by <a href="http://www.flickr.com/photos/iliasbartolini/with/6782917515/">Ilias Bartolini</a>.&nbsp;</em></p>
]]></description> 
      <dc:date>2012-02-03T18:27:43+00:00</dc:date>
    </item>

    <item>
      <title>Diving into Data: The School of Data Journalism at the International Journalism Festival in Perugia</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/diving_into_data_the_school_of_data_journalism_at_the_international_journal</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/diving_into_data_the_school_of_data_journalism_at_the_international_journal#When:13:05:05Z</guid>
      <description><![CDATA[<p>
	<em>This article is cross posted on the Open Knowledge Foundation blog and journalismfestival.com.</em></p>
<p>
	&nbsp;</p>
<p>
	In the past investigative reporters would suffer from a scarcity of information relating to questions they were trying to answer. While this is still the case, today journalists are also faced with an overwhelming abundance of data. In an age of information overload, to stay relevant to society journalists need to learn to separate signal from noise in order to provide valuable insights. Journalists need to be equipped with knowledge of the tools, techniques and tactics of working with data in order to derive maximum value from for their readers.</p>
<p>
	The European Journalism Centre and the Open Knowledge Foundation are pleased to invite you to the <strong>School of Data Journalism</strong> hosted at the sixth edition of Italy&#39;s leading journalism event, the <strong>International Journalism Festival</strong>. The 2012 edition takes place in the beautiful city of <strong>Perugia</strong> between <strong>25-29 April</strong>. Entry to the School of Data Journalism panels and workshops is free. Each workshop has a limited number of places and therefore registration will be necessary. Please note that not all requests to participate in the workshops will be accepted.</p>
<h3>
	&nbsp;</h3>
<h3 style="text-align: left; ">
	What is the School of Data Journalism and who is it for?</h3>
<p style="text-align: center; ">
	<img alt="school_bus_.jpg" src="http://datadrivenjournalism.net/uploads/school_bus_.jpg" /><br />
	&nbsp;</p>
<p>
	The School consists of <strong>three panel discussions</strong> and <strong>five workshops</strong>.</p>
<p>
	The panels attempt to provide answers to crucial questions for aspiring data journalists, editors and decision-makers in newsrooms:</p>
<ul>
	<li>
		What can aspiring data journalists learn from the successes of the past?</li>
	<li>
		How can data journalism save your newsroom?</li>
	<li>
		How do you start a data journalism operation?</li>
	<li>
		How can you become a data journalist and what do you need to do?</li>
</ul>
<p>
	In the workshops journalists who are interested to get started with reporting with data and budding data journalists will learn from experienced data journalists and open data experts essential skills related to how to get the data you need, how to analyse it, how to get stories from data and how to present your stories.</p>
<h3>
	&nbsp;</h3>
<h3 style="text-align: center; ">
	Panels</h3>
<h4>
	&nbsp;</h4>
<h4>
	Panel 1: News and numbers: from CAR to data journalism (Thursday, 26 April)</h4>
<p>
	Journalists have always used data and numbers to produce stories&hellip;and win Pulitzers. From Philip Meyer&rsquo;s coverage of the Detroit riots in 1967 to Steve Doig&rsquo;s &#39;What Went Wrong&#39; analysis of the damage patterns from Hurricane Andrew, data-driven reporting has brought valuable public service and won journalists recognition and prizes.</p>
<p>
	Whereas there may be distinguishing aspects about the data journalism of today and the computer-assisted reporting of the past, it is crucial to learn from successful examples, techniques and approaches of the past.</p>
<ul>
	<li>
		How can we bring the data journalism community and the CAR community closer together?</li>
	<li>
		What can aspiring data journalists learn from the successes of the past?</li>
	<li>
		What is the future of data journalism?</li>
</ul>
<h4>
	&nbsp;</h4>
<h4>
	Panel 2: How can data journalism save your newsroom? (Friday, 27 April)</h4>
<p>
	Journalism is under siege. Traditional models are collapsing. Developing the know-how to use the available data more effectively, to understand it, communicate and generate stories based on it, could be a huge opportunity to breathe new life into journalism.</p>
<ul>
	<li>
		What is the potential of data journalism?</li>
	<li>
		How do you start a data journalism operation?</li>
	<li>
		How can you start thinking about making money with data journalism?</li>
</ul>
<h4>
	&nbsp;</h4>
<h4>
	Panel 3: You Too Can Be A Data Journalist! (Saturday, 28 April)</h4>
<p>
	Where can I find data? How can I request data? What tools can I use? How can I find stories in data? How can I make money with data journalism?</p>
<p>
	Several leading data journalists, CAR specialists and journalism professors from the Guardian, the New York Times, Financial Times, Chicago Tribune, Medill School of Journalism and Cronkite School of Journalism worked together to answer these questions in the Data Journalism Handbook.</p>
<p>
	The handbook, the first comprehensive practical guide to data journalism, will be officially launched in this session. The session will provide the opportunity to meet and greet authors of the book, exchange knowledge and learn from them what you need to know to be a data journalist, as well as get a printed copy of the book.</p>
<p>
	&nbsp;</p>
<p>
	<strong>Speakers</strong></p>
<ul>
	<li>
		<strong>Caelainn Barr</strong>, EU data journalist, formerly with the Bureau of Investigative Journalism in London</li>
	<li>
		<strong>Heather Brooke</strong>, journalist and FOI campaigner</li>
	<li>
		<strong>Sarah Cohen</strong>, Knight Professor of the Practice of Journalism and Public Policy, DeWitt Wallace Center for Media and Democracy, Duke University (Pulitzer prize winner)</li>
	<li>
		<strong>Steve Doig</strong>, Knight Chair in Journalism, Walter Cronkite School of Journalism, Arizona State University (Pulitzer prize winner)</li>
	<li>
		<strong>Mirko Lorenz</strong>, data journalism trainer, Deutsche Welle</li>
	<li>
		<strong>Aron Pilhofer</strong>, editor of Interactive News at The New York Times and co-founder ofDocumentCloud.org</li>
	<li>
		<strong>Dan Nguyen</strong>, news application developer, ProPublica</li>
	<li>
		<strong>Simon Rogers</strong>, editor of the Guardian Data Blog</li>
</ul>
<h3>
	&nbsp;</h3>
<h3 style="text-align: center; ">
	Workshops</h3>
<h4>
	&nbsp;</h4>
<h4>
	Workshop 1: Scraping data &amp; cracking PDFs (Thursday, 26 April)</h4>
<p>
	Hands up who knows what machine-readable data is? You will soon, and more importantly, how to get it and what you can do with it once you have it!</p>
<p>
	A workshop targeted at scraping from scratch, including:</p>
<ul>
	<li>
		What PDFs and webpages look like to your computer - An introduction to machine-readable / non-machine readable data</li>
	<li>
		The Scraper Cookbook - an overview of the key things you need to know to write a scraper</li>
	<li>
		Hands on session - learning to screen-scrape. Main focus: hands on session using tools such as ScraperWiki. If sufficient interest &amp; time, we will also touch on some of the tools &amp; skills needed to extract data from PDFs.</li>
	<li>
		Error checking - how to check what you have makes sense, spotting the types of errors sometimes introduced if you don&rsquo;t get it quite right!&nbsp;</li>
</ul>
<h4>
	&nbsp;</h4>
<h4>
	Workshop 2: Information wants to be free - Freedom of information requests and how to use them (Friday, 27 April)</h4>
<p>
	Freedom of information requests are constantly evolving. Law changes and technological advancements make it increasingly easier to file and systematise FOI requests, and importantly track their progress through the system. This workshop includes demonstrations and case studies examining the current state-of-play with FOI requests in Europe and beyond and looking into what&rsquo;s next for the freedom of information movement.&nbsp;</p>
<h4>
	&nbsp;</h4>
<h4>
	Workshop 3: Making data pretty (Friday, 27 April)</h4>
<p>
	Journalism is no longer just a block of prose on a page. The modern reader often demands maps, infographics and visualisations to make the story jump out at them, particularly in digital environments.</p>
<p>
	There are a vast array of free tools available on the web to allow data-journalists to quickly and easily digest, process and display the data powering their stories. This workshop aims to give a good overview of what is currently available and delve into depth on one of the most powerful: Google Fusion Tables.</p>
<h4>
	&nbsp;</h4>
<h4>
	Workshop 4: Getting Stories from Data (Saturday, 28 April)</h4>
<p>
	Enormous datasets can often prove extremely daunting to the unfamiliar. Mistakes and crimes have historically benefited from, and triumphs and good decisions been obscured by, a mask of bewildering numbers and statistics and gone unreported.</p>
<p>
	Large datasets often hold a wealth of undiscovered stories for those willing to invest the time into exploring them. This workshop is a &lsquo;spotters&rsquo;-guide&rsquo; for things to look out for and where to look for datasets.</p>
<h4>
	&nbsp;</h4>
<h4>
	Workshop 5: Spending Stories (Sunday, 29 April)</h4>
<p>
	To get to the bottom of a story, you need only to &#39;follow the money&#39;. The same is true of government: budgeting is where policies and priorities are broken down into figures. Financial programming has a direct influence on all political areas: while other data on health or social help us understand what challenges society faces, looking at spending data allows us to see how government reacts to all of these.</p>
<p>
	There are many spending databases available on the web, some impenetrable, some accessible for analysis. We&rsquo;ll show how we enable journalists and researchers to make sense of the data and what strategies can be used to investigate stories and policies.</p>
<ul>
	<li>
		Overview: What datasets are out there? Where can you look for more?</li>
	<li>
		Interrogating databases, how to extract the maximum amount of data out of tricky databases</li>
	<li>
		Tools for spending analysis, how to slice and dice once you have your data</li>
</ul>
<p>
	&nbsp;</p>
<p>
	<strong>Workshop leaders</strong></p>
<ul>
	<li>
		<strong>Caelainn Barr</strong>, formerly with the Bureau of Investigative Journalism, working on the award-winning Structural Funds investigation</li>
	<li>
		<strong>Lucy Chambers</strong>, Open Knowledge Foundation, Community Coordinator for OpenSpending and the &lsquo;Spending Stories&rsquo; project.</li>
	<li>
		<strong>Steve Doig</strong>, Knight Chair in Journalism, Cronkite School of Journalism, Arizona State University (Pulitzer prize winner)</li>
	<li>
		<strong>Friedrich Lindenberg</strong>, Open Knowledge Foundation Deutschland, Developer on OpenSpending</li>
	<li>
		<strong>Dan Nguyen</strong>, news application developer, ProPublica</li>
	<li>
		<strong>Simon Rogers</strong>, editor of the Guardian Data Blog</li>
</ul>
<h3>
	<br />
	When and where?</h3>
<p>
	<br />
	The Data Journalism School takes place at the International Journalism Festival in Perugia between 25 and 29 April 2012. The schedule of the Data Journalism School, with confirmed speakers for each panel and workshop, will be posted on the <a href="http://www.journalismfestival.com/">festival website</a> in early February.</p>
<h3>
	&nbsp;</h3>
<h3>
	How to register?</h3>
<p>
	<br />
	Entry to the festival and the School are free. There is no registration process to attend the festival. For the workshops there is a <strong>limited number of available seats</strong>. To secure a seat in the workshops please register via <a href="https://docs.google.com/spreadsheet/viewform?hl=en_US&amp;formkey=dEJQOXphTHFfdWFvbDVMVDJkWldyTnc6MQ#gid=0">this form</a>. The deadline for workshop registration is <strong>20 March 2012</strong>. You will be notified by email by 25 March at the latest if we were able to confirm you a seat. The workshops are entry-level. Consideration will be given to your experience, skills and motivation to attend the workshop when making the selection.</p>
<h3>
	&nbsp;</h3>
<h3>
	What do you need to bring?</h3>
<p>
	A lot of enthusiasm and a laptop are required for the workshop sessions. Please note for hands-on workshops tablet PC&#39;s will not be appropriate.</p>
<h3>
	&nbsp;</h3>
<h3>
	Questions?</h3>
<p>
	<br />
	If you have questions about the School of Data Journalism get in touch with the coordinators: bounegru@ejc.net&nbsp;or lucy.chambers@okfn.org.</p>
<p>
	&nbsp;</p>
]]></description> 
      <dc:date>2012-02-01T13:05:05+00:00</dc:date>
    </item>

    <item>
      <title>The inverted pyramid of data journalism</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/the_inverted_pyramid_of_data_journalism</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/the_inverted_pyramid_of_data_journalism#When:14:38:19Z</guid>
      <description><![CDATA[<p>
	<em>Originally published by Paul Bradshaw on the <a href="http://onlinejournalismblog.com">Online Journalism Blog</a> on 7 July 2011. This article is republished with permission.</em></p>
<p>
	&nbsp;</p>
<p>
	I&rsquo;ve been working for some time on picking apart the many processes which make up what we call data journalism. Indeed, if you read the chapter on data journalism (<a href="http://onlinejournalismblog.com/2010/04/21/data-journalism-pt1-finding-data-draft-comments-invited/">blogged draft</a>) in my <a href="http://www.amazon.co.uk/Online-Journalism-Handbook-Survive-Digital/dp/140587340X/ref=as_li_ss_mfw?&amp;camp=2486&amp;linkCode=wey&amp;tag=onlijourblog-21&amp;creative=8882">Online Journalism Handbook</a>, or <a href="http://www.slideshare.net/onlinejournalist/data-journalism-city-online-journalism-wk8">seen</a> me <a href="http://www.slideshare.net/onlinejournalist/10-things-ive-learned-about-data-journalism">speak</a> on the subject, you&rsquo;ll have seen my previous diagram that tries to explain those processes.</p>
<p>
	I&rsquo;ve now revised that considerably, and what I&rsquo;ve come up with bears some explanation. I&rsquo;ve cheekily called it the inverted pyramid of data journalism, partly because it begins with a large amount of information which becomes increasingly focused as you drill down into it until you reach the point of communicating the results.</p>
<p>
	What&rsquo;s more, I&rsquo;ve also sketched out a second diagram that breaks down how data journalism stories are communicated &ndash; an area which I think has so far not been very widely explored. But that&rsquo;s for a future post.</p>
<p>
	I&rsquo;m hoping this will be helpful to those trying to get to grips with data, whether as journalists, developers or designers. This is, as always, work in progress so let me know if you think I&rsquo;ve missed anything or if things might be better explained.</p>
<p>
	UPDATE: <a href="http://onlinejournalismblog.com/2011/07/08/the-inverted-pyramid-of-data-journalism-in-spanish/">Also in Spanish</a>.</p>
<h3>
	The inverted pyramid of data journalism</h3>
<p style="text-align: left; ">
	<img alt="" src="http://farm8.staticflickr.com/7160/6755025107_a95c52dd7c.jpg" style="width: 449px; height: 500px; " /></p>
<p style="text-align: left; ">
	Here are the stages explained:</p>
<h3>
	Compile</h3>
<p>
	Data journalism begins in one of two ways: either you have a question that needs data, or a dataset that needs questioning. Whichever it is, the compilation of data is what defines it as an act of data journalism.</p>
<p>
	Compiling data can take various forms. At its most simple the data might be:</p>
<ol>
	<li>
		supplied directly to you by an organisation (how long until we see &lsquo;data releases&rsquo; alongside press releases?),</li>
	<li>
		found through using advanced search techniques to plough into the depths of government websites;</li>
	<li>
		compiled by scraping databases hidden behind online forms or pages of results using tools like <a href="http://www.outwit.com/products/hub/">OutWit Hub</a> and <a href="http://scraperwiki.com/">Scraperwiki</a>;</li>
	<li>
		by converting documents into something that can be analysed, using tools like DocumentCloud;</li>
	<li>
		by pulling information from APIs;</li>
	<li>
		or by collecting the data yourself through observation, surveys, online forms or crowdsourcing.</li>
</ol>
<p>
	This compilation stage is the most important &ndash; not only because everything else rests on that, but because it is probably the stage that is returned to the most &ndash; at each of the subsequent stages &ndash; cleaning, contextualising, combining and communicating &ndash; it may be that you need to compile further information.</p>
<h3>
	Clean</h3>
<p>
	Having data is just the beginning. Being confident in the stories hidden within it means being able to trust the quality of the data &ndash; and that means cleaning it.</p>
<p>
	Cleaning typically takes two forms: removing human error; and converting the data into a format that is consistent with other data you are using.</p>
<p>
	For example, datasets will often include some or all of the following: duplicate entries; empty entries; the use of default values to save time or where no information was held; incorrect formatting (e.g. words instead of numbers); corrupted entries or entries with HTML code; multiple names for the same thing (e.g. BBC and B.B.C. and British Broadcasting Corporation); and missing data (e.g. constituency). You can probably suggest others.</p>
<p>
	There are <a href="http://www.datadrivenconsulting.com/2011/04/data-analysts-toolbox-cleaning-data-in-excel/">simple ways to clean up data in Excel</a> or Google Docs such as find and replace, sorting to find unusually high, low, or empty entries, and using filters so that only duplicate entries (i.e. those where a piece of data occurs more than once) are shown.</p>
<p>
	<a href="http://code.google.com/p/google-refine/">Google Refine</a> adds a lot more power: <a href="http://onlinejournalismblog.com/2011/07/05/cleaning-data-using-google-refine-a-quick-guide/">its &lsquo;common transforms&rsquo; function will, for example, convert all entries to lowercase, uppercase or titlecase</a>. It can remove HTML, remove spaces before and after entries (which you can&rsquo;t see but which computers will see as different to the same data without a space), remove double spaces, join and split cells, and format them consistently. It will also &lsquo;cluster&rsquo; entries and allow you to merge those which should be the same. Note: this will work for BBC and B.B.C. but not BBC and British Broadcasting Corporation, so some manual intervention is often needed.</p>
<h3>
	Context</h3>
<p>
	Like any source, data cannot always be trusted. It comes with its own histories, biases, and objectives. So like any source, you need to ask questions of it: who gathered it, when, and for what purpose? How was it gathered? (The methodology). What exactly do they mean by that?</p>
<p>
	You will also need to understand jargon, such as codes that represent categories, classifications or locations, and specialist terminology.</p>
<p>
	All the above will most likely lead you to compile further data. For example, knowing the number of crimes in a city is interesting, but only becomes meaningful when you contextualise that alongside the population, or the numbers of police, or the levels of crime 5 years ago, or perceptions of crime, or levels of unemployment, and so on. Statistical literacy is a must here &ndash; or at least show your work to someone who has read <a href="http://www.amazon.co.uk/gp/product/0007240198?ie=UTF8&amp;tag=onlijourblog-21&amp;linkCode=xm2&amp;creativeASIN=0007240198">Ben Goldacre&rsquo;s book</a>.</p>
<p>
	Having a clear question at the start of the whole process, by the way, helps ensure you don&rsquo;t lose your focus at this point, or miss an interesting angle.</p>
<h3>
	Combine</h3>
<p>
	Good stories can be found in a single dataset, but often you will need to combine two together. After all, given the choice between a single-source story and a multiple-source one, which would you prefer?</p>
<p>
	The classic combination is the maps mashup: taking one dataset and combining it with map data to provide an instant visualisation of how something is distributed in space: <a href="http://kijamedia.com/2011/03/men-timeline-cuts-map/">where are the cuts hitting hardest</a>? <a href="http://www.guardian.co.uk/news/datablog/interactive/2011/jan/12/school-tables-mapped">Which schools are performing best</a>? <a href="http://trendsmap.com/">What are the most talked-about topics around the world on Twitter right now</a>?</p>
<p>
	This is so common (largely because the Google Maps API was one of the first journalistically useful APIs) it has almost become a cliche. But still, cliches are often &ndash; if not always &ndash; effective.</p>
<p>
	A more mundane combination is to combine two or more datasets with a common data point. That might be <a href="http://dharmafly.com/theywriteforyou/">a politican&rsquo;s name</a>, for example, <a href="http://www.schooloscope.com/">or a school</a>, or <a href="http://mapumental.com/">a location</a>.</p>
<p>
	This often means ensuring that the particular data point is formatted in the same name across each dataset.</p>
<p>
	In one, for example, the first and last names might have separate columns, but not in the other (you can concatenate or split cells to solve this).</p>
<p>
	Or you might have local authority names in one, but local authority codes in another (find another dataset that has both together and use a tool like <a href="http://google.com/fusiontables">Google Fusion Tables</a> to merge them).</p>
<p>
	One might use latitude and longitude; another postcodes, or easting and northing (<a href="http://onlinejournalismblog.com/2010/12/16/adding-geographical-information-to-a-spreadsheet-based-on-postcodes-google-refine-and-apis/">a postcodes API and Google Refine can help</a>). But once you&rsquo;ve got them formatted right, you may find some interesting stories or leads for further questions to ask.</p>
<h3>
	Communicate</h3>
<p>
	In data journalism the all-too-obvious thing to do at this point is to visualise the results &ndash; on a map, in a chart, an infographic, or an animation. But there&rsquo;s a lot more here to consider &ndash; from the classic narrative, to news apps, case studies and personalisation. In fact there&rsquo;s so much in this stage alone that I&rsquo;ve <a href="http://onlinejournalismblog.com/2011/07/13/the-inverted-pyramid-of-data-journalism-part-2-6-ways-of-communicating-data-journalism/">written a separate post</a> (diagram below). Meanwhile, comments very much welcome.</p>
<p>
	<img alt="" src="http://farm8.staticflickr.com/7147/6755026759_5fde2956b0.jpg" style="width: 500px; height: 389px; " /></p>
]]></description> 
      <dc:date>2012-01-24T14:38:19+00:00</dc:date>
    </item>

    <item>
      <title>First ever Data Journalism Awards open for submissions!</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/first_ever_data_journalism_awards_open_for_submissions</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/first_ever_data_journalism_awards_open_for_submissions#When:13:59:32Z</guid>
      <description><![CDATA[<p>
	The European Journalism Centre (EJC) is proud to announce the launch of the first annual Data Journalism Awards (DJA).</p>
<p>
	Journalists and media organisations have increasingly been using data to improve reportage of complex issues and to give readers direct access to the sources behind the headlines. From the Guardian to the New York Times, La Stampa to Die Zeit, journalists and media organisations are experimenting with new ways of sourcing, delivering and enabling readers and users to engage with news.&nbsp;</p>
<p>
	The DJA is the first international contest recognizing outstanding work in this field. It is organised by the Global Editors Network in collaboration with the EJC, and is sponsored by Google.</p>
<p>
	<img alt="IMG_1487.jpg" src="http://datadrivenjournalism.net/uploads/IMG_1487.jpg" style="margin-left: 9px; margin-right: 9px; float: left; width: 300px; height: 345px; " /></p>
<p>
	A jury of data journalism experts and editors from all over the world, including from prestigious organisations like New York Times, Reuters, and Les Echos will award a total of &euro;45,000 to six winners. The jury is headed by Paul Steiger, founder of ProPublica.</p>
<p>
	There are three award categories awarded at both (i) national and international and (ii) local and regional levels to give a total of six prizes. The three categories are:</p>
<p>
	1. Data-driven investigative journalism<br />
	2. Data visualisation &amp; storytelling<br />
	3. Data-driven applications</p>
<p>
	&nbsp;The work submitted must have been published or aired between 11 April 2011 and 10 April 2012.</p>
<h3>
	<strong>How to apply?</strong></h3>
<p>
	As of yesterday, applicants are welcome to submit their best data journalism projects at <a href="http://datajournalismawards.org/submit-your-work/">http://datajournalismawards.org/submit-your-work/</a> The competition is open to media companies, non-profit organisations, freelancers and individuals until 10 April 2012. The six winning projects will be announced the annual News World Summit (NEWS!) hosted by the Global Editors Network, which will take place in Paris, France, from 30 May to 01 June 2012.</p>
<p>
	Find out more about the competition and how to apply at&nbsp;http://datajournalismawards.org/. Get in touch with the organisers by info@datajournalismawards.org, on Twitter using the hashtag #dja or by posting a message on the <a href="http://www.facebook.com/pages/Data-Journalism-Awards/305250662849826">DJA Facebook page</a>.</p>
<p>
	&nbsp;</p>
<p>
	<i>Image: EJC Director, Wilfried Ruetten, at DJA press conference,&nbsp;</i><i>January 19, Google, London</i></p>
]]></description> 
      <dc:date>2012-01-20T13:59:32+00:00</dc:date>
    </item>

    <item>
      <title>How to build news apps: Best practices from NPR</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/how_to_build_news_apps_best_practices_from_npr</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/how_to_build_news_apps_best_practices_from_npr#When:12:07:46Z</guid>
      <description><![CDATA[<p>
	In the following video <a href="http://@stiles">Matt Stiles</a>, data journalist at <a href="http://www.npr.org/">NPR</a>, previously data applications editor at The Texas Tribune, talks about best practices in building news apps in an internal data webinar at NPR:</p>
<ul>
	<li>
		What is a news app?</li>
	<li>
		Who are news apps for?</li>
	<li>
		How do you decide on when to build an app?</li>
	<li>
		What should a good news app include?</li>
	<li>
		What are the successful and not so successful examples? What can be learned from them?</li>
	<li>
		How do you package apps?</li>
</ul>
<p>
	&nbsp;</p>
<p>
	<iframe allowfullscreen="" frameborder="0" height="350" mozallowfullscreen="" scrolling="no" src="http://player.vimeo.com/video/28728286?title=0&amp;byline=0&amp;portrait=0" webkitallowfullscreen="" width="600"></iframe></p>
<p>
	<a href="http://vimeo.com/28728286">Data Best Practices</a> from <a href="http://vimeo.com/user7691280">StateImpact</a> on <a href="http://vimeo.com">Vimeo</a>.</p>
<p>
	&nbsp;</p>
<p>
	You can read more about data practices at NPR on the <a href="http://stateimpact.npr.org/teamblog/">StateImpact blog</a>.</p>
]]></description> 
      <dc:date>2011-12-15T12:07:46+00:00</dc:date>
    </item>

    <item>
      <title>&#8216;Data journalism&#8217; draws the line between the quick and the dead</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/data_journalism_draws_the_line_between_the_quick_and_the_dead</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/data_journalism_draws_the_line_between_the_quick_and_the_dead#When:09:20:55Z</guid>
      <description><![CDATA[<p>
	<em>Originally published by <a href="http://dailymaverick.co.za/opinionistas/mandy-de-waal">Mandy de Waal</a> on <a href="http://freeafricanmedia.com">Free African Media</a> on 5 December 2011. This article is republished with permission.</em></p>
<p>
	&nbsp;</p>
<p>
	<strong>As global newspaper audiences decline, some media organisations are latching onto emerging tools and open-source technologies to make news more relevant, personalised, and &#39;hyper-local&#39;. Investigative journalism is being reinvented and reporters are doing their jobs in unimagined ways. &#39;Data journalism&#39; is becoming the competitive differentiator for global news.&nbsp;</strong></p>
<p>
	When Gordon Patterson released South Africa&#39;s latest Audit Bureau of Circulation data, there was bad news and more bad news for most newspapers. Patterson expects 2012 to be the toughest year yet. Readerships are declining and fragmenting and, with disposable income levels plummeting in South Africa, consumer decision making was becoming illusive. As if that wasn&rsquo;t enough to put the owners of Avusa, Napers&rsquo; Media24 and Independent in the doldrums, ABC&rsquo;s circulation data showed more misery for the daily and weekend press.</p>
<p>
	The aggregated data showed circulation for daily newspapers dropped by 5.1% for the third quarter of 2011, when compared with the second quarter. The bloodletting was slowing marginally, but none-the-less figures were still headed downwards. Except for The Citizen, the worst pain was felt by English dailies, while Afrikaans dailies remained stable and vernacular press showed moderate growth.</p>
<p>
	Weekend newspapers like Rapport (down 7.33%), City Press (down 6.02%) and Sunday World (down 9.37) didn&rsquo;t fare much better. Sunday Times also dropped, but at 0.4% this decline was negligible. isiZulu Ilanga Langesonto showed good growth with a circulation increase of 6.45%.</p>
<p>
	Weekly newspapers, on the other hand, were smiling. Mail &amp;Guardian continued its growth trend and reported a pleasing 2.56% increase in circulation. Ilanga, the isiZulu weekly published by Independent, spiked by a massive 31.81%, underscoring the rich success of vernacular press.</p>
<p>
	As the likes of The Star, Sunday World and Sowetan face dwindling readerships, they could be asking themselves: &ldquo;What are those other newspapers doing right that are we doing wrong?&rdquo; The obvious answer in a sector where news has become a commodity is there&rsquo;s little distinction between competitive papers and most of these media brands have lost relevance. As these newspapers flail, there&rsquo;s a sea of emerging open-source technology freely available to media owners which could help hone content, increase relevance and add value to investigative reporting skills.</p>
<p>
	But most of the press aren&rsquo;t even dipping their toes in that water, despite the great interest across the continent in tech tools that enable better journalism, particularly investigative and data journalism. Ghana recently wrapped up an open data and data journalism conference which was <a href="http://www.ghana.gov.gh/index.php?option=com_content&amp;view=article&amp;id=9271:journalists-urged-to-utilise-data-in-reportage-&amp;catid=28:general-news&amp;Itemid=162">welcomed by the state</a> which appealed to reporters to use data to hold public officials more accountable &ndash; even as South Africa stares into the barrel of the Secrecy Bill which would do precisely the opposite.</p>
<p>
	The rest of Africa&rsquo;s interest in &ldquo;technology for journalism&rdquo; is, in part, being stimulated by Justin Arenstein, contracted by Google to help media owners build more robust business models and diversified revenue streams. Arenstein&rsquo;s helping media improve the quality of content and by showing publishers how better to engage audiences using new technologies.</p>
<p style="text-align: center; ">
	<img alt="" src="http://farm8.staticflickr.com/7166/6514933111_9179bb16e7.jpg" style="width: 500px; height: 330px; " /></p>
<p style="text-align: center; ">
	<em>Justin Arenstein (courtesy of Justin Arenstein)</em></p>
<p>
	&quot;A lot of what I&#39;m doing with Google is based on the lessons learnt from the collapse of audiences and the advertising model in the US and Europe. Africa escaped much of that disruption three years ago because of our dismal Internet penetration stats. We are, however, now starting to see some of the same seismic changes in Africa&#39;s mature media markets in SA, Kenya, and Nigeria. The causes are different, driven by mobile and not Internet usage, but the trends and results are the same,&quot; says Arenstein.</p>
<p>
	Interestingly enough Arenstein was fired from his first media management job at a local newspaper, because he insisted on equal pay for black and white journalists. The former reporter has won awards, set up Mpumalanga&rsquo;s first commercial radio station and helped establish a number of organisations to support investigative work on the continent. He also built African Eye News Service from the ground up to become southern Africa&rsquo;s largest independent newswire service and the first rural-based wire agency on the sub-continent.</p>
<p>
	Recently Arenstein took a year off to explore &ldquo;data journalism&rdquo;, &ldquo;civic engagement media&rdquo; and &ldquo;augmented reality&rdquo; as a Knight Fellow at Stanford University. The sabbatical led to his gig with Google, which sees him earning more than his fair share of frequent-flyer miles.</p>
<p>
	&ldquo;There&rsquo;s a huge range of brilliant open-source and non-Google tools out there just waiting to be used, and we actively help people find them,&rdquo; he says.</p>
<p>
	&ldquo;When the scandal broke about dodgy expense claims by members of the UK parliament, journalists were swamped by almost 500,000 pages of evidence and receipts. Instead of being overwhelmed, The Guardian <a href="http://mps-expenses.guardian.co.uk/">recruited - or crowdsourced - thousands of volunteers</a> to help their journalists comb through and verify the evidence. Then, they used the new tools for <a href="http://www.guardian.co.uk/news/datablog/2010/feb/05/mps-expenses-houseofcommons">data analysis and visualisation</a> to extract examples, identify trends and to identify <a href="http://www.guardian.co.uk/politics/2009/dec/16/mps-expenses-what-we-learned">conflicts of interest</a> in the data,&rdquo; says Arenstein.</p>
<p style="text-align: center; ">
	<img alt="" src="http://farm8.staticflickr.com/7019/6515054659_45402d16b5.jpg" style="text-align: center; width: 500px; height: 320px; " /></p>
<p style="text-align: center; ">
	<em>The Guardian - Data visualisation that illustrates where tax pounds are spent</em></p>
<p>
	An astounding 28,800 readers helped review almost 225,000 documents. By involving readers in the investigative phase of the story, and by granting access to the raw evidence, The Guardian built widespread public interest and momentum for a story that has run for well over a year. The media&rsquo;s focus on the corruption resulted in dismissals, repayments, and a reform of financial management systems in the British parliament.</p>
<p>
	&ldquo;But, these new tools are not just great for mobilising readers. They also help journalists do the kind of forensic analysis that we would previously have had to use consultants for. The Guardian&#39;s use of demographic data and mapping to <a href="http://www.guardian.co.uk/news/datablog/2011/aug/16/riots-poverty-map-suspects">compare the recent public rioting in British cities</a> with poverty data is a good example. <a href="http://www.guardian.co.uk/news/datablog+uk/london-riots">The results</a> have helped move the public debate beyond recriminations, to a more informed discussion. Why aren&#39;t we seeing South African media doing something similar with census and other data, to analyse the ongoing service delivery protests?&quot; he asks.</p>
<p>
	Arenstein concedes there are pitfalls. &ldquo;Journalists need to be very clear about why they&#39;re using any new tool. Does it add real, tangible value?&rdquo; he says.</p>
<p style="text-align: center; ">
	<img alt="" src="http://farm8.staticflickr.com/7018/6515055773_243ccca649.jpg" style="width: 500px; height: 373px; " /></p>
<p style="text-align: center; ">
	<em>The Guardian &ndash; Infographic of MP Expense Claim Scandal which explains how data was visualised</em></p>
<p>
	The real challenge is in making information relevant by augmenting narrative reporting with &ldquo;utility news&rdquo; tools. With the help of these readers can calibrate decisions on personal issues like where they live to who they vote for. &ldquo;Every year, the South Africa media devote massive resources and coverage to the national budget, but 99% of the reportage is boring, impenetrable or just plain irrelevant to ordinary citizens,&rdquo; he says.</p>
<p>
	The knee-jerk reaction from local journalists, he adds, is often that the data doesn&#39;t exist yet in Africa for this kind of journalism. Or that the technology is too expensive, or that audiences aren&#39;t ready. &quot;This simply isn&#39;t true. South African journalists are sitting on a goldmine of census and other socio-economic data going back decades. We could easily be using property, education, and health information to build things like <a href="http://www.theglobeandmail.com/news/technology/chinese-rival-puts-heat-on-rim-for-key-african-sales/article2251291/">SinCountyRankings</a> to help people compare lifestyle standards in their cities or neighbourhoods,&quot; Arenstein responds. &quot;Even when the data isn&#39;t perfect, there are free and easy-to-use tools like <a href="http://scraperwiki.com/">ScraperWiki</a> and <a href="http://code.google.com/p/google-refine/wiki/GettingStarted">Google Refine</a> to scrape and crunch data, and scores of tools to then visualise or analyse connections between different sets of data.&quot;</p>
<p>
	A few exceptions aside, South African news media are ignoring the wealth and depth of information available to them because they don&rsquo;t know how to analyse the data, and aren&rsquo;t au fait with the tools. Mail &amp; Guardian&rsquo;s amaBhugane and Andrew Trench&rsquo;s team at Media24 investigations are among the few local &ldquo;data journalism&rdquo; pioneers, and the results are telling. Late last year Trench, together with Jacques Pauw and Julian Rademeyer, broke the news of the <a href="http://www.citypress.co.za/SouthAfrica/ANCs-mine-grab-20101121">ANC&rsquo;s mining rights grab</a> through its investment front, Chancellor House.</p>
<p>
	In an environment where so much news that embarrasses the ANC comes from informants, the City Press expos&eacute; on the party&rsquo;s mine grab was a direct result of putting data mining tools to good use. &ldquo;A couple of months back the department of mineral resources published records of all mining and prospecting right applications in South Africa over the last couple of years,&rdquo; Trench writes on his blog &ldquo;<a href="http://www.andrewtrench.com/2010/11/21/data-crunching-mapping-tools-and-bit-scoop/">Notes from the revolution</a>&rdquo;.</p>
<p>
	&ldquo;As is the habit of SA government departments, the records were published in PDF format which makes them pretty useless for any useful intensive examination of the data.&rdquo; Fortunately Trench had taught himself Python, a relatively easy, but incredibly powerful programming language. Trench downloaded all the PDF files and created a Python script to enable rapid searches of the documents. Once relevant information was extracted the investigative journalist and his team could start getting a better handle on what was really going on.</p>
<p>
	When Trench and his team broke the story they reported that Chancellor House, through its Chancellor House Mineral Resources subsidiary, had scored prospecting rights to 560 sites in Northern Cape, North West and Mpumalanga. The lesson here is that the Secrecy Bill&rsquo;s intention to <a href="http://dailymaverick.co.za/article/2011-11-23-sa-investigative-greats-secrecy-bill-will-not-silence-us">intimidate whistle-blowers</a> will collapse in on itself as &ldquo;data journalism&rdquo; becomes increasingly important to expose hidden connections or connect the dots in a way that&rsquo;s not immediately evident. Data tools can become an important source for stories if informants start clamming up.</p>
<p>
	&quot;The building blocks are in place. All it takes is one data journalism project to go viral, and we could see a quantum shift in a very short period. The trick, of course, will be to build business models to support the new approach,&quot; says Arenstein. All the tools are there. Most of them are free. Now what&rsquo;s required is the will to use them.</p>
]]></description> 
      <dc:date>2011-12-15T09:20:55+00:00</dc:date>
    </item>

    <item>
      <title>African News Innovation Challenge: $1 million in funding, but more about brains than money</title>
      <link>http://datadrivenjournalism.net/news_and_analysis/african_news_innovation_challenge_1_million_in_funding_but_more_about_brain</link>
      <guid>http://datadrivenjournalism.net/news_and_analysis/african_news_innovation_challenge_1_million_in_funding_but_more_about_brain#When:08:21:19Z</guid>
      <description><![CDATA[<p>
	<em>Originally published by <a href="http://www.bizcommunity.com/Profile/TheresaMallinson_1">Theresa Mallinson</a> on <a href="http://freeafricanmedia.com">Free African Media</a> on 13 December 2011. This article is republished with permission.</em></p>
<h4>
	&nbsp;</h4>
<h4>
	It&#39;s called the African News Innovation Challenge, and it has $1 million to award in start-up grants. Anic, which was announced in October, and had its soft launch in last week, will formally launch in January. Theresa Mallinson talked to project manager Justin Arenstein about the thinking behind the initiative.</h4>
<p>
	&nbsp;</p>
<p>
	The premise of the African News Innovation Challenge is simple: it&#39;s a fund that will enable African news organisations to experiment in the digital technologies and other newsroom innovations, at no financial risk to themselves. But it&#39;s not about simply throwing money at problems; the projects that are chosen will also receive technical support and individual mentoring.</p>
<p style="text-align: center; ">
	<img alt="" src="http://farm8.staticflickr.com/7166/6514933111_9179bb16e7.jpg" style="width: 500px; height: 330px; " /></p>
<p style="text-align: center; ">
	<em>Justin Arenstein</em></p>
<p>
	Anic&#39;s project manager Justin Arenstein, who currently holds a Knight International Journalism Fellowship, is a passionate advocate of news innovation and, in particular, <a href="http://dailymaverick.co.za/article/2011-12-05-data-journalism-draws-the-line-between-the-quick-and-the-dead">data journalism</a>. &ldquo;People often say there are not many data sources in Africa; it&#39;s bullshit,&rdquo; Arenstein says. &ldquo;Even if it&#39;s not available in country, it&#39;s often available out of country.&rdquo; Institutions such as the International Monetary Fund and the World Bank have detailed country data available, even where individual governments don&#39;t make it readily accessible to their citizens. Arenstein says one of the aims of Anic is about &ldquo;realising the wealth of information that&#39;s available (and) the use it could have in the newsroom.</p>
<p>
	&ldquo;A lot of African media... just regurgitates,&rdquo; says Arensteins. &ldquo;We seldom have the tools to go out and aggressively test what they&#39;re saying.&rdquo; Elaborating further, he says: &ldquo;The problem is we (journalists) don&#39;t have the programming and IT skills necessarily and IT geeks don&#39;t have any concept of content, so they build fantastic tools, and they sink without a trace. The idea was how do we bring together these two very different constituencies?&rdquo;</p>
<p>
	Answering his own question, Arestein says that: &ldquo;One of the ways that&#39;s worked internationally is to dangle a very big carrot in front of them.&rdquo; And so the concept of the African News Innovation Challenge was born. Initially the Omdiyar Network put up $500,000 for the project, and this was soon matched. &ldquo;It actually only took us two-and-a-half months to raise a million dollars,&rdquo; says Arenstein. &ldquo;There is money available for innovation and experimentation.&rdquo;</p>
<p>
	The challenge will work like this: in January, at the formal launch, Anic&#39;s advisory council and judging panel will be announced. Arenstein says these two bodies already include people from the Bill &amp; Melinda Gates Foundation, the Knight Foundation, the International Centre for Journalists, and Google. In February, a call for entries will go out. &ldquo;We envision a two- to three-month window period, during which people will post their idea on a public forum, and we&#39;ll casually invite the rest of the industry to help refine that idea,&rdquo; says Arenstein. &ldquo;This is not really an initiative for people who want to build top-secret (projects). Ideas are cheap; it&#39;s actually execution that builds sucessful products and successful projects.&rdquo;</p>
<p>
	Winners will be announced in the third quarter of 2012, and then the hard work really begins. &ldquo;We&#39;ve tried to learn from what people have done in the US and Europe and elsewhere,&rdquo; says Arenstein. &ldquo;We&#39;ve looked at the innovation funds that seem to have had an impact, like the Knight News Challenge. They&#39;ve learnt from what works best, and what doesn&#39;t work best. Simply throwing money at a bunch of inonvators, and giving them all that money upfront, doesn&#39;t really help.&rdquo;</p>
<p>
	To this end, money given to Anic awardees will be released in tranches, with each project receiving between $12,500 to $100,000 depending on its scope. &ldquo;The model that we&#39;re looking at using is helping people sit down and very critically reassess their proposal, and break it down into bite sizes,&rdquo; says Arenstein. And if any of the start-ups fail, that won&#39;t be a problem; it&#39;s the nature of the business.</p>
<p>
	Innovators that are awarded funding will find themselves in an open environment. &ldquo;Ideas that win (can) evolve during course of development,&rdquo; says Arenstein, adding that development time will be anything between 12 and 18 months. &ldquo;We&#39;re not going to hold people religiously or fanatically to the original concept,&rdquo; he says. And they&#39;ll be given plenty of help and support along the way. &ldquo;Where there are gaps in their own team, we&#39;ll try and plug those gaps by bringing in external people,&rdquo; Arenstein says. &ldquo;And right from the beginning we&#39;re going to try and give them some of the world&#39;s best media economists, to develop some of the most robust models.&rdquo;</p>
<p>
	Commenting on the innovation the project intends to foster in African media, Arenstein says: &ldquo;You almost need a crisis where there&#39;s real pain in a marketplace, for people to go out and invent something to solve it.&rdquo; With the backing of the African Media Initiative, the Omidyar Network, the International Centre for Journalists, Google, the Knight Foundation, Konrad-Adenauer-Stiftung, the US Department of State, and the World Association of Newspapers and News Publishers &ndash; all of whom are contributing either funding or expertise &ndash; the African News Innovation Challenge has already garnered high-level support. Now it&#39;s up to the journalists and media workers of Africa to meet the challenge. We look forward to seeing what they come up with.&nbsp;</p>
]]></description> 
      <dc:date>2011-12-15T08:21:19+00:00</dc:date>
    </item>

    
    </channel>
</rss>
