What’s the global data journalism community tweeting about this week? GIJN's NodeXL #ddj mapping from April 8 to 14 finds a zine focused on machine learning-powered investigative journalism produced by @bxrobertz, a video explainer from @FT on if big corporations are really generous or just avoiding taxes, @knowtheory and @amandabeereviewing seven optical character recognition tools, and @workbenchdata with a tutorial on visualizing @Twitter data.
AI-Powered Investigative Journalism
Computational journalist Brandon Roberts produced an interesting zine on how machine learning can be applied to investigative journalism. In the first issue, he interviews the Minneapolis Star Tribune’s Chase Davis, profiles a new web scraping tool and dissects a machine learning-powered investigation into local property tax evasion.
Just checked out this awesome first edition of Artificial Informer, a web zine about machine learning and investigative journalism. Slick design and a great interview with @chasedavis https://t.co/AfvoFrBPvF
— Ben Hancock (@benghancock) April 10, 2019
Philanthropy or Tax Avoidance?
At the World Economic Forum this year, economic historian Rutger Bregman questioned whether big corporations and their bosses were truly being generous in donating billions to causes, or just avoiding taxes. Financial Times’ journalists Federica Cocco and John Burn-Murdoch attempt to investigate Bregman’s question and explain their findings with sketched charts, Monopoly money and candy.
An interesting (and accessible) description of just how much corporations are paying in tax versus what they are giving away philanthropically. An interesting add-on to what historian Rutger Bregman argued at Davos a few months ago. https://t.co/mgEFOW5jLX #philanthropy #davos
— Thoughtful Philanthropy (@laurenjanus) April 11, 2019
Extracting Data From PDFs
Trying to get data out from pesky PDFs but not sure which tool to use? Ted Han and Amanda Hickman, from Factful, went through seven optical character recognition tools so that you don’t have to. Here is their side-by-side comparison and review of the tools.
Nice comparison of #OCR engines: https://t.co/c74qysNx8I #ddj
— Timo Grossenbacher (@grssnbchr) April 9, 2019
Visualize Twitter Data
Want to analyze and visualize Twitter data in four steps? Workbench prepared a tutorial on how to use the Twitter API to load tweets and associated data from an account and then visualize results of how often that account tweets a specific word. Data Journalism Turkey translated the tutorial into Turkish here.
Here's detailed explainer on how @sergiospagnuolo used Workbench to power a Twitter monitoring bot - https://t.co/D8M1NoUUJO …
— Workbench (@workbenchdata) April 4, 2019
Get started working with Twitter data in few minutes with this tutorial:https://t.co/rQQL6PreQT … ⚡️#ddj https://t.co/O6xNUXGc26
Google’s Data Offerings
Google News Initiative is helping to boost the field of data journalism with more trainings, online resources and tools. What’s coming: free data training for local newsrooms in the United States and Canada, in partnership with Investigative Reporters and Editors, as well as Google tools training in collaboration with the Society of Professional Journalists. Also, data journalism MOOCs will be launched with the Knight Center for Journalism in the Americas this fall.
SPJ and @GoogleNewsInit expect to train an estimated 4,000 journalists at major conferences, conventions and regional SPJ events, reaching an estimated 4,000 journalists by March 2020. @nickdigitalhttps://t.co/0v9X5BZYs8 pic.twitter.com/Eii1VqFUP5
— Society of Professional Journalists (@spj_tweets) April 10, 2019
Python for Journalists Course
Datajournalism.com has released a four-course module on Python for data journalism. Listen to data journalism trainer Winny de Jong teach you how to set up Python on your computer, clean up messy datasets, analyze data and conduct web scraping. Best of all, the course is free!
Python for Journalists - FREE video series! 4+ hours, videos, very clear. https://t.co/Trs6Lvzndt #python #journalism
— Mindy McAdams (@macloo) April 4, 2019
Unequal Income Distribution
“The more beautiful the view, the higher the income inequality.” SRF Data took a hard look at wage differentials in Switzerland and created an interactive map showing the income distribution in each community. The most uneven income distribution was found in Anières, a municipality in the canton of Geneva. (In German.)
Einkommensverteilung in der 🇨🇭
— Sandro Lüscher (@sandroluescher) April 11, 2019
▪️Einkommensstarke & ungleiche Gemeinden in Seegebieten
▪️Einkommensstarke & gleiche Gem. v.a. im Mittelland/Agglo
▪️Einkommensschwache & ungleiche Gem. in Tourismusregionen
▪️Einkommensschwache & gleiche Gem. auf dem Land.https://t.co/HJedKEgY7g
The UK’s Gender Pay Gap
BBC journalists have dug into data reported by British firms on the difference between what they pay men and women. They found that 8,124 companies pay men more while just 1,424 pay women more.
The numbers tell the real story. Lack of balance at senior levels across all industries is the key driver of the gender pay gap. That’s the action we need to focus on as businesses @BalanceInBizIE @30percentclubIE https://t.co/iecBEo66xm
— Gillian Harford (@gillian_harford) April 10, 2019
Relationship Between Pharma and Doctors
In Switzerland, pharmaceutical companies inject millions into medical companies that conduct training for doctors. Le Temps dives into the issues surrounding the relationship between big pharma and the medical community, and finds a debate casting doubt on the independence and impartiality of doctors. Here’s how the team investigated this story. (In French.)
Excellente enquête sur ces financements qui jettent le doute sur l'impartialité de nos médecins: Les sociétés médicales infiltrées par les pharmas, par @sylvielogean et @MarieMaurissehttps://t.co/D8RxGDcl77 par @sylvielogean
— Koller Frédéric (@frederickoller) April 11, 2019
Goal Psychology
Does a goal scored just before half-time significantly affect the outcome of the game? Der Spiegel analyzed data from more than 45,000 matches from the four European leagues to find out.
In German football journalism a goal scored just before half time (min 45 and extra time) is referred to as 'psychologically important'. @PatrickStotz analysed the data and it turns out that such goals are actually very impactful. Source: https://t.co/tJ4dVdo9un pic.twitter.com/Tr4hoDo80v
— Simon Kuestenmacher (@simongerman600) April 14, 2019
This article was originally published on GIJN. It was republished on IJNet with permission.
Thanks, once again, to Marc Smith of Connected Action for gathering the links and graphing them. The Top Ten #ddj list is curated weekly.
Eunice Au is GIJN’s program coordinator. Previously, she was a Malaysia correspondent for Singapore’s The Straits Times, and a journalist at the New Straits Times. She has also written for The Sun, Malaysian Today and Madam Chair.
Main image CC-licensed by Unsplash via Markus Spiske.