Historians and journalists alike have long prized one source of information above all others: the on-the-record, primary source. They scour the attics for diaries and journals and fly across the country for interviews. But now we have a glut of documentation.
Thanks to social media, millions of people go on the record, publicly, every single day. People sent billions of tweets, Facebook posts, and WhatsApp messages last year. They have expressed wonder 😮, anger 😤, and love ❤️ for social and political issues.
At present, most journalists treat social sources like they would any other — individual anecdotes and single points of contact. But to do so with a handful of tweets and Instagram posts is to ignore the potential of hundreds of millions of others.
Many stories lay dormant in the vast amounts of data produced by everyday consumers because journalists are still only starting to acquire the large-scale data-wrangling expertise needed to tap them. As more and more people conduct their lives online, and as smartphones are penetrating previously unconnected regions around the world, this trove of stories is only becoming larger.
The kinds of stories journalists can tell using this data are wide ranging. We can reconstruct online encounters in ways more precise than a source may recall from memory. Le Monde, for instance, retraced the journey of Syrian refugees through their WhatsApp messages. At Al Jazeera America, we analyzed and chronicled the evolution of a Hong Kong pro-democracy movement in Facebook chatrooms.
Journalists can try to find insight to people’s personality and character or hold powerful people accountable. Data scientist David Robinson did a sentiment analysis of Donald Trump’s tweets and found that Trump’s own tweets are much more negative than those his campaign staff tweeted. My colleague Charlie Warzel and I looked at the links Trump tweeted to explore the news he chooses to circulate, as a proxy for the news he may consume.
Journalists can examine the ways in which technology disadvantages groups by looking at social data. ProPublica’s Julia Angwin and Terry Parris Jr. bought a Facebook ad and established that the social media company allows advertisers to exclude customers based on race, while Vox’s Alvin Chang expanded on ProPublica’s analysis by looking at whether Facebook’s algorithm excludes already disadvantaged populations from being offered opportunities that their more affluent counterparts receive.
When journalists venture into this kind of story mining, I hope that they also continue to discuss the ethics surrounding it, the blurred lines between what is considered public and what is considered private, and the caveats that come with each dataset.
Last but not least, I hope that journalists will dig into social data to gain insights into whom they reach and, perhaps more importantly, whom they do not reach. People live in their filtered worlds in which algorithms serve them information that tends to affirm rather than question political views.
Maybe social data can allow us to understand these bubbles better in an effort to pierce them.
Lam Thuy Vo is a fellow in BuzzFeed’s Open Lab for Journalism, Technology, and the Arts.
Umbreen Bhatti A sense of journalists’ humanity
Hillary Frey Forests need to burn to regrow
Swati Sharma Failing diversity is failing journalism
Megan H. Chan Cultural reporting goes mainstream
Mario García Virtual reality on mobile leaps forward
Aja Bogdanoff Comments start pulling their weight
Richard Tofel The country doesn’t trust us — but they do believe us
Francesco Marconi The year of augmented writing
Matt Karolian AI improves publishing
Doris Truong Connecting with diverse perspectives
Cory Haik Navigating power in Trump’s America
Moreno Cruz Osório The year of transparency in Brazilian journalism
Dan Colarusso Let’s make live video we can love
Kathleen Kingsbury Print as a premium offering
Andrew Losowsky Building our own communities
Vivian Schiller Tested like never before
Emi Kolawole From empathy to community
Ståle Grut The battle for high-quality VR
Ken Schwencke Disaggregation and collection
Bill Adair The year of the fact-checking bot
Keren Goldshlager Defining a focus, and then saying no
Carrie Brown-Smith We won’t do enough
Jeremy Barr A terrible year for Tiers B through D
Andrew Ramsammy Rise of the rebel journalist
Renée Kaplan Pure reach has reached its limit
Nushin Rashidian A rise in high-price, high-value subscriptions
Joanne Lipman The year of the drone, really
Pablo Boczkowski Fake news and the future of journalism
Tanya Cordrey The resurgence of reach
Alberto Cairo Communicating uncertainty to our readers
Rebekah Monson Journalism is community-as-a-service
Taylor Lorenz “Selfie journalism” becomes a thing
Erin Millar The bottom falls out of Canadian media
Annemarie Dooling UGC as a path out of the bubble
Andrew Haeg The year of listening
Juan Luis Sánchez Your predictions are our present
Mary Walter-Brown Getting comfortable asking for money
Peter Sterne A dangerous anti-press mix
P. Kim Bui The year journalism teaches again
Anita Zielina The sales funnel reaches (and changes) the newsroom
Jonathan Hunt Measurement companies get with the times
Emily Goligoski Incorporating audience feedback at scale
Mary Meehan Feeling blue in a red state
Liz Danzico The triumph of the small
Tim Griggs The year we stop taking sides
Mathew Ingram The Faustian Facebook dance continues
Margarita Noriega From pinning tweets to tweeting pins
Millie Tran International expansion without colonial overtones
Alexis Lloyd Public trust for private realities
Sydette Harry Facing journalism’s history
Tim Herrera The safe space of service journalism
Carla Zanoni Prioritizing emotional health
Rachel Sklar Women are going to get loud
Dannagal G. Young The return of the gatekeepers
Rasmus Kleis Nielsen News after advertising may look like news before advertising
Melody Kramer Radically rethinking design
Laura Walker Authentic voices, not fake news
Helen Havlak Chasing mobile search results
Ray Soto VR moves from experiments to immersion
Olivia Ma The year collaboration beats competition
S.P. Sullivan Baking transparency into our routines
Liz McMillen The year of deep insights
Gabriel Snyder The aberration of 20th-century journalism
Rachel Schallom Stop flying over the flyover states
Juliette De Maeyer and Dominique Trudel A rebirth of populist journalism
Matt Waite The people running the media are the problem
Eric Nuzum Podcasting stratifies into hard layers
Mike Ragsdale A smarter information diet
Sarah Wolozin Virtual reality on the open web
Maria Bustillos “It’s true — I saw it on Facebook”
Sara M. Watson There is no neutral interface
Ryan McCarthy Platforms grow up or grow more toxic
Jon Slade Trusted news, at a premium
Adam Thomas The coming collaboration across Europe
Bill Keller A healthy skepticism about data
Nicholas Quah Podcasting’s coming class war
Mira Lowe News literacy, bias, and “Hamilton”
Tressie McMillan Cottom A path through the media’s coming legitimacy crisis
Michael Kuntz Trust is the new click
Steve Henn The next revolution is voice
M. Scott Havens Quality advertising to pair with quality content
Alice Antheaume A new test for French media
Scott Dodd Nonprofits team up for impact
Michael Oreskes Reversing the erosion of democracy
Robert Hernandez History will exclude you, again
David Chavern Fake news gets solved
Ole Reißmann Un-faking the news
Corey Ford The year of the rebelpreneur
Errin Haines Chaos or community?
Ernst-Jan Pfauth Earn trust by working for (and with) readers
Tracie Powell Building reader relationships
Sarah Marshall Focusing on the why of the click
Molly de Aguiar Philanthropists galvanize around news
Mandy Velez The audience is the source and the story
Katie Zhu The year of minority media
Almar Latour Thanks, #fakenews
Julia Beizer Building a coherent core identity
Geetika Rudra Journalism is community
Rubina Madan Fillion Snapchat grows up
Andy Rossback The year of the user
Jonathan Stray A boom in responsible conservative media
Kawandeep Virdee Moving deeper than the machine of clicks
David Weigel A test for online speech
Samantha Barry Messaging apps go mainstream
Erin Pettigrew A year of reflection in tech
Amy Webb Journalism as a service
Cindy Royal Preparing the digital educator-scholar hybrid
Christopher Meighan Unlocking a deeper mobile experience
Andrea Silenzi Podcasts dive into breaking news analysis
Burt Herman Local news gets interesting
Lee Glendinning A call for great editing
Javaun Moradi What can we own?
Nathalie Malinarich Making it easy
Reyhan Harmanci Bear witness — but then what?
Valérie Bélair-Gagnon Truthiness in private spaces
Lam Thuy Vo The primary source in the age of mechanical multiplication
An Xiao Mina 2017 is for the attention innovators
Dhiya Kuriakose The year of digital detoxing
Ariane Bernard Better data about your users
Sue Schardt Objectivity, fairness, balance, and love
Ashley C. Woods Local journalism will fight a new fight
Dan Gillmor Fix the demand side of news too
Zizi Papacharissi Distracted journalism looks in the mirror
Asma Khalid The year of the newsy podcast
Libby Bawcombe Kids board the podcast train
Amy O'Leary Not just covering communities, reaching them
Elizabeth Jensen Trust depends on the details
Amie Ferris-Rotman Вслед за Россией
Priya Ganapati Mobile websites are ready for reinvention
Claire Wardle Verification takes center stage
Guy Raz Inspiration and hope will matter more than ever
Sam Ford The year we talk about our awful metrics