Nieman Foundation at Harvard
HOME
          
LATEST STORY
Cold, hard numbers will drive the stories on this Internet-crawling company’s new media arm
ABOUT                    SUBSCRIBE
June 13, 2014, 2:07 p.m.
Reporting & Production
LINK: www.nickdiakopoulos.com  ➚   |   Posted by: Joshua Benton   |   June 13, 2014

Robot journalists are having a moment. The idea of narrative stories built by algorithms opens up a lot of possibilities — and taps into a lot of fears for human journalists worried about becoming outmoded.

But many of the most prominent algorithms doing journalism are black boxes, unknowable to the public. How do robot journalists do their work?

Nick Diakopoulos has an interesting piece that takes advantage of patent filings to outline the basics. Based on what Narrative Science has filed, here’s the basic outline: “(1) ingest data, (2) compute newsworthy aspects of the data, (3) identify relevant angles and prioritize them, (4) link angles to story points, and (5) generate the output text.” (He’s got far more detail in his post.)

Some parts of that are straightforward; others, like how “newsworthiness” is determined, can be more contentious:

From my reading, I’d have to say that the Narrative Science patent seems to be the most informed by journalism. It stresses the notion of newsworthiness and editorial in crafting a narrative…What still seems to be lacking though is a broader sense of newsworthiness besides “deviance” in these algorithms. Harcup and O’Neill identified 10 modern newsworthiness values, each of which we might make an attempt at mimicking in code: reference to the power elite, reference to celebrities, entertainment, surprise, bad news, good news, magnitude (i.e. significance to a large number of people), cultural relevance to audience, follow-up, and newspaper agenda. How might robot journalists evolve when they have a fuller palette of editorial intents available to them?

Show tags Show comments / Leave a comment
 
Join the 45,000 who get the freshest future-of-journalism news in our daily email.
Cold, hard numbers will drive the stories on this Internet-crawling company’s new media arm
Fintech company Thinknum has lots of interesting data to sell access to. Now it wants to build public-facing stories out of it.
At The Boston Globe, the editorial pages are looking for new ways to engage readers
“We learned how important it is to have writers and editors and digital producers working collaboratively, near each other. It’s a model for the future.”
Americans say greater access to news sources is actually making it harder to stay informed
But they’re evenly split on whether or not the news selection algorithms on sites like Facebook and Twitter should be regulated.