Nieman Foundation at Harvard
HOME
          
LATEST STORY
How The Washington Post built — and will be building on — its “Knowledge Map” feature
ABOUT                    SUBSCRIBE
Dec. 15, 2010, 1 p.m.

Coming soon to journalism: Matt Thompson sees the “Speakularity” and universal instant transcription

Editor’s Note: We’re wrapping up 2010 by asking some of the smartest people in journalism what the new year will bring.

We also want to hear your predictions: Take our Lab reader poll and tell us what you think we’ll be talking about in 2011. We’ll share those results later this week.

Here’s Matt Thompson, he of Newsless, Snarkmarket, and NPR fame.

At some point in the near future, automatic speech transcription will become fast, free, and decent. And this moment — let’s call it the Speakularity — will be a watershed moment for journalism.

So much of the raw material of journalism consists of verbal exchanges — phone conversations, press conferences, meetings. One of journalism’s most significant production challenges, even for those who don’t work at a radio company, is translating these verbal exchanges into text to weave scripts and stories out of them.

After the Speakularity, much more of this raw material would become available. It would render audio recordings accessible to the blind and aid in translation of audio recordings into different languages. Obscure city meetings could be recorded and auto-transcribed; interviews could be published nearly instantly as Q&As; journalists covering events could focus their attention on analyzing rather than capturing the proceedings.

Because text is much more scannable than audio, recordings automatically indexed to a transcript would be much quicker to search through and edit. Jon Stewart’s crew for The Daily Show uses expensive technology to process and search through the hundreds of hours of video the various news programs air each week. Imagine if that capability were opened up to citizens — if every on-air utterance of every pundit, politician, or policy wonk were searchable on Google.

The likeliest path to the Speakularity runs through Google. The company has already taken significant steps in this direction. They’ve trained their speech processing algorithms through the millions of queries submitted to Google 411, so that now, my Android phone is already pretty good at recognizing my voice commands. They automatically add captions to YouTube videos and transcribe voicemails through Google Voice. Developers can already call on Google’s voice recognition system when developing apps for Android devices.

The Speakularity itself probably won’t happen in 2011, but I think a key moment might. Let’s say that sometime in 2011, Google unveils a product called Google Transcribe. Not for charity, of course; better transcription = more relevant ads. The core of the product is a speech transcription API: send it audio and get back text in return. But there’s a front end to Transcribe where non-techies can get their mp3s auto-transcribed. Crucially, that app allows the user to manually correct the transcription (highlight a passage and it plays automatically), enabling a human feedback loop that makes the machine better and better over time. In addition to captioning, YouTube videos appear by default next to an automatically generated transcript that users can use for navigation, Debate-Viewer-style.

Constant social feedback plus machine learning could improve automatic speech transcription to the point where it’s finally ready for prime time. And when it does, the default expectation for recorded speech will be that it’s searchable and readable, nearly in the instant. I know this sounds totally retrograde, but I think it’s something like the future.

POSTED     Dec. 15, 2010, 1 p.m.
PART OF A SERIES     Predictions for Journalism 2011
SHARE THIS STORY
   
Show comments  
Show tags
 
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
How The Washington Post built — and will be building on — its “Knowledge Map” feature
The Post is looking to create a database of “supplements” — categorized pieces of text and graphics that help give context around complicated news topics — and add it as a contextual layer across lots of different Post stories.
How 7 news organizations are using Slack to work better and differently
Here’s how Fusion, Vox, Quartz, Slate, the AP, The Times of London, and Thought Catalog are using Slack for workflow — and which features they wish the platform would add.
The New York Times built a robot to help make article tagging easier
Developed by the Times R&D lab, the Editor tool scans text to suggest article tags in real time. But the automatic tagging system won’t be moving into the newsroom soon.
What to read next
1119
tweets
New Pew data: More Americans are getting news on Facebook and Twitter
A new study from the Pew Research Center and Knight Foundation finds that more Americans of all ages, races, genders, education levels, and incomes are using Twitter and Facebook to consume news.
701Newsonomics: The halving of America’s daily newsrooms
If you’re lucky enough to have the right deep-pocketed owner buy your paper and steady it, you’ve won the lottery. If you’re in a town whose paper is owned by the better chains, or committed local ownership, your loss will probably be mitigated. Otherwise, you’re out of luck.
575How 7 news organizations are using Slack to work better and differently
Here’s how Fusion, Vox, Quartz, Slate, the AP, The Times of London, and Thought Catalog are using Slack for workflow — and which features they wish the platform would add.
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
Drudge Report
New England Center for Investigative Reporting
The Chronicle of Higher Education
NPR
The New Yorker
Quora
The Boston Globe
Minneapolis Star Tribune
CBS News
Newser
Futurity
PolitiFact