Nieman Foundation at Harvard
HOME
          
LATEST STORY
The Wall Street Journal website — paywalled from the very beginning — turns 20 years old today
ABOUT                    SUBSCRIBE
Sept. 24, 2012, 9:55 a.m.

Pop Up Archive is open-source software to help producers preserve sound, painlessly

The Knight News Challenge winners want to help radio producers archive and organize their raw tape with simple web-based software.

Journalists, especially those who collect sound and pictures, are hoarders. To dump raw material onto a hard drive is effortless, thoughtless. To delete it — what if you need something again some day? — is harder. And to find that something again some day is, often, excruciating.

Anne Wootton and Bailey Smith, winners of a $300,000 Knight News Challenge grant, are the creators of Pop Up Archive. They are building open-source software and partnering with the Internet Archive to streamline the entire workflow of a radio producer — from ingestion to cataloging to eventual distribution.

“What’s closest to our heart is saving oral history. It’s actually easier to lose your material that’s digital than it is to lose something that was on a tape,” Smith told me. Data gets corrupted, or just lost. Thousands of files called 000001.WAV become meaningless to humans.

Wootton and Smith have developed plugins for Omeka, the open-source archiving and publishing software. Once the user drops raw audio files onto an FTP site, she can tag those files in Omeka with rich metadata (record date, air date, location, interviewer, interviewee) using a web-based form. Their software normalizes the metadata using PBCore, an emerging standard in public media. (For example: Is it called a story, feature, or piece?) The user could also upload copies of the audio to the Internet Archive, which has agreed to host the files free of charge and forever, as well as SoundCloud.

Pop Up Archive graphic

Wootton and Smith said their software is in alpha. Their first test case is The Kitchen Sisters, Nikki Silva and Davia Nelson, a pair of independent radio producers who have 30 years’ worth of tape, the last decade of it digital.

“They are a relatively small shop that, like many in public media, survive largely on grants and private funding, private donors and…lack any sort of archivist or technologist on staff,” Wootton said, “and yet are looking — especially as their production workflow is increasingly digital — to refine, reuse, repackage content, collaborate with others.”

The Kitchen Sisters’ stacks of filled-up hard drives amount to a “huge stew of dead media,” Silva says in a video promoting the project. “People call us and say, ‘Oh, you guys did that piece about the sonic memorial about 9/11, where is all that raw material? Where’s the original?”

Wootton and Smith are working with The Kitchen Sisters to upload all of their raw material for a public archive. That could be a boon for projects such as Blank on Blank, which mines old interviews and forgotten footage for new radio stories.

Wootton said she got interested in digital archiving as a student at Brown, where she led an effort to digitize the 120-year history of The Brown Daily Herald. She “saw these bound volumes of pages from 60 years ago just literally rotting on the shelves of the Brown library, and managed that project, did fundraising for that project, really enjoyed it — and was incredibly flustered by the fact that I didn’t understand a lot of the technology that they were using.”

That’s the kind of person they hope to reach with Pop Up Archive: someone good at creating content but who isn’t a professional archivist.

There’s one missing piece that would complete the perfect archive, of course: transcription. That would let people search through years of audio archives as though they were documents. Speech-to-text software is still too immature to translate audio into text on the cheap. PBS, for its part, is transcribing thousands of hours of video archives using a combination of advanced software and human editors. Amara, formerly Universal Subtitles, is a crowdsourced approach to transcription used by PBS NewsHour and others.

Wootton said she hopes their project would one day harmonize with the Public Media Platform, a so-called “super API” that would link content from all public media producers and distributors (PBS, NPR, APM, PRI, PRX). That long-planned project so far has not gotten off the ground.

The Knight grant will let the duo focus on the project full-time and, they hope, scale the project up to work for more producers and stations. Also on the team are researcher Christen Penny and software engineer Daniel Vizzini.

POSTED     Sept. 24, 2012, 9:55 a.m.
PART OF A SERIES     Knight News Challenge 2012
SHARE THIS STORY
   
Show comments  
Show tags
 
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
The Wall Street Journal website — paywalled from the very beginning — turns 20 years old today
“From the very beginning it was very clear we needed to cover all the same concerns and sensibilities of the print Journal even though we were online and even though we were a young staff.”
Newsonomics: In the platform wars, how well are you armed?
“Think about platforms as fishing places where you can find large, engaged audiences and build a relationship with them by providing content. Then offer these users some other services off-platform.”
Wired’s making the long and slow switch to HTTPS and it wants to help other news sites do the same
With its HTTPS implementation, Wired’s starting with its security vertical and for users who pay for the ad-free version of the site.
What to read next
0
tweets
In the room where it happens: The host of NPR’s new show Embedded talks about news in podcast form
Kelly McEvers: “A lot of the great storytelling podcasts happen in the studio. I hope ours opens the door to people thinking more about what you can do in the field, when things don’t go as planned and are unexpected.”
0What a group of USC students learned shooting lots of VR video (hint: duct tape is involved)
The students traveled to Houston over spring break to shoot footage to accompany a ProPublica/Texas Tribune project on what a hurricane could do to the city.
0Audible, long known only for audiobooks, is branching out into podcasts — and news
The podcast/audio world has been waiting for Audible to make its big move into the space. It’s here, including original content from major publishers like The New York Times, The Wall Street Journal, and Jeff Bezos’ Washington Post.
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
Minneapolis Star Tribune
Flipboard
Demand Media
Instapaper
The Guardian
AOL
Daily Kos
Quora
West Seattle Blog
GateHouse Media
El Faro
The Dish