Nieman Foundation at Harvard
HOME
          
LATEST STORY
The Wall Street Journal website — paywalled from the very beginning — turns 20 years old today
ABOUT                    SUBSCRIBE
Sept. 26, 2012, 10:43 a.m.
Mobile & Apps
donkey-elephant-cc

New York Times, Washington Post developers team up to create Open Elections database

Standardized state-by-state elections database gets a Knight News Challenge boost.

Calling all data hounds!

Senior developers from The New York Times and The Washington Post are looking for volunteers to help collect more than 10 years of federal elections data from each state. With their help — and $200,000 in Knight News Challenge funding — Serdar Tumgoren and Derek Willis are working on creating a free, comprehensive source of official U.S. election results.

The goal is to end up with electoral data that can then be linked to different types of data sets — campaign finance, voter demographics, legislative histories, and so on — in ways that previously haven’t been possible on this scale.

Tumgoren, of The Washington Post, says the idea for Open Elections came from “mutual frustration that there is no single, free source of data — and more importantly, nicely standardized data.” Soothing this frustration isn’t necessarily going to be pretty. The task of finding state elections data — at least some of which will be a godawful, inextricable mess — will require some “brute-forcing,” Tumgoren says.

“If you look at Mississippi’s data, they make me not very happy — they make me sad, in fact,” Tumgoren said. “Just a sampling of a few states I randomly picked, they run the gambit from pretty good to oh-my-God-how-are-we-going-to-get-this-data.”

Tumgoren estimates it will take about two years to get to where he wants Open Elections to be, but the entire process will be open to the public. As data comes in, the team will clean it up, put it in a standardized format, and share it. What that format will be is still up in the air — as are many of the details, which Tumgoren says they’ll have to figure out as they begin to get a better sense of the state of the data they’ll get.

“There are going to be some states like Virginia that are wonderful and have very clean data,” Tumgoren said. “Other places — we don’t even know which ones yet — data is going to be less accessible because it’s not centralized or it’s in formats like image PDF.”

For now, Open Elections is building the infrastructure to begin collecting and sorting data. As they recruit volunteers, they’ll be looking for people who can dig up U.S. Senate, House, presidential, and gubernatorial elections results from the past 10 years or so.

“This is such a big project we’re limiting the scope initially,” Tumgoren said. “Governor, Senate, House, president: Whatever else we can get, we’re not going to turn our noses up at it.” While they may not be able to clean up, link, and standardize data from other races, Tumgoren says his team will still work to centralize it.

“It’s just an untapped resource,” Tumgoren said. “The ability to do this is very limited right now. We almost don’t know what we don’t know. I have a vague sense of some of the questions I’d like to ask but I bet there are tons of journalists and developers who are going to think of things that never even occurred to me. The possibilities for so-called data mashups are limitless.”

Photo by DonkeyHotey used under a Creative Commons license.

POSTED     Sept. 26, 2012, 10:43 a.m.
SEE MORE ON Mobile & Apps
PART OF A SERIES     Knight News Challenge 2012
SHARE THIS STORY
   
Show comments  
Show tags
 
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
The Wall Street Journal website — paywalled from the very beginning — turns 20 years old today
“From the very beginning it was very clear we needed to cover all the same concerns and sensibilities of the print Journal even though we were online and even though we were a young staff.”
Newsonomics: In the platform wars, how well are you armed?
“Think about platforms as fishing places where you can find large, engaged audiences and build a relationship with them by providing content. Then offer these users some other services off-platform.”
Wired’s making the long and slow switch to HTTPS and it wants to help other news sites do the same
With its HTTPS implementation, Wired’s starting with its security vertical and for users who pay for the ad-free version of the site.
What to read next
0
tweets
In the room where it happens: The host of NPR’s new show Embedded talks about news in podcast form
Kelly McEvers: “A lot of the great storytelling podcasts happen in the studio. I hope ours opens the door to people thinking more about what you can do in the field, when things don’t go as planned and are unexpected.”
0What a group of USC students learned shooting lots of VR video (hint: duct tape is involved)
The students traveled to Houston over spring break to shoot footage to accompany a ProPublica/Texas Tribune project on what a hurricane could do to the city.
0Audible, long known only for audiobooks, is branching out into podcasts — and news
The podcast/audio world has been waiting for Audible to make its big move into the space. It’s here, including original content from major publishers like The New York Times, The Wall Street Journal, and Jeff Bezos’ Washington Post.
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
New York
Seattle Post-Intelligencer
El País
Hearst
San Diego News Network
Hacks/Hackers
Associated Press
Center for Public Integrity
PubliCola
The Washington Post
INDenverTimes
Kaiser Health News