Nieman Foundation at Harvard
HOME
          
LATEST STORY
How a titan of 20th-century journalism transformed the AP — and the news
ABOUT                    SUBSCRIBE
Sept. 26, 2012, 10:43 a.m.
Mobile & Apps

New York Times, Washington Post developers team up to create Open Elections database

Standardized state-by-state elections database gets a Knight News Challenge boost.

Calling all data hounds!

Senior developers from The New York Times and The Washington Post are looking for volunteers to help collect more than 10 years of federal elections data from each state. With their help — and $200,000 in Knight News Challenge funding — Serdar Tumgoren and Derek Willis are working on creating a free, comprehensive source of official U.S. election results.

The goal is to end up with electoral data that can then be linked to different types of data sets — campaign finance, voter demographics, legislative histories, and so on — in ways that previously haven’t been possible on this scale.

Tumgoren, of The Washington Post, says the idea for Open Elections came from “mutual frustration that there is no single, free source of data — and more importantly, nicely standardized data.” Soothing this frustration isn’t necessarily going to be pretty. The task of finding state elections data — at least some of which will be a godawful, inextricable mess — will require some “brute-forcing,” Tumgoren says.

“If you look at Mississippi’s data, they make me not very happy — they make me sad, in fact,” Tumgoren said. “Just a sampling of a few states I randomly picked, they run the gambit from pretty good to oh-my-God-how-are-we-going-to-get-this-data.”

Tumgoren estimates it will take about two years to get to where he wants Open Elections to be, but the entire process will be open to the public. As data comes in, the team will clean it up, put it in a standardized format, and share it. What that format will be is still up in the air — as are many of the details, which Tumgoren says they’ll have to figure out as they begin to get a better sense of the state of the data they’ll get.

“There are going to be some states like Virginia that are wonderful and have very clean data,” Tumgoren said. “Other places — we don’t even know which ones yet — data is going to be less accessible because it’s not centralized or it’s in formats like image PDF.”

For now, Open Elections is building the infrastructure to begin collecting and sorting data. As they recruit volunteers, they’ll be looking for people who can dig up U.S. Senate, House, presidential, and gubernatorial elections results from the past 10 years or so.

“This is such a big project we’re limiting the scope initially,” Tumgoren said. “Governor, Senate, House, president: Whatever else we can get, we’re not going to turn our noses up at it.” While they may not be able to clean up, link, and standardize data from other races, Tumgoren says his team will still work to centralize it.

“It’s just an untapped resource,” Tumgoren said. “The ability to do this is very limited right now. We almost don’t know what we don’t know. I have a vague sense of some of the questions I’d like to ask but I bet there are tons of journalists and developers who are going to think of things that never even occurred to me. The possibilities for so-called data mashups are limitless.”

Photo by DonkeyHotey used under a Creative Commons license.

POSTED     Sept. 26, 2012, 10:43 a.m.
SEE MORE ON Mobile & Apps
PART OF A SERIES     Knight News Challenge 2012
Show tags
 
Join the 60,000 who get the freshest future-of-journalism news in our daily email.
How a titan of 20th-century journalism transformed the AP — and the news
“If one man fails to file a story of a millionairess marrying a poor factory hand because that man understands such a story is not properly A.P. stuff, such an error of news judgment ought to be generally made known to other employees.”
The New York Times launches a free, geo-targeted extreme weather newsletter
Readers can opt in to receive morning emails explaining the level and type of extreme weather risk in up to four different places. The newsletter is free for everyone, not just subscribers.
Gannett journalists across the U.S. will strike on June 5
Gannett has around 200 newsrooms, and editorial employees at around two dozen of those will go on strike.