HOME
          
LATEST STORY
Before the “teaching model” of journalism education: 5 questions to ask
ABOUT                    SUBSCRIBE
Sept. 26, 2012, 10:43 a.m.
Mobile & Apps
donkey-elephant-cc

New York Times, Washington Post developers team up to create Open Elections database

Standardized state-by-state elections database gets a Knight News Challenge boost.

Calling all data hounds!

Senior developers from The New York Times and The Washington Post are looking for volunteers to help collect more than 10 years of federal elections data from each state. With their help — and $200,000 in Knight News Challenge funding — Serdar Tumgoren and Derek Willis are working on creating a free, comprehensive source of official U.S. election results.

The goal is to end up with electoral data that can then be linked to different types of data sets — campaign finance, voter demographics, legislative histories, and so on — in ways that previously haven’t been possible on this scale.

Tumgoren, of The Washington Post, says the idea for Open Elections came from “mutual frustration that there is no single, free source of data — and more importantly, nicely standardized data.” Soothing this frustration isn’t necessarily going to be pretty. The task of finding state elections data — at least some of which will be a godawful, inextricable mess — will require some “brute-forcing,” Tumgoren says.

“If you look at Mississippi’s data, they make me not very happy — they make me sad, in fact,” Tumgoren said. “Just a sampling of a few states I randomly picked, they run the gambit from pretty good to oh-my-God-how-are-we-going-to-get-this-data.”

Tumgoren estimates it will take about two years to get to where he wants Open Elections to be, but the entire process will be open to the public. As data comes in, the team will clean it up, put it in a standardized format, and share it. What that format will be is still up in the air — as are many of the details, which Tumgoren says they’ll have to figure out as they begin to get a better sense of the state of the data they’ll get.

“There are going to be some states like Virginia that are wonderful and have very clean data,” Tumgoren said. “Other places — we don’t even know which ones yet — data is going to be less accessible because it’s not centralized or it’s in formats like image PDF.”

For now, Open Elections is building the infrastructure to begin collecting and sorting data. As they recruit volunteers, they’ll be looking for people who can dig up U.S. Senate, House, presidential, and gubernatorial elections results from the past 10 years or so.

“This is such a big project we’re limiting the scope initially,” Tumgoren said. “Governor, Senate, House, president: Whatever else we can get, we’re not going to turn our noses up at it.” While they may not be able to clean up, link, and standardize data from other races, Tumgoren says his team will still work to centralize it.

“It’s just an untapped resource,” Tumgoren said. “The ability to do this is very limited right now. We almost don’t know what we don’t know. I have a vague sense of some of the questions I’d like to ask but I bet there are tons of journalists and developers who are going to think of things that never even occurred to me. The possibilities for so-called data mashups are limitless.”

Photo by DonkeyHotey used under a Creative Commons license.

POSTED     Sept. 26, 2012, 10:43 a.m.
SEE MORE ON Mobile & Apps
PART OF A SERIES     Knight News Challenge 2012
SHARE THIS STORY
   
Show comments  
Show tags
 
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
Before the “teaching model” of journalism education: 5 questions to ask
It’ll take a new generation of academic leadership — willing to incur the wrath of faculty, the greater university, alumni, industry, and analysts — to break through the old ways we train journalists.
Controlled chaos: As journalism and documentary film converge in digital, what lessons can they share?
Old and new media types from journalism, documentary, and technology backgrounds gathered at MIT to share practices and discuss mutual concerns.
The near future of First Look’s next site, Racket, looks fuzzy
The site, promised as a “satirical approach to American politics and culture,” was set to launch this month, but now it’s unclear when or if it’ll get off the ground.
What to read next
1020
tweets
The newsonomics of the millennial moment
The new wave of news startups is aiming at a younger audience. But do legacy media companies have a chance at earning their attention?
803A mixed bag on apps: What The New York Times learned with NYT Opinion and NYT Now
The two apps were part of the paper’s plan to increase digital subscribers through smaller, targeted offerings. Now, with staff cutbacks on the way, one app is being shuttered and the other is being adjusted.
413The new Vox daily email, explained
The company’s newsletter, Vox Sentences, enters an increasingly crowded inbox. Can concise writing and smart aggregation on the day’s news help expand their audience?
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
Medium
Detroit Free Press and Detroit News
Texas Tribune
Facebook
New Haven Independent
Voice Media Group
Frontline
The New York Times
CBS News
Center for Investigative Reporting
New England Center for Investigative Reporting
Spot.Us