Nieman Foundation at Harvard
HOME
          
LATEST STORY
This is how The New York Times is using bots to create more one-to-one experiences with readers
ABOUT                    SUBSCRIBE
Sept. 10, 2013, 12:58 p.m.
LINK: www.journalism.co.uk  ➚   |   Posted by: Joshua Benton   |   September 10, 2013

Sarah Marshall at journalism.co.uk:

A tool which helps non-coding journalists scrape data from websites has launched in public beta today.

Import.io lets you extract data from any website into a spreadsheet simply by mousing over a few rows of information.

Until now import.io, which we reported on back in April, has been available in private developer preview and has been Windows only. It is now also available for Mac and is open to all.

Although import.io plans to charge for some services at a later date, there will always be a free option.

The London-based start-up is trying to solve the problem of the fact that there is “lots of data on the web, but it’s difficult to get at”, Andrew Fogg, founder of import.io, said in a webinar last week.

Those with the know-how can write a scraper or use an API to get at data, Fogg said. “But imagine if you could turn any website into a spreadsheet or API.”

If learning the basics (and the not-so-basics) of scraping is of interest to you, I can recommend Paul Bradshaw’s Scraping for Journalists.

Show tags Show comments / Leave a comment
 
Join the 35,000 who get the freshest future-of-journalism news in our daily email.
This is how The New York Times is using bots to create more one-to-one experiences with readers
“I’m not worried about this technology driving the humanity out of journalism. I’m really excited about the promise of technology bringing more humanity to journalism.” Also: a Michael Barbaro bot.
These are the bots powering Jeff Bezos’ Washington Post efforts to build a modern digital newspaper
“It’s this great, simple experience, and the technology is getting so much better for it: AI’s getting better. big data’s more accessible.” Also: a Marty Baron bot.
The Information’s new Briefing is a continuous update of opinionated takes on other people’s articles
Briefing is meant to be more Politico Playbook than Techmeme. It’s updated around the clock, but is also being sent out as a daily email newsletter for subscribers.