Another new tool for journalists to scrape web content

LINK: www.journalism.co.uk ➚ | Posted by: Joshua Benton | September 10, 2013

A tool which helps non-coding journalists scrape data from websites has launched in public beta today.

Import.io lets you extract data from any website into a spreadsheet simply by mousing over a few rows of information.

Until now import.io, which we reported on back in April, has been available in private developer preview and has been Windows only. It is now also available for Mac and is open to all.

Although import.io plans to charge for some services at a later date, there will always be a free option.

The London-based start-up is trying to solve the problem of the fact that there is “lots of data on the web, but it’s difficult to get at”, Andrew Fogg, founder of import.io, said in a webinar last week.

Those with the know-how can write a scraper or use an API to get at data, Fogg said. “But imagine if you could turn any website into a spreadsheet or API.”

If learning the basics (and the not-so-basics) of scraping is of interest to you, I can recommend Paul Bradshaw’s Scraping for Journalists.

Show tags

Sarah Scire

Seeking “innovative,” “stable,” and “interested”: How The Markup and CalMatters matched up

Nonprofit news has seen an uptick in mergers, acquisitions, and other consolidations. CalMatters CEO Neil Chase still says “I don’t think we’ve seen enough yet.”

Jonathan Stray

“Objectivity” in journalism is a tricky concept. What could replace it?

“For a long time, ‘objectivity’ packaged together many important ideas about truth and trust. American journalism has disowned that brand without offering a replacement.”

Renee DiResta, Abhiram Reddy, and Josh A. Goldstein

From shrimp Jesus to fake self-portraits, AI-generated images have become the latest form of social media spam

Within days of visiting the pages — and without commenting on, liking, or following any of the material — Facebook’s algorithm recommended reams of other AI-generated content.