Nieman Foundation at Harvard
HOME
          
LATEST STORY
How The Washington Post built — and will be building on — its “Knowledge Map” feature
ABOUT                    SUBSCRIBE
April 22, 2009, 10:56 p.m.

Blogs: One person’s curation is another person’s scraping

Curation has become a popular term in media circles, in the sense of a human editor who filters and selects content, and then packages it and delivers it to readers in some way. Many people (including me) believe that, in an era when information sources are exploding online, aggregation and curation of some kind is about the only service left that people might be willing to pay for. That’s why it’s been interesting to watch one prominent website — All Things Digital, the online blog property that is owned by the Wall Street Journal, but run as a separate entity by Kara Swisher and Walt Mossberg — wrestling with how to handle that kind of aggregation, amid criticism from some prominent bloggers that it has been doing it wrong.

As described by Andy “Waxy” Baio in an excellently reported roundup of the brouhaha, the fuss seemed to start with comments from Wall Street Journal editor Robert Thompson about how Google and other aggregators of news are “parasites” in the intestines of the Internet, because they republish the content of others and then make money from it. Pretty soon, some bloggers were pointing out that All Things Digital did exactly the same thing in a section called Voices — namely, published long excerpts from a variety of prominent bloggers, displayed in exactly the same way as the rest of the site’s content, and surrounded by ads.

Josh Schachter, founder of Delicious, noted this behaviour in a Twitter message, and Metafilter founder Matt Haughey said that “apparently The Wall Street Journal’s All Things D does a reblogging thing. I sure wish they asked me first though. That’s a hell of a lot of ads on my ‘excerpt’.” Merlin Mann, who blogs at 43folders, said on Twitter that “republishing online work without consent and wrapping it in ads is often called ‘feed scraping.’ At AllThingsD, it’s called ‘a compliment.”

In a conversation with Andy about the issue, Kara Swisher agreed that some of the excerpts were too long, and that the site didn’t really make it clear enough that these pieces were pulled from outside sources. This week, she wrote a long post describing how All Things D was changing the format of the Voices section to address some of these complaints, by shortening excerpts and providing better disclosure. “While we did not agree with all the complaints in the story,” she said, “the debate did make us realize we needed to be a lot clearer and more explicit about what we are doing, and to make those policies–which we had not posted in as much detail as we have, for example, about our ethics statements… more prominent and transparent.”

What I found particularly fascinating about this whole affair was the differing opinions on what All Things D had been doing. While Merlin Mann was incensed (and has since written a long polemic on the issue of re-use of his content), and Schachter and Haughey seemed miffed, some other writers that Andy Baio talked to said they were very happy to have the WSJ-owned site link to their content — something that they saw as a promotional effort on their behalf. “I think it’s helpful in driving some additional traffic to my blog,” said Eric Savitz of Tech Trader Daily (which is part of Barron’s, which in turn is owned by Dow Jones, parent of the WSJ). “It also gets me some higher visibility with a valuable audience. I have no complaints at all.” (Full disclosure: I know Kara Swisher and have had my content excerpted on the ATD site, and I was quite happy with the arrangement)

In a nutshell, this is why re-use, and the related concept of fair use, is such a tough nut to crack. Google uses a few sentences from a newspaper article and links back to the paper site — some papers see that as beneficial, but others see that as Google stealing something, and trying to repay them with a few cheap trinkets (i.e., web traffic). A site like All Things D excerpts content that it sees as worthy, and displays it in such a way as to elevate it to the same status as its own content — some writers see that as a favour and are happy to receive it, while others feel the site is taking something without permission and trying to give the impression that it’s theirs. Some authors want Google to scan and display chunks of their books, so that readers can find them — others see that as copyright infringement or even outright theft.

I wish I had the answer to this problem, but I don’t. Obviously, checking with the author before you excerpt something — which All Things D apparently didn’t do in these cases — is one way to avoid problems. But how is Google supposed to do that? How is any sufficiently large aggregator or curator supposed to do that? Should the onus be on the aggregator to ask, or should the onus be on the content creator to protest and ask that it be removed? Lots of questions, very few hard answers.

POSTED     April 22, 2009, 10:56 p.m.
SHARE THIS STORY
   
Show comments  
Show tags
 
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
How The Washington Post built — and will be building on — its “Knowledge Map” feature
The Post is looking to create a database of “supplements” — categorized pieces of text and graphics that help give context around complicated news topics — and add it as a contextual layer across lots of different Post stories.
How 7 news organizations are using Slack to work better and differently
Here’s how Fusion, Vox, Quartz, Slate, the AP, The Times of London, and Thought Catalog are using Slack for workflow — and which features they wish the platform would add.
The New York Times built a robot to help make article tagging easier
Developed by the Times R&D lab, the Editor tool scans text to suggest article tags in real time. But the automatic tagging system won’t be moving into the newsroom soon.
What to read next
1119
tweets
New Pew data: More Americans are getting news on Facebook and Twitter
A new study from the Pew Research Center and Knight Foundation finds that more Americans of all ages, races, genders, education levels, and incomes are using Twitter and Facebook to consume news.
788Newsonomics: The halving of America’s daily newsrooms
If you’re lucky enough to have the right deep-pocketed owner buy your paper and steady it, you’ve won the lottery. If you’re in a town whose paper is owned by the better chains, or committed local ownership, your loss will probably be mitigated. Otherwise, you’re out of luck.
575How 7 news organizations are using Slack to work better and differently
Here’s how Fusion, Vox, Quartz, Slate, the AP, The Times of London, and Thought Catalog are using Slack for workflow — and which features they wish the platform would add.
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
Sacramento Press
Houston Chronicle
The Wall Street Journal
Fwix
Baristanet
Bayosphere
Crosscut
The Christian Science Monitor
New West
Hacks/Hackers
Yahoo
New York