HOME
          
LATEST STORY
Newsonomics: Buying Yelp — and making it the next core of the local news and information business
ABOUT                    SUBSCRIBE
Aug. 13, 2009, 3:22 p.m.

What The Associated Press’ tracking beacon is — and what it isn’t

So what about THE BEACON?

When The Associated Press said last month that it was building a “news registry” of AP content, most reaction focused on the so-called “tracking beacon” that will monitor usage across the web. I use quotation marks because, well, those are metaphors for technology that’s still in development: The AP document we’ve obtained says the registry, set to launch on Nov. 15, will “require capabilities not currently available.”

But there’s nothing particularly magical about the beacon, which will amount to JavaScript embedded in the online feeds that are distributed to clients. So when you read an AP article on the New York Times website, a script running in the background will take note of that usage. (It’s unclear how news organizations like the Times, which is particularly neurotic about the weight of its pages, will feel about the script.)

Tracking readers

The point, of course, is to identify uses of AP and potentially member content that isn’t licensed. So if someone copied an article’s source code onto his own site, by hand or automation, the beacon would follow along and, according to the document distributed to some AP members, “send reports back to the core database each time the item is clicked on by an end user. The beacon will identify each piece of content, the IP address of the content viewer, the referring Web server and the time of use.”

I immediately flagged “IP address of the content viewer.” In recent years, the recording industry has used the IP addresses of downloaders to pursue legal action against people sharing music online, leading to lots of ill will toward the RIAA. That said, recording such data isn’t all that unusual. Websites using basic analytics software already record the IP addresses of their users.

When I asked the AP’s general counsel, Srinandan Kasi, about it, he said the AP wasn’t interested in monitoring who specifically reads their content on unauthorized sites: “In writing this” — he meant the document — “obviously, theoretically anything is possible. But what you actually make the final available piece is a different thing. This is simply: These are the capabilities that are possible.” Later, he added, “If at some point this business goes there, they’ll be completely transparent about it. There’ll be all the disclosure and compliance issues.”

Removing the beacon

There was another passage in the document that struck me as weirdly written:

Because the News Registry’s active tracking beacon would not be effective if the beacon were removed, the Registry also has a backup enforcement system. Based on current Web behavior, it is safe to assume that some users will intentionally or inadvertently remove the beacon. A “passive” tracking service will crawl the Web searching for AP content and identify the publishing Web page, an image of usage and the time of discovery. Matches will be queried against the active tracking database, and unauthorized uses will be pursued.

By “intentionally or inadvertently remove the beacon,” doesn’t the AP simply mean copy and paste the text of an article? While there’s been some innovation in the field lately, it’s difficult to imagine how the beacon could survive the magic of Ctrl+C, and that’s an obvious limitation of the tracking system. But referring to that as removing the beacon called to mind the anti-circumvention portions of the Digital Millennium Copyright Act, which criminalize attempts to get around copyright controls like burning an encrypted DVD. Now, I can’t imagine the AP would have a legitimate claim there, but I gave it a go with Kasi, who said, “You may be giving it a lot more heavy reading than” intended.

So why use that language? “We need to worry about that sort of thing because, right, Zach, we’ve seen that happen….If some of these formats get stripped out, including the mythical beacon, then we need to have a way of knowing and being able to address that.” (I had just referred to the beacon as mythical.)

Rhetoric and reality

I told Kasi that there seemed to be a persistent disconnect between the AP’s rhetoric on copyright and what it actually cares about. He acknowledged that the consortium had not always effectively communicated its intent: “It’s easy to think that, when you read ‘beacon’ and given the issues of some other companies and so on, you can immediately jump to the conclusion, ‘oh, this is a persistent cookie that’s going to track this user across all kinds of sites.’ No.” That was surely a reference to Facebook’s poorly received advertising platform, also known as Beacon.

The AP’s graphic explaining the beacon and a new microformat was easily mocked and labeled “magic beans” by prominent tech blogger John Gruber. In the clip from the actual graphic at right, doesn’t it look like a faceless news consumer will be deposited in a toxic waste receptacle?

But in talking to Kasi, I came away with the same impressions as Columbia Journalism Review’s Ryan Chittum: that the AP isn’t interested in broadly pursuing copyright claims against republication of its content. In fact, they’re hoping to encourage distribution in certain ways, and there’s plenty of innovative stuff in the microformats they’re adopting. The AP just needs to clear up what kind of “rampant unauthorized use of AP content” — that’s from the document — they want to combat.

And that’s the topic of my next post.

This is the third in a series of posts on the AP’s online strategy. Photo of beacon, by Brenda Anderson, used under a Creative Commons license.

POSTED     Aug. 13, 2009, 3:22 p.m.
PART OF A SERIES     AP’s online strategy
SHARE THIS STORY
   
Show comments  
Show tags
 
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
Newsonomics: Buying Yelp — and making it the next core of the local news and information business
The pricetag would be high, but it might be worth it to reassemble one part of the old newspaper bundle — tying together local news and local services.
Crossing the streams: Why competing publications are deciding to team up on podcasts
Low financial risk and a desire for word-of-mouth sharing have led news sites to collaborate, sharing audience and infrastructure.
Listicles, aggregation, and content gone viral: How 1800s newspapers prefigured today’s Internet
“Many 19th-century newspapers are comprised primarily of content from other newspapers.”
What to read next
953
tweets
The State of the News Media 2015: Newspapers ↓, smartphones ↑
The annual omnibus report from Pew outlines a story of continued trends more than radical change.
561The Upshot uses geolocation to push readers deeper into data
The New York Times story changes its text depending on where you’re reading it: “It’s a fine line between a smarter default and being creepy.”
422Knight Foundation invests $1 million in creator-driven podcast collective Radiotopia
The money will help PRX’s collective of public media-minded shows develop sustainable business models and expand with new shows and producers.
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
The Economist
The UpTake
The Times of London
Davis Wiki
TBD
Wired
Newser
Mother Jones
The Miami Herald
Animal Político
St. Louis Beacon
St. Louis Globe-Democrat