Nieman Foundation at Harvard
“Why not be all the way in?” How publishers are using Facebook Instant Articles
ABOUT                    SUBSCRIBE
Aug. 13, 2009, 3:22 p.m.

What The Associated Press’ tracking beacon is — and what it isn’t

So what about THE BEACON?

When The Associated Press said last month that it was building a “news registry” of AP content, most reaction focused on the so-called “tracking beacon” that will monitor usage across the web. I use quotation marks because, well, those are metaphors for technology that’s still in development: The AP document we’ve obtained says the registry, set to launch on Nov. 15, will “require capabilities not currently available.”

But there’s nothing particularly magical about the beacon, which will amount to JavaScript embedded in the online feeds that are distributed to clients. So when you read an AP article on the New York Times website, a script running in the background will take note of that usage. (It’s unclear how news organizations like the Times, which is particularly neurotic about the weight of its pages, will feel about the script.)

Tracking readers

The point, of course, is to identify uses of AP and potentially member content that isn’t licensed. So if someone copied an article’s source code onto his own site, by hand or automation, the beacon would follow along and, according to the document distributed to some AP members, “send reports back to the core database each time the item is clicked on by an end user. The beacon will identify each piece of content, the IP address of the content viewer, the referring Web server and the time of use.”

I immediately flagged “IP address of the content viewer.” In recent years, the recording industry has used the IP addresses of downloaders to pursue legal action against people sharing music online, leading to lots of ill will toward the RIAA. That said, recording such data isn’t all that unusual. Websites using basic analytics software already record the IP addresses of their users.

When I asked the AP’s general counsel, Srinandan Kasi, about it, he said the AP wasn’t interested in monitoring who specifically reads their content on unauthorized sites: “In writing this” — he meant the document — “obviously, theoretically anything is possible. But what you actually make the final available piece is a different thing. This is simply: These are the capabilities that are possible.” Later, he added, “If at some point this business goes there, they’ll be completely transparent about it. There’ll be all the disclosure and compliance issues.”

Removing the beacon

There was another passage in the document that struck me as weirdly written:

Because the News Registry’s active tracking beacon would not be effective if the beacon were removed, the Registry also has a backup enforcement system. Based on current Web behavior, it is safe to assume that some users will intentionally or inadvertently remove the beacon. A “passive” tracking service will crawl the Web searching for AP content and identify the publishing Web page, an image of usage and the time of discovery. Matches will be queried against the active tracking database, and unauthorized uses will be pursued.

By “intentionally or inadvertently remove the beacon,” doesn’t the AP simply mean copy and paste the text of an article? While there’s been some innovation in the field lately, it’s difficult to imagine how the beacon could survive the magic of Ctrl+C, and that’s an obvious limitation of the tracking system. But referring to that as removing the beacon called to mind the anti-circumvention portions of the Digital Millennium Copyright Act, which criminalize attempts to get around copyright controls like burning an encrypted DVD. Now, I can’t imagine the AP would have a legitimate claim there, but I gave it a go with Kasi, who said, “You may be giving it a lot more heavy reading than” intended.

So why use that language? “We need to worry about that sort of thing because, right, Zach, we’ve seen that happen….If some of these formats get stripped out, including the mythical beacon, then we need to have a way of knowing and being able to address that.” (I had just referred to the beacon as mythical.)

Rhetoric and reality

I told Kasi that there seemed to be a persistent disconnect between the AP’s rhetoric on copyright and what it actually cares about. He acknowledged that the consortium had not always effectively communicated its intent: “It’s easy to think that, when you read ‘beacon’ and given the issues of some other companies and so on, you can immediately jump to the conclusion, ‘oh, this is a persistent cookie that’s going to track this user across all kinds of sites.’ No.” That was surely a reference to Facebook’s poorly received advertising platform, also known as Beacon.

The AP’s graphic explaining the beacon and a new microformat was easily mocked and labeled “magic beans” by prominent tech blogger John Gruber. In the clip from the actual graphic at right, doesn’t it look like a faceless news consumer will be deposited in a toxic waste receptacle?

But in talking to Kasi, I came away with the same impressions as Columbia Journalism Review’s Ryan Chittum: that the AP isn’t interested in broadly pursuing copyright claims against republication of its content. In fact, they’re hoping to encourage distribution in certain ways, and there’s plenty of innovative stuff in the microformats they’re adopting. The AP just needs to clear up what kind of “rampant unauthorized use of AP content” — that’s from the document — they want to combat.

And that’s the topic of my next post.

This is the third in a series of posts on the AP’s online strategy. Photo of beacon, by Brenda Anderson, used under a Creative Commons license.

POSTED     Aug. 13, 2009, 3:22 p.m.
PART OF A SERIES     AP’s online strategy
Show comments  
Show tags
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
“Why not be all the way in?” How publishers are using Facebook Instant Articles
“If we end up making more money as a publisher, that’s fantastic. I don’t think that’s going to be an afterthought or byproduct; I think there is a way to win from the business perspective.”
How did the GE-branded podcast The Message hit No. 1 on iTunes? In part, by sounding nothing like an ad
“I don’t consider it advertising. It’s a podcast show that just happens to be produced by a brand instead of a network.”
America’s Test Kitchen, “the Consumer Reports of cooking,” wants to grow to new platforms
“We’d like to move to other platforms, particularly as we see the changes in how people consume television.”
What to read next
How one blog helped spark The New York Times’ digital evolution
“I certainly had editors tell me that I shouldn’t be wasting my time on Bird Week. But that was the best part of City Room…We were like unsupervised children.”
572News outlets left and right (and up, down, and center) are embracing virtual reality technology
Among those experimenting is The Wall Street Journal, which plans to open source its 360-degree mobile video and VR technology and hopes to turn VR into more of a mainstay of its storytelling.
502Podcasting in 2015 feels a lot like blogging circa 2004: exciting, evolving, and trouble for incumbents
The same trends we saw a decade ago — professionalization on one hand, platformization on the other — sure seem to be playing out again.
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Fuego is our heat-seeking Twitter bot, tracking the links the future-of-journalism crowd is talking about most on Twitter.
Here are a few of the top links Fuego’s currently watching.   Get the full Fuego ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
Voice of San Diego
Sacramento Press
The Weekly Standard
The Ann Arbor Chronicle
Franklin Center
Media Consortium
St. Louis Globe-Democrat