How Tribune Co. plans to rid itself of SEO-killing duplicate content

By Zachary M. SewardSept. 8, 2009  /  9 a.m.  

Last month, I wrote about how The Associated Press plans to leverage its network of members and customers with centralized topic pages linked to content distributed by the consortium. That post has sprung at least three noteworthy legs:

  • an intelligent comments thread on Wikipedia’s strength in search results
  • some informed skepticism of automated pages from Reuters’ Felix Salmon
  • an epic comment by Brent Payne, director of search engine optimization at Tribune Co., on the perils of duplicated content on news sites

This was all very exciting for me, because while I’m deeply interested in the mystical arts of SEO, it’s not something I knew tons about. Intrigued, I contacted Payne for more on how he’s dealing with a problem faced by all networks of news sites: multiple copies of the same article. When the Tribune Co.’s Washington bureau files a story, for instance, it may appear on lots and lots of pages across the company’s many news sites, diluting its power in search. And while Google attempts to make sense of duplicate content, publishers are still advised “to avoid over-syndicating the articles that you write.”

Payne said he’s readying a plan to rid the Tribune Co. of duplicate content. “The goal will be to always have only a single URL for a piece of content across all of our sites,” he told me in an email. For example, when The Los Angeles Times writes a story, it exists, of course, at latimes.com. But when The Chicago Tribune picks up the piece, the current system creates a duplicate of the article with a chicagotribune.com URL.

Under Payne’s plan, Tribune readers would instead visit the Times domain. Meanwhile, a cookie or URL parameter would make the page look like the Tribune’s site and serve the Tribune’s ads. One article, one URL, maximum SEO.

Though I’m hardly the one to judge, it sounds like a neat trick. As news organizations of all sizes act more like networks, they’ll need to make sure that syndication doesn’t, paradoxically, cannibalize their ability to be found by readers. (Another tool for avoiding the duplicate-content blues is the canonical tag recently adopted by Google and other major search engines, but that doesn’t work across domains.)

Payne also shared an unrelated but similarly smart SEO initiative he has implemented at Tribune Co.: Soon, when reporters and editors file a story, they’ll be required to include a phrase they think is most likely to lead a Google searcher to the article.

Payne is convinced that the SEO game is won by ranking high on the search results for three- and four-word phrases. So while he’s happy that The Chicago Tribune’s topic page for Barack Obama ranks on the first page when searching for the president’s name in Google, Payne is more concerned with doing well with phrases like “obama birth certificate,” “obama’s secret service code names,” and “michelle obama and new do” — to name a few searches that lead lots of readers to the Tribune Co.

Some additional advice from a guy who says he doubled the Tribune Co.’s SEO traffic between 2007 and 2008:

When you do SEO for a huge media company with more content than you can possibly digest with a team so small it’s frustrating, it’s all about what you can automate and what you can force the masses of content creators to do via the CMS to help be more intelligent about SEO. It’s about training the masses and finding evangelists for SEO from those groups that are trained and encouraging them when there are wins.

If you like the way Payne thinks, check out some more SEO tips in the slides of a presentation he gave this year to the Society of Professional Journalists. And last week, we shared some tips for optimizing content in Google News.


13 comments:

  1. Michel Leconte at 11:13 am, September 8, 2009

    Hello,

    I think Payne is on the right track there. I believe automation is the way to go when it comes down to execution, especially if you’re working across a large number of domains. For the rest of us publishers with less of a clout and nimbler technical team than Tribune Co., I recommend looking at the news content management module built by my organization. To the best of my knowledge, it is the only system that builds Google news compliant architecture out of the box, RSS feeds, newssitemap xml feeds, and a number of other neat things. A description can be found at the bottom of the following page:
    http://www.seosamba.com/seo-technology.html

     
  2. Andy Kruse at 3:38 pm, September 8, 2009

    Linking to the original source is a huge win for you, Brent. It probably wasn’t an easy sell, so congratulations.

    But I think the AP’s problem (and one that I deal with as well) is that most of its nationally shared content has no home. It’s created for the masses and they can’t pick just one destination URL.

    AP seems to get this, at least, and is doing something about it. I’ve seen their plans for 2010 Olympic content and it involves a lot of exclusive hosting and not as much syndicating/duplicating.

     
  3. Gina Chen at 9:08 am, September 9, 2009

    I think this is a great idea. I’m no SEO expert, so I can’t really say if it will work, although it sounds promising. But what I find really important about this idea is that it’s an example of a media organization changing itself to get its stories better play in Google searches, rather than just moaning that Google’s system is unfair. That’s a big step in the right direction. Will be interesting to see how this works, and how others can use this idea.

     
  4. SEO Company at 11:18 am, September 9, 2009

    It will be interesting to see if the cookie or URL parameter does not appear to be a form of cloaking, which is also looked down upon by search engines. Other than that, eliminating duplicates also helps search engines (and the internet) by reducing the amount of unnecessary content online.

     
  5. Inisheer at 11:29 am, September 9, 2009

    Google is a national, even worldwide platform. It has no reason to send it’s readers to multiple newspaper websites that are running the same content. This ia a basic problem the newspapers just refuse to understand. Search engines want original content, because their readers do.

     
  6. Andy Batten at 5:14 pm, January 7, 2010

    Just an update to this, Google now supports a cross-domain canonical tag:

    http://googlewebmastercentral.blogspot.com/2009/12/handling-legitimate-cross-domain.html

    This should make many of the duplicate content issues moot, except in those cases where articles are illegally redistributed on other domains and don’t include the tag.

     

Trackbacks:

  1. Optimal Use Of Technology And Content | Hartford Courant Alumni Association and Refugee Camp at 2:48 pm, September 8, 2009

    [...] you online geeks in the audience, here’s Tribune’s latest innovation: maximum search engine optimization. The plan is to eliminate duplicate stories on all Tribune [...]

     
  2. SearchCap: The Day In Search, September 8, 2009 at 4:58 pm, September 8, 2009

    [...] How Tribune Co. plans to rid itself of SEO-killing duplicate content, Nieman Journalism Lab [...]

     
  3. 5 O’Clock Roundup: Sprint’s disappearing promo, stem cell breakthrough (sort of), search engine optimization | UpOff.com at 10:14 pm, September 8, 2009

    [...] Newspapers figuring out how to block duplicate copies of stories. Here’s why you care — Duplicate stories in Google’s index cause the individual copies of those stories to get a lower relevance ranking score among Google search results. So the multiple copies of a story may actually make it less likely you’ll ever see it, if you’re among the roughly 50 percent of newspaper traffic that comes from Google searches. The Tribune Co., which owns the Los Angeles Times, has a tech trick in the works that will result in “One article, one URL, maximum [search engine optimization].” [...]

     
  4. links for 2009-09-08 « Glenna DeRoy at 11:25 pm, September 8, 2009

    [...] How Tribune Co. plans to rid itself of SEO-killing duplicate content » Nieman Journalism Lab (tags: seo onlinejournalism technology web) Possibly related posts: (automatically generated)links for 2009-08-03links for 2009-08-06links for 2009-09-01 [...]

     
  5. How Tribune Co. plans to rid itself of SEO-killing duplicate …- SFWEBDESIGN.com at 1:32 am, September 9, 2009

    [...] from: How Tribune Co. plans to rid itself of SEO-killing duplicate …Share this on del.icio.usDigg this!Share this on RedditBuzz up!Stumble upon something good? Share it [...]

     
  6. Some Newspapers Get It, And Some Still Don’t at 1:06 pm, September 10, 2009

    [...] first article that caught my eye was titled “How Tribune Co. plans to rid itself of SEO-killing duplicate content.” This post was published on the same blog that published the post about the AP’s new [...]

     
  7. Newspaper’s top 5 search queries are commercial brands » Nieman Journalism Lab at 8:01 am, September 17, 2009

    [...] quick lesson in search engine optimization: Brent Payne, the Tribune Co.’s head of SEO, whose wisdom I wrote about last week, posted a video yesterday demonstrating a new feature in Google Webmaster [...]

     

Leave a comment

Check out these related posts