Twitter  Study from Newspaper Association of America finds papers engage readers more effectively than other media. nie.mn/XENPs3  
Nieman Journalism Lab
Pushing to the future of journalism — A project of the Nieman Foundation at Harvard

Dennis Mortensen: Are news orgs worrying too much about search and not enough about the front page?

Editor’s Note: This is a guest post from Dennis R. Mortensen, former director of data insights at Yahoo and founder of Visual Revenue, a New York firm that sells its predictive-analytics services to news organizations.

Dennis saw my talk at the Canadian Journalism Foundation in January and wanted to comment on the role of a news site’s front page in its success in assembling an audience. He argues that paying too much attention to SEO on current articles could backfire on news orgs.

In Josh’s talk in Toronto, he hypothesized that:

[The front page is] still an enormously powerful engine of traffic. I would say actually that for most American newspapers…it’s probably 70 percent in a lot of cases.

Josh is saying you should view the front page as a traffic channel unto itself, just as you’d think of Facebook or Google — something I wholeheartedly agree with. If you choose to view your front page as a traffic channel, you’ll also sign up for a different kind of data analysis — analysis that mixes external referrers with internal referrers. In other words, a link from the Drudge Report is no different than a link from your own front page, in the sense that they both should be viewed as channels to optimize.

I argue that the front page is the most important engine of traffic for news media publishers today. I would also argue that this whole notion of news media publishers being held hostage by Google — and the slightly suboptimal idea of optimizing your articles for search to begin with — is somewhat misguided. It certainly seems wrong when we look at the data.

In this analysis, it’s important to distinguish between two core segments: current article views and archived article views. To begin, I’ve chosen a set of very strict and non-favorable definitions to my conclusion. A current article is defined as an item of content that is directly being exposed on the front or section front page right now. Any other content not currently exposed on a front page or section front page is deemed to be an archived article.

We looked at a sample of about 10 billion article views, across a sample of Visual Revenue clients, and found that on any given day, 64 percent of views are on current articles, and 36 percent of views are on archived articles.

So on a typical day, for most if not all news media publishers, the largest portion of article views comes off of their current article segment — stories published today or perhaps yesterday and still being actively promoted. I find this analysis fascinating and almost empowering, for the simple reason that most current news events are initially non-searchable. If a revolution breaks out in Egypt, I won’t know until I’m told about it. Non-searchable translates into a need for the stories to be discoverable by the reader in a different way, such as on a front page, through RSS, or in their social stream — all channels the publisher either owns or can influence.

There is no doubt that search, as a channel, owns the archive. One can discuss the data of why that is and why it is or isn’t optimal — I’ll leave that for a later discussion. But today, let’s focus on the current article segment, by far the bigger of the two. Where do those views come from? Looking at the same dataset from our clients, we get a very clear indication of where one’s focus should lie:

Sources of current article views:

78 percent come from front pages
7 percent come from search
6 percent come from social media
5 percent come from news aggregators
3 percent come from news originators
1 percent come from RSS & email

(We’re defining “news originators” as sites where at least two-thirds of the stories excerpted and promoted on their front page are original pieces generated by the news organization — which includes most traditional media. “News aggregators” are sites where less than two-thirds are original, such as Google News, Techmeme, or Drudge.)

I doubt this front-page dominance is much of a surprise to most editors — but for some reason, it seems like we aren’t taking the appropriate action today. We have 78 percent of all views on current articles coming from the front page — that’s 49 percent of all your article views on any given day — and what do we do to optimize it? And why is it that so many news organizations think immediately of search when we write a new story, when search has minimal initial impact? Even worse, writing an SEO-optimized article excerpt title for your front page probably deoptimizes your results on current articles.

The front page is indeed still an enormously powerful engine of traffic. We now know that about half of your article views can be attributed to the primary front page or the section front pages — and with it a huge chunk of any news organization’s online revenue. The question, then, is what kind of processes and optimization methodologies have you applied to take advantage of this fact?

                                   
What to read next
aereo
Mark Coddington    April 12, 2013
Plus: BuzzFeed’s native advertising model, protecting anonymous sources at Fox News, and the rest of the week’s news about the future of news.
  • http://twitter.com/Ebyline Ebyline

    Exactly. We speak with publishers across the country on a daily basis and there are clearly two camps. The “SEO Extremists” who tend to develop content simply for the sake of indexing well and the “Core Audience Loyalists” who are committed to creating valuable content for their audience but don’t have the resources in-house to sustain it all. The fact that the front page still dominates traffic patterns supports the value of trusted brands and shows that they do still have significance when it comes to quality content.

  • http://tangled-in-wires.blogspot.com Steve Shoe

    Can you explain how an SEO-optimized title on the front page deoptimizes current article results? I’m not sure I follow the thinking on that.

    From a local news perspective, I found one benefit of an SEO-friendly headline on the front page is clarity for the reader. Writing a headline like “Town, county taxes set to increase” doesn’t do much to inform a reader, whereas the addition of proper names for the town and county does. That’s a simple example, but I think the exercise makes a huge difference. In fact, I consider it the most important consideration in writing a webhead.

    The searchability of that headline is really the second consideration, in my opinion.

    You’re right, though: This probably won’t come as a surprise to editors (or former editors, like me) who are (were) plugged into their analytics. In some ways these numbers deflate the “referral model” of news consumption via social media that many argue is becoming the norm. Do you view these stats as testament to the power of a brand’s name in drawing users directly to their site?

  • http://tangled-in-wires.blogspot.com Steve Shoe

    Can you explain how an SEO-optimized title on the front page deoptimizes current article results? I’m not sure I follow the thinking on that.

    From a local news perspective, I found one benefit of an SEO-friendly headline on the front page is clarity for the reader. Writing a headline like “Town, county taxes set to increase” doesn’t do much to inform a reader, whereas the addition of proper names for the town and county does. That’s a simple example, but I think the exercise makes a huge difference. In fact, I consider it the most important consideration in writing a webhead.

    The searchability of that headline is really the second consideration, in my opinion.

    You’re right, though: This probably won’t come as a surprise to editors (or former editors, like me) who are (were) plugged into their analytics. In some ways these numbers deflate the “referral model” of news consumption via social media that many argue is becoming the norm. Do you view these stats as testament to the power of a brand’s name in drawing users directly to their site?

  • http://www.niemanlab.org/ Joshua Benton

    Steve, I think Dennis’ thinking is that while a keyword-dense, SEO-friendly headline may be attractive to Google, it may not be as attractive to a human being who’s deciding which (if any) headlines to click on your homepage. Maybe some of the traditional print strategies — a tease, a pun, something to draw the reader in without revealing all — might work better.

    Of course, many (most?) large news organizations are already writing separate headlines for the front page, leaving the more dedicated SEO optimizing for article pages.

  • http://twitter.com/mlicudine Michelle Licudine

    Very interesting analysis. We’ve been conditioned over the last few years to view every page as a potential entry point — which is still true. But the data does show (not just here, but for us at The Palm Beach Post as well) that homepage traffic dominates. And our exec editor has asked for renewed focus on the homepage.

    It seems worth pointing out that many people land on home pages through branded searches. So, in that sense, you can’t totally ignore SEO when optimizing a front page; imagine the jump-off-a-cliff feeling you’d have if you searched for your name and *didn’t* find your site. (Not likely, but it could happen.)

    BTW, Steve makes a great point that sometimes gets lost in the SEO frenzy: that you’re not writing for engines, you’re writing for people.

    Joshua mentions writing a separate headline for the homepage versus the article page (and, I assume, versus any printed version.) This seems smart, but I also wonder about social optimization of page titles — say, to stay within the 140-character Twitter limit but still leave room for a bit.ly link or attribution to another account. So that’d add yet another step to the mix, and that’s just one of many social outlets.

    Has anyone figured out how to optimize breaking news for multiple outlets without sacrificing speed?

  • http://tangled-in-wires.blogspot.com Steve Shoe

    Understood, but I stand by my point of useability and clarity for the human reader. I think writing a good webhead is part science and part art form. You want to balance clarity, brevity, and archival considerations. The latter is especially true for smaller news organizations that probably don’t have the time to go in and edit the headlines of their archived articles to more SEO-friendly forms.

    I can only speak for the content management systems I used during my newspaper stint, but here’s what should be included in any media outlet’s CMS: A/B/C headlines where you can write multiple heads to see which gets the most clicks (WordPress started doing this sometime last year), and perhaps even the ability to compose an SEO-friendly headline and set it to go live on the article once it archives. Maybe that’s already out there, though.

    That said, I fully agree it’s WAY too easy to go overboard and write a dense headline in the hopes of attracting those spiders. “Late model blue Ford Taurus crashes into utility pole on Main Street near Thomas Jefferson Elementary School in Example City,” is a very slight exaggeration of a headline someone wrote in my old newsroom last year (and which I quickly pared down).

  • http://www.niemanlab.org/ Joshua Benton

    FYI, fellow WordPress users, more on that A/B testing plugin for WP:

    http://www.niemanlab.org/2010/11/ab-testing-for-headlines-now-available-for-wordpress/

  • http://twitter.com/mattoperry mattoperry

    Though we are not a traditional newspaper, our site sees a much much larger portion of traffic driven to new content by social media, and indeed it is largely those pieces that quickly do well on Facebook or Twitter that perform for us in general. Though there is sometimes tension between the two, we’ve begun to think about social media optimization equally as much as SEO. The good news is that optimizing headlines and other content for social media sharing can result in really good front-page optimization since click-ability, rather than things like keyword density, are emphasized.

    Does anyone else share this experience? Is 7% traffic to new pieces from social networks really typical?

  • http://visualrevenue.com/blog/ Dennis R. Mortensen

    Hi Matt,

    >>Does anyone else share this experience? Is 7% traffic to new pieces from social networks really typical?

    The dataset we have is conclusive and I honestly think the data paints a very positive picture for publishers – news publishers in particular. I believe the misperception, or slight surprise perhaps, arrives from publishers who look at their web property traffic influx in full (not segmenting out the archive) AND from not viewing the front page as a channel (essentially discarding internal referrers as site navigation).

    If you disregard the front page(s) the social media channel wrongly quadruples of course, which is the number most publishers will look at. My argument is that such analysis is sub-optimal and that you end up focusing on the wrong channels.

    >>The good news is that optimizing headlines and other content for social media sharing can result in really good front-page optimization since click-ability, rather than things like keyword density, are emphasized.

    That’s a very good point and I fully agree!

    In my slightly biased world it would be of less damage than writing current content as if its pushed directly to the archive (aka optimizing for search when writing news)

    Cheers
    d. :-)

  • Pingback: Il Valore della Prima Pagina | Il Giornalaio

  • Neil

    One way to optimize your front page is to have it personalized to each user’s interests. DailyMe’s Newstogram platform can do just that.

  • http://twitter.com/mattoperry mattoperry

    Thanks Dennis,

    This all makes me want to filter our referrer data as you describe and see what the numbers say.

    I also wonder if there is a sort of “ceiling of benefit” that the front page can impart to any given piece of content. We know that our very highest traffic stories do not get that way because of homepage placement. They receive rivers of traffic because they spread around the social net, or occasionally are linked from the yahoo homepage or huffpo, etc. Editors have a tendency to swing for the fences, so I wonder if the steady traffic contribution of front pages, like the unswerving love of a loyal but slothful dog, is sometimes under-appreciated because of more spectacular, but less frequent, traffic events.

    I hereby apologize for the preceding simile.

    In any case, thanks so much for your article Dennis.

    M

  • http://visualrevenue.com/blog/ Dennis R. Mortensen

    >>so I wonder if the steady traffic contribution of front pages, like the unswerving love of a loyal but slothful dog, is sometimes under-appreciated because of more spectacular, but less frequent, traffic events.

    This is exactly how it is for some properties – and one can only aspire to your eloquence in describing the phenomena in chasing hits versus that of serving your most loyal users.

    I’m not saying that one should not chase the news aggregation lottery, as there’s certainly ways you can work that semi-effectively. AND therein lays my view, that the news aggregation lottery is just another channel, smaller and less effective in total than the front page, but surely a channel in itself.

    have a great weekend
    d. :-)

  • http://visualrevenue.com/blog/ Dennis R. Mortensen

    That’s a paradox of sorts isn’t it, because once you serve only unique personal front pages, you do not have a front page any more. Jokes aside, this is actually something we spend a lot of time thinking about, just how personal the front page is supposed to be.

    Cheers
    d. :-)

  • http://lavrusik.com Vadim Lavrusik

    I’m quite skeptical of the numbers of above even by segmenting the data in the way you have, we see far more traffic come from social media and search.

  • http://www.niemanlab.org/ Joshua Benton

    I’m certain you do see a lot more social media traffic — you’re Mashable! :)

    Dennis’ numbers also don’t match up for us at the Lab either — only about 9-11% of our visits start on the front page, and 30-40% of our overall traffic is from Twitter. But I suspect sites like ours are still very much the exception. (E.g., the NYT editor saying 50-60% of their visits start on the homepage.)

  • http://visualrevenue.com/blog/ Dennis R. Mortensen

    Hi Vadim,

    >>I’m quite skeptical of the numbers of above even by segmenting the data in the way you have.

    The data is probably slightly controversial, but it is very difficult to dispute the results beyond questioning whether the segment of two dozen or so large volume publishers, which are included in the study, are a fair representative of the publishing universe as a whole. I personally believe that the publishers included are very traditional in usage patterns. Let me try to explain the results using a fictive but still fair case.

    Let’s take the following example of an average 150M page views per month news-media destination (arriving from 30M Visits). A likely distribution of Visits could be something like this:

    10M Visits from: Direct (primarily the Front Page)
    10M Visits from: Search
    10M Visits from: Referrers (such as Social)

    This is usually where it goes wrong. When less data savvy publishers look at this picture I’ve seen them go look at something as simple as a top referring sites (or URLs) report in their web analytics solution. This is a good start, but probably not the best analysis to figure out the most important channels for Article Views. What almost always happens in this scenario is that Google ends up on top of that list and from there on a belief of being ‘search dependent’ starts.

    This list is not an optimal analysis as Search is being dominated by one player (Google), Referrers are highly distributed among multiple sites (from Social sites to News Aggregators), and Direct traffic is essentially excluded (it is not a referrer in the typical web analytics sense).

    Let‘s make a few additional (and very standard) assumptions for our 150M publishers above. The first assumption being that 7% of all search referrers are branded terms that take the visitor directly to a front page – in my book that is not search, that is navigational traffic. There is no real difference between a visitor writing “San Francisco Chronicle” in Google and that of going directly to sfgate.com. For referrers, a link with the anchor text “San Francisco Chronicle” linking to the Front Page of sfgate.com is again direct traffic to me, let’s make a cautious assumption of 3% of arriving on from branded anchor texts.

    If we move the visits around based on those assumptions, we have the following visit distribution:

    11.0M Visits from: Direct (primarily the Front Page)
    9.3M Visits from: Search
    9.7M Visits from: Referrers (such as Social)

    Let’s make the assumption (widely accepted) that the average page-views-per-visit is not the same from channel to channel (equally loyal), and that those coming directly to the destination because of the brand are the most loyal. We could do a decent average of 7 PPV for Direct, 3.5 PPV for Search and finally 4 PPV for Referrers. Sounds fair?

    If we move back into Page Views we get the following distribution:

    77,000,000 Page views (11.0M * 7) from: Direct (primarily the Front Page)
    32,550,000 Page views (9.3M * 3.5) from: Search
    38,800,000 Page views (9.7M * 4) from: Referrers (such as Social)

    This is approximately 150M (the size of the property as stated) and a distribution that concludes that 52% of all page views can be attributed back to the Front Pages (plural). In other words about half of all the page views on any given day can be attributed back to the front pages.

    The Data in the Article itself concludes that of the 64% article views in the current article segment, 78% could be attributed to the Front Pages – which is 50% (64% * 78%). In other words about half of all the page views on any given day can be attributed back to the front page.

    In conclusion it is natural to be skeptical, I was too, but the data tells us a different story than what we’ve heard the industry at large push over the last half decade. There is simply no escaping it – The Front Page is very likely the most important channel for most publishers today!

    cheers
    d. :-)

  • http://visualrevenue.com/blog/ Dennis R. Mortensen

    I agree. They are Mashable indeed and I would almost be disappointed if they did not master Social as a Channel :-)

    That said, I went all-in in explaining how one might end up with a different picture of channel dependency in regards to Article Views. It is very easy to look at a referring-domains report and believe that that is a fair channel representation – which it unfortunately is not.

    But as you say, there are certainly edge cases and exceptions to this distribution for media properties in general.

    Cheers
    d.

  • http://lavrusik.com Vadim Lavrusik

    Thanks for the thorough response, and a well articulated one. However, I think the number by itself may be a bit misleading based on how most publishers analyze their traffic and as you mention would likely be quite different for non-traditional publishers.

  • Pingback: Share your data! Tell us how your readers arrive at your site: search, social media, the front door? » Nieman Journalism Lab » Pushing to the Future of Journalism

  • Pingback: News sites: The Nieman Lab is hungry for your data // Future News

  • Pingback: Da NON leggere: l’importanza della prima pagina - Misurarelacomunicazione

  • Pingback: Liens vagabonds (10 mars) » Metamedia | La révolution de l'information

  • Pingback: The NYT adds recommendation features to its article pages » Nieman Journalism Lab » Pushing to the Future of Journalism

  • Pingback: The NYT adds recommendation features to its article pages » Nieman Journalism Lab » Pushing to the Future of Journalism

  • Pingback: Article to Article optimization – and the need for a Virtual Front Page | Visual Revenue

  • http://www.nextlevelofnews.com Steffen Konrath

    I can confirm the 64:36 split you discovered, Dennis. Would like to add, that as long as Twitter was part of Google’s real-time strategy there was nearly no time lag between initial post and appearance in Google’s index, the search result page if you’ve searched for the most recent posts. The same with Google News, which indexes at a higher rate and speed. – But what to do with the findings? Shall publishers increase their rate article/day ratio? No. More articles would not necessarily lead to more visitors unless they receive the update notice via push to their device. Why should they know, without any kind of notification? 
    - Thanks for taking the time to share your experience!-
    Steffen
    Future of Journalism
    Blog: http://www.nextlevelofnews.com
    Twitter: http://www.twitter.com/#!/stkonrath