Nieman Foundation at Harvard
HOME
          
LATEST STORY
How The Washington Post built — and will be building on — its “Knowledge Map” feature
ABOUT                    SUBSCRIBE
Jan. 6, 2010, noon

What qualifies as a Spotlight story on Google News? Here’s a few clues

Google News launched a Spotlight section back in September to highlight “in-depth pieces of lasting value.” Initial response was positive, but with a few months under its belt I checked in to see if the feature is living up to that first flush of excitement.

The verdict?

It all depends on how you define “in-depth” and “lasting value.” The material on the page is certainly different from what you typically find on Google News. It’s a nice sample of deeper stories. But visiting the section doesn’t inspire the curiosity and intellectual satisfaction you’d get from a great magazine, newspaper or documentary film. “Lasting” isn’t a word that springs to mind. I’m guessing that has something to do with the algorithm.

Getting around the algorithm issue

The Spotlight page, like all of Google News, is automatically generated by one of Google’s secret algorithms. It’s impossible to discern exactly how stories are selected because Google guards algorithms the way Kentucky Fried Chicken protects those 11 herbs and spices.

But if Google News’ general ranking rules apply to the Spotlight page, there might be a few clues within this video from Maile Ohye, a tech lead at Google (full transcript is here). In the video, Ohye notes that Google uses keywords to categorize articles within Google News. That’s how a story ends up in business, sports, etc. Ohye used the following example to describe the classification process:

So you can see on this article, “The Millions Kozlowski Didn’t Steal.” We actually take out individual words, like business, Tyco, money, and CFO, and understand that this article pertains to the section of business.

Carrying this out a bit, it’s possible Spotlight articles are partially determined by a list of keywords and phrases. I’m thinking words like society, impact, and trend could signal the kind of bigger/deeper stories appropriate for Spotlight. On a lark, I combined all the text from 10 Spotlight stories into a Wordle cloud to see if any “lasting” words stood out. No luck on that front, though.

Truth is, there’s no way to fully understand how Spotlight stories become Spotlight stories because Google goes mum whenever algorithms are discussed. I asked. They politely declined.

So I went with the next best thing: grunt work. I took a snapshot of the Spotlight page on Jan. 4, 2010 at 12:02 p.m. and dug into the top 10 stories to see if any obvious commonalities were at play. (These are the same 10 stories I plugged into Wordle.) Here’s what I found:

Length: five of the 10 stories were more than 1,000 words long.

Posting date: seven stories were published four days before I took the snapshot (Dec. 31, 2009).

Comments: six stories had received more than 50 comments.

Source: nine stories were from what I’d consider to be major publishers.

The stories were all over the map topic-wise: straight news, financial analysis, sports, and even a Wall Street Journal column from Karl Rove. If there’s topical targeting here, I couldn’t find it.

As for the lingering criteria — “in-depth” and “lasting value” — I’ll say yes on the former and no on the latter. Many of the stories were deep dives into a particular issue, so those certainly qualify as in-depth. Something achieves “lasting value” in my mind if it goes beyond strict just-the-facts reporting or knee-jerk reactions. By that criteria, the New York Times’ “Safety of Beef Processing Method Is Questioned” is the only story that fits. Everything else was fleeting. Interesting, certainly, but not likely to be relevant in a few weeks.

Here’s the raw data from my analysis. Let me know if you spot any wayward trends I might have missed.

Story No. 1. The Biggest Losers
(Wall Street Journal, Jan. 3, 2010)

Type: Opinion
Word count: 953
Comments: 70

2. Google Plans Google Voice Enhancements
(TMCnet, Dec. 31, 2009)

Type: News analysis
Word count: 430
Comments: 0

3. Come Buy With Me and Be My Love
(New York Times, Dec. 31, 2009)

Type: Feature story
Word count: 1,865
Comments: Not enabled

4. Civil rights hero caught in corruption probe to begin serving sentence
(CNN, Jan. 4, 2010)

Type: News story
Word count: 1,219
Comments: 103

5. It’s All in How You See It: The Resolution Revolution
(Huffington Post, Dec. 31, 2009)

Type: Advice column from Mehmet Oz, M.D.
Word count: 1,346
Comments: 133

6. New Year’s Resolutions for Washington
(Wall Street Journal, Dec. 30, 2009)

Type: Opinion piece by Karl Rove
Word count: 830
Comments: 236

7. 2010 Draft prospects in BCS games
(SI.com, Dec. 31, 2009)

Type: Sports analysis
Word count: 1,847
Comments: Not enabled

8. Hole in the Moon Could Shelter Colonists
(FOXNews.com, Dec. 31, 2009)

Type: News story
Word count: 403
Comments: 14

9. Safety of Beef Processing Method Is Questioned
(New York Times, Dec. 31, 2009)

Type: Investigative report
Word count: 3,090
Comments: 383

10. 3 reasons home prices are heading lower
(CNNMoney.com, Dec. 31, 2009)

Type: Financial analysis
Word count: 695
Comments: 86

POSTED     Jan. 6, 2010, noon
SHARE THIS STORY
   
Show comments  
Show tags
 
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
How The Washington Post built — and will be building on — its “Knowledge Map” feature
The Post is looking to create a database of “supplements” — categorized pieces of text and graphics that help give context around complicated news topics — and add it as a contextual layer across lots of different Post stories.
How 7 news organizations are using Slack to work better and differently
Here’s how Fusion, Vox, Quartz, Slate, the AP, The Times of London, and Thought Catalog are using Slack for workflow — and which features they wish the platform would add.
The New York Times built a robot to help make article tagging easier
Developed by the Times R&D lab, the Editor tool scans text to suggest article tags in real time. But the automatic tagging system won’t be moving into the newsroom soon.
What to read next
1119
tweets
New Pew data: More Americans are getting news on Facebook and Twitter
A new study from the Pew Research Center and Knight Foundation finds that more Americans of all ages, races, genders, education levels, and incomes are using Twitter and Facebook to consume news.
701Newsonomics: The halving of America’s daily newsrooms
If you’re lucky enough to have the right deep-pocketed owner buy your paper and steady it, you’ve won the lottery. If you’re in a town whose paper is owned by the better chains, or committed local ownership, your loss will probably be mitigated. Otherwise, you’re out of luck.
575How 7 news organizations are using Slack to work better and differently
Here’s how Fusion, Vox, Quartz, Slate, the AP, The Times of London, and Thought Catalog are using Slack for workflow — and which features they wish the platform would add.
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
Talking Points Memo
Wikipedia
Instapaper
GateHouse Media
Medium
Facebook
Lens
BuzzFeed
The Daily
Investigative Reporting Workshop
Associated Press
Los Angeles Times