Nieman Foundation at Harvard
HOME
          
LATEST STORY
Do article tags matter? Maybe not for traffic, but publishers are using them to glean insights
ABOUT                    SUBSCRIBE
June 26, 2014, 4:51 p.m.
Aggregation & Discovery
LINK: bost.ocks.org  ➚   |   Posted by: Liam Andrew   |   June 26, 2014

Mike Bostock is one of data visualization’s leading lights. As creator of the hugely popular visualization library D3.js and editor in The New York Times’ graphics department, he has had a hand (visibly and invisibly) in most of the widely shared interactives on the web.

Today Bostock posted an adaptation of a celebrated talk he gave at Eyeo 2014 about visualizing algorithms. Full of ideas and gorgeous patterns, it’s an elegant flip to the script of the typical data visualization.

Computers are sometimes conceptually divided between data structures and algorithms, and we usually visualize the data, while ignoring the processes that manipulate it. But Bostock argues that “visualization is more than a tool for finding patterns in data.”

He breaks down various methods for sampling, shuffling, sorting, and making mazes, ably explaining (via text and gorgeous graphics) why there are different types of randomness, for example, or how to most effectively sort a list.

bostock-quicksort

Bostock is interested in the value of visualizing algorithms for learning about and understanding complex processes. A novice could use a visualization to peer into an algorithm’s black box; an expert algorithm builder might visualize in order to debug and reframe it.

He classifies algorithm visualizations based on the level of introspection they give into the data — some only show the output, while others let you peer fully into how data points are being manipulated.

The goal here is to study the behavior of an algorithm rather than a specific dataset. Yet there is still data, necessarily — the data is derived from the execution of the algorithm. And this means we can use the type of derived data to classify algorithm visualizations.

Using his work on the Times’ revamped rent-versus-buy calculator as an example, he shows how opening up the algorithm allows for new questions:

To output an accurate answer, the calculator needs accurate inputs. While some inputs are well-known (such as the length of your mortgage), others are difficult or impossible to predict. No one can say exactly how the stock market will perform, how much a specific home will appreciate or depreciate, or how the renting market will change over time.

We can make educated guesses at each variable — for example, looking at Case–Shiller data. But if the calculator is a black box, then readers can’t see how sensitive their answer is to small changes.

To fix this, we need to do more than output a single number. We need to show how the underlying system works.

rent-vs-buy

Some of the examples are fairly technical and outwardly trivial — in a sense, what are the social implications of a sorting algorithm as long as the sorting happens? But they do demonstrate the sheer number of ways to solve a seemingly simple problem, and in the case of some of these examples (such as sampling algorithms), the results matter immensely.

The examples also demonstrate an opportunity to rethink what a visualization can tell us. Whether static or dynamic, or whether describing a state or a process, a visualization can show and hide as much as it needs.

Show tags Show comments / Leave a comment
 
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
Do article tags matter? Maybe not for traffic, but publishers are using them to glean insights
Analytics company Parse.ly found that sites are expanding their use of article tags to track sponsored content and control paywall access.
It’s time to apply for a visiting Nieman Fellowship
The Nieman Foundation for Journalism at Harvard wants to hear your idea for making journalism better. Come spend a few weeks working on it in Cambridge. Deadline: October 31.
From Nieman Reports: From earnings reports to baseball recaps, automation and algorithms are becoming a bigger part of the news
“Let’s have a computer do what a computer’s good at, and let’s have a human do what a human’s good at.”
What to read next
2569
tweets
The New York Times built a Slack bot to help decide which stories to post to social media
The bot, named Blossom, helps predict how stories will do on social and also suggests which stories editors should promote.
1287Jo Ellen Green Kaiser: Do independent news outlets have a blind spot when it comes to ethnic media?
The head of the Media Consortium argues that, by defining themselves in opposition to mainstream media, independent progressive outlets miss out on the power of ethnic and community journalism.
1029Newsonomics: 10 numbers on The New York Times’ 1 million digital-subscriber milestone
Digital subscribers are proving to be the bedrock of the Times’ business model going forward. How much more room is there for growth — and at what price points?
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
Corporation for Public Broadcasting
The Orange County Register
Iowa Center for Public Affairs Journalism
Suck.com
The Chronicle of Higher Education
USA Today
NPR
Chicago Tribune
Al Jazeera
The Daily Beast
Journal Register Co.
Wired