Nieman Foundation at Harvard
HOME
          
LATEST STORY
What Scribd’s growing pains mean for the future of digital content subscription models
ABOUT                    SUBSCRIBE
June 26, 2014, 4:51 p.m.
Aggregation & Discovery
LINK: bost.ocks.org  ➚   |   Posted by: Liam Andrew   |   June 26, 2014

Mike Bostock is one of data visualization’s leading lights. As creator of the hugely popular visualization library D3.js and editor in The New York Times’ graphics department, he has had a hand (visibly and invisibly) in most of the widely shared interactives on the web.

Today Bostock posted an adaptation of a celebrated talk he gave at Eyeo 2014 about visualizing algorithms. Full of ideas and gorgeous patterns, it’s an elegant flip to the script of the typical data visualization.

Computers are sometimes conceptually divided between data structures and algorithms, and we usually visualize the data, while ignoring the processes that manipulate it. But Bostock argues that “visualization is more than a tool for finding patterns in data.”

He breaks down various methods for sampling, shuffling, sorting, and making mazes, ably explaining (via text and gorgeous graphics) why there are different types of randomness, for example, or how to most effectively sort a list.

bostock-quicksort

Bostock is interested in the value of visualizing algorithms for learning about and understanding complex processes. A novice could use a visualization to peer into an algorithm’s black box; an expert algorithm builder might visualize in order to debug and reframe it.

He classifies algorithm visualizations based on the level of introspection they give into the data — some only show the output, while others let you peer fully into how data points are being manipulated.

The goal here is to study the behavior of an algorithm rather than a specific dataset. Yet there is still data, necessarily — the data is derived from the execution of the algorithm. And this means we can use the type of derived data to classify algorithm visualizations.

Using his work on the Times’ revamped rent-versus-buy calculator as an example, he shows how opening up the algorithm allows for new questions:

To output an accurate answer, the calculator needs accurate inputs. While some inputs are well-known (such as the length of your mortgage), others are difficult or impossible to predict. No one can say exactly how the stock market will perform, how much a specific home will appreciate or depreciate, or how the renting market will change over time.

We can make educated guesses at each variable — for example, looking at Case–Shiller data. But if the calculator is a black box, then readers can’t see how sensitive their answer is to small changes.

To fix this, we need to do more than output a single number. We need to show how the underlying system works.

rent-vs-buy

Some of the examples are fairly technical and outwardly trivial — in a sense, what are the social implications of a sorting algorithm as long as the sorting happens? But they do demonstrate the sheer number of ways to solve a seemingly simple problem, and in the case of some of these examples (such as sampling algorithms), the results matter immensely.

The examples also demonstrate an opportunity to rethink what a visualization can tell us. Whether static or dynamic, or whether describing a state or a process, a visualization can show and hide as much as it needs.

Show tags Show comments / Leave a comment
 
Join the 15,000 who get the freshest future-of-journalism news in our daily email.
What Scribd’s growing pains mean for the future of digital content subscription models
It turns out that ebook subscription models don’t work very well when people read too much. So what happens next?
How research (and PowerPoints) became the backbone of National Journal’s membership program
“We no longer look at National Journal simply as a news source, but as a collection of resources, as well as a collection of experts we can turn to on occasion.”
Added value: How Harvard Business Review thinks it can add subscribers while getting more expensive
By creating new products and taking advantage of its extensive archives, HBR’s plan is to both offer more to and ask more of subscribers.
What to read next
2843
tweets
A blow for mobile advertising: The next version of Safari will let users block ads on iPhones and iPads
Think making money on mobile advertising is hard now? Think how much more difficult it will be with a significant share of your audience is blocking all your ads — all with a simple download from the App Store.
1763For news organizations, this was the most important set of Apple announcements in years
A new Flipboard-clone with massive potential reach, R.I.P. Newsstand, and news stories embedded deeper inside iOS — it was a big day for news on iPhones and iPads.
762Newsonomics: 10 numbers that define the news business today
From video to social, from mobile to paywalls — these data points help define where we are in the “future of news” today, like it or not.
These stories are our most popular on Twitter over the past 30 days.
See all our most recent pieces ➚
Encyclo is our encyclopedia of the future of news, chronicling the key players in journalism’s evolution.
Here are a few of the entries you’ll find in Encyclo.   Get the full Encyclo ➚
La Nación
Instapaper
News Corp
NBCNews.com
New York
Politico
ABC News
Tumblr
Bayosphere
The Orange County Register
Backfence
Bureau of Investigative Journalism