Twitter  “The great decline in print advertising continues to swamp much of the other progress news companies are making.” nie.mn/1mZvm1C  
Nieman Journalism Lab
Pushing to the future of journalism — A project of the Nieman Foundation at Harvard

A researcher at Stanford has some new insight into how content — specifically, visual content — becomes massively popular on Facebook, a phenomenon he calls cascade sharing.

Justin Cheng wanted to see if it was possible to predict what content would be shared over and over again. With the help of some people at Facebook, they were able to get access to data that showed “which people (nodes) reshared each photograph and at what time.”

Cheng and pals use a portion of their data to train a machine learning algorithm to search for features of cascades that make them predictable.

These features include the type of image, whether a close-up or outdoors or having a caption and so on; the number of followers the original poster has; the shape of the cascade that forms, whether a simple star graph or more complex structures; and finally how quickly the cascade takes place, its speed.

Having trained their algorithm, they used it to see whether it could make predictions about other cascades. They started with images that had been shared only five times, so the question was whether they would eventually be shared more than 10 times.

It turns out that this is surprisingly predictable. “For this task, random guessing would obtain a performance of 0.5, while our method achieves surprisingly strong performance: classification accuracy of 0.795,” they say.

There’s a lot more work to be done in this area of research, but some of Cheng’s findings — for example, content that is shared rapidly is likely to become viral — could be useful in a publishing context.

— Caroline O'Donovan
                                   
What to read next
nytimes-building-990-cc
Ken Doctor    
Is the rise of reader revenue stopping not long after it started?