Nieman Foundation at Harvard
HOME
          
LATEST STORY
Why “Sorry, I don’t know” is sometimes the best answer: The Washington Post’s technology chief on its first AI chatbot
ABOUT                    SUBSCRIBE
April 4, 2016, 2:04 p.m.
Reporting & Production
LINK: panamapapers.icij.org  ➚   |   Posted by: Ricardo Bilton   |   April 4, 2016

Sunday’s report of the Panama Papers detailed a global, decades-long story of bribery, arms deals, tax evasion, and financial fraud that implicates over 100 politicians and public officials. Equally impressive, however, is the project itself, which was kept secret for nearly two years despite the involvement of hundreds of reporters.

Here are some details about how the project came together.

It was indeed a major leak. The entire document set, which consists of data going back 40 years, came in at 2.6 terabytes — the largest that reporters have ever had to work with. It’s a thousand times larger than even the 2010 WikiLeaks Cablegate leak, totaled a mere 1.73 gigabytes. In total, the Panama Papers leak consisted 11.5 million files, including 4.8 million emails, 1 million images, and 2.1 million PDFs.

panama-papers2

The process was high tech. The process started way back in 2014, when an unknown source contacted a reporter at German newspaper Süddeutsche Zeitung via encrypted chat, as Wired reports. Later, to process the millions of leaked, developers from The International Consortium of Investigative Journalists built a custom search engine, which the group shared with partner news organizations around the world. The effort also included a real-time chat system for reporters to talk to each others, algorithms that searched for links between names mentioned in the leaked documents, and software that converted raw data into digital formats.

It was a group effort. While Süddeutsche Zeitung was the initial recipient of the leaked documents, the newspaper didn’t analyze them alone. Working with the ICIJ, the organization tapped teams at France’s Le Monde, Argentina’s La Nación, Switzerland’s Sonntagszeitung, and the U.K.’s Guardian and BBC. In total, over 400 reporters across organizations worked on the project.

Few U.S. outlets were involved. While over 100 news organizations across the world analyzed the Panama Papers, besides the Washington D.C.-based ICIJ, just a small handful of the partner organizations were based in the United States: McClatchy’s Charlotte Observer and Miami Herald, Univisión, and Fusion. (Columbia professor Giannina Segnini is also working with the documents, along with five of her j-school students.) None of The New York Times, The Washington Post, and The Wall Street Journal featured coverage of the Panama Papers on their front pages.

The ICIJ doesn’t plan to release the full set of documents. Unlike WikiLeaks, which dumped its entire set of Cablegate documents on the web for anyone to see, the ICIJ says it doesn’t plan to do the same with the Panama Papers. “We’re not WikiLeaks. We’re trying to show that journalism can be done responsibly,” ICIJ director Gerard Ryle told Wired.

Show tags
 
Join the 60,000 who get the freshest future-of-journalism news in our daily email.
Why “Sorry, I don’t know” is sometimes the best answer: The Washington Post’s technology chief on its first AI chatbot
“For Google, that might be failure mode…but for us, that is success,” says the Post’s Vineet Khosla
Browser cookies, as unkillable as cockroaches, won’t be leaving Google Chrome after all
Google — which planned to block third-party cookies in 2022, then 2023, then 2024, then 2025 — now says it won’t block them after all. A big win for adtech, but what about publishers?
Would you pay to be able to quit TikTok and Instagram? You’d be surprised how many would
“The relationship he has uncovered is more like the co-dependence seen in a destructive relationship, or the way we relate to addictive products such as tobacco that we know are doing us harm.”