Shailesh Prakash, CIO and CTO of The Washington Post, kicked off a series of speakers at the Big Data for Media Week conference in London on Thursday by sharing some of the Big Data tools that can help publishers succeed.
The American publisher currently has 100 million unique visitors per month just within the United States and publishes 1,200 stories per day.
While content is at the heart of the company, The Washington Post recognises that product — especially the design, speed, engagement level — has become its key to success, Prakash told the audience of 200+ participants from news media companies around the world.
Some of the Big Data tools the company has built internally and used are:
- Clavis: The first tool built by The Washington Post, Clavis is a suite of audience targeting technologies The Post has successfully monetised. Clavis automatically analyses everything the media company publishes and arranges them into topics, while at the same time tries to understand who the readers are. It then personalises content and brand messaging. and The Washington Post has seen click-through-rate (CTR) increase significantly through the years.
- Virality: This tool tries to predict an article’s popularity. It allows editors to prioritise content, identifies under-performing articles, and eventually supports advertising opportunities.
- Bandito: This multi-armed bandit for content variation testing is a dynamic optimisation of variants using real-time user engagement feedback. It takes a combination of headlines and images, experiments on them, and finally exploits the best performing combination by automatically directing the winning variants.
- Headliner: This is a somewhat controversial tool, whereby The Washington Post attempts to automatically generate headlines based on the story content. It can also suggest different headlines for different channels and devices. The three algorithms used are: hedge trimmer, multi-sentence compression, and neural machine translation.
- Heliograf: This is an intelligent, automated storytelling agent. It uses Artificial Intelligence to automatically write stories based on structured data and deliver them to specific channels and personalise stories for readers.
The Washington Post has successfully used these tools during the Olympics and the U.S. election. Prakash does not believe these tools will replace journalists in the newsroom; instead, they can allow journalists to focus on investigative pieces as machines take over the more mundane reporting pieces.
Some of the other tools that they use are Tau (an article-scoring tool), Loxodo (a real-time data analysis), Riveting (to understand how riveting a story is), and BreakFast (measuring how successful their alerts are).
The tools presented work across multi-media platforms, be they texts, audio, or video. Prakash emphasised that all these tools rely on relatively new Big Data and cloud technologies — all these simply would not have been possible 10 years ago.