Content Harvesting

No News Is Good News – Monitoring Average Sentiment By News Network With Diffbot’s Knowledge Graph

Ever have the feeling that news used to be more objective? That news organizations — now media empires — have moved into the realm of… Keep Reading

Merrill Cook

Download This Dataset of 12,118 Yahoo Answers for $1

With only 2 weeks left till May 4th (be with you), the internet is bursting with excitement over all the work that needs to be… Keep Reading

Jerome Choo

These Are The Hardest Page Types To Scrape — With Workarounds For Each

Phrases like “the web is held together by [insert ad hoc, totally precarious binding agent]” have been around for a while for a reason. While… Keep Reading

Merrill Cook

The 25 Most Covid-Safe Restaurants in San Francisco (According to NLP)

A few weeks ago, we ran reviews for a Michelin-reviewed restaurant through our Natural Language API. It was able to tell us what people liked… Keep Reading

Merrill Cook

Context Matters, Tracking Quote Spread Across The Web In A Historic Year

Hindsight is 20/20. And as we usher in a new president in what has been one of the most tumultuous years in American history, we… Keep Reading

Merrill Cook

Robotic Process Automation Extraction Is A Time Saver. But it’s Not Built For the Future

Enough individuals have heard the siren song of Robotic Process Automation to build several $1B companies. Even if you don’t know the “household names” in… Keep Reading

Merrill Cook

How to Track Market Indicators Using News Monitoring Scheduling

The public web is chock full of indicators with implications for stock prices, commodities prices, supply chain issues, or the general perceived value of an… Keep Reading

Merrill Cook

Stories By DQL: Tracking the Sentiment of a City

The story: sentiment of news mentions of Gaza fluctuate by as much as 2000% a week. 90% of news mentions about Minneapolis have had negative… Keep Reading

Merrill Cook

Stories By DQL: George Floyd, Police, and Donald Trump

We will get justice. We will get it. We will not let this door close. – Philonise Floyd, Brother of George Floyd News coverage this… Keep Reading

Merrill Cook

Comparison of Web Extraction Providers: Import.io vs. Diffbot

Harnessing the public web as data is one of the smartest things product, marketing, PR, and machine learning teams can do. It also opens up… Keep Reading

Merrill Cook