Analyzing Consumer Marketplaces Using Crawlbot and the Product API

Miles Grimshaw of Thrive Capital recently used Crawlbot and our Product API to analyze product availability and extract pricing data from a number of online fashion marketplaces — to help determine the scale, margins, customer profile and trends of each site, and to inform their investment decision-making. Miles writes about his experience and analysis on his blog. Nice […]

Read More

Announcing Semantic Hack (June 1, 2013)

What could you build if the entire web was your database? Could you do it in a day? We’re glad to be working with the fine folks at SemanticWeb.com to host the inaugural Semantic Hack at the Semantic Technology & Business Conference in San Francisco on June 1, 2013. See additional details and registration at http://semantichack.eventbrite.com, and more […]

Read More

Diffbot Leads in Text Extraction Shootout

In a recent benchmark, Diffbot placed first overall among text extraction APIs on an academic evaluation set and one sampled from Google News. Tomaz Kovacic, a university student in artificial intelligence, recently conducted a comprehensive benchmark of text extraction methods as part of his thesis. Included in the study are commercial vendors as well as open-source APIs […]

Read More