Crawly: Never Write Another Web Scraper

Dru Wynings • March 22, 2016

Turn websites into data in seconds. Crawly spiders and extracts complete structured data from an entire website.

Input a website and we’ll crawl and automatically extract the article’s:

Title
Text
HTML
Comments
Date
Entity Tags
Author

AuthorUrl
Images
Videos
Publisher Country
Publisher Name
Language

which you can download in a CSV or as JSON.

Crawly is a free tool built on top of Diffbot’s Crawlbot and Automatic Web Data Extraction APIs.

I encourage you to give it a try and see for yourself Diffbot’s industry-leading accuracy.

Up Next