Turn websites into data in seconds. Crawly spiders and extracts complete structured data from an entire website.
Input a website and we’ll crawl and automatically extract the article’s:
- Title
- Text
- HTML
- Comments
- Date
- Entity Tags
- Author
- AuthorUrl
- Images
- Videos
- Publisher Country
- Publisher Name
- Language
which you can download in a CSV or as JSON.
Crawly is a free tool built on top of Diffbot’s Crawlbot and Automatic Web Data Extraction APIs.
I encourage you to give it a try and see for yourself Diffbot’s industry-leading accuracy.
You must be logged in to post a comment.