GrabzIt's Web Scraper API, Scrape the Web!
The Web Scraper API allows you to control when scrapes start and stop, as well as enabling you to integrate scraped data back into your application. The integration of data into your application is achieved through a callback handler, which is a script or application on a publicly accessible URL that processess the data sent from GrabzIt's Web Scraper. Complete files are posted to this callback handler sequentially, so for instance it could start with a series of images before ending with a JSON file.
To get started first create a scrape then choose Callback URL option from the Export Options tab and enter the URL to your callback handler e.g
If you are having any issues with your callback handler choose Debug mode from the Scrape Options tab. This will output the response returned by the callback handler into the logs.
To process scraped data inside your callback handler choose the JSON or XML options on the Export Tab as this returns the data in a format that can easily be read by any object oriented language.
For data that is not JSON or XML data your processing options are limited as the data is not very machine readable so the best option may be to save the file to disk or in a database.
To help the integration process GrabzIt provides some scraper API's, below are the languages we have currently have created API's for. However as our code is open source and available on GitHub there is no reason you not to make one for a programming language not listed here. If you do why not share it with the world?