Archive for scraping

Wrapper Development Tools

Wrapper Development Tools

This is a useful survey of wrapper development toolkits.

Advertisements

Comments (1)

Marmite: Re-purposing Web Content through End-User Programming

Marmite: Re-purposing Web Content through End-User Programming

This is about making mashups. It also uses Solvent for wrapper creation.

Leave a Comment

Using Solvent to extract data from structured pages

Leave a Comment

Crowbar

Crowbar – SIMILE
Crowbar is a web scraping environment based on the use of a server-side headless mozilla-based browser.
It is used as a research prototype to investigate how to enable the running of Piggy Bank javascript scrapers from the command line and thus automating web sites scraping.

Leave a Comment