This is a useful survey of wrapper development toolkits.
Archive for scraping
Marmite: Re-purposing Web Content through End-User Programming
Marmite: Re-purposing Web Content through End-User Programming
This is about making mashups. It also uses Solvent for wrapper creation.
Using Solvent to extract data from structured pages
~wingerz ยป Using Solvent to extract data from structured pages
This is another tutorial for Solvent.
Crowbar
Crowbar – SIMILE
Crowbar is a web scraping environment based on the use of a server-side headless mozilla-based browser.
It is used as a research prototype to investigate how to enable the running of Piggy Bank javascript scrapers from the command line and thus automating web sites scraping.