1: Process Review

As you'll remember from the previous tutorial, extracting information from web sites using screen-scraper typically involves four main steps:

Use the proxy server to determine the exact files that need to be requested in order to get the information you're after.

Create a scraping session with scrapeable files that define the sequence of pages screen-scraper will request.

Generate extractor patterns to define the exact information you need screen-scraper to grab from each page.

Write small scripts or programming code to invoke screen-scraper and/or work with the data it extracts.