1: Process Review
Screen Scraping Process
As you'll remember from the previous tutorial, extracting information from web sites using screen-scraper typically involves four main steps:
- Use the proxy server to determine the exact files that need to be requested in order to get the information you're after.
- Create a scraping session with scrapeable files that define the sequence of pages screen-scraper will request.
- Generate extractor patterns to define the exact information you need screen-scraper to grab from each page.
- Write small scripts or programming code to invoke screen-scraper and/or work with the data it extracts.
scraper on 07/16/2010 at 4:25 pm
- Printer-friendly version
- Login or register to post comments