1: Process Review
Screen Scraping Process
As you'll remember from the previous tutorial, extracting information from web sites using screen-scraper typically involves four main steps:
- Use the proxy server to determine the exact files that need to be requested in order to get the information you're after.
![](/files/media/tutorials/tutorial1/how_the_proxy_server_works.png)
- Create a scraping session with scrapeable files that define the sequence of pages screen-scraper will request.
![](/files/media/tutorials/tutorial1/convert-proxy-transactions.png)
- Generate extractor patterns to define the exact information you need screen-scraper to grab from each page.
![](/files/media/tutorials/tutorial1/extractor-pattern-tokens-like-stencils.gif)
- Write small scripts or programming code to invoke screen-scraper and/or work with the data it extracts.
![](/files/media/tutorials/tutorial2/scripting_paper-hand-draw.png)