10: Review
Quick Summary
When learning to do something new it is important to see what you have done, and not just what you still don't know how to do. With that in mind take a moment to review the things that you have accomplished in this tutorial. To help in your reflection we have provided a list of the major steps from the tutorial. If you want to review information from any of the steps, you can click on the link to be returned back to that section of the tutorial.
- Configure Proxy Server
- Record Page Transactions
- Create Scraping Session
- Add Scrapeable Files
- Inspect Scrapeable File Parameters
- Create Initialization Script
- Use Session Variables in Scrapeable File Parameters
- Add Script Association
- Create Details URL Extractor Pattern
- Add Regular Expressions to Extractor Tokens
- Use Extractor patterns to make pattern more stable
- Get next page link from extractor pattern
- Invoke Scrapeable File Manually From Script (Details page)
- Iterate through pages using next link
- Use sub-extractor patterns to extract product details
- Write Data to Tab-delimited File
- Login to Site
scraper on 08/14/2010 at 12:48 pm