screen-scraper support for licensed users
How to change the URL in a scrapeable file?
One of the tasks I will have to do after I learn screen-scraper is to check web pages whose URLs change periodically, perhaps once a day or more. In order to handle this, I am thinking about doing this if possible: The .NET program will determine what the URL of the web page is likely to be. Use a .NET program to change the URL contained by a scrapeable file before the program tells it to scrape the web page.
Is it possible for a .NET program to change the name of the URL contained by a scrapeable file before it scrapes a web page? If so, how would you do that?
Repeat a scraped field
I have this website I am trying to scrape that list one field only once, but I would like for it to repeat next to each tracking number. Basically it is the container number and numerous tracking numbers are inside the container number. I would like for the container number to repeat for each tracking number in the scrape. Is that possible. Below is a snippet of html code from the website.
The part I would like to repeat for each tracking number in the scrape
Ability of screen-scraper to simulate user interaction with a web site
I have a basic question about the ability of screen-scraper to simulate user interaction with a web site:
Can you pass parameters via the HTTP GET method to ALL web sites in order to insert data into fields? Or does a web site have to be coded to accept these parameters? Are there any web sites that screen-scraper won't work with?
Is there anything that simulates clicking a specific button on a web page?
Waiting on page to load
Hi all-
I was wondering if its possible to have screen scrape wait until a page loads so I can get the data off of it? Right now its wanting to scrape the processing page vs wait on the main page to load. The website works where you submit a tracking# and hit retrieve and it goes to a processing page then after a few seconds it displays the results. Any way around this?
VB.NET program not getting data returned
I created the Shopping Site scraping sesison in tutorial 2. Then I wrote a VB.NET program to use it. The program is basically the VB.NET code provided with the documentation. It does have the reference to the RemoteScrapingSession dll and it has an Imports Screenscraper at the top of the file. The computer is running Windows 7 and IE8. I started the screen scraper service using the Windows 7 task manager.
Strategy for multiple tables
What would be the best strategy to harvest data from the following set of tables?
All the data appears on one page, but the number of table rows may vary.
2009
1st quarter
50
$80,802
$82,500
2nd quarter
55
$80,948
$72,112
3rd quarter
74
$48,495
$65.445
4th quarter
58
$68,455
$86,445
Total:
307
$89,290
$80,000
2008
So Lost on this Extractor
Here is a snippet of code that is returned after you submit a tracking request on our internal tracking website. The code that is returned seems to be XML vs HTML. For the life of me I can not get an extractor pattern to work on any of it except the tracking field. I would like to get the other fields, but completely lost.
Token Parsing Error
Any ideas why I would be getting this error? Token Parsing Error? I am scraping from a website where the date is in an XML format. Below is a snippet of the code in a text file I am extracting the field called "tracking-nbr"
Session Variables
Hi all-
I am new to screen scraper and I am having a problem. I have a scrape session setup to capture some scan information off our internal website here. It scrapes a field called CONS. I then have another scrape session setup to scrape off a different website using the CONS number that I scraped off the first website. Can this field be stored in a session variable and used later for a different scrape session?
Thanks
Jason
How to extract data from PDF and Excel files?
I would like to extract data that a web site makes it available in Excel spreadsheet form. Is there a way that I can get that data using screen-scraper? If it's not built into the product, is there an indirect way to get it, such as a program that converts the speadsheet into an XML or HTML file that SS can assign to a variable that it makes available to a .NET program?