screen-scraper support for licensed users

Questions and answers regarding the use of screen-scraper. Only licensed Professional and Enterprise Edition users can post; anyone can read. Licensed users please contact support with your registered email address for access. This forum is monitored closely by screen-scraper staff. Posts are generally responded to in one business day.

How to interpret cookies and headers and how to use them?

I am scraping the site https://committing.efanniemae.com/eCommitting/eCommitting (Fannie Mae Oak Bank). I created a proxy session, a scraping session, and created a scrapeable file for each transaction. When I ran the scraping session, it failed right away. I think it failed because cookies and headers changed between the proxy and scraping sessions. I haven't been able to successfully handle this yet.

Simultaneously New Scraping Session and Running Web Interface

Hello Support,

Can I work on a new scraping session while the web interface is running another scraping session?

Best,

Adrian

411.com going to the next page

Hi all-

I am a little stuck on how to make this happen and get screen scraper to go to the next page and scrape the required information. I looked over the API's and the other scripts for the NEXT page, but I am still LOST. I have attached the scrape file that I have made so far.

Jason

How to handle periodic changes of session state values and cookies that relate to denial of access

I set up a up a proxy session and a scraping session, then added scrapeable files. Then I added sesison variables that allow me to insert userID and password into a scrapeable file. It all works. However, I did the same thing previously and it worked too, but then it stopped working after a few hours. After while the scraping session began issuing a 404 code, which is the status code that means the web site denied access. I logged on to the web site manually and that worked, so the userID and password were OK. So something else changed.

Having trouble getting file to save

See Resolved Section Below...

I'm attempting to save the results of the following URL:

"http://wogcc.state.wy.us/DumpByApdA.cfm?bmon=01&bday=01&byear=2001&emon=03&eday=14&eyear=2010&cType=1&Points=Y"

This URL runs fine in a browser.

However, when I run this in a scrape (attached) I get the following error:

ERROR: Failed to retrieve the file: "http://wogcc.state.wy.us/APD20010101_20100315.txt". The server returned a status code of 404.

Here is the script I call to save the file:

CSV File returned as attachment. How do I save this to a file?

I've recieved an attachment as a response. A part of this is shown below:

TTP/1.1 200 OK
X-Powered-By: ASP.NET
Content-Type: application/csv
Connection: close
Server: Microsoft-IIS/6.0
Date: Fri, 12 Mar 2010 15:09:11 GMT
Content-Disposition: attachment; filename="permits.csv"

"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

"Coudn't load main class." Error

When trying to start screen scraper professional addition I receive the error "Couldn't load main class.".
This error also occurs when trying to uninstall the program.

Has anyone ran into this problem in the past and found a solution?

DataSet scope

Having a problem with the scope for DataSet.
This is what I'm doing.

Fetching a zip from a file of zipcodes.
Passing it to the url.

Extractor pattern 1 looks for the place in the code that contains 12 months of data, and saves that portion of the page to ~@YearData@~.
Fine.
Calls "DataExtract" script "after each pattern application"

Clearing an existing cookie and setting a new one

Hi,

One of the sites we're trying to crawl sets the language to view the pages in via a cookie. So we're trying to set the cookie so that we get the correct language content. We have two scraping files. Product and French Product. In the scripts before Product is executed, we set the cookie to en_US. In the scripts before French Product is executed, we set the cookie to fr_FR.

What we end up seeing in the request is the cookie being set twice. Once with en_US and once with fr_FR.

It looks like this:

Mile-Long Viewstate

Hello Support,

I have a mile-long viewstate and eventvalidation that is truncated, but only when I click into value portion of the viewstate and eventvalidation. I noticed everything works fine until I click into the value portion and then the screen-scraping does not work anymore. As a result, I created viewstate and eventvalidation variables and set the mile-long variables.