screen-scraper public support
How to reference a scraping file from a different scraping session
I have a series of scraping sessions with scraping files that are consistent across all sessions. How do you reference a scraping file in a different session? How do you pass the variable information between the sessions?
If sessions are accessing the scraping files concurrently is there anyway of keeping instance data separate and specific to the session accessing them?
Scrape Site
Hi,
I'm trying to scrape the following page\url: http://www.racingpost.com/horses/result_home.sd
passing in 2 parameters to form the url:
http://www.racingpost.com/horses/result_home.sd?race_id=517517&r_date=2010-11-27
The issue I'm having is that none of the html I need for the results is showing in the last response. When I click view in browser the results box doesn't even appear. Can anyone help me please.
beginner questions: searching forum, renaming sessions
1. Is there a search function for this forum?
2. Following Tutorial 1 on Ubuntu 10.04 LTS, I initially was able to save Proxy Server as 'HelloWorld'. After a few minutes I was not able to rename the New Scraping Session and also, not able to rename the HelloWorld Proxy Server.
Can not pass parameters for Details page
Hi,
I'm try to scape details off of the CA Secretary of State website: http://kepler.sos.ca.gov/
I have no trouble getting results based on a search, but can not figure out how to pass paramaters for them in order to get to the Details page. The variable passed to get to the details page seem to be encrypted...is there any way to use them in Screen Scraper?
Any help would be greatly apprecaited.
Thanks,
Chris
Never mind...I figured it out. This was the first ASP.net site I tried to scrape...it had me totally stumped.
storing session variable not working, problem with tokens?
I'm a NEWBIE tring the following scripts, following your tutorials but on my text file, i'm getting just nulls, plus when i create an extractor pattern "edit token" is gray, any help is much appreciate
here are scripts, extractor patterns and logs
SCRIPTS
TITOLO_SPIAGGE
session.log( "SCRIVO IL TESTO." );
// Open up the file to be appended to.
out = new FileWriter( "testo_spiagge.txt", true );
// Write out the data to the file.
out.write(session.getVariable("TITOLO_SPIAGGE")+ "," );
out.write("\n" );
// Close up the file.
out.close();
TESTO_SPIAGGE
scrapeableFile.extractData being invoke by script within script
when I call a script which calls another script, the scrapeableFile.extractData which resides on the second script doesn't work. Is this not possible?
Secured site login issue..
How to resolve login issue for secured site like gmail..i keeps on redirecting back to login page.
Problems writing to a CSV file with null values
I need to be able to find a way of filling a null with a blank. The
out.write (datarecord.get ("token")+ "t");
does not work, as some of the field are sometimes missing on the site.
I read a suggestion here about using:
out.write(nullToEmptyString(session.getVariable("make")) + "," );
instead, but I keep getting an error message that I can't understand, (newbie to all this)
An error occurred while processing the script: WriteDataToAFile
The error message was: class bsh.ParseException (line 31):
403 Error received when trying to download images
My goal is to download multiple gif images from a particular site.
I have an extractor pattern as follows:
a href="/members/product_images/filestore/~@GIF_IMAGE_END_URL@~.GIF" onclick="return true" class="reddownload"
The token is stored as a session variable.
I wrote the following script to handle the downloading (it runs after each pattern match - not sure if this is correct):
how to set login credentials when getting redirected to login page frm search page?
I am using the basic edition of screen scraper. I need to scrape a page that i get after valid login credentials.But even after following the procedure mentioned in Logging in section of Tutorial 2 i am not able to get the source code of the particular page in the last response tab. The response is redirected to login page. Please mention as to how to get the source code of the search page on which i can apply extractor patterns.