screen-scraper public support

Questions and answers regarding the use of screen-scraper. Anyone can post. Monitored occasionally by screen-scraper staff.

Removing unwanted html tags in output file...

Hey guys,

After all the data I've collected has been outputted to my text document, how do I prevent tags like
from appearing in the text document. I guess what I'm asking is how to I make a filter that replaces certain unwanted tags with lets say.. a space or comma.

thanks a lot!

By the way, I'm coding in interpreted Java.

Regular Expression matches for 'Houses For Sale'

Hi everybody,
I am working on scraping of 'real estate classifieds' site.
Here i face a problem getting values for 'houses for sale' section from that site. I feel the issue is in regular expression, because i mention 'houses for sale' but the result i get is for 'houses' section.

I actually need to block the 'houses' section values and get the 'houses for sale' section values.

Please help me to solve the above issue.

Thanks in advance,
Vivek

Replacing a Variable in the URL field of a Scrapeable file

In the URL field of a scrapeable file I have the following hypothetical value

http://yahoo.com/~@QUOTE@~

I want this variable "QUOTE" to be replaced with a value when I invoke " Run Scraping Session".

This is what I unsuccessfully did.

Created a script with the following code.

session.setVariable( "QUOTE","GOOG" );

And I attached it to the scrapeable file then I set "When to run" to "Before file is scraped".

need help :navigate to next page.

hi
i am using screen scraper for extracting data from webpage.i am stuck when i need to go to second page of search result.
The website is in ASP.NET .form tutoral i came to know that it is possible to navigate to next page if you have next page link.and you can grab that link by extracting it from that webpage.but in my website there is not a link to next page i means to say that as the site is in ASP.NET,the address of next page in that website is as follow

javascript:__doPostBack('ctl00$C1$pag2$lnkPage2','')

Using scraper to log into Google..

So I have a site I want to scrape, but it requires that I log into a Google account first.

However when I run the proxy session, if I go into a page that requires me to log into google, (e.g. the sign-in link on the top right of the home page, or gmail.com) it gives me a page error..

Anyone know any way around this?

EDIT: Looks like I was having a secure login problem with everything--I reinstalled S-S and all works well now :D

Iterate only x times through a pattern match

Hi,

I have a 'results' page which can list many items in repeating table cells. I can extract them fine; however only want to run the 'details' page for the first 5 it finds.

How can that be achieved ?

Thanks you.

Testing on multiple machines

I have a co-worker who wants to look at the extracted data and patterns from a scraping session that I have created. He installed screen scraper on his local machine and imported what I have created. We set up the proxy and did all the steps that were outlined in the tutorials; however, when he looks at the text file created and applies the pattern he is getting an empty set of data where I have data in the patterns. As a test he created a new scrape file and copied one patteren over. After running that he started seeing data for that session variable.

looking for help with SS

I was wondering if anyone could give me about an hour of their to build on a project that im working on with screen scraper? just to help plan things and keep me organized on what i should be focusing on? Via skype/msn that kind of thing. Please let me know :)

Help with long loop

I currently have a single scrape session and scrapeable file that I need to run through a VERY long loop. I currently have a batch file that runs it from them command line but would like to have multiple sessions running at the same time. Do I need to run tandem sessions? If so how? How would I designate different starting points for id numbers to pass along with different output files for the results?

Thanks in advance
Kelly