screen-scraper public support

Questions and answers regarding the use of screen-scraper. Anyone can post. Monitored occasionally by screen-scraper staff.

newbie - selecting only first instance of pattern

Hi,

I'm working on my first project and I'm trying to deal with a next link. From the tutorial (excellent btw) I've managed to dig down into multiple sub pages from a list page but I'm having a little difficult with the next link.

I've got two issues - firstly the next link appears twice on the page (top and bottom), how do I just scrape the first instance.

newbie - Remove unwanted characters

Firstly - Great product, thanks.

As part of my first project I'm trying to scrape postcodes embedded within a link so I have to deal with the "+" character. I think there are two ways to do this - create two extractor patterns one for each half of the postcode and then stitch them together or when exporting the data use some form of embedded java to remove the "+".

My current extractor pattern is

alter code created

I was wondering if there is a way to view/alter code created by screen scraper? I'm trying to collect data from a cite which already has an export option, so I wanted to simplify it as much as possible, or just write from scratch.
Thanks!

Migrating from 3.0 to 4.0

We're finally doing the migration from 3.0 to 4.0 and I had a question or two.. And I have a feeling this might be useful info for the general community.

The big question for me is:

What's the difference between .xml and .sss files? Can a 4.0 exported scrape be imported by 3.0 and can a 3.0 exported scrape be imported by a 4.0?

Tutorial 3: Data Duplication in output file

Hi,

I have jst started with screen-scraper basic edition. I have reached up to Tutorial 3. I have followed the steps given in the tutorial. When I get output text file with all detail. it shows duplication of last record in the file.

Can some1 b so kind to help me out ?

Selection of drop down values

Hi Everybody,
I am a java programmer and I have a site to scrape which has a drop down list. On click of one item from that list, a javascript function is called with a full page refresh, and i get a new page with contents i need to scrape.

The new page i get is having 1,2,3,next links on it, which in turn calls javascript function.

Please can anyone tell me how i need to click all the items of the drop down list through screen scraper and how to navigate to next pages.

Thanks in advance,
Vivek.

Date formating incorrectly in with interpreted java

Having some problems with Interpreted Java. It is providing me a date approximately one month ago:

// get the date
Calendar date = Calendar.getInstance();

my system date is correct. Any suggestions? Thank you!

Can someone clean this up for me please? Messy syntax

Hey all, I'm using this script and although to me it makes sense, the beanshell formatting says otherwise haha. If anyone could just re-write this script so that it works, that would be amazing!

Bypassing intermittent warning page

I'm trying to scrap a site that sometimes displays warning page, on the same URL than the detail page that interests me.

How could I test and bypass that?

PS: no, the site I'm scrapping is not an adult site... ;-) it's just a bunch of console games catalogue (the warning is for ESRB rating over Everyone) !

scrape multiple accounts from same site

I'm scraping a secure site that has a login (my admin), couple of pages of navigation, then a customer account input. This results in a page that I write out to a file and strip out the html table I need for that account. When I do a single account, I get what I need. However, the total time for entire process is approx 20 seconds.