screen-scraper public support
csv problem : extract > loop > save
Hi all,
this program is really great..i was searching for something like that long long time...
i have to extract data from a local dir containing files that have a variable in url, and save them to a csv.
All works quite perfect but with "little" problem in final csv that produce something like that, ID, empty,empt than empty ADDRESS empty,..and so on.. :
CSV looks like :
UK Postcode RegEx
Found this and thought it may be of use for UK Postcodes.
^([Gg][Ii][Rr] 0[Aa]{2})|((([A-Za-z][0-9]{1,2})|(([A-Za-z][A-Ha-hJ-Yj-y][0-9]{1,2})|(([A-Za-z][0-9][A-Za-z])|([A-Za-z][A-Ha-hJ-Yj-y][0-9]?[A-Za-z])))) {0,1}[0-9][A-Za-z]{2})$
Problem in 'onchange="new Ajax.Request('...')' calls.
Hi Everybody,
I have a site to scrape, which has three drop down boxes for country, province and city respectively. For country and province dropdown box, I have "onchange="new Ajax.Request('...')" calls. When i select a Country it automatically populates the option values of province, and when i select a province it populates the city drop down values without any page refresh.
Regular Expression - How to stop on punctuation
Hi,
Firstly the more I use this product, the better it seems, I'm now working on a site that has a tree 4 deep and things are going great.
The only issue I'm now having is with my regular expression skills. I'm trying to extract the venue name from an address string but I've got the issue that different characters are being used to either separate or terminate the part of the string I want. Any suggestions are welcome. I assume I need to do something like capture anything that is an alpha numeric character or a space, then stop on anything else.
newbie in need of help.
I have been getting on OK with Screen-Scraper until this...
Loop through products
Rather than constructing a rather elaborate scraping session that submits a search (with lots of different search terms), works through the results pages and the calls each of the details pages, I was planning to just call the details pages with a call to the sequential product ID.
I realise this is a little lazy but it is bit quicker and will do what I need at the moment.
noob question - splitting data in extractor Pattern
i am writing a basic script to grab some contact info from a website.
however the first half of the post code is getting included in with the county
the extractor Pattern code:
~@TNAME@~
~@TROAD@~, ~@TTOWN@~, ~@TCOUNTY@~ ~@TPCODE@~
Tel: ~@TPHONE@~
can anyone help?
just can't seem to find a solution.
thanks in advance
Ash
Login issues
I'm trying to scrape data from behind a login page and I have some valid credentials but I cannot get past the login using Screen Scraper but I can by entering the data directly in either firefox or Safari.
I've tried adding the login details in the advanced tab, I've also tried adding them as parameters and setting the type to POST. Neither works.
Any suggestions?
Dealing with Captcha images
I see from some of the forum posts that it is possible to throw up a dialogue box with the captcha image showing to allow a user to manually enter the captcha information.
My Java skills are weak, does anyone have a sample script they are willing to share?
Interpreted Java - Set data and time
Hi,
I'd like to set the filename of my export file to include the current date and time so making it easy to identify the results of a particular session. My Java skills are weak (well, non-existent but I'm learning), but I assume it is something like:
where [b]my_data_time[/b] is some form of date function (any suggestions, or even a pointer where to look for documentation).
rgds/alex