screen-scraper public support
Web site encoding
Hi,
How does screen scraper know which character incoding to use to scrape a site? Does it use the http response header information? If not, is there a way to tell screen scraper which encoding to use for a particular site?
Thanks,
Brendan
Start scraping where you left off
Hi,
If the scraper crashes or stops responding, is it possible to restart the scraping session and have the program continue where it left off?
Thanks,
Brendan
Incrementing session variables
Hi there,
Before I start, I’d just like to say thanks for a fantastic product. I’m currently evaluating SS to see if it can make a planned project viable and having scraped around 11,000 products so far, it is looking extremely promising. Nothing else I’ve tried has come close to being up to the job and it’s great to finally find something that actually does what it says it can!
Scripting error
Hi,
I need some help. I am running a scape session that fills a dataRecord called(ReadMoreUrl) with a url and then calls a script after each pattern match. My problem is when the dataRecord is filled with the url it has an 'amp;' right after the amper (&) sign so I need to remove it before my script tries to use it. Here is what is in my script (VBScript):
ReadMore = Replace(DataRecord.Get( "ReadMoreUrl" ),"amp;","",1,50)
session.setVariable( "Url", "ReadMore" )
Call session.ScrapeFile( "Classifieds - Follow link scrape" )
Version 1.1.5 user upgrading to 2.0
I have been using version 1.1.5 for a while and my script stopped working so I decided to install the basic version 2.0. I have everything up an running and imported all of my scripts. I am having a problem with a vbscript that basically takes my extracted data and writes it to an XML file. I used the old Slash Dot example that created a text file to adapt it to writing an XML file. This is the error message I receive in the scraping session log.
Proxy server problems
I can't get passed Tutorial One. I have downloaded the latest version of ss_basic, and am running IE6. I have amended the settings of IE6 as described in Tutorial 1, but still nothing registers in "HTTP Transactions" pane of ss while I'm browsing. So I am guessing that nothing is being passed to ss.
Any ideas what else I can try? I'd really like to get this trial version working - because it does sound ideal for what I want.
Thanks.
Errors listed in Progress log using Windows Proxy Server
When running a Scraping session, I selected "Don't log binary files", yet I continue to see Error entries for each gif and jpg enountered in the response file.
This doesn't appear to affect the eventual HTML response file, but it clutters up the Progress log table :( ...
Using the latest release 2.0.5-14a
Vangoghnads...
select a string
Hello, I need to do a selection of the datarecords befere print they in a web.
for example, if I search for pen, the searcher try :
pen
pencil
ballpen
I only want to extract pen. How can I do?
I use php. thanks.
COM component problems
Hello,
Guys, could you please, pay more attention to ur COM component? Make it more adequate to main features, create more methods and properties ? You even can add events, it will be very helpful. For now it works very unstable, and I can not expect it to work as SS GUI does.
scraping the shopping site example
Hi,
I tried to scrape the shopping site example, by scraping all the articles and the details at once and store the data in the dataset variable and then read the records containing the product info and some of the detail info of the detail page.
E.g.
A Bug's Life "Multi Pak" Model: DVD-ABUG
Microsoft IntelliMouse Explorer Model: MSIMEXP
The product would come from scraping the page:
http://www.screen-scraper.com/shop