screen-scraper public support
RegEx help
I need a bit of help with the regex for a pattern. The url has parameters like this;
x=foo bar&y=more foo bar
but sometimes the parameters look like:
x=foo bar(foo&bar)&y=more foo bar
The ampersand in the first parameter is confusing the parser and thinks that's the end of the token. Can someone suggest a way to parse this?
TIA.
Jim
"Write Access Error"...
"In order for screen-scraper to function properly, please ensure you have
write access to the folder in which screen-scraper is installed as well
as all of its sub-folders.
For example, I found that you don't have write access to the following file(s):
D:\Screen-Scraper\screen-scraper Basic Edition\resource\db\ss.data"
Seems like Screen-Scraper wants to delete SS.data and re-create it
each time it runs. Have I got that part right?
And I *do* have access.
To Wit:
- Checked the folder and file props all the way up and down the chain
Unable To Start Screen Scraper: "java.lang.NullPointerException"
"Basic" edition. v6.0.0.0 running on Windows 7 Home Premium.
I'm pretty sure it's the 32-bit version of ScreenScraper because the
installer put in Program Files(x86), but cannot be 100% positive
because the error does not give the version and I can't get any further.
.
It was working last night and nobody has been on the machine since then.
But this morning I am getting what is below.
.
Am I the only one? Something with my install of Java, maybe?
I'm trolling for thoughts before I try a re-install from the 64-bit
Stop session if pattern exists?
Sorry, this one is probably pretty simple. How do you stop scraping if an extractor pattern is found?
More specifically, I have 5 or so scrapable files. if an extractor pattern is found in pattern 3 i want it to skip the last two scrapable files and end the session.
Emulate browser?
A site previously scraped now gives me an error of "we no longer support your browser." Is there an easy way to emulate another web browser version using the free software you provide?
database connection error - alert
When I setup connection to my MySQL database and the connection fails I would like to be able to send out an email notifying the team of such error.
Here is the error I'm getting:
ERROR--test: An error occurred while processing the script: _test db connection
HELP! Error on startup
I re-installed scrapper. When I start up I get this
java.lang.IncompatibleClassChangeError
A lot of messages follow but the screen won't let me copy and paste. If I can get it some other way just let me know. Thanks.
Unable to scrape website
I have been able to scrape the site www.theice.com for a couple of years. Then a week ago screen-scraper stopped being able to see it. When I run the scrape now the log shows:
Scraping file: "ICE Test"
ICE Test: Resolved URL: https://www.theice.com
ICE Test: Sending request.
ICE Test: An input/output error occurred while connecting to 'https://www.theice.com'. The message was peer not authenticated.
I am able to see it in a browser but if I run the proxy I get a ERR_SSL_PROTOCOL_ERROR message.
Has anyone experienced this before?
scrape details from multiple pages
I am new to screen scraper and working as a Research Assistant. I want to scrape uspto.gov. There are thousands of pages and each page includes 50 patents. I want to scrape the details of all these patents with their automatic navigation.
Can anyone help me how to do it?
Any help will be appreciated.
Thank you
Login uses a realm authorization, uses no post data.
I can see in the response a statement that says Authorization: basic and then what looks like a base64 encoded string, but that's in the header and as far as I can tell, even with watching the binary data, I still can't see where the username and password is entered.
I did find that the site sets a cookie. Oddly enough, once that cookie is set, no matter how it gets set, the site allows my administration access. Since I'm using the free version of scraper, I can't set the cookie.