screen-scraper support for licensed users

Questions and answers regarding the use of screen-scraper. Only licensed Professional and Enterprise Edition users can post; anyone can read. Licensed users please contact support with your registered email address for access. This forum is monitored closely by screen-scraper staff. Posts are generally responded to in one business day.

Issue with non english characters

One of the records, we are trying to scrape contains a non english word

ménage

On applying an extractor pattern it gets converted to

m?ge

Since we are trying to save the same as text inside a CSV file,
Could you kindly tell us the approach to scrap ménage as ménage
and save the same as text inside the CSV file.

Regards-Diptirmaya

Issue with changing cookies on web site

-We are trying to scrap a website which contians cookies.
-These cookies are used as parameters while scraping the sessions
-Also the cookies value keep changing on a regular basis
-
Is it possible for screen scraper to fetch the cookies when we hit
a particular website. If yes then how??

Appreciate your response- Diptirmaya

Screen- Scraper to take dynamic value at runtime

Hi,

I need to make the screen-scraper take dynamic values for the parameter at runtime from an external application to return the value. I don't know which method(s) i need to call.
Second is that i want to return the content of the file that is scraped to my application so that i can read it and display the result on my screen.

Thanks,
Dapo.

Developing first scrape for my work

Hi there-

I first off want to commend on what a great product this is! I have wrote screen scrape programs in the past via MS Access and it was a PAIN! I am an engineer for FedEx and I am trying to use screen scraper to pull some employee scanning information from our internal website each day.

Here are the problems I am running into

Replace uppercase for lowercase

As I am scraping descriptions from sites some appear completely in uppercase. Is there a swift way of replacing "all" uppercase chars for lowercase?

Even better would be if I could have a way to set the first letter and all ". " beginning with uppercase.

Export failed

I get the following error when trying to export a fresh scraping session:

----
Export Failed
An error occurred while exporting session.
The error message was: The character ' ' is an invalid XML character
Generally errors that occur when exporting a scraping session are a a result of an uncommon character set.
Please check this in the "Settings" dialog box.
----

Freeze on Web Interface

We have screen-scraper running on a server, and we access it using the web interface. It is one of the more recent alpha releases, and we are getting some pretty consistent freezing. Basically what happens is that we have a few scrapes running at once (usually at least 3), then we try to import another one. It simply hangs and never imports. If we exit the page and try to reload the interface, the top of the GUI loads, but none of the actual scrapes load and it sits there, spinning. All running scrapes have stopped as well. I can't find any error messages in the logs.

Capturing subsection data

I'm trying to capture physician data from the following URL: http://www.scanhealthplan.com/article/discoverscan/findascandoctor/provi...

Any given physician can have nultiple offices, and there may be multiple addresses listed for a given office (see example below). Is there a way to get a single record for each name/office/address combination?

Screen-Scraper Hangs when I Try to Save

I am using Version 4.5 - enterprise edition

I've created a scraping session in which the only file that exists is a scrapeable file that contains numerous post parameters (~20). I'm encountering a problem when I try to save my session. The "Save" never completes. It just hangs...

Is this a common occurence and/or is there a solution?

justmarc

Issue of encoded url parameters even when adding to url in properties tab

One more issue w.r.t get parameters in a url.

Referring to this posting titled

url variables automatically hex encoding - how can i get it to stop?

We have to send a parameter to a website in the following format

somewebsite.com/someaction.do?RIPSESSION={[*!1D1A020B050

I add this parameter to the url on the properties tab,
but in the logs i am observing it gets converted to

somewebsite.com/someaction.do?RIPSESSION=%7B%5B*!1D1A020B050,