encoding problem with some french caracters : – œ

Hi,

I m in trouble with french character encoding since 2 days now and i really dont khow what to do.
Here is the page i want to scrap http://sortir.telerama.fr/concerts/ahmad-jamal-quartet,39949.php

I'm in trouble with the Em dash in the descriptif :– Idris Muhammad – who appears as ? when i get it in Java

The proces is :
in ScreeScraper the character set is define as UTF-8 and everything is ok in the log.
But when i invoke it in java, i get a ?
instead of –
I use :RemoteScrapingSession(SiteCritiqueEncours,"localhost",8778,EncodageCar)

And i tried with different EncodageCar : UTF-8 ; ISO-8859-1;

Hope you can help me

Gilles

PS : youre product is great !

I tested this with UTF-8 and

I tested this with UTF-8 and it worked, so I know you can get it ... just need some information first.

  1. What version of screen-scraper are you using?
  2. On what platform?
  3. In screen-scraper settings, what is the default font?
  4. Are you setting the encoding globally or on the scraping session?

encoding problem with some french caracters : – œ

Hello,

1.What version of screen-scraper are you using?
Version 5 and 5.5
2.On what platform?
Mac OS

3.In screen-scraper settings, what is the default font?
ArialUnicodeMS But once again, the problem is only on java side

4.Are you setting the encoding globally or on the scraping session?
On the scraping session
In ScreeScraper the character set is define as UTF-8 and everything is ok in the log.
In java, if i use RemoteScrapingSession with UTF-8, all the accent are displaying badly.
If i use ISO-8859-1, everything is ok except for certains caracter like Em dash, word qote, œ ect….

My workaround for the moment, is to replace theses caracters on the screenscraper side.
Sorry for the delay in response
Gilles