Issue with non english characters

One of the records, we are trying to scrape contains a non english word

ménage

On applying an extractor pattern it gets converted to

m?ge

Since we are trying to save the same as text inside a CSV file,
Could you kindly tell us the approach to scrap ménage as ménage
and save the same as text inside the CSV file.

Regards-Diptirmaya

Setting character sets

diptermaya,

In order to render the characters correctly in screen-scraper (and ultimately output them correctly), you'll want to adjust the character set being used. You can either set this globally or on a scraping session level. The global setting is available under the main settings dialog (click the wrench icon). To set it for specific scraping sessions see the option under the Advanced tab of the relevant scraping session.

You may need to experiment some to find which character set works. Two things to keep in mind.

1. The character set indicated in the last response of the HTML page may not be the character set that works in screen-scraper
2. You are able to specify any character set you like by typing in the drop-down menu

For the word you've indicated you may want to try the following.

UTF-8 (screen-scraper's default)
CP1256
ISO-8859-1
GB2312

Further reading...

http://community.screen-scraper.com/faq/80#80n868
http://community.screen-scraper.com/faq/80#80n861

-Scott