Foreign Language Characters
Hi there - I'm a brand new convert to SS and trying to make my way through the tutorials. I managed to get a scrapping session to work on a site that has some French names, however the output in the text document replaces all the French Characters (â è ê ć á) with a "?"
Is there anything I can do with my Screen Scraper settings?
Advance Gratitude,
-R
The FAQ about this is here:
The FAQ about this is here: http://community.screen-scraper.com/FAQ/International
It's always tricky to do international character sets, but it can most always be done.
Most of the time you just
Most of the time you just need to turn on HTML Tidy on the scrapeable file(s). If that's not enough, you need to edit the default character set in your screen-scraper settings.
HTML Tidy was turned on
Thanks for your comment. I did have HTML tidy on - but come to think of it - there was still plenty of HTML code in the output file. My Default character settings are set to UTF-8. Is there a different one I should set it to?
Much appreciated!
-R
I led you astray. You need
I led you astray. You need to turn HTML tidy off. I knew that, but not sure how I typed it so wrong.
UTF-8 is usually fine for Latin characters.
I have the same problem. I'm
I have the same problem. I'm trying to scrape the line:
If I turn off my HTML tidy, instead of seeing '?', I'm seeing square boxes '�'. If I switch to Jerico tidy mode, I get a bunch of '�'
How do I get the actual/literal text?
that is happening to me too
- any thoughts?