is it possible to scrape different sites in one ss without changing the character setting in the options menu?
For instance , pre-define the encoding in the script which can be invoked. And leave the "default character set" blank?
If these possible , then in the command-line we could run mult-scripts with changing the file screen-scraper.properties?
Well, screen-scraper will automatically try to detect the encoding of a website, based on the information the website itself gives to you. So, there isn't a strong requirement for that.
When it comes to writing out files, etc, you can specify the encoding as you write a file.
There could be specific instances where you need to constantly specify the encoding, and for those cases, I'm not sure what the best solution would be. Manually altering the screen-scraper.properties file is kind of clunky, to say the least.
Go to "Options" menu, and then choose "Settings". There is a box that says "Default character set". Set this to the same as the webpage you are scraping. If you are not sure which 'encoding' to use, use your web browser to "View source" of your webpage. Near the top there should be a line that says:
Set screen-scraper to use the same as the "charset" above (in this example, it was "UTF-8")
is it possible to scrape
is it possible to scrape different sites in one ss without changing the character setting in the options menu?
For instance , pre-define the encoding in the script which can be invoked. And leave the "default character set" blank?
If these possible , then in the command-line we could run mult-scripts with changing the file screen-scraper.properties?
let me know what u guys think.
//Max
Well, screen-scraper will
Well, screen-scraper will automatically try to detect the encoding of a website, based on the information the website itself gives to you. So, there isn't a strong requirement for that.
When it comes to writing out files, etc, you can specify the encoding as you write a file.
There could be specific instances where you need to constantly specify the encoding, and for those cases, I'm not sure what the best solution would be. Manually altering the screen-scraper.properties file is kind of clunky, to say the least.
Go to "Options" menu, and
Go to "Options" menu, and then choose "Settings". There is a box that says "Default character set". Set this to the same as the webpage you are scraping. If you are not sure which 'encoding' to use, use your web browser to "View source" of your webpage. Near the top there should be a line that says:
Set screen-scraper to use the same as the "charset" above (in this example, it was "UTF-8")