Scraping large chunks of html including line breaks
I am having trouble scraping large chunks of html code and preserving the line breaks. The extractor process works as I intend it to, however when I print the newly scraped output to the screen, all line breaks have been removed. Is there a way I can maintain the integrity of the page?
If the html page looks like this:
blah
blah
blah
Scraped results:
blah
blah
blah
Desired results:
blah
\n blah
\n blah
\n
Scraping large chunks of html including line breaks
Thanks - that seemed to do the trick.
Overall, it did not really effect the speed of the scrape, so I am pleased.
thanks again!
Scraping large chunks of html including line breaks
Hi,
Please see this FAQ, which might help
http//www.screen-scraper.com/support/faq/faq.php#WhiteSpace
Feel free to post a reply if we can clarify further.
Kind regards,
Todd Wilson