Unable to scrap an https website
When I am scrapping a https site, I get the following error:
Security Error: Domain Name Mismatch
You have attempted to establish a connection with "www.public.surreyheath-online.gov.uk".
However, the security certificate presented belongs to "screen-scrapper.com".It is possible
though unlikely, that someone may be trying to intercept your communication with this site.
If you suspect the certificate shown does not belong to "www.public.surreyheath-online.gov.uk",
please cancel the connection and notify the administrator.
I followed the instruction under section "Viewing encrypted transactions " given at :
http://community.screen-scraper.com/using_the_proxy_server
Still I am unable to scrap the site.
-Dipti
Do you see this error when
Do you see this error when trying to proxy the site? If you do (as I suspect) then the "interception" it refers to is your own proxy server. In such cases sometimes you sometimes cannot use the proxy to capture your requests, and have to use something like HTTPFox or Charles Proxy to see the requests, and then build them into your scraping session manually.