Version 3.01a - gzip encoded response
Hi -
I'm using v3.01a and am encountering garbled responses when the remote webserver sends gzip encoded responses. I've tried to find how to enable HTTP client decoding, or change the HTTP client's Accept-Encoding: gzip parameter, but can't find either.
I've also tried to decode it using a java.util.zip.GZIPInputStream, but since scrapableFile only allows getContentAsString (and not getContentAsBinary), the gzip gets corrupted.
Any other ideas?
Version 3.01a - gzip encoded response
It was the "Unstable updates" checkbox... thanks for the tip!
Version 3.01a - gzip encoded response
Hi,
Could be. If you're upgrading from 2.7.2, see this blog entry: [url]http://blog.screen-scraper.com/2007/01/12/version-30-of-screen-scraper-now-available/[/url]. This might also help: [url]http://www.screen-scraper.com/support/faq/faq.php#NoUpdates[/url].
Kind regards,
Todd
Version 3.01a - gzip encoded response
Fantastic! What a quick response...
I'm trying to use the Options->Check for Update menu item but I get "No updates available at this time". I upgraded from 2.7.2. Is there a trick I'm missing?
Version 3.01a - gzip encoded response
Hi,
This turned out to be a result of a minor bug in screen-scraper. We've just made a fix in version 3.0.2a: [url]http://blog.screen-scraper.com/2007/02/09/version-302a-of-screen-scraper-available/[/url].
Also, we have a policy to offer a discount on the Professional Edition of screen-scraper to anyone who reports a previously unknown bug. If you'd like to take advantage of this, go ahead and email me directly. My email address is my first name at screen-scraper.com.
Kind regards,
Todd
Version 3.01a - gzip encoded response
Sure, here is an example site. I've checked that it is the gzip encoding that is the issue by using another User Agent to get the HTTP stream with and without the "Accept-Encoding: gzip" header. I saved the gzip stream to disk and gunziped it to check if the result was correct, and it was.
[url]http://online.musicmatch.com/genesis/cgi/search.cgi?TYPE=track&TEXT=California&COUNT=100[/url]
In SS, the response looks like this:
Content-Type: text/xml; charset=ISO-8859-1
Connection: close
Content-encoding: gzip
Server: Apache/1.3.34 (Unix) mod_perl/1.29
Transfer-Encoding: chunked
Date: Fri, 09 Feb 2007 05:54:13 GMT
‹ÈšçßקÃö‹é™ˆ®VÞ/gÃœ>áK]Û®rwWÔLll¨ ËÖZHÂŒ$ì¦?û¾ØLáÂX©Ä`@Îéî0Ø$JÂ~z®ÿçà Ÿõ“à NÃ¥Eϴ...
Version 3.01a - gzip encoded response
Hi,
screen-scraper should be handling this automatically. Could you provide us with a URL where it doesn't seem to be working so that we can investigate?
Kind regards,
Todd Wilson