Binary Data

I'm actually collecting the same data as the following forum topic was:

http://community.screen-scraper.com/node/1131

I found the JS file okay, and used the referrer and everything, but when I scrape the file, all I get is this:

HTTP/1.1 200 OK
Content-Type: application/javascript
Last-Modified: Sat, 05 Dec 2009 05:30:21 GMT
ETag: "12af-44aba-479f4853a1540"
Accept-Ranges: bytes
Content-Encoding: gzip
Date: Sun, 06 Dec 2009 07:04:02 GMT
Expires: Mon, 07 Dec 2009 07:04:02 GMT
Cache-Control: max-age=86400
Server: Apache/2.2.13 (Unix) mod_jk/1.2.27
Content-Length: 42476
Vary: Accept-Encoding,User-Agent

[Binary Data]

The URL is a bit different now (https://www.bankofthewest.com/static_files/botw2/home/special-publish/br...), but it shows up fine in a browser, even with just cut and paste. What is it that the browser can interpret that screen-scraper is having problems with?

chrishathaway on 12/06/2009 at 1:16 am

screen-scraper support for licensed users

Hi Chris, Glad to see you're

Hi Chris,

Glad to see you're still scrapin'. This was a result of screen-scraper not checking for a specific content-type header. Try it in version 4.5.24a, and let us know if it still doesn't work.

Thanks,

Todd

todd on 12/09/2009 at 10:49 am

Hi, I've tried version

Hi,

I've tried version 4.5.24a but still got [Binary Data]. The strange thing is after scraping the first few levels of a site, it is ok. But after, for example, the 3rd level, I only receive [Binary Data], and then 4th level and so on are ok again. I've checked the HTML and the headers are all the same. Is there anyway around this?

Test_Scraper on 03/06/2010 at 8:16 am

We have version 4.5.36a now;

We have version 4.5.36a now; have you tried that? Could you let me get to a page that returns this just so I can test it?

jason on 03/08/2010 at 9:44 am

Thank you for your

Thank you for your response!

Its still not working with the latest update. Here is the link:
http://tiny.cc/K4q9x

I can scrape other pages, but it is just this page that can't be scraped. It is the same whether logged in or not.

Test_Scraper on 03/09/2010 at 9:55 pm

You're right that there is a

You're right that there is a problem here. The web-server isn't providing a content type (which it really should), and our means of determination is getting it wrong right now. We just came up with an idea though, so watch for version 4.5.38a today or tomorrow, and the fix will be integrated.

jason on 03/10/2010 at 3:20 pm

Maybe in time we'll make this configurable but it's such a rarity that it's not a very high priority.

-Scott

swilsonmc on 12/28/2009 at 6:48 pm

Search

Community

screen-scraper

User login

Binary Data

Hi Chris, Glad to see you're

Hi, I've tried version

We have version 4.5.36a now;

Thank you for your

You're right that there is a

Yep, that worked perfect.

The binary data just refers

It's just text, but

Chris, Right now the