screen-scraper support for licensed users

Questions and answers regarding the use of screen-scraper. Only licensed Professional and Enterprise Edition users can post; anyone can read. Licensed users please contact support with your registered email address for access. This forum is monitored closely by screen-scraper staff. Posts are generally responded to in one business day.

Input/output Error

Hello ,

I am getting error while run the scraping session of this website "https://www.americanleather.com/our-products"

Error massage :-
"An input/output error occurred while connecting to 'https://www.americanleather.com/our-products'. The message was java.net.ConnectException: Connection timed out: no further information: www.americanleather.com/191.238.240.12:443."

I also select HTTP Client:- Ning Async Http Client.

But still getting same error.

Please suggest me.

Thanks
Shyam

Broxtowe Council - Requesting data from server

I am capturing Broxtowe council - weekly list

http://planning.broxtowe.gov.uk/ApplicationSearch

I have captured the page but when I click on Display response in Browser then it shows me webpage saying -
Requesting data from server...

Also For result page I am getting the result like below format in different format

Screen Scraper 7 hangs when scraping NGX website

We have been running a scrape of http://www.ngx.com/settlehistory.html for a few year and have had no problems, until we upgraded to 7. Now when we run the scrape it just hangs on the download (the exact message is "NGX - Download & Parse: Requesting URL: http://www.ngx.com/settlehistory.html"). When we look at the CPU and memory usage it becomes stagnant after about 10 mins, zero CPU usage for at least an hour. I have increased the memory to 1024 MB and that seems to have no effect (still hangs around 360MB).

Any suggestions?

Barrow council

Barrow council -
http://www.barrowbc.gov.uk/residents/planning/local-planning-applications/application-search/
capturing 500 applications.
Able to get the first page records but not able to collect next page records.
I have passed the parameters in next page

__EVENTARGUMENT - ~#NEXT_PAGE#~

parameter is - page$2
but it is converting $ into some value, showing as page value in last request - EVENTARGUMENT=Page%243

how to decode $value.

Wix scraping - Any success

Hi All,

Anyone had any success with scraping a Wix site? We've got a new client that needs their site scraped, but this is provding to be very tricky. Tehy use AJAX for just about the whole site and passing the values into Scraper to get the pages is proving impossible.

Any ideas?

Richard

Error 404: javax.servlet.ServletException: java.io.FileNotFoundException: SRVE0190E: File not found

Hi.
I'm having problems scrapping a page, at the key pages it return "Error 404: javax.servlet.ServletException: java.io.FileNotFoundException: SRVE0190E: File not found ###/####"
apparently, the requests sent are correct.
I have to use "Request entity" params and is in that pages where i get the error at the body of the "last Response".
Do you know this issue? can be solved?

Thanks a lot.
Br

PKIX path building failed

I've gone through the forum posts regarding this same issue but there seems to be no general solution that works for everyone. So I'm posting mine here again in case someone can help:

Starting scraper.
Running scraping session: Metrobank CIB
Processing scripts before scraping session begins.
Processing script: "Metrobank CIB Initialize"
https://live.dragonpay.ph/Bank/ScrapeHandler.aspx
=========================================================
=================== Log Variables with Message ===============
screen-scraper Instance Information

New https issue

Hi

I have read the posts on the blog about the known issues, made all the recommended changes before and everything was working - I have a problem with one site that returns null, (see below). Any thoughts It had been working for ages before.?

Starting scraper.
Running scraping session: HRN
Processing scripts before scraping session begins.
Scraping file: "HRN Test"
HRN Test: Requesting URL: https://hrntractors.com/stock/used-tractors/page/2/
HRN Test: An input/output error occurred while connecting to 'https://hrntractors.com/stock/used-tractors/page/2/'. The message was null.

n/a

n/a