scrapeableFile

Scrapeable File

setMaxResponseLength

void scrapeableFile.setMaxResponseLength ( int maxKBytes ) (professional and enterprise editions only)

Description

Limit the amount of information retrieved by the scrapeable file. This method can be useful in cases of very large responses where the desired information is found in the first portion of the response. It can also help to make the scraping process more efficient by only downloading the needed information.

scraper on 07/16/2010 at 5:19 pm

scrapeableFile

setCharacterSet

void scrapeableFile.setCharacterSet ( String characterSet ) (professional and enterprise editions only)

Description

Set the character set used in a specific scrapeable file's response renderings. This can be particularly helpful when the page renders characters incorrectly.

scraper on 07/16/2010 at 5:13 pm

scrapeableFile

addHTTPHeader

void scrapeableFile.addHTTPHeader ( String key, String value ) (professional and enterprise editions only)

Description

Add an HTTP header to be sent along with the request.

scraper on 07/16/2010 at 5:02 pm

scrapeableFile

resolveRelativeURL

String scrapeableFile.resolveRelativeURL ( String urlToResolve ) (professional and enterprise editions only)

Description

Resolves a relative URL to an absolute URL based on the current URL of this scrapeable file.

scraper on 07/16/2010 at 5:02 pm

scrapeableFile

setForceMultiPart

void scrapeableFile.setForceMultiPart ( boolean forceMultiPart ) (professional and enterprise editions only)

Description

Set content type header to multipart/form-data.

scraper on 07/16/2010 at 5:02 pm

scrapeableFile

getName

String scrapeableFile.getName ( )

Description

Get the name of the scrapeable file.

Parameters

This method does not receive any parameters.

Return Values

Returns the name of the scrapeable file, as a string.

scraper on 07/16/2010 at 4:57 pm

scrapeableFile

setRetainNonTidiedHTML

void scrapeableFile.setRetainNonTidiedHTML ( boolean retainNonTidiedHTML ) (enterprise edition only)

Description

Set whether or not non-tidied HTML is to be retained for the current scrapeable file.

scraper on 07/16/2010 at 4:57 pm

scrapeableFile

getRetainNonTidiedHTML

boolean scrapeableFile.getRetainNonTidiedHTML ( ) (enterprise edition only)

Description

Determine if the scrapeable file is set to retain non-tidied html.

Parameters

This method does not receive any parameters.

Return Values

Returns boolean flag for non-tidied contents being retained.

scraper on 07/16/2010 at 4:57 pm

scrapeableFile

scrapeableFile

Overview

The scrapeableFile object refers to the current file being requested from a given server. It houses both the request for a file and response and can be manipulated to meet any necessary requirements: GET and POST parameters, referer information, cookies, FILE parameters, HTTP headers, characterset, and such.

scraper on 07/16/2010 at 4:56 pm

scrapeableFile

setUserAgent

void scrapeableFile.setUserAgent ( String userAgent ) (professional and enterprise editions only)

Description

Explicitly state the user agent making the request.

scraper on 07/16/2010 at 4:56 pm

scrapeableFile

Search

Community

screen-scraper

User login

scrapeableFile

setMaxResponseLength

Description

setCharacterSet

Description

addHTTPHeader

Description

resolveRelativeURL

Description

setForceMultiPart

Description

getName

Description

Parameters

Return Values

setRetainNonTidiedHTML

Description

getRetainNonTidiedHTML

Description

Parameters

Return Values

scrapeableFile

Overview

setUserAgent

Description