scrapeableFile
setMaxResponseLength
Description
Limit the amount of information retrieved by the scrapeable file. This method can be useful in cases of very large responses where the desired information is found in the first portion of the response. It can also help to make the scraping process more efficient by only downloading the needed information.
setCharacterSet
Description
Set the character set used in a specific scrapeable file's response renderings. This can be particularly helpful when the page renders characters incorrectly.
addHTTPHeader
Description
Add an HTTP header to be sent along with the request.
resolveRelativeURL
Description
Resolves a relative URL to an absolute URL based on the current URL of this scrapeable file.
setForceMultiPart
Description
Set content type header to multipart/form-data.
getName
Description
Get the name of the scrapeable file.
Parameters
This method does not receive any parameters.
Return Values
Returns the name of the scrapeable file, as a string.
setRetainNonTidiedHTML
Description
Set whether or not non-tidied HTML is to be retained for the current scrapeable file.
getRetainNonTidiedHTML
Description
Determine if the scrapeable file is set to retain non-tidied html.
Parameters
This method does not receive any parameters.
Return Values
Returns boolean flag for non-tidied contents being retained.
scrapeableFile
Overview
The scrapeableFile object refers to the current file being requested from a given server. It houses both the request for a file and response and can be manipulated to meet any necessary requirements: GET and POST parameters, referer information, cookies, FILE parameters, HTTP headers, characterset, and such.