screen-scraper public support

Questions and answers regarding the use of screen-scraper. Anyone can post. Monitored occasionally by screen-scraper staff.

Tiny minor bug

Latest pre-release version.

If you renamed a scrapeable file to include a : SS pops a warning about
'Sorry, the name cannot contain the following characters: : / \ . But still allows the ':' character.

Problem starting screen-scraper NT service

I'm a new user of SS and trying to evaluate your product.

I'm using a trial version of the Screen-Scraper Pro on Win2k3 R2. I am trying to start the NT service from "Service" control panel (let's call it SCP); it appears the service was started successfully but when I refresh SCP it report the service is actully stopped.

logging into site with .htaccess login

How would i login into a site that is protected with .htaccess
i tried doing
http://username:[email protected]
and setting that as the url in the scrapeable file but the response from the server was 401, Access Denied. Authentication failed or was missing.

Running multiple scrape sessions..

I'm not sure if this is possible so.. can I run a new scrape on data while one is already being run (both in the workbench)? Do scrapes run through the server faster than the workbench?

Thank you

main vs sub extraction

I couldn't find an answer to this, but it should be a simple answer. Is it better to have multiple 'Main' extractor patterns OR 1 'Main' and multiple sub-extractor's beneath?

I'm finding the CSS

tables change "position" or don't even exist in one page scrape vs the next; this is far from a normal site. :)

UI: Odd minor mapping bug

I've been able to reproduce this annoying bug, the second time only because I wasn't paying attention.

Within a main extractor pattern eg:

text to ~@GRAB@~

Scraping foreign language site

The site from which I’m trying to scrap are Korean sites (thus Korean fonts). So, I have set ‘Default character set’ as ‘euc_kr’ and ‘Default font’ as ‘Arial Unicode MS’. I am able to receive token results from scrapeable file except that they are unreadable texts (actually symbols and squares). Of course, when I then transfer the token results to database I can then see the results in readable Korean fonts. But, this isn't good enough..

Tidying HTML - Intermittent Issue

I have a nagging problem whereby I get occasional errors where "Tidying HTML" has failed and the program returns the original HTML and my extractor patterns fail. To combat the problem I went to Options - Settings and unticked the "Tidy HTML after scraping" box and adjusted the extractor patterns to match the original HTML. The problem is that in the new configuration the program will occasionally tidy the HTML despite the box being unticked. I then have to readjust my extractor patterns once more. Is there some way to stabilize the situation?

Writing Output - Error With Varying Data Set Length

I have a problem writing out data from a data set. Here is my output logic:

FileWriter out = null;

try
{
session.log( "Writing data to a file." );

out = new FileWriter( "Data.txt", true );

out.write( session.getVariable ("VAR1") + ",");
out.write( dataSet.get( 0, "VAR2" ) + "," );
out.write( dataSet.get( 1, "VAR2" ) + "," );
out.write( dataSet.get( 2, "VAR2" ) + "," );
out.write( session.getVariable ("VAR3") + ",");
out.write( session.getVariable ("VAR4") + "\r\n");

scraping session tree disappear

I'm running scraping session from command line.
But sometimes, scraping session stops soon after start, leaving the error message "Scraping session finished.".
Then I go check at screen scraper to find scraping session tree disappeared.

Does anyone have same problem?