screen-scraper public support

Questions and answers regarding the use of screen-scraper. Anyone can post. Monitored occasionally by screen-scraper staff.

Screen scraper on linux

I start screen scraper server from SHELL SCRIPT using this command

"jre/bin/java -Xmx128M -jar screen-scraper.jar --start-server --interactive"

and I get following message in the shell as follows

"Initializing screen-scraper server. Please wait...
Ready.
Type "quit" to quit.
? Warning: You're not using a Java Development Kit (SDK) environment so JSP pages can not be compiled.
Starting service LiteWebServer Base Module (lws-base/3.0.3) (with LiteWebServer JSP Module (lws-jsp/1.1.2))"

New User - Scraping Categories URL for Site

One of the sites I would like to scrape for content...all the URL's are search engine friendly URL's and I have played around trying to figure out the parameters of the search query but not able too.

With that said I only need to scrape the categories of this site. So is there a way to set up the program to scrap all the categories URL's and then go to the detail pages for the products and get the content?

screen-scraper is telling me that it can't connect to its database or that the database is corrupted

Hello,

I want to start screen scraper enterprise for the first time, but I always get this error message. I tried what is written here:

http://community.screen-scraper.com/faq/CorruptedDatabase

but It doesnt help. Even after a system shut down.

i use windows 7 x86 and I told windows firewall that all incoming and outgoing connections of screen scraper are allowed.

What can I do to get screen scraper running?

Scraping Business on Yahoo - Extractor Pattern changes, then changes back

I am working on Scraping Businesses listed on Yahoo. I am having a problem with at least one of the indexed businesses listed with the Search Results.

The #4 postion one is okay. Then on #5, the extractor pattern changed. Then on #6, the extractor pattern went back to the usual.

Position #4:

<td class="marker"><span class="yls-rs-marker yls-rs-pos4">4.</span></td>

Extracting Several Tables to a CSV File

Dear Screen-Scraper Community,

I've been wrestling with this for days now, it seems kind of basic but I can't seem to figure out a way to make it work.

I want to extract this data to a CSV file:

<table width="100%" border="0" cellpadding="4" cellspacing="0">
<tr>
<td>
<hr size="1" width="100%" noshade="noshade" />
</td>
</tr>

<tr valign="top">
<td><b>Steve Abraham</b><br />
Yellow-Checker Cab Co., Inc.<br />
P.O. Box 25123<br />
 Albuquerque, NM 87125<br />
Reservations Phone Number: <b>505-247-8888</b><br />
 Fax: <b>505-243-7499</b><br />

Can Read Chinese Characters on SS but NOT on the Output File, Why?

I have problem to get Chinese characters displayed correctly in the output file. All I get is ??. The strange thing is I can see the Chinese characters from the last response or test pattern of ss by turning off tidy, so I think the problem occurs when the values are write to the output file. Any idea how to fix this? Thank you!

encoding problem with some french caracters : – œ

Hi,

I m in trouble with french character encoding since 2 days now and i really dont khow what to do.
Here is the page i want to scrap http://sortir.telerama.fr/concerts/ahmad-jamal-quartet,39949.php

I'm in trouble with the Em dash in the descriptif :– Idris Muhammad – who appears as ? when i get it in Java

The proces is :
in ScreeScraper the character set is define as UTF-8 and everything is ok in the log.
But when i invoke it in java, i get a ?
instead of –
I use :RemoteScrapingSession(SiteCritiqueEncours,"localhost",8778,EncodageCar)

Possible to generate a list of a scrape's tokens/variables?

Hi everyone,

Is there a way to see or generate a list of all tokens/variables that are defined and/or used in a particular scrape?

My situation is this: when I was a less-experienced SS user, I created multiple scrapes to extract different data from the same website. Now that I have graduated to being a semi-noob, I want to standardize these to only 1-2 scrapes. However, I want to review my list of tokens/variables (and if possible see where they are used) to leverage the extractor patterns/scripts that I've already built.

Screen Scraper Noob - Need help extracting to Excel/.CSV file

Hey guys,
I'm very new at this and I have (basically) no experience with coding. I am "tech-saavy" so hopefully I can understand what to do, if anyone is willing to help me.

So basically, I've been trying to scrape doctors names and addresses off of a website. I've been able to make extractor patterns that extract the ~@name@~, ~@address@~, and ~@phone@~, off of the website... but I am unable to take those extractions and make use of them. I basically want them to be in a .csv file, each in their own columns, so I can use them later to create mailing labels. Could anyone be of help?

Problem screen-scraper conection with oracle

Hi,

I'm trying to connect, a script whit oracle database. I added a classpath inside the file screen-scrapper and works fine.

But I like, execute screen-scraper via terminal, like this:

/usr/bin/java -jar screen-scraper.jar -s "City Booking" --params "DISPLAY_RECORD_MIN=1,10"

But I have that error:

An error occurred while processing the script: Comprova Fitxers CityBooking
The error message was: The application script threw an exception: java.lang.NoClassDefFoundError: oracle/dms/console/DMSConsole BSF info: null at line: 0 column: columnNo