screen-scraper public support

Questions and answers regarding the use of screen-scraper. Anyone can post. Monitored occasionally by screen-scraper staff.

Probably a common extraction problem for beginners

Look at these 2 different ways rows are represented i the html:
"...
<td class="c2 n"><span class="neg">-0,33%</span></td>
<td class="c3 o n"><span class="neg">-0,50</span></td>
..."

"...
<td class="c2 n">-</td>
<td class="c3 o n">-</td>
..."

I would like to extract "-0,33%" and "-0,50" in the first example and "-" and "-" in the second example.

The span-tags is of course what creates trouble for me. Until a solution is found, I must now choose to get

using screen-scraper as a crawler

Hi,

can you use screen scraper to crawl various multiples urls and search for something on each page, please?

I think reading in a url list is no problem.

On the site you probably have to extract all links from it and extract from there.

But how do I get all links, where different sites have different structures?

Regards

Ben

Finding People Scraping Session Repository Available?

Does anyone have a scraping session(s) they'd be willing to share that can help me find contact information for people using the web?

Some example sites I'd REALLY like sessions for include:

1) Whitepages.com: Search by Address to find a person's contact info.
2) Pipl.com: Search by Name to find a person's contact info.
3) Spokeo: Search by name, phone, or email to find a person or their contact info.
4) Facebook: Search by e-mail address to see if a facebook profile exists.

International characters problem

I have searched and looked at other forum posts, but haven't really found anything that solves it for me.

These are the results (examples) i get in log and in the text file i write my results to:

UTF-8 chosen in both Settings and set in session (in script):
"...
HOME_TEAM_NAME=Str?msgodset
Storing this value in a session variable.
..."

UTF-16 chosen in both Settings and set in session (in script):
"...
HOME_TEAM_NAME=Str?msgodset
Storing this value in a session variable.
..."

ISO-8859-1 chosen in both Settings and set in session (in script):
"...

Multiplying Variables before Writing to File

How can I multiply my variables before they are placed in my CSV file?

I'm trying to manipulate data before it is actually written to the file. I've placed the following code at the beginning of my "writing data to file script".

session.setVariable( "PRICE", "1.5 * COST" );
I completely guessed on this one and I'm not surprised it didn't work.

I'm trying to multiply the variable COST by 1.5 and have PRICE be the result. Both PRICE and COST will be written to the CSV file.

out.write( dataRecord.get( "COST" ) + "\t" );

write data to .txt file

Sorry for my horrible English,
I configured my extractore pattern, and the test shows exactly what I want to extract, but i can't export data.
I read all the tutorials and tried to adapt the examples to my needs but I can't do it in any way.
These are my pattern:
Scheda Azienda~@NOME@~"
with flag save in session variable
and the output is :
0 STUDIO XXXX XXXX" class="_lms _noc" href="http://www.xyvz.it

and i try with this script

// Output a message to the log so we know that we'll be writing the text out to a file.
session.log( "Writing data to a file." );

Tutorial Shopping Site Down?

Is the Tutorial Shopping Site down?

The tutorial shopping site at http://www.screen-scraper.com/shop/ is not looking good. There are all sorts PHP Deprecated and Warning messages printed on the screen.

Here is a snippet scraped from the homepage:

<b>Deprecated</b>:  Function ereg_replace() is deprecated in <b>/var/www/html/screen-scraper/shop/includes/classes/template_func.php</b> on line <b>49</b><br />
<br />
<b>Deprecated</b>:  Function ereg_replace() is deprecated in <b>/var/www/html/screen-scraper/shop/includes/functions/html_output.php</b> on line <b>86</b><br />

getting different value when setting the variable by session

by matching pattern l am getting the matched value as
>X3
but when l am setting this in a variable and fetching it on a page is shows %26gt%3BX3
thats why the system is not taking the value right

can you help me how can l write or set the exact value l am getting

Rolling back to previous version from ver 6.0

I recently upgraded to the Basic edition version 6.0, but have found it to be very unstable, closing if I attempt to click on any scrapeable file or buttons related to scrapeable files.

I'm wondering, do I need to simply uninstall it and download an older version?

general questions

Why does screen scraper use a proxy to record the transactions?

How do you export your session as a .sss file?