screen-scraper public support

Questions and answers regarding the use of screen-scraper. Anyone can post. Monitored occasionally by screen-scraper staff.

Outputting UTF-16 text files from SS?

Hi everyone,

I'm in the process of scraping information from a European website, which means I have to handle and correctly output diacritical marks (accents, umlauts, etc.).

Tips for how to approach scraping this site?

I'm trying to scrape this page: http://ipldata.msdlouky.org/IPLCISEntry.aspx

I have a list of property addresses in column "A" of an excel spreadsheet. Ideally, i'd like to use this spreadsheet to populate the fields on this site. However, this website requires the street number and street name be input seperately so I could, if needed, export the data into into columns "A" and "B" of an XLS spreadsheet for the street number and street name to make it easier.

Suche komerziellen DEUTSCHEN Support..

Ich suche vor dem Kauf von Screen-scrapper
einen deutschsprachigen Support, der

1. ein scrap-file erstellt
- 1 Onlineshop
- 2 suchwörter
- ca. 100 Artikel, die dann "gescrapt" werden müssen
- ca. 4-5 Variable (Artikelname, Artikelnummer, Preis, Website)

2. Telefonischer Support bei dem Erstellen eines eigenen Scrap-Files
Abrechnung nach benötigten Stunden

Bei Interesse bitte bei [email protected] melden.

Vielen Dank
Boergi

A comparison site using screen scraper. How?

Hi,

Im trying to build a site that takes search terms and then returns results from two or three
separate sites. I'm not quite sure how this would work with screen scraper. Can anyone explain the process flow for a comparison site like this and if possible point me toward an
example or two here on the site.

Any help is appreciated, even if you have a partial solution it might help me out.

Thanks in advance

Pausing random periods inbetween each pattern match

How might I pause for a random period of time between each pattern match? I'm running a script for each pattern match already, do I add code to that one or add an additional script to the list?

Can someone post the complete code of a script I could use?

I've seen many posts but none of them seem to work all by themselves and i'm a super-newbie.

This post seems to be most like what I"m looking for: http://community.screen-scraper.com/script_repository/BreakPoint

Code here:
------------

import java.util.Random;

// Pauses scraping session each time script is run.

Separating City, ST ZIP+5

I'm trying to split a mailing city, state, and zip into three separate sub-extractor patterns. I currently use the following sub-extractor pattern to get the mailing address of an individual:

Mailing Add ~@MAILING_STREET@~<

However, the result always includes the City, a comma, two-letter state code, and the zip code. Sometimes the zip code is only 5 numbers, and sometimes it includes a "-" and the other 4 numbers. "CITY, ST XXXXX-XXXX"

I want to create the DIFFERENT sub-extractor patterns rather than just 1.

How to get scrap this website

Hi guys,

Can anyone help me with help on how to scrap data from this website?
www.urbanlyrics.com

I have tried to follow tutorials on how to use screen-scraper but the the tutorials and video tour and near useless in my own opinion.

Can anyone help.

Automatically Scraping Pages listed on an Index

Hi,

I have just downloaded screen-scraper and gone through a tutorial--it looks like an awesome application.

I was wondering if anybody could tell me how I would go about screen-scraping all of the websites listed on an index page?

I would normally think that I could scrape the index for all of the anchor tags (which I was successfully able to do thanks to the tutorial), but then I'd like to use a script to create new Scrape jobs with the URLs that were scraped from the index.

The Amazing Disappearing Edit Token window!

Upgraded from SS 5.0 to 5.5 and the Edit Token window has disappeared from my Mac G4!

Scraping this site with directories, then search results using Ajax.

We are testing several sites for scraping, and came across this site which seems to be something different. We can't find any guides online for our interns to follow, so we are wondering if you could give some heads-up on this.

We are trying to scrape this: http://www.hotelscombined.com/CountryAll/Argentina.htm