screen-scraper public support
Set url in a scrapeable file?
Hi, I would like to ask if I can change the url for a scrapeable file.
For example, I scrape a site www.example.com which requires login.
But after logging in the site's url changes to www21.example.com.
I'm searching a way to put the number in the scrapeable's file url field, because this number changes everytime you log in the site
Command Line Access in OS X
HI guys. Great program (Like you didn't already know that)
screen-scraper got bug via ubuntu?
hi kind men,
screen scraper works not so well on my ubuntu pc. its ok when i run a scrape session. but sometimes, after i export a scrape session, the scraper becomes uneditable. i can create new pattern,new scrape file,but i cant do any edit stuff,like renaming,coding. all the textfield, such as script body,scrape pattern name, all uneditable.
it will work well again after i restart screen scraper, but, after i export a scrape session,or after i run a scrape session, everything will be uneditable again.
waiting for you help,thanks!
Extractor Generated Code Not Working!
I took a site named:
http://www.scottsla.com/guides-books-maps-photos.htm
I then scraped it. I was trying to extract the category name, listings, and the hyperlink. I was using this as a test to get familiar with the code:
I used this as a main extractor:
regular exp to extract dollar amount
I have a token in an extraction pattern that has alot of garbage in it, and I want to extract the first occurence of a number in the form
$xxx,xxx
OR
$x,xxx,xxx
OR
$xx,xxx,xxx
But I cant seem to get the regular expression function to work. Whats the right regular expression for this?
How to write an Extractor!
I am trying to see what do I need to look at to write an extractor pattern. I did one that I received from here and it worked but as I looked through the code, I could not figure out where it came from. I was looking at more examples similar to what we would be scraping with different fields and parameters and I wanted to know what you would be the extractor pattern for this site:
How to change character set when calling server from .Net?
Hi
Just downloaded a trial, and everything seems awesome. I have set up a screen scrape to a site containing the Danish characters "æ", "ø", "å". When I run the scraping session from the interface, the scraped data are returned correctly, however when I run the session from a .Net application calling the scrape server "æ", "ø", "å" are returned as "?". What character set is used when calling the scrape server, and how to change it?
I figured it out myself using this:
Help with output
I have ran into a problem with the way I am currently outputting my data. I am currently writing the data to csv and after each page is scraped it saves it as the page name as a separate csv. However I want to make this one big file instead of multiple separate files. For example I have multiple files named a.csv, b.csv, c.csv, ... In each file it contains a schedule which includes multiple dates, locations, and times. There is only one instance on each page of the page title (a, b, c, ...).
How to envoke the script!
I am just starting with screen scraper but I have a question about how the script runs:
I took this site:
http://www.tours.com/tours_vacations/alaska.htm
I took a suggestion from Tim and used this mail extractor pattern:
~@DATARECORD@~
And this sub-extractor pattern:
~@ENTRY_NAME@~
~@DESCRIPTION@~
LOCATION:~@LOCATION@~
DESTINATION(S):~@DESTINATIONS@~
Commas in the URL replaced by plus
This is the URL of the search result I want to scrap..