screen-scraper public support

Questions and answers regarding the use of screen-scraper. Anyone can post. Monitored occasionally by screen-scraper staff.

Writing the Search Term to a file?

Hi, this is slightly related to my previous post. How can I save the search term along with the data I am scraping?

In the example below, I am trying to save the session variable SEARCH along with the data I am extracting from the web page. However, it just returns null value.

Thanks,

Bhavesh

Using multiple search terms (extension of Tutorial 7)

Hi,

I want to extend the features Tutorial 7 and wanted to know if it was possible.

The tutorial shows one search term per line.

Is there a way to add a variable to it that will be used in the outputting of a file? Let me explain.

I want to scrape a website with multiple search terms, but I also want to record the context of the search terms. For example:

the search file would be set up something like this:

IMCL, Imclone
Hsp90, Imclone
MEK, Array
MEK, AstraZeneca
AZD, AstraZeneca

Multiple Pattern Extractors not working

on a page i have

[quote]---PAGE CONTENT---

---PAGE CONTENT---

---PAGE CONTENT---

---PAGE CONTENT---[/quote]

SS pulls up the wrong page on one of several

Hi,

I am trying to run a scraping session and I have come quite far. In fact, I feel that I am nearly done setting it up.

However, I am having a very strange problem. A search results yields 4 pages.

Screen scraper properly brings up Pages 1, 2, and 4, but page 3 gets a different page.

When I copy and paste the raw URLs into any of several web browsers on different machines, I get the expect result. But with SS, it consistently does not get page 3 correct.

Here are the links:

page 1:

Extractor pattern

I have two conditions to decide if continue going to the next page.
One is
48

The other one is
class="pagerNotCurrent">Next

Either one finds pattern, it will call next_page script..

How should I apply this situation?

I tried using sub-extractor pattern but didn't work out.

Thank you!

Getting past Javascript popups

I'm trying to automate several web tasks. For the current task i want to scape all the ids of page and then submit it to the server. The scraping part goes well but then a java script pops requiring me to click ok, How do I get around this or interact with this system to pass the data

What does -1 for start mean in a session?

I am seeing some scraping sessions just kinda hanging, leaving 0 length log files, and when I (using SOAP) view the list of sessions on that server the "start" is not a time (like 1176758577171) but a -1. I cannot (via my soap interface, again) stop or remove the sessions.

Any ideas?

Need Help with scraping project!

Hey,

I've successfully scraped about 10 sites in the past. I'm trying to do another, but this new one is really doing me in.

The site I'm trying to scrape is

http://www.walgreens.com/storelocator/find.jsp

I'm trying to get the store addresses for a list of zip codes.

a script to have access to a variable of another scrapeableF

Hello again! :)

One question:
I have two scrapeable files and two different scripts. Each of these scripts need to access variables' content that are defined and populated on the scrapeable file that does NOT call this script. How can I do this?

By the regular procedure (out.write( dataRecord.get( "USERID" ) + "\t" );)
I get a null content in the variable, or that it wasn't defined.

Thanks a lot,
Carlos.

1 or many sub-items of an item

Hi Guys,
One item could have been written in only one language, as below:

  • Translations: English
  • Or, it could have been written in 2, 3, 4 or whatever. The example show a two example.

  • Translations: English, French
  • How can I take care of this pattern?

    Thanks a lot,
    Carlos.