Scraping "outline" formats
Hi folks,
Is there a way to scrape and write data published in an outline format by using the standard Extractor Pattern interface (i.e., little java)? Here's what I mean (swapped out brackets so the code won't parse):
[td class="day-with-date"]
[span>13[/span]
[span class="calnk"]...
[span class="calnk"]...
[/td]
[td class="day-with-date"]
[span>14[/span]
[span class="calnk"]...
[/td]
etc.
I'd like to scrape the date "13" then scrape the first calnk info and write it including 13 to csv, then scrape the second calnk info and write it including 13 to csv. Then scrape "14" and scrape the calnk info and write it including 14 to csv, etc. Basically a sub-loop within a loop done sequentially.
Thanks
The method you need is
The method you need is scrapeableFile.extractData(), and I think the example on there should get you started.