Strategy for multiple tables
What would be the best strategy to harvest data from the following set of tables?
All the data appears on one page, but the number of table rows may vary.
2009
1st quarter
50
$80,802
$82,500
2nd quarter
55
$80,948
$72,112
3rd quarter
74
$48,495
$65.445
4th quarter
58
$68,455
$86,445
Total:
307
$89,290
$80,000
2008
1st quarter
50
$80,802
$82,500
(and so on, for only three years, max)
Nice enough, except that sometimes the data isn't available for all three years and then the missing quarters are not displayed.
2009
2nd quarter
55
$80,948
$72,112
3rd quarter
74
$48,495
$65.445
Total:
307
$89,290
$80,000
2008
2nd quarter
66
$60,948
$84,112
4th quarter
76
$65,455
$73,339
Total:
307
$89,290
$80,000
etc.
I'd like to be able to write lines to the file that look like this...
2009 1st quarter 50 $80,802 $82,500 (separated by tabs, due to the commas in the dollar amts)
2009 2nd quarter 55 $80,948 $72,112
etc.
Can I use the extraction pattern approach or do I need to go with the 'extractdata' function in a script, or something else?
Any help would be much appreciated.
Thanks much.
code block on forum
Hi,
I'm not sure it you were looking for this specifically, but you can submit code in the forum by using the code tag.
It looks like this "<" code ">" and the end is "<" /code ">" ~please ignore quotes
Hope that helps your posts.
I've been working on it, and
I've been working on it, and I had been going down the path you suggest.
Looking good, now for the tweaking.
Thanks.
You would need to use a fancy
You would need to use a fancy implantation of: http://community.screen-scraper.com/API/extractData
I think you'd need one extractor to get each year, and all the HTML for that year, then call a separate extractor for each quarter. If the number of rows per quarter change, you can make a separate extractor for that too, but from what I see, that would be unnecessary.
Sorry, can't figure out how
Sorry, can't figure out how to attach source html code.
Let me know and I'll include that in the comment.
Thanks.