Separating City, ST ZIP+5
I can figure out how to do the City because it has a comma after it, not another sub-extractor name. But what about when the sub-extractor pattern involves another sub-extractor pattern, then what? Mine look completely identical...Is that the right way to do it??
1)For the City:
Mailing Add
2)For the State:
Mailing Add
3) For the Zip:
Mailing Add
PS. I ONLY want to scrape the first 5 digits of the zip code if that's possible.
I do this all the time, and
I do this all the time, and what you really need to do is to add regular expressions to your tokens.
If they are
>~@ADDRESS@~, ~@CITY@~ ~@STATE@~ ~@ZIP@~
the RegEx (in order) would be:
[^<>]*
[^<>,]*
[A-Z]{2}
\d{5}
Notice there is no HTML tag after the ~@ZIP@~, with that RegEx in it, it will get the first 4 digits, and the presence/absence of the +4 won't affect it.
Can you link to a page that
Can you link to a page that explains what to do with the answer you just gave me? It's probably simple but I can't find it when I search...
JeffCole, Jason is suggesting
JeffCole,
Jason is suggesting adding those regular expressions to each of the tokens in your extractor pattern text field.
-Scott
GREAT
Thank you, it's working perfectly. This page is helpful too: http://community.screen-scraper.com/helpful_regex