Separating City, ST ZIP+5

I can figure out how to do the City because it has a comma after it, not another sub-extractor name. But what about when the sub-extractor pattern involves another sub-extractor pattern, then what? Mine look completely identical...Is that the right way to do it??

1)For the City:

Mailing Add ~@MAILING_CITY@~,

2)For the State:

Mailing Add ~@MAILING_CITY@~, ~@MAILING_CITY@~ ~@MAILING_ZIP@~<

3) For the Zip:

Mailing Add ~@MAILING_CITY@~, ~@MAILING_CITY@~ ~@MAILING_ZIP@~<

PS. I ONLY want to scrape the first 5 digits of the zip code if that's possible.

I do this all the time, and

I do this all the time, and what you really need to do is to add regular expressions to your tokens.

If they are
>~@ADDRESS@~, ~@CITY@~ ~@STATE@~ ~@ZIP@~
the RegEx (in order) would be:
[^<>]*
[^<>,]*
[A-Z]{2}
\d{5}

Notice there is no HTML tag after the ~@ZIP@~, with that RegEx in it, it will get the first 4 digits, and the presence/absence of the +4 won't affect it.

Can you link to a page that

Can you link to a page that explains what to do with the answer you just gave me? It's probably simple but I can't find it when I search...

JeffCole, Jason is suggesting

JeffCole,

Jason is suggesting adding those regular expressions to each of the tokens in your extractor pattern text field.

-Scott

GREAT

Thank you, it's working perfectly. This page is helpful too: http://community.screen-scraper.com/helpful_regex