You'd want a script like this, and I would run it on the first extractor "before pattern is applied".
You'd want a script like
You'd want a script like this, and I would run it on the first extractor "before pattern is applied".
String s = scrapeableFile.getContentAsString();
s = StringUtils.replacePattern(s, "<p[^>]*>", "<p>");
scrapeableFile.setLastScrapedData(s);
You could tweak the replace to get better what you want.