Package org.apache.tika.eval.core.util
Class ContentTagParser
java.lang.Object
org.apache.tika.eval.core.util.ContentTagParser
-
Constructor Summary
Constructors -
Method Summary
-
Constructor Details
-
ContentTagParser
public ContentTagParser()
-
-
Method Details
-
parseXML
public static ContentTags parseXML(String html, Set<String> uppercaseTagsOfInterest) throws TikaException, IOException, SAXException - Throws:
TikaException
IOException
SAXException
-
parseHTML
public static ContentTags parseHTML(String html, Set<String> uppercaseTagsOfInterest) throws SAXException, IOException - Throws:
SAXException
IOException
-