Package org.apache.tika.parser.html
-
Interface Summary Interface Description HtmlMapper HTML mapper used to make incoming HTML documents easier to handle by Tika clients. -
Class Summary Class Description DataURIScheme DataURISchemeUtil Not thread safe.DefaultHtmlMapper The default HTML mapping rules in Tika.HtmlEncodingDetector Character encoding detector for determining the character encoding of a HTML document based on the potential charset parameter found in a Content-Type http-equiv meta tag somewhere near the beginning.IdentityHtmlMapper Alternative HTML mapping rules that pass the input HTML as-is without any modifications.JSoupParser HTML parser. -
Exception Summary Exception Description DataURISchemeParseException