Class XWPFWordExtractorDecorator
java.lang.Object
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator
- All Implemented Interfaces:
OOXMLExtractor
-
Field Summary
Fields inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
config, EMBEDDED_RELATIONSHIPS, extractor
-
Constructor Summary
ConstructorsConstructorDescriptionXWPFWordExtractorDecorator
(Metadata metadata, ParseContext context, org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) XWPFWordExtractorDecorator
(ParseContext context, org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) Deprecated. -
Method Summary
Modifier and TypeMethodDescriptionprotected void
buildXHTML
(XHTMLContentHandler xhtml) Populates theXHTMLContentHandler
object received as parameter.protected Map<String,
EmbeddedPartMetadata> protected List<org.apache.poi.openxml4j.opc.PackagePart>
Include main body and anything else that can have an attachment/embedded objectMethods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getJustFileName, getMetadataExtractor, getXHTML, handleEmbeddedFile, loadLinkedRelationships
-
Constructor Details
-
XWPFWordExtractorDecorator
public XWPFWordExtractorDecorator(Metadata metadata, ParseContext context, org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) -
XWPFWordExtractorDecorator
@Deprecated public XWPFWordExtractorDecorator(ParseContext context, org.apache.poi.xwpf.extractor.XWPFWordExtractor extractor) - Parameters:
context
-extractor
-
-
-
Method Details
-
buildXHTML
protected void buildXHTML(XHTMLContentHandler xhtml) throws SAXException, org.apache.xmlbeans.XmlException, IOException Description copied from class:AbstractOOXMLExtractor
Populates theXHTMLContentHandler
object received as parameter.- Specified by:
buildXHTML
in classAbstractOOXMLExtractor
- Throws:
SAXException
org.apache.xmlbeans.XmlException
IOException
- See Also:
-
XWPFWordExtractor.getText()
-
getEmbeddedPartMetadataMap
- Overrides:
getEmbeddedPartMetadataMap
in classAbstractOOXMLExtractor
-
getMainDocumentParts
Include main body and anything else that can have an attachment/embedded object- Specified by:
getMainDocumentParts
in classAbstractOOXMLExtractor
-
XWPFWordExtractorDecorator(Metadata, ParseContext, XWPFWordExtractor)