Class XSLFPowerPointExtractorDecorator
java.lang.Object
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
org.apache.tika.parser.microsoft.ooxml.XSLFPowerPointExtractorDecorator
- All Implemented Interfaces:
OOXMLExtractor
-
Field Summary
Fields inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
config, EMBEDDED_RELATIONSHIPS, extractor
-
Constructor Summary
ConstructorsConstructorDescriptionXSLFPowerPointExtractorDecorator
(Metadata metadata, ParseContext context, org.apache.poi.xslf.extractor.XSLFExtractor extractor) -
Method Summary
Modifier and TypeMethodDescriptionprotected void
buildXHTML
(XHTMLContentHandler xhtml) Populates theXHTMLContentHandler
object received as parameter.protected List<org.apache.poi.openxml4j.opc.PackagePart>
In PowerPoint files, slides have things embedded in them, and slide drawings which have the imagesMethods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getEmbeddedPartMetadataMap, getJustFileName, getMetadataExtractor, getXHTML, handleEmbeddedFile, loadLinkedRelationships
-
Constructor Details
-
XSLFPowerPointExtractorDecorator
public XSLFPowerPointExtractorDecorator(Metadata metadata, ParseContext context, org.apache.poi.xslf.extractor.XSLFExtractor extractor)
-
-
Method Details
-
buildXHTML
Description copied from class:AbstractOOXMLExtractor
Populates theXHTMLContentHandler
object received as parameter.- Specified by:
buildXHTML
in classAbstractOOXMLExtractor
- Throws:
SAXException
IOException
- See Also:
-
SlideShowExtractor.getText()
-
getMainDocumentParts
protected List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts() throws TikaExceptionIn PowerPoint files, slides have things embedded in them, and slide drawings which have the images- Specified by:
getMainDocumentParts
in classAbstractOOXMLExtractor
- Throws:
TikaException
-