public abstract class AbstractOOXMLExtractor extends java.lang.Object implements OOXMLExtractor
buildXHTML(XHTMLContentHandler)
that
populates the XHTMLContentHandler
object received as parameter.Constructor and Description |
---|
AbstractOOXMLExtractor(ParseContext context,
POIXMLTextExtractor extractor) |
Modifier and Type | Method and Description |
---|---|
POIXMLDocument |
getDocument()
Returns the opened document.
|
MetadataExtractor |
getMetadataExtractor()
org.apache.poi.ooxml.extractor.POIXMLTextExtractor#getMetadataTextExtractor() not yet supported
for OOXML by POI. |
void |
getXHTML(org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the document into a sequence of XHTML SAX events sent to the
given content handler.
|
public AbstractOOXMLExtractor(ParseContext context, POIXMLTextExtractor extractor)
public POIXMLDocument getDocument()
OOXMLExtractor
getDocument
in interface OOXMLExtractor
OOXMLExtractor.getDocument()
public MetadataExtractor getMetadataExtractor()
OOXMLExtractor
org.apache.poi.ooxml.extractor.POIXMLTextExtractor#getMetadataTextExtractor()
not yet supported
for OOXML by POI.getMetadataExtractor
in interface OOXMLExtractor
OOXMLExtractor.getMetadataExtractor()
public void getXHTML(org.xml.sax.ContentHandler handler, Metadata metadata, ParseContext context) throws org.xml.sax.SAXException, XmlException, java.io.IOException, TikaException
OOXMLExtractor
getXHTML
in interface OOXMLExtractor
org.xml.sax.SAXException
XmlException
java.io.IOException
TikaException
OOXMLExtractor.getXHTML(ContentHandler, Metadata, ParseContext)
"Copyright © 2010 - 2020 Adobe Systems Incorporated. All Rights Reserved"