public abstract class AbstractOOXMLExtractor extends java.lang.Object implements OOXMLExtractor
buildXHTML(XHTMLContentHandler) that
populates the XHTMLContentHandler object received as parameter.| Constructor and Description |
|---|
AbstractOOXMLExtractor(ParseContext context,
POIXMLTextExtractor extractor) |
| Modifier and Type | Method and Description |
|---|---|
POIXMLDocument |
getDocument()
Returns the opened document.
|
MetadataExtractor |
getMetadataExtractor()
org.apache.poi.ooxml.extractor.POIXMLTextExtractor#getMetadataTextExtractor() not yet supported
for OOXML by POI. |
void |
getXHTML(org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the document into a sequence of XHTML SAX events sent to the
given content handler.
|
public AbstractOOXMLExtractor(ParseContext context, POIXMLTextExtractor extractor)
public POIXMLDocument getDocument()
OOXMLExtractorgetDocument in interface OOXMLExtractorOOXMLExtractor.getDocument()public MetadataExtractor getMetadataExtractor()
OOXMLExtractororg.apache.poi.ooxml.extractor.POIXMLTextExtractor#getMetadataTextExtractor() not yet supported
for OOXML by POI.getMetadataExtractor in interface OOXMLExtractorOOXMLExtractor.getMetadataExtractor()public void getXHTML(org.xml.sax.ContentHandler handler,
Metadata metadata,
ParseContext context)
throws org.xml.sax.SAXException,
XmlException,
java.io.IOException,
TikaException
OOXMLExtractorgetXHTML in interface OOXMLExtractororg.xml.sax.SAXExceptionXmlExceptionjava.io.IOExceptionTikaExceptionOOXMLExtractor.getXHTML(ContentHandler, Metadata, ParseContext)"Copyright © 2010 - 2020 Adobe Systems Incorporated. All Rights Reserved"