Class XPSTextExtractor
- java.lang.Object
-
- org.apache.poi.extractor.POITextExtractor
-
- org.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
- org.apache.tika.parser.microsoft.ooxml.xps.XPSTextExtractor
-
- All Implemented Interfaces:
java.io.Closeable,java.lang.AutoCloseable
public class XPSTextExtractor extends POIXMLTextExtractor
Currently, mostly a pass-through class to hold pkg and properties and keep the general framework similar to our other POI-integrated extractors.
-
-
Constructor Summary
Constructors Constructor Description XPSTextExtractor(OPCPackage pkg)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description POIXMLProperties.CorePropertiesgetCoreProperties()Returns the core document propertiesPOIXMLProperties.CustomPropertiesgetCustomProperties()Returns the custom document propertiesPOIXMLProperties.ExtendedPropertiesgetExtendedProperties()Returns the extended document propertiesOPCPackagegetPackage()Returns the opened OPCPackage that contains the documentjava.lang.StringgetText()Retrieves all the text from the document.-
Methods inherited from class org.apache.poi.ooxml.extractor.POIXMLTextExtractor
close, getDocument, getMetadataTextExtractor
-
Methods inherited from class org.apache.poi.extractor.POITextExtractor
setFilesystem
-
-
-
-
Constructor Detail
-
XPSTextExtractor
public XPSTextExtractor(OPCPackage pkg) throws OpenXML4JException, XmlException, java.io.IOException
- Throws:
OpenXML4JExceptionXmlExceptionjava.io.IOException
-
-
Method Detail
-
getPackage
public OPCPackage getPackage()
Description copied from class:POIXMLTextExtractorReturns the opened OPCPackage that contains the document- Overrides:
getPackagein classPOIXMLTextExtractor- Returns:
- the opened OPCPackage
-
getText
public java.lang.String getText()
Description copied from class:POITextExtractorRetrieves all the text from the document. How cells, paragraphs etc are separated in the text is implementation specific - see the javadocs for a specific project for details.- Specified by:
getTextin classPOITextExtractor- Returns:
- All the text from the document
-
getCoreProperties
public POIXMLProperties.CoreProperties getCoreProperties()
Description copied from class:POIXMLTextExtractorReturns the core document properties- Overrides:
getCorePropertiesin classPOIXMLTextExtractor- Returns:
- the core document properties
-
getExtendedProperties
public POIXMLProperties.ExtendedProperties getExtendedProperties()
Description copied from class:POIXMLTextExtractorReturns the extended document properties- Overrides:
getExtendedPropertiesin classPOIXMLTextExtractor- Returns:
- the extended document properties
-
getCustomProperties
public POIXMLProperties.CustomProperties getCustomProperties()
Description copied from class:POIXMLTextExtractorReturns the custom document properties- Overrides:
getCustomPropertiesin classPOIXMLTextExtractor- Returns:
- the custom document properties
-
-