Class XPSTextExtractor
- java.lang.Object
 - 
- org.apache.poi.extractor.POITextExtractor
 - 
- org.apache.poi.ooxml.extractor.POIXMLTextExtractor
 - 
- org.apache.tika.parser.microsoft.ooxml.xps.XPSTextExtractor
 
 
 
 
- 
- All Implemented Interfaces:
 java.io.Closeable,java.lang.AutoCloseable
public class XPSTextExtractor extends POIXMLTextExtractor
Currently, mostly a pass-through class to hold pkg and properties and keep the general framework similar to our other POI-integrated extractors. 
- 
- 
Constructor Summary
Constructors Constructor Description XPSTextExtractor(OPCPackage pkg) 
- 
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description POIXMLProperties.CorePropertiesgetCoreProperties()Returns the core document propertiesPOIXMLProperties.CustomPropertiesgetCustomProperties()Returns the custom document propertiesPOIXMLProperties.ExtendedPropertiesgetExtendedProperties()Returns the extended document propertiesOPCPackagegetPackage()Returns the opened OPCPackage that contains the documentjava.lang.StringgetText()Retrieves all the text from the document.- 
Methods inherited from class org.apache.poi.ooxml.extractor.POIXMLTextExtractor
close, getDocument, getMetadataTextExtractor 
- 
Methods inherited from class org.apache.poi.extractor.POITextExtractor
setFilesystem 
 - 
 
 - 
 
- 
- 
Constructor Detail
- 
XPSTextExtractor
public XPSTextExtractor(OPCPackage pkg) throws OpenXML4JException, XmlException, java.io.IOException
- Throws:
 OpenXML4JExceptionXmlExceptionjava.io.IOException
 
 - 
 
- 
Method Detail
- 
getPackage
public OPCPackage getPackage()
Description copied from class:POIXMLTextExtractorReturns the opened OPCPackage that contains the document- Overrides:
 getPackagein classPOIXMLTextExtractor- Returns:
 - the opened OPCPackage
 
 
- 
getText
public java.lang.String getText()
Description copied from class:POITextExtractorRetrieves all the text from the document. How cells, paragraphs etc are separated in the text is implementation specific - see the javadocs for a specific project for details.- Specified by:
 getTextin classPOITextExtractor- Returns:
 - All the text from the document
 
 
- 
getCoreProperties
public POIXMLProperties.CoreProperties getCoreProperties()
Description copied from class:POIXMLTextExtractorReturns the core document properties- Overrides:
 getCorePropertiesin classPOIXMLTextExtractor- Returns:
 - the core document properties
 
 
- 
getExtendedProperties
public POIXMLProperties.ExtendedProperties getExtendedProperties()
Description copied from class:POIXMLTextExtractorReturns the extended document properties- Overrides:
 getExtendedPropertiesin classPOIXMLTextExtractor- Returns:
 - the extended document properties
 
 
- 
getCustomProperties
public POIXMLProperties.CustomProperties getCustomProperties()
Description copied from class:POIXMLTextExtractorReturns the custom document properties- Overrides:
 getCustomPropertiesin classPOIXMLTextExtractor- Returns:
 - the custom document properties
 
 
 - 
 
 -