Package org.apache.poi.hwpf.converter
Class WordToTextConverter
- java.lang.Object
-
- org.apache.poi.hwpf.converter.AbstractWordConverter
-
- org.apache.poi.hwpf.converter.WordToTextConverter
-
@Beta public class WordToTextConverter extends AbstractWordConverter
-
-
Constructor Summary
Constructors Constructor Description WordToTextConverter()
Creates new instance ofWordToTextConverter
.WordToTextConverter(TextDocumentFacade textDocumentFacade)
WordToTextConverter(org.w3c.dom.Document document)
Creates new instance ofWordToTextConverter
.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description org.w3c.dom.Document
getDocument()
java.lang.String
getText()
static java.lang.String
getText(java.io.File docFile)
static java.lang.String
getText(HWPFDocumentCore wordDocument)
static java.lang.String
getText(DirectoryNode root)
boolean
isOutputSummaryInformation()
static void
main(java.lang.String[] args)
Java main() interface to interact withWordToTextConverter
void
processDocumentPart(HWPFDocumentCore wordDocument, Range range)
void
setOutputSummaryInformation(boolean outputDocumentInformation)
-
Methods inherited from class org.apache.poi.hwpf.converter.AbstractWordConverter
getFontReplacer, getPicturesManager, processDocument, setFontReplacer, setPicturesManager
-
-
-
-
Constructor Detail
-
WordToTextConverter
public WordToTextConverter() throws javax.xml.parsers.ParserConfigurationException
Creates new instance ofWordToTextConverter
. Can be used for output severalHWPFDocument
s into single text document.- Throws:
javax.xml.parsers.ParserConfigurationException
- if an internalDocumentBuilder
cannot be created
-
WordToTextConverter
public WordToTextConverter(org.w3c.dom.Document document)
Creates new instance ofWordToTextConverter
. Can be used for output severalHWPFDocument
s into single text document.- Parameters:
document
- XML DOM Document used as storage for text pieces
-
WordToTextConverter
public WordToTextConverter(TextDocumentFacade textDocumentFacade)
-
-
Method Detail
-
getText
public static java.lang.String getText(DirectoryNode root) throws java.lang.Exception
- Throws:
java.lang.Exception
-
getText
public static java.lang.String getText(java.io.File docFile) throws java.lang.Exception
- Throws:
java.lang.Exception
-
getText
public static java.lang.String getText(HWPFDocumentCore wordDocument) throws java.lang.Exception
- Throws:
java.lang.Exception
-
main
public static void main(java.lang.String[] args) throws java.lang.Exception
Java main() interface to interact withWordToTextConverter
Usage: WordToTextConverter infile outfile
Where infile is an input .doc file ( Word 95-2007) which will be rendered as plain text into outfile- Throws:
java.lang.Exception
-
getDocument
public org.w3c.dom.Document getDocument()
- Specified by:
getDocument
in classAbstractWordConverter
-
getText
public java.lang.String getText() throws java.lang.Exception
- Throws:
java.lang.Exception
-
isOutputSummaryInformation
public boolean isOutputSummaryInformation()
-
processDocumentPart
public void processDocumentPart(HWPFDocumentCore wordDocument, Range range)
-
setOutputSummaryInformation
public void setOutputSummaryInformation(boolean outputDocumentInformation)
-
-