Package org.apache.poi.hwpf.converter
Class WordToTextConverter
- java.lang.Object
-
- org.apache.poi.hwpf.converter.AbstractWordConverter
-
- org.apache.poi.hwpf.converter.WordToTextConverter
-
@Beta public class WordToTextConverter extends AbstractWordConverter
-
-
Constructor Summary
Constructors Constructor Description WordToTextConverter()Creates new instance ofWordToTextConverter.WordToTextConverter(TextDocumentFacade textDocumentFacade)WordToTextConverter(org.w3c.dom.Document document)Creates new instance ofWordToTextConverter.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description org.w3c.dom.DocumentgetDocument()java.lang.StringgetText()static java.lang.StringgetText(java.io.File docFile)static java.lang.StringgetText(HWPFDocumentCore wordDocument)static java.lang.StringgetText(DirectoryNode root)booleanisOutputSummaryInformation()static voidmain(java.lang.String[] args)Java main() interface to interact withWordToTextConvertervoidprocessDocumentPart(HWPFDocumentCore wordDocument, Range range)voidsetOutputSummaryInformation(boolean outputDocumentInformation)-
Methods inherited from class org.apache.poi.hwpf.converter.AbstractWordConverter
getFontReplacer, getPicturesManager, processDocument, setFontReplacer, setPicturesManager
-
-
-
-
Constructor Detail
-
WordToTextConverter
public WordToTextConverter() throws javax.xml.parsers.ParserConfigurationExceptionCreates new instance ofWordToTextConverter. Can be used for output severalHWPFDocuments into single text document.- Throws:
javax.xml.parsers.ParserConfigurationException- if an internalDocumentBuildercannot be created
-
WordToTextConverter
public WordToTextConverter(org.w3c.dom.Document document)
Creates new instance ofWordToTextConverter. Can be used for output severalHWPFDocuments into single text document.- Parameters:
document- XML DOM Document used as storage for text pieces
-
WordToTextConverter
public WordToTextConverter(TextDocumentFacade textDocumentFacade)
-
-
Method Detail
-
getText
public static java.lang.String getText(DirectoryNode root) throws java.lang.Exception
- Throws:
java.lang.Exception
-
getText
public static java.lang.String getText(java.io.File docFile) throws java.lang.Exception- Throws:
java.lang.Exception
-
getText
public static java.lang.String getText(HWPFDocumentCore wordDocument) throws java.lang.Exception
- Throws:
java.lang.Exception
-
main
public static void main(java.lang.String[] args) throws java.lang.ExceptionJava main() interface to interact withWordToTextConverterUsage: WordToTextConverter infile outfile
Where infile is an input .doc file ( Word 95-2007) which will be rendered as plain text into outfile- Throws:
java.lang.Exception
-
getDocument
public org.w3c.dom.Document getDocument()
- Specified by:
getDocumentin classAbstractWordConverter
-
getText
public java.lang.String getText() throws java.lang.Exception- Throws:
java.lang.Exception
-
isOutputSummaryInformation
public boolean isOutputSummaryInformation()
-
processDocumentPart
public void processDocumentPart(HWPFDocumentCore wordDocument, Range range)
-
setOutputSummaryInformation
public void setOutputSummaryInformation(boolean outputDocumentInformation)
-
-