public final class Word6Extractor extends POIOLE2TextExtractor
WordExtractor which deals properly
with HWPF.document| Constructor and Description |
|---|
Word6Extractor(DirectoryNode dir) |
Word6Extractor(DirectoryNode dir,
POIFSFileSystem fs)
Deprecated.
Use
Word6Extractor(DirectoryNode) instead |
Word6Extractor(HWPFOldDocument doc)
Create a new Word Extractor
|
Word6Extractor(java.io.InputStream is)
Create a new Word Extractor
|
Word6Extractor(POIFSFileSystem fs)
Create a new Word Extractor
|
| Modifier and Type | Method and Description |
|---|---|
java.lang.String[] |
getParagraphText()
Deprecated.
|
java.lang.String |
getText()
Retrieves all the text from the document.
|
getDocSummaryInformation, getDocument, getMetadataTextExtractor, getRoot, getSummaryInformationclose, setFilesystempublic Word6Extractor(java.io.InputStream is)
throws java.io.IOException
is - InputStream containing the word filejava.io.IOExceptionpublic Word6Extractor(POIFSFileSystem fs) throws java.io.IOException
fs - POIFSFileSystem containing the word filejava.io.IOException@Deprecated public Word6Extractor(DirectoryNode dir, POIFSFileSystem fs) throws java.io.IOException
Word6Extractor(DirectoryNode) insteadjava.io.IOExceptionpublic Word6Extractor(DirectoryNode dir) throws java.io.IOException
java.io.IOExceptionpublic Word6Extractor(HWPFOldDocument doc)
doc - The HWPFOldDocument to extract from@Deprecated public java.lang.String[] getParagraphText()
public java.lang.String getText()
POITextExtractorgetText in class POITextExtractorCopyright 2018 The Apache Software Foundation or its licensors, as applicable.