public final class Word6Extractor extends java.lang.Object implements POIOLE2TextExtractor
WordExtractor which deals properly
with HWPF.| Constructor and Description |
|---|
Word6Extractor(DirectoryNode dir) |
Word6Extractor(DirectoryNode dir,
POIFSFileSystem fs)
Deprecated.
Use
Word6Extractor(DirectoryNode) instead |
Word6Extractor(HWPFOldDocument doc)
Create a new Word Extractor
|
Word6Extractor(java.io.InputStream is)
Create a new Word Extractor
|
Word6Extractor(POIFSFileSystem fs)
Create a new Word Extractor
|
| Modifier and Type | Method and Description |
|---|---|
HWPFOldDocument |
getDocument()
Return the underlying POIDocument
|
HWPFOldDocument |
getFilesystem() |
java.lang.String[] |
getParagraphText()
Deprecated.
|
java.lang.String |
getText()
Retrieves all the text from the document.
|
boolean |
isCloseFilesystem() |
void |
setCloseFilesystem(boolean doCloseFilesystem) |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitgetDocSummaryInformation, getMetadataTextExtractor, getRoot, getSummaryInformationclosepublic Word6Extractor(java.io.InputStream is)
throws java.io.IOException
is - InputStream containing the word filejava.io.IOExceptionpublic Word6Extractor(POIFSFileSystem fs) throws java.io.IOException
fs - POIFSFileSystem containing the word filejava.io.IOException@Deprecated public Word6Extractor(DirectoryNode dir, POIFSFileSystem fs) throws java.io.IOException
Word6Extractor(DirectoryNode) insteadjava.io.IOExceptionpublic Word6Extractor(DirectoryNode dir) throws java.io.IOException
java.io.IOExceptionpublic Word6Extractor(HWPFOldDocument doc)
doc - The HWPFOldDocument to extract from@Deprecated public java.lang.String[] getParagraphText()
public java.lang.String getText()
POITextExtractorgetText in interface POITextExtractorpublic HWPFOldDocument getDocument()
POIOLE2TextExtractorgetDocument in interface POIOLE2TextExtractorgetDocument in interface POITextExtractorpublic void setCloseFilesystem(boolean doCloseFilesystem)
setCloseFilesystem in interface POITextExtractordoCloseFilesystem - true (default), if underlying resources/filesystem should be
closed on POITextExtractor.close()public boolean isCloseFilesystem()
isCloseFilesystem in interface POITextExtractortrue, if resources/filesystem should be closed on POITextExtractor.close()public HWPFOldDocument getFilesystem()
getFilesystem in interface POITextExtractorCopyright 2021 The Apache Software Foundation or its licensors, as applicable.