public final class Word6Extractor extends java.lang.Object implements POIOLE2TextExtractor
WordExtractor
which deals properly
with HWPF.Constructor and Description |
---|
Word6Extractor(DirectoryNode dir) |
Word6Extractor(DirectoryNode dir,
POIFSFileSystem fs)
Deprecated.
Use
Word6Extractor(DirectoryNode) instead |
Word6Extractor(HWPFOldDocument doc)
Create a new Word Extractor
|
Word6Extractor(java.io.InputStream is)
Create a new Word Extractor
|
Word6Extractor(POIFSFileSystem fs)
Create a new Word Extractor
|
Modifier and Type | Method and Description |
---|---|
HWPFOldDocument |
getDocument()
Return the underlying POIDocument
|
HWPFOldDocument |
getFilesystem() |
java.lang.String[] |
getParagraphText()
Deprecated.
|
java.lang.String |
getText()
Retrieves all the text from the document.
|
boolean |
isCloseFilesystem() |
void |
setCloseFilesystem(boolean doCloseFilesystem) |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
getDocSummaryInformation, getMetadataTextExtractor, getRoot, getSummaryInformation
close
public Word6Extractor(java.io.InputStream is) throws java.io.IOException
is
- InputStream containing the word filejava.io.IOException
public Word6Extractor(POIFSFileSystem fs) throws java.io.IOException
fs
- POIFSFileSystem containing the word filejava.io.IOException
@Deprecated public Word6Extractor(DirectoryNode dir, POIFSFileSystem fs) throws java.io.IOException
Word6Extractor(DirectoryNode)
insteadjava.io.IOException
public Word6Extractor(DirectoryNode dir) throws java.io.IOException
java.io.IOException
public Word6Extractor(HWPFOldDocument doc)
doc
- The HWPFOldDocument to extract from@Deprecated public java.lang.String[] getParagraphText()
public java.lang.String getText()
POITextExtractor
getText
in interface POITextExtractor
public HWPFOldDocument getDocument()
POIOLE2TextExtractor
getDocument
in interface POIOLE2TextExtractor
getDocument
in interface POITextExtractor
public void setCloseFilesystem(boolean doCloseFilesystem)
setCloseFilesystem
in interface POITextExtractor
doCloseFilesystem
- true
(default), if underlying resources/filesystem should be
closed on POITextExtractor.close()
public boolean isCloseFilesystem()
isCloseFilesystem
in interface POITextExtractor
true
, if resources/filesystem should be closed on POITextExtractor.close()
public HWPFOldDocument getFilesystem()
getFilesystem
in interface POITextExtractor
Copyright 2022 The Apache Software Foundation or its licensors, as applicable.