public final class Word6Extractor extends POIOLE2TextExtractor
WordExtractor
which deals properly
with HWPF.document
Constructor and Description |
---|
Word6Extractor(DirectoryNode dir) |
Word6Extractor(DirectoryNode dir,
POIFSFileSystem fs)
Deprecated.
Use
Word6Extractor(DirectoryNode) instead |
Word6Extractor(HWPFOldDocument doc)
Create a new Word Extractor
|
Word6Extractor(java.io.InputStream is)
Create a new Word Extractor
|
Word6Extractor(POIFSFileSystem fs)
Create a new Word Extractor
|
Modifier and Type | Method and Description |
---|---|
java.lang.String[] |
getParagraphText()
Deprecated.
|
java.lang.String |
getText()
Retrieves all the text from the document.
|
getDocSummaryInformation, getDocument, getMetadataTextExtractor, getRoot, getSummaryInformation
close, setFilesystem
public Word6Extractor(java.io.InputStream is) throws java.io.IOException
is
- InputStream containing the word filejava.io.IOException
public Word6Extractor(POIFSFileSystem fs) throws java.io.IOException
fs
- POIFSFileSystem containing the word filejava.io.IOException
@Deprecated public Word6Extractor(DirectoryNode dir, POIFSFileSystem fs) throws java.io.IOException
Word6Extractor(DirectoryNode)
insteadjava.io.IOException
public Word6Extractor(DirectoryNode dir) throws java.io.IOException
java.io.IOException
public Word6Extractor(HWPFOldDocument doc)
doc
- The HWPFOldDocument to extract from@Deprecated public java.lang.String[] getParagraphText()
public java.lang.String getText()
POITextExtractor
getText
in class POITextExtractor
Copyright 2020 The Apache Software Foundation or its licensors, as applicable.