public abstract class POIOLE2TextExtractor extends POITextExtractor
ExcelExtractor
,
PowerPointExtractor
,
VisioTextExtractor
,
WordExtractor
Modifier and Type | Field and Description |
---|---|
protected POIDocument |
document
The POIDocument that's open
|
Modifier | Constructor and Description |
---|---|
|
POIOLE2TextExtractor(POIDocument document)
Creates a new text extractor for the given document
|
protected |
POIOLE2TextExtractor(POIOLE2TextExtractor otherExtractor)
Creates a new text extractor, using the same
document as another text extractor.
|
Modifier and Type | Method and Description |
---|---|
DocumentSummaryInformation |
getDocSummaryInformation()
Returns the document information metadata for the document
|
POIDocument |
getDocument()
Return the underlying POIDocument
|
POITextExtractor |
getMetadataTextExtractor()
Returns an HPSF powered text extractor for the
document properties metadata, such as title and author.
|
DirectoryEntry |
getRoot()
Return the underlying DirectoryEntry of this document.
|
SummaryInformation |
getSummaryInformation()
Returns the summary information metadata for the document.
|
close, getText, setFilesystem
protected POIDocument document
public POIOLE2TextExtractor(POIDocument document)
document
- The POIDocument to use in this extractor.protected POIOLE2TextExtractor(POIOLE2TextExtractor otherExtractor)
otherExtractor
- the extractor which document to be usedpublic DocumentSummaryInformation getDocSummaryInformation()
public SummaryInformation getSummaryInformation()
public POITextExtractor getMetadataTextExtractor()
getMetadataTextExtractor
in class POITextExtractor
public DirectoryEntry getRoot()
public POIDocument getDocument()
getDocument
in class POITextExtractor
Copyright 2020 The Apache Software Foundation or its licensors, as applicable.