|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.poi.POITextExtractor org.apache.poi.POIOLE2TextExtractor
public abstract class POIOLE2TextExtractor
Common Parent for OLE2 based Text Extractors of POI Documents, such as .doc, .xls You will typically find the implementation of a given format's text extractor under org.apache.poi.[format].extractor .
ExcelExtractor
,
PowerPointExtractor
,
VisioTextExtractor
,
WordExtractor
Field Summary | |
---|---|
protected POIDocument |
document
The POIDocument that's open |
Constructor Summary | |
---|---|
|
POIOLE2TextExtractor(POIDocument document)
Creates a new text extractor for the given document |
protected |
POIOLE2TextExtractor(POIOLE2TextExtractor otherExtractor)
Creates a new text extractor, using the same document as another text extractor. |
Method Summary | |
---|---|
DocumentSummaryInformation |
getDocSummaryInformation()
Returns the document information metadata for the document |
POIDocument |
getDocument()
Return the underlying POIDocument |
POITextExtractor |
getMetadataTextExtractor()
Returns an HPSF powered text extractor for the document properties metadata, such as title and author. |
DirectoryEntry |
getRoot()
Return the underlying DirectoryEntry of this document. |
SummaryInformation |
getSummaryInformation()
Returns the summary information metadata for the document. |
Methods inherited from class org.apache.poi.POITextExtractor |
---|
close, getText, setFilesystem |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
protected POIDocument document
Constructor Detail |
---|
public POIOLE2TextExtractor(POIDocument document)
document
- The POIDocument to use in this extractor.protected POIOLE2TextExtractor(POIOLE2TextExtractor otherExtractor)
otherExtractor
- the extractor which document to be usedMethod Detail |
---|
public DocumentSummaryInformation getDocSummaryInformation()
public SummaryInformation getSummaryInformation()
public POITextExtractor getMetadataTextExtractor()
getMetadataTextExtractor
in class POITextExtractor
public DirectoryEntry getRoot()
public POIDocument getDocument()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |