public abstract class POITextExtractor
extends java.lang.Object
implements java.io.Closeable
ExcelExtractor
,
PowerPointExtractor
,
VisioTextExtractor
,
WordExtractor
Constructor and Description |
---|
POITextExtractor() |
Modifier and Type | Method and Description |
---|---|
void |
close()
Allows to free resources of the Extractor as soon as
it is not needed any more.
|
abstract java.lang.Object |
getDocument() |
abstract POITextExtractor |
getMetadataTextExtractor()
Returns another text extractor, which is able to
output the textual content of the document
metadata / properties, such as author and title.
|
abstract java.lang.String |
getText()
Retrieves all the text from the document.
|
void |
setFilesystem(java.io.Closeable fs)
Used to ensure file handle cleanup.
|
public abstract java.lang.String getText()
public abstract POITextExtractor getMetadataTextExtractor()
public void setFilesystem(java.io.Closeable fs)
fs
- filesystem to closepublic void close() throws java.io.IOException
close
in interface java.io.Closeable
close
in interface java.lang.AutoCloseable
java.io.IOException
public abstract java.lang.Object getDocument()
Copyright 2020 The Apache Software Foundation or its licensors, as applicable.