public class EventBasedExcelExtractor extends POIOLE2TextExtractor implements ExcelExtractor
ExcelExtractor
, but may not provide
the same richness of formatting.
Returns the textual content of the file, suitable for
indexing by something like Lucene, but not really
intended for display to the user.
To turn an excel file into a CSV or similar, then see the XLS2CSVmra example
document
Constructor and Description |
---|
EventBasedExcelExtractor(DirectoryNode dir) |
EventBasedExcelExtractor(POIFSFileSystem fs) |
Modifier and Type | Method and Description |
---|---|
DocumentSummaryInformation |
getDocSummaryInformation()
Would return the document information metadata for the document,
if we supported it
|
SummaryInformation |
getSummaryInformation()
Would return the summary information metadata for the document,
if we supported it
|
java.lang.String |
getText()
Retreives the text contents of the file
|
void |
setFormulasNotResults(boolean formulasNotResults)
Should we return the formula itself, and not
the result it produces? Default is false
|
void |
setIncludeCellComments(boolean includeComments)
Would control the inclusion of cell comments from the document,
if we supported it
|
void |
setIncludeHeadersFooters(boolean includeHeadersFooters)
Would control the inclusion of headers and footers from the document,
if we supported it
|
void |
setIncludeSheetNames(boolean includeSheetNames)
Should sheet names be included? Default is true
|
getDocument, getMetadataTextExtractor, getRoot
close, setFilesystem
public EventBasedExcelExtractor(DirectoryNode dir)
public EventBasedExcelExtractor(POIFSFileSystem fs)
public DocumentSummaryInformation getDocSummaryInformation()
getDocSummaryInformation
in class POIOLE2TextExtractor
public SummaryInformation getSummaryInformation()
getSummaryInformation
in class POIOLE2TextExtractor
public void setIncludeCellComments(boolean includeComments)
setIncludeCellComments
in interface ExcelExtractor
includeComments
- true
if cell comments should be includedpublic void setIncludeHeadersFooters(boolean includeHeadersFooters)
setIncludeHeadersFooters
in interface ExcelExtractor
includeHeadersFooters
- true
if headers and footers should be includedpublic void setIncludeSheetNames(boolean includeSheetNames)
setIncludeSheetNames
in interface ExcelExtractor
includeSheetNames
- true
if the sheet names should be includedpublic void setFormulasNotResults(boolean formulasNotResults)
setFormulasNotResults
in interface ExcelExtractor
formulasNotResults
- true
if the formula itself is returnedpublic java.lang.String getText()
getText
in interface ExcelExtractor
getText
in class POITextExtractor
Copyright 2020 The Apache Software Foundation or its licensors, as applicable.