public interface ExtractorProvider
| Modifier and Type | Method and Description |
|---|---|
boolean |
accepts(FileMagic fm) |
POITextExtractor |
create(DirectoryNode poifsDir,
java.lang.String password)
Create Extractor from POIFS node
|
POITextExtractor |
create(java.io.File file,
java.lang.String password)
Create Extractor via file
|
POITextExtractor |
create(java.io.InputStream inputStream,
java.lang.String password)
Create Extractor via InputStream
|
default void |
identifyEmbeddedResources(POIOLE2TextExtractor ext,
java.util.List<Entry> dirs,
java.util.List<java.io.InputStream> nonPOIFS)
Returns an array of text extractors, one for each of
the embedded documents in the file (if there are any).
|
boolean accepts(FileMagic fm)
POITextExtractor create(java.io.File file, java.lang.String password) throws java.io.IOException
file - the filepassword - the password or null if not encryptedjava.io.IOException - if file can't be read or parsedPOITextExtractor create(java.io.InputStream inputStream, java.lang.String password) throws java.io.IOException
inputStream - the streampassword - the password or null if not encryptedjava.io.IOException - if stream can't be read or parsedPOITextExtractor create(DirectoryNode poifsDir, java.lang.String password) throws java.io.IOException
poifsDir - the nodepassword - the password or null if not encryptedjava.io.IOException - if node can't be parsedjava.lang.IllegalStateException - if processing fails for some other reason,
e.g. missing JCE Unlimited Strength Jurisdiction Policy files
while handling encrypted files.default void identifyEmbeddedResources(POIOLE2TextExtractor ext, java.util.List<Entry> dirs, java.util.List<java.io.InputStream> nonPOIFS) throws java.io.IOException
POITextExtractor for each embedded file.ext - the extractor holding the directory to start parsingdirs - a list to be filled with directory references holding embeddednonPOIFS - a list to be filled with streams which aren't based on POIFS entriesjava.io.IOException - when the format specific extraction fails because of invalid entiresCopyright 2022 The Apache Software Foundation or its licensors, as applicable.