public interface ExtractorProvider
Modifier and Type | Method and Description |
---|---|
boolean |
accepts(FileMagic fm) |
POITextExtractor |
create(DirectoryNode poifsDir,
java.lang.String password)
Create Extractor from POIFS node
|
POITextExtractor |
create(java.io.File file,
java.lang.String password)
Create Extractor via file
|
POITextExtractor |
create(java.io.InputStream inputStream,
java.lang.String password)
Create Extractor via InputStream
|
default void |
identifyEmbeddedResources(POIOLE2TextExtractor ext,
java.util.List<Entry> dirs,
java.util.List<java.io.InputStream> nonPOIFS)
Returns an array of text extractors, one for each of
the embedded documents in the file (if there are any).
|
boolean accepts(FileMagic fm)
POITextExtractor create(java.io.File file, java.lang.String password) throws java.io.IOException
file
- the filepassword
- the password or null
if not encryptedjava.io.IOException
- if file can't be read or parsedPOITextExtractor create(java.io.InputStream inputStream, java.lang.String password) throws java.io.IOException
inputStream
- the streampassword
- the password or null
if not encryptedjava.io.IOException
- if stream can't be read or parsedPOITextExtractor create(DirectoryNode poifsDir, java.lang.String password) throws java.io.IOException
poifsDir
- the nodepassword
- the password or null
if not encryptedjava.io.IOException
- if node can't be parsedjava.lang.IllegalStateException
- if processing fails for some other reason,
e.g. missing JCE Unlimited Strength Jurisdiction Policy files
while handling encrypted files.default void identifyEmbeddedResources(POIOLE2TextExtractor ext, java.util.List<Entry> dirs, java.util.List<java.io.InputStream> nonPOIFS) throws java.io.IOException
POITextExtractor
for each embedded file.ext
- the extractor holding the directory to start parsingdirs
- a list to be filled with directory references holding embeddednonPOIFS
- a list to be filled with streams which aren't based on POIFS entriesjava.io.IOException
- when the format specific extraction fails because of invalid entiresCopyright 2022 The Apache Software Foundation or its licensors, as applicable.