public class OLE2ScratchpadExtractorFactory
extends java.lang.Object
OLE2ExtractorFactory
and
org.apache.poi.extractor.ExtractorFactory
, which permit the other two to run with
no Scratchpad jar (though without functionality!)
Note - should not be used standalone, always use via the other two classes
Constructor and Description |
---|
OLE2ScratchpadExtractorFactory() |
Modifier and Type | Method and Description |
---|---|
static POITextExtractor |
createExtractor(DirectoryNode poifsDir)
Look for certain entries in the stream, to figure it
out what format is desired
Note - doesn't check for core-supported formats!
Note - doesn't check for OOXML-supported formats
|
static void |
identifyEmbeddedResources(POIOLE2TextExtractor ext,
java.util.List<Entry> dirs,
java.util.List<java.io.InputStream> nonPOIFS)
Returns an array of text extractors, one for each of
the embedded documents in the file (if there are any).
|
public static POITextExtractor createExtractor(DirectoryNode poifsDir) throws java.io.IOException
poifsDir
- the directory node to be inspectedjava.io.IOException
- when the format specific extraction fails because of invalid entirespublic static void identifyEmbeddedResources(POIOLE2TextExtractor ext, java.util.List<Entry> dirs, java.util.List<java.io.InputStream> nonPOIFS) throws java.io.IOException
POITextExtractor
for each embedded file.ext
- the extractor holding the directory to start parsingdirs
- a list to be filled with directory references holding embeddednonPOIFS
- a list to be filled with streams which aren't based on POIFS entriesjava.io.IOException
- when the format specific extraction fails because of invalid entiresCopyright 2020 The Apache Software Foundation or its licensors, as applicable.