Class PDFParser

  • All Implemented Interfaces:
    com.logicaldoc.core.parser.Parser

    public class PDFParser
    extends com.logicaldoc.core.parser.PDFParser
    Extension of standard PDF parser that also uses OCR to read embedded raster images.
    Since:
    2.0.0
    Author:
    Marco Meschieri - LogicalDOC
    • Constructor Detail

      • PDFParser

        public PDFParser()
    • Method Detail

      • internalParse

        public void internalParse​(InputStream input,
                                  String filename,
                                  String encoding,
                                  Locale locale,
                                  String tenant,
                                  com.logicaldoc.core.document.Document document,
                                  String fileVersion,
                                  StringBuilder output)
                           throws com.logicaldoc.core.parser.ParseException
        Overrides:
        internalParse in class com.logicaldoc.core.parser.PDFParser
        Throws:
        com.logicaldoc.core.parser.ParseException