Package com.logicaldoc.core.parser
Class PDFParser
- java.lang.Object
-
- com.logicaldoc.core.parser.AbstractParser
-
- com.logicaldoc.core.parser.PDFParser
-
- All Implemented Interfaces:
Parser
public class PDFParser extends AbstractParser
Text extractor for Portable Document Format (PDF). For parsing uses an external library: PDFBox. Created on 4. November 2003, 18:09- Since:
- 3.5
- Author:
- Michael Scholz, Alessandro Gasparini - LogicalDOC
-
-
Constructor Summary
Constructors Constructor Description PDFParser()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description intcountPages(File file, String filename)Same as the otherParser.countPages(InputStream, String), but use this when you have a file rather than a stream.intcountPages(InputStream input, String filename)Counts the number of pages of the given binary document.voidinternalParse(InputStream input, String filename, String encoding, Locale locale, String tenant, Document document, String fileVersion, StringBuilder content)-
Methods inherited from class com.logicaldoc.core.parser.AbstractParser
parse, parse, parse, parse
-
-
-
-
Method Detail
-
internalParse
public void internalParse(InputStream input, String filename, String encoding, Locale locale, String tenant, Document document, String fileVersion, StringBuilder content) throws ParseException
- Throws:
ParseException
-
countPages
public int countPages(File file, String filename)
Description copied from interface:ParserSame as the otherParser.countPages(InputStream, String), but use this when you have a file rather than a stream.- Specified by:
countPagesin interfaceParser- Overrides:
countPagesin classAbstractParser- Parameters:
file- the filefilename- name of the file- Returns:
- the number of pages
-
countPages
public int countPages(InputStream input, String filename)
Description copied from interface:ParserCounts the number of pages of the given binary document.- Specified by:
countPagesin interfaceParser- Overrides:
countPagesin classAbstractParser- Parameters:
input- binary content from which to extract the textfilename- name of the file- Returns:
- the number of pages
-
-