Package com.logicaldoc.core.parser
Class PDFParser
- java.lang.Object
-
- com.logicaldoc.core.parser.AbstractParser
-
- com.logicaldoc.core.parser.PDFParser
-
- All Implemented Interfaces:
Parser
public class PDFParser extends AbstractParser
Text extractor for Portable Document Format (PDF). For parsing uses an external library: PDFBox. Created on 4. November 2003, 18:09- Since:
- 3.5
- Author:
- Michael Scholz, Alessandro Gasparini - LogicalDOC
-
-
Constructor Summary
Constructors Constructor Description PDFParser()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description int
countPages(File file, String filename)
Same as the otherParser.countPages(InputStream, String)
, but use this when you have a file rather than a stream.int
countPages(InputStream input, String filename)
Counts the number of pages of the given binary document.void
internalParse(InputStream input, String filename, String encoding, Locale locale, String tenant, Document document, String fileVersion, StringBuilder content)
-
Methods inherited from class com.logicaldoc.core.parser.AbstractParser
parse, parse, parse, parse
-
-
-
-
Method Detail
-
internalParse
public void internalParse(InputStream input, String filename, String encoding, Locale locale, String tenant, Document document, String fileVersion, StringBuilder content) throws ParseException
- Throws:
ParseException
-
countPages
public int countPages(File file, String filename)
Description copied from interface:Parser
Same as the otherParser.countPages(InputStream, String)
, but use this when you have a file rather than a stream.- Specified by:
countPages
in interfaceParser
- Overrides:
countPages
in classAbstractParser
- Parameters:
file
- the filefilename
- name of the file- Returns:
- the number of pages
-
countPages
public int countPages(InputStream input, String filename)
Description copied from interface:Parser
Counts the number of pages of the given binary document.- Specified by:
countPages
in interfaceParser
- Overrides:
countPages
in classAbstractParser
- Parameters:
input
- binary content from which to extract the textfilename
- name of the file- Returns:
- the number of pages
-
-