Package com.logicaldoc.ocr
Class OCR
- java.lang.Object
-
- com.logicaldoc.ocr.OCR
-
- Direct Known Subclasses:
Advanced,OCRWebService,PowerPDF,Tesseract
public abstract class OCR extends Object
This OCR engine is capable of recognizing characters (letter and numbers) accurately- Author:
- Alessandro Gasparini
-
-
Constructor Summary
Constructors Constructor Description OCR()
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description voidextractPDFText(File pdffile, String lang, String tenant, StringBuilder buffer)Extracts the text from PDF filevoidextractText(File imgfile, String lang, String tenant, StringBuilder sb)StringgetParameter(String name)List<String>getParameterNames()Map<String,String>getParameters()intgetResolutionThreshold(String tenant)booleanisAvailable()static booleanisWindows()voidloadParameters()
-
-
-
Method Detail
-
loadParameters
public void loadParameters()
-
isAvailable
public boolean isAvailable()
-
extractPDFText
public void extractPDFText(File pdffile, String lang, String tenant, StringBuilder buffer) throws IOException
Extracts the text from PDF file- Parameters:
pdffile- the file to ocrlang- the language in which the document is writtentenant- name of the tenantbuffer- the buffer to store the extracted text- Throws:
IOException- In case of OCR error
-
extractText
public void extractText(File imgfile, String lang, String tenant, StringBuilder sb) throws IOException
- Throws:
IOException
-
getResolutionThreshold
public int getResolutionThreshold(String tenant)
-
isWindows
public static boolean isWindows()
-
-