| DXFParser |
This parser reads Autocad DXF files (supported versions are from r12 to
2007).
|
| EbookParser |
Extracts the texts from ebooks in the format azw, azw3 and mobi
|
| EMLParser |
Parses Mozilla .eml files
|
| ImageParser |
This parser reads images (supported formats are: jpg, jfif, bmp, jpeg, wbmp,
png, gif) and extracts text using OCR.
|
| MsgParser |
Text extractor for Microsoft Outlook messages.
|
| OCRUtil |
Utility class for OCR interaction.
|
| OpenXMLParser |
This parser is able to extract text from Office documents
|
| P7MParser |
Parses the signed files .p7m
|
| ParserPlugin |
Provides some initializations needed by parsers
|
| PDFParser |
Extension of standard PDF parser that also uses OCR to read embedded raster
images.
|