DXFParser |
This parser reads Autocad DXF files (supported versions are from r12 to
2007).
|
EbookParser |
Extracts the texts from ebooks in the format azw, azw3 and mobi
|
EMLParser |
Parses Mozilla .eml files
|
ImageParser |
This parser reads images (supported formats are: jpg, jfif, bmp, jpeg, wbmp, png,
gif) and extracts text using OCR.
|
MsgParser |
Text extractor for Microsoft Outlook messages.
|
OCRUtil |
Utility class for OCR interaction.
|
OpenXMLParser |
This parser is able to extract text from Office documents
|
P7MParser |
Parses the signed files .p7m
|
ParserPlugin |
Provides some initializations needed by parsers
|
PDFParser |
Extension of standard PDF parser that also uses OCR to read embedded raster
images.
|