class |
AbiWordParser |
Text extractor for AbiWord documents.
|
class |
AbstractParser |
Abstract implementation of a Parser
|
class |
CatchAllParser |
Parser that tries to convert the document into PDF and then tries to parse it
|
class |
DOCParser |
Parses a MS Word (*.doc, *.dot) file to extract the text contained in the
file.
|
class |
DummyParser |
Parser that doesn't parse anything
|
class |
EpubParser |
A specialized parser to extract text from .epub(e-books) format
|
class |
HTMLParser |
Text extractor for HyperText Markup Language (HTML).
|
class |
KOfficeParser |
Text extractor for KOffice 1.6 documents.
|
class |
OpenOfficeParser |
Text extractor for OpenOffice/OpenDocument documents.
|
class |
PDFParser |
Text extractor for Portable Document Format (PDF).
|
class |
PPTParser |
Parser for Office 2003 presentations
|
class |
RarParser |
Class for parsing rar files.
|
class |
RTFParser |
|
class |
SevenZipParser |
Class for parsing 7z files.
|
class |
TarParser |
Class for parsing tar files.
|
class |
TXTParser |
Class for parsing text (*.txt) files.
|
class |
XLSParser |
Parser for Office 2003 worksheets
|
class |
XMLParser |
Text extractor for XML documents.
|
class |
ZABWParser |
Text extractor for AbiWord compressed documents.
|
class |
ZipParser |
Class for parsing zip files.
|