Class HTMLSAXParser

  • All Implemented Interfaces:
    org.apache.xerces.xni.XMLDocumentHandler, org.apache.xerces.xni.XMLDTDContentModelHandler, org.apache.xerces.xni.XMLDTDHandler, org.apache.xerces.xs.PSVIProvider, Parser, XMLReader

    public class HTMLSAXParser
    extends org.apache.xerces.parsers.AbstractSAXParser
    Helper class for HTML parsing
    • Field Summary

      • Fields inherited from interface org.apache.xerces.xni.XMLDTDContentModelHandler

        OCCURS_ONE_OR_MORE, OCCURS_ZERO_OR_MORE, OCCURS_ZERO_OR_ONE, SEPARATOR_CHOICE, SEPARATOR_SEQUENCE
      • Fields inherited from interface org.apache.xerces.xni.XMLDTDHandler

        CONDITIONAL_IGNORE, CONDITIONAL_INCLUDE
    • Constructor Summary

      Constructors 
      Constructor Description
      HTMLSAXParser()  
      HTMLSAXParser​(org.apache.xerces.xni.parser.XMLParserConfiguration parserConfig)  
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      void characters​(org.apache.xerces.xni.XMLString xmlString, org.apache.xerces.xni.Augmentations augmentations)  
      String getContents()
      Returns parsed content
      void startDocument​(org.apache.xerces.xni.XMLLocator arg0, String arg1, org.apache.xerces.xni.NamespaceContext arg2, org.apache.xerces.xni.Augmentations arg3)  
      • Methods inherited from class org.apache.xerces.parsers.AbstractSAXParser

        attributeDecl, comment, doctypeDecl, elementDecl, endCDATA, endDocument, endDTD, endElement, endExternalSubset, endGeneralEntity, endParameterEntity, externalEntityDecl, getAttributePSVI, getAttributePSVIByName, getContentHandler, getDTDHandler, getElementPSVI, getEntityResolver, getErrorHandler, getFeature, getProperty, ignorableWhitespace, internalEntityDecl, notationDecl, parse, parse, processingInstruction, reset, setContentHandler, setDocumentHandler, setDTDHandler, setEntityResolver, setErrorHandler, setFeature, setLocale, setProperty, startCDATA, startElement, startExternalSubset, startGeneralEntity, startParameterEntity, unparsedEntityDecl, xmlDecl
      • Methods inherited from class org.apache.xerces.parsers.AbstractXMLDocumentParser

        any, element, empty, emptyElement, endAttlist, endConditional, endContentModel, endGroup, getDocumentSource, getDTDContentModelSource, getDTDSource, ignoredCharacters, occurrence, pcdata, separator, setDocumentSource, setDTDContentModelSource, setDTDSource, startAttlist, startConditional, startContentModel, startDTD, startGroup, textDecl
      • Methods inherited from class org.apache.xerces.parsers.XMLParser

        parse
    • Constructor Detail

      • HTMLSAXParser

        public HTMLSAXParser()
      • HTMLSAXParser

        public HTMLSAXParser​(org.apache.xerces.xni.parser.XMLParserConfiguration parserConfig)
    • Method Detail

      • startDocument

        public void startDocument​(org.apache.xerces.xni.XMLLocator arg0,
                                  String arg1,
                                  org.apache.xerces.xni.NamespaceContext arg2,
                                  org.apache.xerces.xni.Augmentations arg3)
                           throws org.apache.xerces.xni.XNIException
        Specified by:
        startDocument in interface org.apache.xerces.xni.XMLDocumentHandler
        Overrides:
        startDocument in class org.apache.xerces.parsers.AbstractSAXParser
        Throws:
        org.apache.xerces.xni.XNIException
      • characters

        public void characters​(org.apache.xerces.xni.XMLString xmlString,
                               org.apache.xerces.xni.Augmentations augmentations)
                        throws org.apache.xerces.xni.XNIException
        Specified by:
        characters in interface org.apache.xerces.xni.XMLDocumentHandler
        Overrides:
        characters in class org.apache.xerces.parsers.AbstractSAXParser
        Throws:
        org.apache.xerces.xni.XNIException
      • getContents

        public String getContents()
        Returns parsed content
        Returns:
        String Parsed content