Class SimpleXMLParser

java.lang.Object
com.itextpdf.text.xml.simpleparser.SimpleXMLParser

public final class SimpleXMLParser extends Object
A simple XML. This parser is, like the SAX parser, an event based parser, but with much less functionality.

The parser can:

  • It recognizes the encoding used
  • It recognizes all the elements' start tags and end tags
  • It lists attributes, where attribute values can be enclosed in single or double quotes
  • It recognizes the <[CDATA[ ... ]]> construct
  • It recognizes the standard entities: &amp;, &lt;, &gt;, &quot;, and &apos;, as well as numeric entities
  • It maps lines ending in \r\n and \r to \n on input, in accordance with the XML Specification, Section 2.11

  • Method Details

    • parse

      public static void parse(SimpleXMLDocHandler doc, SimpleXMLDocHandlerComment comment, Reader r, boolean html) throws IOException
      Parses the XML document firing the events to the handler.
      Parameters:
      doc - the document handler
      comment - the comment handler
      r - the document. The encoding is already resolved. The reader is not closed
      html -
      Throws:
      IOException - on error
    • parse

      public static void parse(SimpleXMLDocHandler doc, InputStream in) throws IOException
      Parses the XML document firing the events to the handler.
      Parameters:
      doc - the document handler
      in - the document. The encoding is deduced from the stream. The stream is not closed
      Throws:
      IOException - on error
    • parse

      public static void parse(SimpleXMLDocHandler doc, Reader r) throws IOException
      Parameters:
      doc -
      r -
      Throws:
      IOException
    • escapeXML

      @Deprecated public static String escapeXML(String s, boolean onlyASCII)
      Deprecated.
      moved to XMLUtil.escapeXML(String, boolean), left here for the sake of backwards compatibility
      Escapes a string with the appropriated XML codes.
      Parameters:
      s - the string to be escaped
      onlyASCII - codes above 127 will always be escaped with &#nn; if true
      Returns:
      the escaped string