Structured Markup Processing Tools¶
Python supports a variety of modules to work with various forms of structured data markup. This includes modules to work with the Standard Generalized Markup Language (SGML) and the Hypertext Markup Language (HTML), and several interfaces for working with the Extensible Markup Language (XML).
It is important to note that modules in the xml
package require that
there be at least one SAX-compliant XML parser available. The Expat parser is
included with Python, so the xml.parsers.expat
module will always be
available.
The documentation for the xml.dom
and xml.sax
packages are the
definition of the Python bindings for the DOM and SAX interfaces.
html.parser
— Simple HTML and XHTML parserhtml.entities
— Definitions of HTML general entitiesxml.parsers.expat
— Fast XML parsing using Expatxml.dom
— The Document Object Model APIxml.dom.minidom
— Lightweight DOM implementationxml.dom.pulldom
— Support for building partial DOM treesxml.sax
— Support for SAX2 parsersxml.sax.handler
— Base classes for SAX handlersxml.sax.saxutils
— SAX Utilitiesxml.sax.xmlreader
— Interface for XML parsersxml.etree.ElementTree
— The ElementTree XML API