next up previous
Next: Text objects: HTML elements Up: From structured text to Previous: HTML tokens

Analysis: Parsing HTML token streams

Parser: an algorithm (or a programme based on an algorithm)

The parser defines an interpretation function which maps the token stream into an HTML document structure:

PARSER: TOKENSTREAM tex2html_wrap_inline1148 DOCUMENT_STRUCTURE

For the enthusiasts ...
In general, HTML can be defined formally as a context-free (Chomsky Type 2) language, and a TEXT_STRUCTURE can be represented as a tree.



© Dafydd Gibbon Mon Jul 13 18:34:24 MET DST 1998