[
Lists Home |
Date Index |
Thread Index
]
In 2.4, the holy script says:
"Text consists of intermingled character data and markup. Markup takes the
form of start-tags, end-tags, empty-element tags, entity references,
character references, comments, CDATA section delimiters, document type
declarations, processing instructions, XML declarations, text declarations,
and any white space that is at the top level of the document entity (that
is, outside the document element and not inside any other markup)."
Then it says:
"All text that is not markup constitutes the character data of the document."
As an exercise, I tried to make a similar list for character data. Here's
what I came up with:
- Element content
- Attribut values
- CDATA content
- Whitespace within the root element
Did I miss anything?
I wonder about entities. Is the literal entity value character data too? I
guess not, because all entity declarations ought to be considered part of
the document type declaration mentioned in 2.4.
A sentence like "everything outside the document element is considered part
of the markup" would have been helpful, assuming this is correct of course.
Gustaf
|