OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help



   RE: Why no ']]>' in CharData?

[ Lists Home | Date Index | Thread Index ]
  • From: Tony Graham <tgraham@mulberrytech.com>
  • To: "Don Park" <donpark@docuverse.com>
  • Date: Sun, 28 Nov 1999 05:30:08 -0400 (EST)

At 28 Nov 1999 00:41 -0800, Don Park wrote:
 > >Even if you disallow existing encodings so that SML allows only UTF-8
 > >and/or UTF-16, you should still allow numeric character references so
 > >that you can represent any Unicode character even if you're creating
 > >SML data using Windows Notepad.
 > >
 > >If you consider that people won't create SML data and that machines
 > >will, then disallow numeric character references.
 > Tony, numeric character reference is still in.  I was just saying that
 > predefined character references like &lt;, &gt;, and &amp; are there
 > for convenience only since numeric character references are there.

You did say they were in, but John Cowan suggested they should be out.

 > Not too many folks noticed it but I removed the hex numerical character
 > references in the preliminary EBNF.  I prefer hex but didn't like the
 > extra 'x' to get there.  I can go with either hex or decimal but not
 > both.

Take out the decimal and put back the hexadecimal.  I nearly said as
much in my previous post but I thought it was too obvious to mention.

Just about everything about Unicode characters -- from how the Unicode
Standard refers to them to how most computer languages represent them
-- is in terms of hexadecimal code values.  In my experience, even if
the encoding isn't Unicode, most specifications of character sets and
most ways of working with code values use hexadecimal character

The only application that I can think of that absolutely requires
decimal code values instead of hexadecimal is the SGML Declaration
(and I don't think that you're going to surprise us with the SGML
Declaration for SML anytime soon).

Working with hexadecimal code values some of the time and decimal code
values some of the time is a real pain.  When the character set's
documentation uses hexadecimal code values, it's much easier if you
can enter hexadecimal code values in your data.


Tony Graham
Tony Graham                            mailto:tgraham@mulberrytech.com
Mulberry Technologies, Inc.                http://www.mulberrytech.com
17 West Jefferson Street                    Direct Phone: 301/315-9632
Suite 207                                          Phone: 301/315-9631
Rockville, MD  20850                                 Fax: 301/315-8285
  Mulberry Technologies: A Consultancy Specializing in SGML and XML

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To unsubscribe, mailto:majordomo@ic.ac.uk the following message;
unsubscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS