[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Blueberry/Unicode/XML

From: Jonathan Borden <jborden@mediaone.net>
To: John Cowan <cowan@mercury.ccil.org>
Date: Wed, 11 Jul 2001 23:41:21 -0400

John Cowan wrote:
>
>
> Jonathan Borden scripsit:
>
> > Aside for perhaps arbitrary (perhaps not :-) decisions about
> what characters
> > ought or ought not be used to name things, what are these "good
> reasons"?
> >
> > I specifically include in "good reasons":
> >
> > 1) useful pieces of code that would break
> > 2) hindrances to the development of useful pieces of code
>
> The main point is that it wouldn't be plain text any more.  If
> XML is just a
> binary format, something that no human being ever looks at, then
> ASCII markup is plenty: you can tag everything x1, x2, x3, ....

Hmmm... I thought that Unicode _was_ plain text, at least it says that it
is. I am not suggesting that we not represent XML as a sequence of Unicode
characters, nor am I suggesting that we allow characters in element names
that are not allowed in text content.

>
> But there are many Unicode characters that are very similar to others,
> such as the halfwidth-fullwidth case that's been talked about already,
> or the 127 (:-)) kinds of stars, or the various kinds of whitespace
> that aren't, and so on.

I don't see the big difference between:

<shrug> O'Hara </shrug>

and

<O'Hara> shrug </O'Hara>

... if 127 kinds of stars pose a problem for humans reading element names,
surely they will pose the same problem for humans reading element content,
no?

-Jonathan

Follow-Ups:
- Re: Blueberry/Unicode/XML
  - From: John Cowan <cowan@mercury.ccil.org>

References:
- Re: Blueberry/Unicode/XML
  - From: John Cowan <cowan@mercury.ccil.org>

Prev by Date: Re: DTD Notation raises a question
Next by Date: Re: Blueberry/Unicode/XML
Previous by thread: Re: Blueberry/Unicode/XML
Next by thread: Re: Blueberry/Unicode/XML
Index(es):
- Date
- Thread