[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
XML referencing (was Re: Blueberry is not "closed")
- From: Joel Rees <rees@server.mediafusion.co.jp>
- To: Tim Bray <tbray@textuality.com>
- Date: Wed, 25 Jul 2001 13:10:37 +0900
Tim Bray further elucidated:
[snipped]
> On the other hand, if they do something like
>
> <?xml version="1.0" unicode="3.1.1" ?>
>
> Then every XML processor in the world will correctly reject it.
> Of course this second option is not viable if the W3C misguidedly
> lets NEL into the S production.
[snipped]
Okay, from Unicode 3.0, p. 48 (sec. 3.9, "Special Character Properties"), I
see that NEL is _not_ in the UNICODE list of line boundary control
characters. This seems odd. Why should NExt Line not be a line boundary
control? I guess I need to get back on the public Unicode list and ask.
(Too many lists. I'll just have to start using rules in my mailer.)
I assume Line Separator and Paragraph Separator _are_ recognized as line
boundary control by XML? No!?
http://www.w3.org/TR/2000/REC-xml-20001006#sec-line-ends
Hmmm.
Add a third parameter that references a character classification table
override table for the parser? (I hear the sound of stomache juices
sloshing.) No, let's try, instead of the second parameter being explicitly a
unicode reference parameter, it is allowed to reference a character
classification table override table, a few of which happen to be universally
known tables matching versions of unicode.
But can such a thing be done without breaking the SGML connection? And can
it be done cheaply enough? efficiently enough? Will it require features not
available in common OSses to implement? (The shared static library problem
_has_ been solved, hasn't it?)
Maybe NEL should be set aside. But (IMnsHO) Blueberry should allow any
UNICODE line boundary control as line separating characters. I wonder if
anyone at IBM has evaluated the possibility of slipping a pass into the
process of translating their EBCDIC documents that would map the NEL to an
XML entity or special tag followed by a line feed.
I get the feeling I'm repeating in monologue a discussion that has been had
elsewhere.
Joel Rees
programmer -- rees@mediafusion.co.jp
----------------------------------------------------
To be a tree supporting all information,
giving root to the chaos
and branches to the trivia,
information breathing anew --
This is the aim of Yggdrasill.
============================XML as Best Solution===
Media Fusion Co. ,Ltd. 株式会社メディアフュージョン
Amagasaki TEL 81-6-6415-2560 FAX 81-6-6415-2556
Tokyo TEL 81-3-3516-2566 FAX 81-3-3516-2567
http://www.mediafusion.co.jp
===================================================