xml-dev - Re: [xml-dev] char ref question

Re: [xml-dev] char ref question

[ Lists Home | Date Index | Thread Index ]

To: Manos Batsis <mbatsis@netsmart.gr>
Subject: Re: [xml-dev] char ref question
From: Rick Jelliffe <ricko@allette.com.au>
Date: Wed, 28 Jan 2004 16:07:26 +1100
Cc: XML Dev <xml-dev@lists.xml.org>
In-reply-to: <4016898C.4000804@netsmart.gr>
References: <4016898C.4000804@netsmart.gr>
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.3.1) Gecko/20030428

Manos Batsis wrote:

> Hi list,
>
> Short version: Must character references in attribute values get 
> expanded by an XML parser?
>
> Long version: When a document like
>
> <?xml version="1.0" encoding="iso-8859-1"?>
> <foo bar="&#955;"/>
>
>
> is accessed by an API like SAX on top of an XML parser like piccolo 
> must the exposed attribute value be "&#955;" or "&lgr;" (greek lambda)?

You could conceivably have a partial parser that does not expand
character references. But then you would have two kinds of strings
floating around, which could cause confusion. I guess it would
be useful if
 * you wanted to stick to ASCII or 8859-1 enocded strings
 * you were just shovelling characters from input to output
 as fast as possible and you weren't interested in looking at
the contents at all.

There are lots of kinds of partial or lazy parsing possible...

Cheers
Rick Jelliffe

References:
- char ref question
  - From: Manos Batsis <mbatsis@netsmart.gr>

Prev by Date: Re: [xml-dev] char ref question
Next by Date: RE: [xml-dev] What is your XML Editor?
Previous by thread: Re: [xml-dev] char ref question
Next by thread: Translating char refs was: (Re: [xml-dev] char ref question)
Index(es):
- Date
- Thread