OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

 


 

   Re: parsing entity values

[ Lists Home | Date Index | Thread Index ]
  • From: Richard Tobin <richard@cogsci.ed.ac.uk>
  • To: "James Tauber" <jtauber@jtauber.com>, "Philip J Grabner" <grabner@interdim.com>, <xml-dev@ic.ac.uk>
  • Date: Mon, 25 Jan 1999 16:12:51 GMT

> ><!ENTITY % ap "&#38;#39;" > ( 38 = "&" , 39 = "'" )
> ><!ENTITY msg "he said %ap;hi!%ap;" >

> Right. The replacement text for ap is
> 
>     &#39;

Yes.

> With msg, the parameter entity is included as part of the replacement text
> and so the replacement text of msg is
> 
>     he said &#39;hi!&#39;

No.

See the table in section 4.4.  We have a parameter entity reference in an
entity value, so it is "included in literal".  4.4.5 says "[the parameter
entity's] replacement text is processed in place of the reference itself
as though it were part of the document at the location the reference was
recognised, except that a single or double quote character [...] will not
terminate the literal".  So the &#39; is processed as if it had occurred
directly in the definition of msg.

You can't see the difference in this case, but if we had:

<!ENTITY % less "&#38;#60;">
<!ENTITY % more "&#38;#62;">
<!ENTITY elt "%less;=%more;">

the replacement text of elt would be 

  <=>

not

  &#60;=&#62;

and should be detected as a syntax error if &elt; occurred in the
body.

Phil suggests that having to keep track of where the quotes are
special makes the parsing quite difficult; I don't think this is true,
though perhaps it depends on how your parser works.  Mine just checks
to see whether it read the quote character from the same entity that it
read the opening quote from.

-- Richard

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@ic.ac.uk the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@ic.ac.uk the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@ic.ac.uk)





 

News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 2001 XML.org. This site is hosted by OASIS