Lists Home |
Date Index |
"Rick Jelliffe" <firstname.lastname@example.org> wrote:
| If the file is coming in from the outside world, it is clearly a
| mistake if there is a control character in the data, and an entity
| manager which uses the "shunned characters" to strip out the control
| char is not doing the wrong thing. (I would say that for internal
| entities, it is a different matter.)
What's the difference? Aren't both under the control of the entity
manager, and didn't the internal entity reside in a file that came from
the outside world?
| The right thing to do does not come from ISO 8879": it just provides a
| way to label characters with their class.
IMHO, allowing NONSGML characters through via entity references can be a
nasty gotcha. (Not to mention the validation fun with "smart quotes" and
the like sprinkled in HTML pages by bogotic software.) It could be a
problem with the existing APIs though: if they said "btw, here's a NONSGML
character, you know how it came through <wink>", that could be different.
| Ultimately, boils down to whatever SP does.
SP does the right things per the standard. It doesn't necessarily do the
right thing(s) for the consuming application (unless you integrate it and
use the native API.)