XML.orgXML.org
FOCUS AREAS |XML-DEV |XML.org DAILY NEWSLINK |REGISTRY |RESOURCES |ABOUT
OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]
Re: [xml-dev] How to handle "newline" characters in an XML parser.

In article <e24752a10612051124j501ffe3i7469c64a94ea4959@mail.gmail.com> you write:

>I'm having some trouble figuring out how to handle "newline"
>characters in XML text files on different platforms. I typically
>ignore all whitespace in the parser,

To conform to the standard, an XML parser must return all whitespace
in content to the application, except that line breaks must be
normalized to a linefeed character.  The idea is that XML applications
don't have to worry about the platform's line-end conventions.  Any of
the following count as a line break: LF, CR LF, and CR not followed by
LF.  So if you get two CRs followed by a LF, you should return two
LFs.  The easiest way to do this is to convert them as you input them,
before parsing.  You can count line numbers at the same time.

-- Richard


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


News | XML in Industry | Calendar | XML Registry
Marketplace | Resources | MyXML.org | Sponsors | Privacy Statement

Copyright 1993-2007 XML.org. This site is hosted by OASIS