OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Just a Little Explanaton for Veering (RE: Blueberry/Unicode/ XML)

John Cowan wrote:
> Perl 5.6 AFAIK is still ASCII-only.

Perl understands UTF-8, though full Unicode support is not yet

From the Perl 5.6.0 perlunicode manpage:

- Identifiers within the Perl script may contain Unicode alphanumeric
characters, including ideographs.

- Character classes in regular expressions match characters instead of
bytes, and match against the character properties specified in the
Unicode properties database. So \w can be used to match an ideograph,
for instance.