[
Lists Home |
Date Index |
Thread Index
]
> The entity for an em-dash in Unicode is ≬
That's not an entity, it's a numeric character reference.
8812 is not an emdash, it's BETWEEN, EM DASH is 8212.
> Viewing the source gives me a strange character set: ти
Looks like that is the utf8 encoding. utf8 is the default encoding for
xml, most characters take more than one byte in that encoding, if you
look at the file in an editor that doesn't understand utf8 you will see
essentially arbitrary characters, as the bytes are mis-interpreted using
latin1 or your operating system's code page.
David
________________________________________________________________________
This e-mail has been scanned for all viruses by Star Internet. The
service is powered by MessageLabs. For more information on a proactive
anti-virus service working around the clock, around the globe, visit:
http://www.star.net.uk
________________________________________________________________________
|