Lists Home |
Date Index |
> > Elliotte Rusty Harold wrote:
> > > It could be worse, though. You could be using C, and trying to decode
> > > UTF-8. :-)
> > ?? It's about 10 lines of code, and has been written lots of
> > times now.
But if your UTF-8 file is a big string, it slows you down for random access:
don't you have to count from the closest known point (unworkable) or maintain
some kind of chop points (e.g. a list of line deltas).
At least in UTF-8 you can just count bytes <0x80 to count characters.