[darcs-users] UTF-16 (was: Default binary masks)

Sean E. Russell ser at germane-software.com
Wed Nov 26 04:08:30 UTC 2003


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Tuesday 25 November 2003 22:04, Trevor Talbot wrote:
> On a practical note, if it is labeled out of band, then you know that
> the stream is UTF-16BE or UTF-16LE by the same token that you know it
> is UTF-8.  That mitigates the requirement of information from the start
> of the stream, but it does not remove the "byte order" state itself.

Agreed.

> If you know the stream is UTF-16, then you always drop into the stream
> on a double-byte boundary.  You never deal with UTF-16 data on a

... if you can.  That is, assuming that you have control over that.  I'm 
concerned about the case where you've lost part of the stream, or missed it.  
You can't just pick it up in the middle.

Not that this particular point is relevant for darcs; if the data stream isn't 
complete, the patch is probably unusable, UTF-16 or no.

- -- 
### SER   
### Deutsch|Esperanto|Francaise|Linux|XML|Java|Ruby|Aikido|Dirigibles
### http://www.germane-software.com/~ser  jabber.com:ser  ICQ:83578737 
### GPG: http://www.germane-software.com/~ser/Security/ser_public.gpg
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.1 (GNU/Linux)

iD8DBQE/xCc+P0KxygnleI8RAvnTAJ41eAZIFiZjX9AjyY7grFglmygZmgCcCxrk
UdMMgjkWy0cpBhT3k7QMqIo=
=EFQH
-----END PGP SIGNATURE-----





More information about the darcs-users mailing list