[darcs-users] Is darcs utf-8 aware?

Tommy Pettersson ptp at lysator.liu.se
Thu Oct 9 19:39:09 UTC 2008


On Wed, Oct 08, 2008 at 11:47:46AM -0300, Mauricio wrote:
> For instance: can I describe a patch
> as "Versão 2"?

I think UTF-8 is the recommended encoding for patch meta data.

Darcs is mostly encoding-ignorant, so what you put in is what
you get out, bit by bit, but most "strings" are handled as
sequences of bytes, so UTF-16 will not work in many situations.
For example, Hunks consider lines to be sequences of bytes ended
by 0x10, and Replace patches have a similar constraint in
byte-oriented token patterns.

Also, when darcs prints patch contents to the terminal it
assumes 7-bit ASCII by default. There are some environment
variables that can be set to customize this behavior.

Most metadata (e.g., name, author) are hashed by darcs, and
can't be changed, so it will not be possible to change to
another encoding for meta data later on.


-- 
Tommy Pettersson <ptp at lysator.liu.se>


More information about the darcs-users mailing list