[darcs-users] [patch37] Store textual patch metadata encoded in UTF-8

Juliusz Chroboczek Juliusz.Chroboczek at pps.jussieu.fr
Wed Nov 11 17:19:36 UTC 2009


> Nice to hear from you.

Lurking still, just not participating ;-)

>> No.  There's no need to tag.
>> 
>> UTF-8 can be detected automatically with 100% certainty in practice.  If
>> a string correctly decodes as UTF-8, then it's most certainly UTF-8.

> Are you saying that the probability of funny characters occurring only
> within UTF-8 compatible sequences like 110xxxxx 10xxxxx is just so
> absurdly low (especially in practice) that we can get away with
> autodetection?

I additionally claim that it never happens in a text written in
a natural language.

Of course, if you use Darcs patch logs to store arbitrary binary data,
that's your problem.

                                        Juliusz


More information about the darcs-users mailing list