Gwern Branwen <gwern0 at gmail.com> writes: > There isn't any schema I know of. You really just have to parse it > kind of ad-hoc. And as we've seen in the Darcs repo, input isn't recoded into UTF-8, so in *one output document* from changes --xml you can have ISO 8859-1 bytes, UTF-8 bytes, and JIS bytes. Which basically means it's not XML :-(