[Date Prev][Date Next]
[Chronological]
[Thread]
[Top]
Re: Problems with case folding of UTF-8
> Pierangelo Masarati wrote:
> >
> > Can you, Stig and Michael, provide a set of strings that do not
> > work, so that I can try to see what's going on?
>
> Well, Ströder (hopefully properly encoded as ISO-8859-1 in this
> e-mail) is one. The hex-escaped string representation produced by
> Python's UTF-8 Unicode codec is:
>
> 'Str\xc3\xb6der'
>
> Furthermore here are all the german umlauts (each two bytes long):
>
> 'äöüÄÖÜß' ->
>
> '\xc3\xa4\xc3\xb6\xc3\xbc\xc3\x84\xc3\x96\xc3\x9c\xc3\x9f'
I guess you also need to omit the 'x' right? '\c3\a4' and so ...
Pierangelo.