#368128 Text::Unidecode: U+4EB0 less Chinese than U+4EAC

#368128#5
Date:
2006-05-19 21:32:51 UTC
From:
To:
In the man page,
        print unidecode(
           "\x{5317}\x{4EB0}\n"
            # those are the Chinese characters for Beijing

Are you sure?
$ echo U+4EB0 |ascii2uni -qP|iconv -t gb2312|qp-encode
iconv: illegal input sequence at position 0
$ echo U+4EAC |ascii2uni -qP|iconv -t gb2312|qp-encode
=BE=A9

I bet U+4EB0 is a Japanese variant or something.

By the way, perhaps add an example of a quick way to use Text::Unidecode:
perl -MText::Unidecode -C -nwe 'print unidecode $_' file
(Want to not to even say $_, but too bad, it is not optional.)
P.S., perhaps add an option to also output Chinese tone numbers.