On Mon, 17 Sep 2007 21:16:33 +0400 Andrey Chernov <ache_at_nagual.pp.ru> wrote: > On Tue, Sep 18, 2007 at 02:01:00AM +0900, YAMAMOTO, Taku wrote: > > Checking for __mb_cur_max is not enough for certain locales. > > For example, SJIS has following range for JIS X0201 (a.k.a. HALFWIDTH KANA). > > > > /* > > * JIS X201 > > */ > > PUNCT 0xa1-0xa5 > > SPACE 0xa0 > > BLANK 0xa0 > > SPECIAL 0xa1-0xdf > > PHONOGRAM 0xa6-0xdf > > SWIDTH1 0xa0-0xdf > > I don't understand your remark. MSKanji have __mb_cur_max = 2 and so those > ranges are wchar_t ranges. My patch restrict unsigned char ranges only. These characters ARE single byte. The problem is that a byte >= 0x80 does not always mean it composes a multi-byte character in that locale. -- -|-__ YAMAMOTO, Taku | __ < <taku_at_tackymt.homeip.net> - A chicken is an egg's way of producing more eggs. -Received on Wed Sep 19 2007 - 00:12:10 UTC
This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:39:17 UTC