Re: Ctype patch for review

From: Taku YAMAMOTO <taku_at_tackymt.homeip.net>
Date: Wed, 19 Sep 2007 11:12:07 +0900
On Mon, 17 Sep 2007 21:16:33 +0400
Andrey Chernov <ache_at_nagual.pp.ru> wrote:

> On Tue, Sep 18, 2007 at 02:01:00AM +0900, YAMAMOTO, Taku wrote:
> > Checking for __mb_cur_max is not enough for certain locales.
> > For example, SJIS has following range for JIS X0201 (a.k.a. HALFWIDTH KANA).
> > 
> > /*
> >  * JIS X201
> >  */
> > PUNCT           0xa1-0xa5
> > SPACE           0xa0
> > BLANK           0xa0
> > SPECIAL         0xa1-0xdf
> > PHONOGRAM       0xa6-0xdf
> > SWIDTH1         0xa0-0xdf
> 
> I don't understand your remark. MSKanji have __mb_cur_max = 2 and so those 
> ranges are wchar_t ranges. My patch restrict unsigned char ranges only.

These characters ARE single byte.
The problem is that a byte >= 0x80 does not always mean it composes a
multi-byte character in that locale.


-- 
-|-__   YAMAMOTO, Taku
 | __ <     <taku_at_tackymt.homeip.net>

      - A chicken is an egg's way of producing more eggs. -
Received on Wed Sep 19 2007 - 00:12:10 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:39:17 UTC