NAME

Char::Class::JISX0221 - Regular Expression Character Classes - JISX0221


DESCRIPTION

Japanese character sub-repertories, defined by JIS X 0221-1:2001 Appendix 1

Appendix 1 (informative) of JIS X 0221-1:2001 defines seven Japanese character sub-repertories:

a) BASIC JAPANESE (基本日本文字集合)

This sub-repertory contains of graphic characters defined by JIS X 0201, ISO/IEC 646 IRV and JIS X 0208. Han characters are defined in main part of ISO/IEC 10646-1 27. (CJK Unified Ideographs) with source JIS X 0208 code point. Non-han charactrers are listed in Appendix 1 table 1.

b) JAPANESE NON IDEOGRAPHICS SUPPLEMENT (追加非漢字集合)

This sub-repertory contains of non-Han JIS X 0212 character and additional symbols used in Japanese documents that does not included in JIS X 0201, JIS X 0208 and JIS X 0212. Characters are listed in Appendix 1 table 2.

This sub-repertory is intended to be used with BASIC JAPANESE.

c) JAPANESE IDEOGRAPHICS SUPPLEMENT 1 (追加漢字集合)

This sub-repertory contains of Han characters defined by JIS X 0212 and being more frequently used. Characters are listed in Appendix 1 table 3.

This sub-repertory is intended to be used with BASIC JAPANESE.

d) JAPANESE IDEOGRAPHICS SUPPLEMENT 2 (補助漢字集合)

This sub-repertory contains of Han characters defined by JIS X 0212, except those included of JAPANESE IDEOGRAPHICS SUPPLEMENT 1.

This sub-repertory is intended to be used with BASIC JAPANESE and JAPANESE IDEOGRAPHICS SUPPLEMENT 1.

e) JAPANESE IDEOGRAPHICS SUPPLEMENT 3 (その他の漢字集合)

Han characters defined in CJK UNIFIED IDEOGRAPHS, except those included in BASIC JAPANESE and JAPANESE IDEOGRAPHICS SUPPLEMENT 1 and 2.

This sub-repertory is intended to be used with BASIC JAPANESE and JAPANESE IDEOGRAPHICS SUPPLEMENT 1 and 2.

f) FULLWIDTH ALPHANUMERICS (互換用全角英数字集合)

Fullwidth variants of ISO/IEC 646 IRV characters, listed in Appendix 1 table 4. This sub-repertory is intended to be used for and only for keeping compatibility with old idiomatic Japanese character repertory or presentation form.

This sub-repertory is intended to be used with BASIC JAPANESE and BASIC LATIN (ISO/IEC 10646-1 Appendix A.1).

g) HALFWIDTH KATAKANA (互換用半角片仮名集合)

Halfwidth variants of Katakana characters defined by JIS X 0201, listed in Appendix 1 table 5. This sub-repertory is intended to be used for and only for keeping compatibility with old idiomatic Japanese character repertory or presentation form.

This sub-repertory is intended to be used with BASIC JAPANESE.

Only a character, TILDE (U+007E) is doubly included in both BASIC JAPANESE and JAPANESE NON IDEOGRAPHICS SUPPLEMENT. Any other characters belong to one sub-repertory.


COLLECTION NAMES

InJISX0221BasicJapanese
InJISX0221FullwidthAlphanumeric
InJISX0221HalfwidthKatakana
InJISX0221JapaneseIdeographicsSupplement1
InJISX0221JapaneseIdeographicsSupplement2
InJISX0221JapaneseIdeographicsSupplement3
InJISX0221JapaneseNonIdeographicsSupplement


EXAMPLE

 use Char::Class::JISX0221;
 if ($s =~ /\p{InFullwidthAlphanumeric}/) {
   print "Match!\n";
 }


SEE ALSO

ISO/IEC 10646-1:1993, ISO/IEC JTC1, 1993. Obsoleted by ISO/IEC 10646-1:2000.

ISO/IEC 10646-1:2000, ISO/IEC JTC1, 2000. Revision of ISO/IEC 10646-1:1993.

JIS X 0221-1995, Japanese Industrial Standard Commitee (JISC <http://www.jisc.go.jp/>), 1995. Obsoleted by JIS X 0221-1:2001.

JIS X 0221-1:2001, Japanese Industrial Standard Commitee (JISC <http://www.jisc.go.jp/>), 2001. JIS version of ISO/IEC 10646-1:2000. Revision of JIS X 0221-1995.

ISO/IEC 646:1991, JIS X 0201:1997, JIS X 0208:1997, JIS X 0212-1990, JIS X 0213:2000 are also related.

the Char::Class::UCS manpage

RFC1815.html">the Char::Class::RFC1815 manpage


LICENSE

Copyright 2008 Wakaba <w@suika.fam.cx>

This library and the library generated by it is free software; you can redistribute them and/or modify them under the same terms as Perl itself.