| 10 |
pairs of same characters dupplicately, with |
pairs of same characters dupplicately, with |
| 11 |
halfwidth/fullwidth property. |
halfwidth/fullwidth property. |
| 12 |
|
|
| 13 |
Even this situation is undesirable, we can't ignore |
Although this situation is undesirable, we can't ignore |
| 14 |
such historical implementions and latest JIS coded |
such historical implementions and latest JIS coded |
| 15 |
character set standards admire such dupulicate |
character set standards admire such dupulicate |
| 16 |
encoding to and only to "keep compatibility with |
encoding to and only to "keep compatibility with |
| 26 |
|
|
| 27 |
{ |
{ |
| 28 |
Name: |
Name: |
| 29 |
|
iso-2022-jp-fullwidth |
| 30 |
|
Encode: |
| 31 |
|
use:Encode::ISO2022::SevenBit |
| 32 |
|
=>ucs_to_ascii ucs_to_jisx0208_1998_irv ucs_to_jisx0208_1978_to_irv ucs_to_jisx0201_latin |
| 33 |
|
->iso-2022-jp |
| 34 |
|
Decode: |
| 35 |
|
use:Encode::ISO2022::SevenBit |
| 36 |
|
<-iso-2022-jp |
| 37 |
|
<=jisx0208_1983_irv_to_ucs jisx0208_1978_irv_to_ucs jisx0201_latin_to_ucs jisx0201_katakana_to_ucs jisx0212_1990_irv_to_ucs jisx0213_2000_1_irv_to_ucs jisx0213_2000_2_to_ucs jisx0208_1997_irv_to_ucs |
| 38 |
|
Description: |
| 39 |
|
ISO/IEC 2022 based 7-bit encoding for Japanese, |
| 40 |
|
ASCII + JIS X 0201 + JIS X 0208-1978 + JIS X 0208-1983. |
| 41 |
|
Some characters defined in JIS X 0208 are mapped to FULLWIDTH |
| 42 |
|
area of UCS as specified in JIS X 0208:1997. |
| 43 |
|
|
| 44 |
|
This encoding is a "compatible" version of |
| 45 |
|
C<iso-2022-jp> defined in Encode::ISO2022::SevenBit. |
| 46 |
|
|
| 47 |
|
When decoding, mapping tables from coded character |
| 48 |
|
sets listed below to UCS are also loaded to restore |
| 49 |
|
incorrectly labeled data. |
| 50 |
|
|
| 51 |
|
JIS X 0201 Katakana coded character set, |
| 52 |
|
JIS X 0212-1990, JIS X 0213:2000 |
| 53 |
|
|
| 54 |
|
Note that for Windows user, Encode::ISO2022::CP932 |
| 55 |
|
may be useful to try to restore broken "ISO-2022-JP" |
| 56 |
|
data. |
| 57 |
|
} |
| 58 |
|
|
| 59 |
|
{ |
| 60 |
|
Name: |
| 61 |
iso-2022-jp-3-fullwidth |
iso-2022-jp-3-fullwidth |
| 62 |
Encode: |
Encode: |
| 63 |
use:Encode::ISO2022::SevenBit |
use:Encode::ISO2022::SevenBit |
| 70 |
Description: |
Description: |
| 71 |
ISO/IEC 2022 based 7-bit encoding for Japanese, |
ISO/IEC 2022 based 7-bit encoding for Japanese, |
| 72 |
ISO/IEC 646 IRV + JIS X 0213:2000. Some characters |
ISO/IEC 646 IRV + JIS X 0213:2000. Some characters |
| 73 |
defined in JIS X 0213 are mapped to FULLWIDTH or |
defined in JIS X 0213 are mapped to FULLWIDTH |
| 74 |
HALFWIDTH area of UCS as specified in JIS X 0213:2000. |
area of UCS as specified in JIS X 0213:2000. |
| 75 |
|
|
| 76 |
This encoding is a "compatible" version of |
This encoding is a "compatible" version of |
| 77 |
C<iso-2022-jp-3> defined in Encode::ISO2022::SevenBit. |
C<iso-2022-jp-3> defined in Encode::ISO2022::SevenBit. |
| 136 |
} |
} |
| 137 |
|
|
| 138 |
POD:SEE ALSO: |
POD:SEE ALSO: |
| 139 |
JIS X 0221-1995, "Universal multi-octet coded character |
%%ReferenceJISX0212_1995%% |
| 140 |
set (UCS)", Japan Industrial Standards Committee |
|
| 141 |
<http://www.jisc.go.jp/>, 1995. IDT with ISO/IEC 10646-1:1993 |
%%ReferenceJISX0208_1997%% |
| 142 |
but three additional appendixes. |
|
| 143 |
|
%%ReferenceJISX0213_2000%% |
| 144 |
JIS X 0208:1997, "7-bit and 8-bit double byte coded |
|
| 145 |
Kanji set for information interchange", Japan |
%%ReferenceRFC1468%% |
|
Industrial Standards Committee <http://www.jisc.go.jp/>, |
|
|
1997. |
|
|
|
|
|
JIS X 0213:2000, "7-bit and 8-bit double byte extended |
|
|
coded Kanji set for information interchange", Japan |
|
|
Industrial Standards Committee <http://www.jisc.go.jp/>, |
|
|
2000. |
|
| 146 |
|
|
| 147 |
L<Encode::ISO2022::EightBit> |
L<Encode::ISO2022::EightBit> |
| 148 |
|
|
|
POD:TO DO: |
|
|
Support of C<iso-2022-jp-fullwidth> |
|
|
|
|
| 149 |
POD:LICENSE: |
POD:LICENSE: |
| 150 |
Copyright %%YEAR%% Wakaba <w@suika.fam.cx> |
Copyright %%YEAR%% Wakaba <w@suika.fam.cx> |
| 151 |
|
|