1 |
wakaba |
1.1 |
#?PESRC/1.0 |
2 |
|
|
Name: |
3 |
|
|
ISO2022::JIS |
4 |
|
|
ShortDescription: |
5 |
wakaba |
1.3 |
The Encode module for 7-bit ISO/IEC 2022 |
6 |
wakaba |
1.1 |
based coding systems defined by JISes |
7 |
|
|
Description: |
8 |
|
|
This module defines convertion between perl internal |
9 |
wakaba |
1.3 |
representation and 7-bit ISO/IEC 2022 based coding systems] |
10 |
|
|
defined in standards of JIS (Japan Industrial Standards). |
11 |
wakaba |
1.1 |
|
12 |
|
|
Note that frequently used coding systems of JIS |
13 |
|
|
are included in other modules. For instance, |
14 |
|
|
C<ISO-2022-JP-3> (defined by JIS X 0213:2000) is |
15 |
|
|
included in Encode::ISO2022::JUNET. |
16 |
|
|
|
17 |
|
|
{ |
18 |
|
|
Name: |
19 |
|
|
jisx0201-1997-latin-7bit |
20 |
|
|
Alias: |
21 |
|
|
JIS_C6220-1969-ro iso-ir-14 ir14 jp ISO646-JP 646-jp csISO14JISC6220ro |
22 |
|
|
Cversion: |
23 |
|
|
C:bit=7 |
24 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
25 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
26 |
wakaba |
1.1 |
C:designate:*:default=-1 |
27 |
|
|
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
28 |
wakaba |
1.2 |
Encode:Prepare: |
29 |
wakaba |
1.1 |
C:GR=undef |
30 |
|
|
C:G1=G96:~ |
31 |
|
|
C:option:{reset}->{Gdesignation}=0 |
32 |
|
|
C:option:{reset}->{Ginvoke}=0 |
33 |
wakaba |
1.2 |
Encode: |
34 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin |
35 |
|
|
->iso2022:C |
36 |
wakaba |
1.2 |
Decode:Prepare: |
37 |
wakaba |
1.1 |
C:G1=G94:J |
38 |
|
|
C:G2=G94:J |
39 |
|
|
C:G3=G94:J |
40 |
wakaba |
1.2 |
Decode: |
41 |
wakaba |
1.1 |
<-iso2022:C |
42 |
|
|
<=jisx0201_latin_to_ucs |
43 |
|
|
Description: |
44 |
|
|
The 7-bit code for Latin letters (JIS X 0201:1997 6.1). |
45 |
|
|
} |
46 |
|
|
|
47 |
|
|
{ |
48 |
|
|
Name: |
49 |
|
|
jisx0201-1997-katakana-7bit |
50 |
|
|
Alias: |
51 |
|
|
JIS_C6220-1969-jp JIS_C6220-1969 iso-ir-13 ir13 katakana x0201-7 csISO13JISC6220jp |
52 |
|
|
Cversion: |
53 |
|
|
C:bit=7 |
54 |
|
|
C:G0=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
55 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
56 |
wakaba |
1.1 |
C:designate:*:default=-1 |
57 |
|
|
C:option:{undef_char}=["\x25", {type => 'G94', charset => 'I'}] |
58 |
wakaba |
1.2 |
Encode:Prepare: |
59 |
wakaba |
1.1 |
C:GR=undef |
60 |
|
|
C:G1=G96:~ |
61 |
|
|
C:option:{reset}->{Gdesignation}=0 |
62 |
|
|
C:option:{reset}->{Ginvoke}=0 |
63 |
wakaba |
1.2 |
Encode: |
64 |
wakaba |
1.1 |
=>ucs_to_jisx0201_katakana |
65 |
|
|
->iso2022:C |
66 |
wakaba |
1.2 |
Decode:Prepare: |
67 |
wakaba |
1.1 |
C:G1=G94:I |
68 |
|
|
C:G2=G94:I |
69 |
|
|
C:G3=G94:I |
70 |
wakaba |
1.2 |
Decode: |
71 |
wakaba |
1.1 |
<-iso2022:C |
72 |
|
|
<=jisx0201_katakana_to_ucs |
73 |
|
|
Description: |
74 |
|
|
The 7-bit code for Katakana (JIS X 0201:1997 6.2). |
75 |
|
|
} |
76 |
|
|
|
77 |
|
|
{ |
78 |
|
|
Name: |
79 |
|
|
jisx0201-1997-latin-katakana-7bit |
80 |
|
|
Cversion: |
81 |
|
|
C:bit=7 |
82 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
83 |
|
|
C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
84 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
85 |
wakaba |
1.1 |
C:designate:*:default=-1 |
86 |
|
|
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
87 |
wakaba |
1.2 |
Encode:Prepare: |
88 |
wakaba |
1.1 |
C:GL=undef |
89 |
|
|
C:GR=undef |
90 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
91 |
|
|
C:option:{reset}->{Gdesignation}=0 |
92 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0201:1997 doesn't have this limitation. |
93 |
wakaba |
1.2 |
Encode: |
94 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana |
95 |
|
|
->iso2022:C |
96 |
|
|
Decode: |
97 |
|
|
<-iso2022:C |
98 |
|
|
<=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs |
99 |
|
|
Description: |
100 |
|
|
The 7-bit code for Latin and Katakana (JIS X 0201:1997 6.3). |
101 |
|
|
|
102 |
|
|
See also the description of C<jisx0201-1997-katakana-latin-7bit>. |
103 |
|
|
} |
104 |
|
|
|
105 |
|
|
{ |
106 |
|
|
Name: |
107 |
|
|
jisx0201-1997-katakana-latin-7bit |
108 |
|
|
Cversion: |
109 |
|
|
C:bit=7 |
110 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
111 |
|
|
C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
112 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
113 |
wakaba |
1.1 |
C:designate:*:default=-1 |
114 |
|
|
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
115 |
wakaba |
1.2 |
Encode:Prepare: |
116 |
wakaba |
1.1 |
C:GL=undef |
117 |
|
|
C:GR=undef |
118 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
119 |
|
|
C:option:{reset}->{Gdesignation}=0 |
120 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0201:1997 doesn't have this limitation. |
121 |
wakaba |
1.2 |
Encode: |
122 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana |
123 |
|
|
->iso2022:C |
124 |
wakaba |
1.2 |
Decode:Prepare: |
125 |
|
|
C:GL=G1 ## In default, G1=Katakana is invoked |
126 |
wakaba |
1.1 |
Decode: |
127 |
|
|
<-iso2022:C |
128 |
|
|
<=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs |
129 |
|
|
Description: |
130 |
|
|
The 7-bit code for Latin and Katakana (JIS X 0201:1997 6.3). |
131 |
|
|
|
132 |
|
|
JIS X 0201:1997 does not define whether G0 or G1 is |
133 |
|
|
invoked to GL at the initial status of information interchange. |
134 |
|
|
(But recommends G0=Latin should be invoked.) |
135 |
|
|
|
136 |
|
|
In this module, C<jisx0201-1997-latin-katakana-7bit> |
137 |
|
|
regards that G0=Latin is invoked to GL and |
138 |
|
|
C<jisx0201-1997-katakana-latin-7bit> that G1=Katakana |
139 |
|
|
is. Note that on encoding, getting rid of this |
140 |
|
|
ambiguity, GL is regarded as undefined so that |
141 |
|
|
C<SI> or C<SO> is outputed before the first G0/G1 letter |
142 |
|
|
in both coding systems. |
143 |
|
|
} |
144 |
|
|
|
145 |
|
|
{ |
146 |
|
|
Name: |
147 |
|
|
jisx0201-1997-latin-latin-8bit |
148 |
|
|
Alias: |
149 |
|
|
JIS_X0201 X0201 csHalfWidthKatakana kana8 |
150 |
|
|
Cversion: |
151 |
|
|
C:bit=8 |
152 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
153 |
|
|
C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
154 |
wakaba |
1.2 |
Encode:Prepare: |
155 |
wakaba |
1.1 |
C:designate:*:default=-1 |
156 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
157 |
wakaba |
1.1 |
C:option:{C1invoke_to_right}=1 |
158 |
|
|
C:option:{reset}->{Gdesignation}=0 |
159 |
|
|
C:option:{reset}->{Ginvoke}=0 |
160 |
|
|
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
161 |
wakaba |
1.2 |
Encode: |
162 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana |
163 |
|
|
->iso2022:C |
164 |
|
|
Decode: |
165 |
|
|
<-iso2022:C |
166 |
|
|
<=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs |
167 |
|
|
Description: |
168 |
|
|
The 8-bit code for Latin and Katakana (JIS X 0201:1997 6.4). |
169 |
|
|
} |
170 |
|
|
|
171 |
|
|
{ |
172 |
|
|
Name: |
173 |
|
|
jisx0208-1997-kanji-7bit |
174 |
|
|
Cversion: |
175 |
|
|
C:bit=7 |
176 |
|
|
C:G0=G94n:B@ ## JIS X 0208:1997 |
177 |
wakaba |
1.2 |
Encode:Prepare: |
178 |
wakaba |
1.1 |
C:GR=undef |
179 |
|
|
C:G1=G96:~ |
180 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
181 |
wakaba |
1.1 |
C:designate:*:default=-1 |
182 |
|
|
C:option:{reset}->{Gdesignation}=0 |
183 |
|
|
C:option:{reset}->{Ginvoke}=0 |
184 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
185 |
|
|
Encode: |
186 |
wakaba |
1.1 |
=>ucs_to_jisx0208_1997 |
187 |
|
|
->iso2022:C |
188 |
|
|
Decode: |
189 |
|
|
<-iso2022:C |
190 |
|
|
<=jisx0208_1997_to_ucs |
191 |
|
|
Description: |
192 |
|
|
The 7-bit code for Kanji (JIS X 0208:1997 7.1.1). |
193 |
|
|
} |
194 |
|
|
|
195 |
|
|
{ |
196 |
|
|
Name: |
197 |
|
|
jisx0208-1997-kanji-8bit |
198 |
|
|
Cversion: |
199 |
|
|
C:bit=8 |
200 |
|
|
C:G0=G94n:B@ ## JIS X 0208:1997 |
201 |
wakaba |
1.2 |
Encode:Prepare: |
202 |
wakaba |
1.1 |
C:GR=undef |
203 |
|
|
C:G1=G96:~ |
204 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
205 |
wakaba |
1.1 |
C:designate:*:default=-1 |
206 |
|
|
C:option:{C1invoke_to_right}=1 |
207 |
|
|
C:option:{reset}->{Gdesignation}=0 |
208 |
|
|
C:option:{reset}->{Ginvoke}=0 |
209 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
210 |
|
|
Encode: |
211 |
wakaba |
1.1 |
=>ucs_to_jisx0208_1997 |
212 |
|
|
->iso2022:C |
213 |
|
|
Decode: |
214 |
|
|
<-iso2022:C |
215 |
|
|
<=jisx0208_1997_to_ucs |
216 |
|
|
Description: |
217 |
|
|
The 8-bit code for Kanji (JIS X 0208:1997 7.1.2). |
218 |
|
|
} |
219 |
|
|
|
220 |
|
|
{ |
221 |
|
|
Name: |
222 |
|
|
jisx0208-1997-irv-kanji-7bit |
223 |
|
|
Cversion: |
224 |
|
|
C:bit=7 |
225 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
226 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
227 |
wakaba |
1.2 |
Encode:Prepare: |
228 |
wakaba |
1.1 |
C:GL=undef |
229 |
|
|
C:GR=undef |
230 |
|
|
C:designate:*:default=-1 |
231 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
232 |
wakaba |
1.1 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
233 |
|
|
C:option:{reset}->{Gdesignation}=0 |
234 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
235 |
wakaba |
1.2 |
Encode: |
236 |
wakaba |
1.1 |
=>ucs_to_ascii ucs_to_jisx0208_1997 |
237 |
|
|
->iso2022:C |
238 |
|
|
Decode: |
239 |
|
|
<-iso2022:C |
240 |
|
|
<=jisx0208_1997_to_ucs |
241 |
|
|
Description: |
242 |
|
|
The 7-bit code for IRV and Kanji (JIS X 0208:1997 7.2.1). |
243 |
|
|
} |
244 |
|
|
|
245 |
|
|
{ |
246 |
|
|
Name: |
247 |
|
|
jisx0208-1997-kanji-irv-7bit |
248 |
|
|
Cversion: |
249 |
|
|
C:bit=7 |
250 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
251 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
252 |
wakaba |
1.2 |
Encode:Prepare: |
253 |
wakaba |
1.1 |
C:GL=undef |
254 |
|
|
C:GR=undef |
255 |
|
|
C:designate:*:default=-1 |
256 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
257 |
wakaba |
1.1 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
258 |
|
|
C:option:{reset}->{Gdesignation}=0 |
259 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
260 |
wakaba |
1.2 |
Encode: |
261 |
wakaba |
1.1 |
=>ucs_to_ascii ucs_to_jisx0208_1997 |
262 |
|
|
->iso2022:C |
263 |
wakaba |
1.2 |
Decode:Prepare: |
264 |
|
|
C:GL=G1 |
265 |
wakaba |
1.1 |
Decode: |
266 |
|
|
<-iso2022:C |
267 |
|
|
<=jisx0208_1997_to_ucs |
268 |
|
|
Description: |
269 |
|
|
The 7-bit code for IRV and Kanji (JIS X 0208:1997 7.2.1). |
270 |
|
|
|
271 |
|
|
This coding system is same as C<jisx0208-1997-irv-kanji-7bit> |
272 |
|
|
but start with Kanji set. See description of |
273 |
|
|
C<jisx0201-1997-katakana-latin-7bit>. |
274 |
|
|
} |
275 |
|
|
|
276 |
|
|
{ |
277 |
|
|
Name: |
278 |
|
|
jisx0208-1997-irv-kanji-8bit |
279 |
|
|
Cversion: |
280 |
|
|
C:bit=8 |
281 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
282 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
283 |
wakaba |
1.2 |
Encode:Prepare: |
284 |
wakaba |
1.1 |
C:GL=undef |
285 |
|
|
C:GR=undef |
286 |
|
|
C:designate:*:default=-1 |
287 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
288 |
wakaba |
1.1 |
C:option:{C1invoke_to_right}=1 |
289 |
|
|
C:option:{reset}->{Gdesignation}=0 |
290 |
|
|
C:option:{reset}->{Ginvoke}=0 |
291 |
wakaba |
1.2 |
Encode: |
292 |
wakaba |
1.1 |
=>ucs_to_ascii ucs_to_jisx0208_1997 |
293 |
|
|
->iso2022:C |
294 |
|
|
Decode: |
295 |
|
|
<-iso2022:C |
296 |
|
|
<=jisx0208_1997_to_ucs |
297 |
|
|
Description: |
298 |
|
|
The 8-bit code for IRV and Kanji (JIS X 0208:1997 7.2.2). |
299 |
|
|
|
300 |
|
|
Note that this coding system is considerable as |
301 |
|
|
a subset of C<euc-jp>. For the histrical reason, |
302 |
|
|
not a small number of Japanese EUCed applications |
303 |
|
|
did not support G2 and G3 sets (and some do not |
304 |
|
|
even now). This coding system can be used for |
305 |
|
|
information interchanges with such implementions. |
306 |
|
|
} |
307 |
|
|
|
308 |
|
|
{ |
309 |
|
|
Name: |
310 |
|
|
jisx0208-1997-latin-kanji-7bit |
311 |
|
|
Cversion: |
312 |
|
|
C:bit=7 |
313 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
314 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
315 |
wakaba |
1.2 |
Encode:Prepare: |
316 |
wakaba |
1.1 |
C:GL=undef |
317 |
|
|
C:GR=undef |
318 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
319 |
wakaba |
1.1 |
C:designate:*:default=-1 |
320 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
321 |
wakaba |
1.1 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
322 |
|
|
C:option:{reset}->{Gdesignation}=0 |
323 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
324 |
wakaba |
1.2 |
Encode: |
325 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1997 |
326 |
|
|
->iso2022:C |
327 |
|
|
Decode: |
328 |
|
|
<-iso2022:C |
329 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1997_to_ucs |
330 |
|
|
Description: |
331 |
|
|
The 7-bit code for Latin and Kanji (JIS X 0208:1997 7.3.1). |
332 |
|
|
} |
333 |
|
|
|
334 |
|
|
{ |
335 |
|
|
Name: |
336 |
|
|
jisx0208-1997-kanji-latin-7bit |
337 |
|
|
Cversion: |
338 |
|
|
C:bit=7 |
339 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
340 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
341 |
wakaba |
1.2 |
Encode:Prepare: |
342 |
wakaba |
1.1 |
C:GL=undef |
343 |
|
|
C:GR=undef |
344 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
345 |
wakaba |
1.1 |
C:designate:*:default=-1 |
346 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
347 |
wakaba |
1.1 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
348 |
|
|
C:option:{reset}->{Gdesignation}=0 |
349 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
350 |
wakaba |
1.2 |
Encode: |
351 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1997 |
352 |
|
|
->iso2022:C |
353 |
|
|
Decode: |
354 |
|
|
C:GL=G1 |
355 |
|
|
<-iso2022:C |
356 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1997_to_ucs |
357 |
|
|
Description: |
358 |
|
|
The 7-bit code for Latin and Kanji (JIS X 0208:1997 7.3.1). |
359 |
|
|
|
360 |
|
|
This coding system is same as C<jisx0208-1997-latin-kanji-7bit> |
361 |
|
|
but start with Kanji set. See description of |
362 |
|
|
C<jisx0201-1997-katakana-latin-7bit>. |
363 |
|
|
} |
364 |
|
|
|
365 |
|
|
{ |
366 |
|
|
Name: |
367 |
|
|
jisx0208-1997-latin-kanji-8bit |
368 |
|
|
Cversion: |
369 |
|
|
C:bit=8 |
370 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
371 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
372 |
wakaba |
1.2 |
Encode:Prepare: |
373 |
wakaba |
1.1 |
C:GL=undef |
374 |
|
|
C:GR=undef |
375 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
376 |
wakaba |
1.1 |
C:designate:*:default=-1 |
377 |
|
|
C:option:{C1invoke_to_right}=1 |
378 |
|
|
C:option:{reset}->{Gdesignation}=0 |
379 |
|
|
C:option:{reset}->{Ginvoke}=0 |
380 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
381 |
|
|
Encode: |
382 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1997 |
383 |
|
|
->iso2022:C |
384 |
|
|
Decode: |
385 |
|
|
<-iso2022:C |
386 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1997_to_ucs |
387 |
|
|
Description: |
388 |
|
|
The 8-bit code for Latin and Kanji (JIS X 0208:1997 7.2.2). |
389 |
|
|
} |
390 |
|
|
|
391 |
|
|
{ |
392 |
|
|
Name: |
393 |
|
|
jisx0213-2000-kanji-7bit |
394 |
|
|
Cversion: |
395 |
|
|
C:bit=7 |
396 |
|
|
C:G0=G94n:O ## JIS X 0213:2000 plane 1 |
397 |
|
|
C:G1=G94n:P ## JIS X 0213:2000 plane 2 |
398 |
wakaba |
1.2 |
Encode:Prepare: |
399 |
wakaba |
1.1 |
C:GR=undef |
400 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
401 |
wakaba |
1.1 |
C:designate:*:default=-1 |
402 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
403 |
wakaba |
1.1 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
404 |
|
|
C:option:{reset}->{Gdesignation}=0 |
405 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
406 |
wakaba |
1.2 |
Encode: |
407 |
wakaba |
1.1 |
=>ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
408 |
|
|
->iso2022:C |
409 |
|
|
Decode: |
410 |
|
|
<-iso2022:C |
411 |
|
|
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
412 |
|
|
Description: |
413 |
|
|
The 7-bit code for Kanji (JIS X 0213:2000 7.1.1). |
414 |
|
|
} |
415 |
|
|
|
416 |
|
|
{ |
417 |
|
|
Name: |
418 |
|
|
jisx0213-2000-kanji-8bit |
419 |
|
|
Cversion: |
420 |
|
|
C:bit=8 |
421 |
|
|
C:G0=G94n:O ## JIS X 0213:2000 plane 1 |
422 |
|
|
C:G1=G94n:P ## JIS X 0213:2000 plane 2 |
423 |
wakaba |
1.2 |
Encode:Prepare: |
424 |
wakaba |
1.1 |
C:GR=undef |
425 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
426 |
wakaba |
1.1 |
C:designate:*:default=-1 |
427 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
428 |
wakaba |
1.1 |
C:option:{C1invoke_to_right}=1 |
429 |
|
|
C:option:{reset}->{Gdesignation}=0 |
430 |
|
|
C:option:{reset}->{Ginvoke}=0 |
431 |
wakaba |
1.2 |
Encode: |
432 |
wakaba |
1.1 |
=>ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
433 |
|
|
->iso2022:C |
434 |
|
|
Decode: |
435 |
|
|
<-iso2022:C |
436 |
|
|
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
437 |
|
|
Description: |
438 |
|
|
The 8-bit code for Kanji (JIS X 0213:2000 7.1.2). |
439 |
|
|
} |
440 |
|
|
|
441 |
|
|
{ |
442 |
|
|
Name: |
443 |
|
|
jisx0213-2000-irv-kanji-7bit |
444 |
|
|
Cversion: |
445 |
|
|
C:bit=7 |
446 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
447 |
|
|
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
448 |
|
|
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
449 |
wakaba |
1.2 |
Encode:Prepare: |
450 |
wakaba |
1.1 |
C:GR=undef |
451 |
|
|
C:designate:*:default=-1 |
452 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
453 |
wakaba |
1.1 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
454 |
|
|
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
455 |
|
|
C:option:{reset}->{Gdesignation}=0 |
456 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
457 |
wakaba |
1.2 |
Encode: |
458 |
wakaba |
1.1 |
=>ucs_to_ascii ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
459 |
|
|
->iso2022:C |
460 |
|
|
Decode: |
461 |
|
|
<-iso2022:C |
462 |
|
|
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
463 |
|
|
Description: |
464 |
|
|
The 7-bit code for IRV and Kanji (JIS X 0213:2000 7.2.1). |
465 |
|
|
} |
466 |
|
|
|
467 |
|
|
{ |
468 |
|
|
Name: |
469 |
|
|
jisx0213-2000-irv-kanji-8bit |
470 |
|
|
Cversion: |
471 |
|
|
C:bit=8 |
472 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
473 |
|
|
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
474 |
|
|
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
475 |
wakaba |
1.2 |
Encode:Prepare: |
476 |
wakaba |
1.1 |
C:GR=undef |
477 |
|
|
C:designate:*:default=-1 |
478 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
479 |
wakaba |
1.1 |
C:option:{Ginvoke_to_left}=[1,0,0,0] |
480 |
|
|
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
481 |
|
|
C:option:{C1invoke_to_right}=1 |
482 |
|
|
C:option:{reset}->{Gdesignation}=0 |
483 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
484 |
wakaba |
1.2 |
Encode: |
485 |
wakaba |
1.1 |
=>ucs_to_ascii ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
486 |
|
|
->iso2022:C |
487 |
|
|
Decode: |
488 |
|
|
<-iso2022:C |
489 |
|
|
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
490 |
|
|
Description: |
491 |
|
|
The 7-bit code for IRV and Kanji (JIS X 0213:2000 7.2.1). |
492 |
|
|
} |
493 |
|
|
|
494 |
|
|
{ |
495 |
|
|
Name: |
496 |
|
|
jisx0213-2000-latin-kanji-7bit |
497 |
|
|
Cversion: |
498 |
|
|
C:bit=7 |
499 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
500 |
|
|
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
501 |
|
|
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
502 |
wakaba |
1.2 |
Encode:Prepare: |
503 |
wakaba |
1.1 |
C:GR=undef |
504 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
505 |
wakaba |
1.1 |
C:designate:*:default=-1 |
506 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
507 |
wakaba |
1.1 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
508 |
|
|
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
509 |
|
|
C:option:{reset}->{Gdesignation}=0 |
510 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
511 |
wakaba |
1.2 |
Encode: |
512 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
513 |
|
|
->iso2022:C |
514 |
|
|
Decode: |
515 |
|
|
<-iso2022:C |
516 |
|
|
<=jisx0201_latin_to_ucs jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
517 |
|
|
Description: |
518 |
|
|
The 7-bit code for Latin and Kanji (JIS X 0213:2000 7.3.1). |
519 |
|
|
} |
520 |
|
|
|
521 |
|
|
{ |
522 |
|
|
Name: |
523 |
|
|
jisx0213-2000-latin-kanji-8bit |
524 |
|
|
Cversion: |
525 |
|
|
C:bit=8 |
526 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
527 |
|
|
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
528 |
|
|
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
529 |
wakaba |
1.2 |
Encode:Prepare: |
530 |
wakaba |
1.1 |
C:GR=undef |
531 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
532 |
wakaba |
1.1 |
C:designate:*:default=-1 |
533 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
534 |
wakaba |
1.1 |
C:option:{Ginvoke_to_left}=[1,0,0,0] |
535 |
|
|
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
536 |
|
|
C:option:{C1invoke_to_right}=1 |
537 |
|
|
C:option:{reset}->{Gdesignation}=0 |
538 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
539 |
wakaba |
1.2 |
Encode: |
540 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
541 |
|
|
->iso2022:C |
542 |
|
|
Decode: |
543 |
|
|
<-iso2022:C |
544 |
|
|
<=jisx0201_latin_to_ucs jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
545 |
|
|
Description: |
546 |
|
|
The 8-bit code for Latin and Kanji (JIS X 0213:2000 7.2.2). |
547 |
|
|
} |
548 |
|
|
|
549 |
|
|
{ |
550 |
|
|
Name: |
551 |
|
|
jisx4001-text-7bit |
552 |
|
|
Cversion: |
553 |
|
|
C:bit=7 |
554 |
|
|
C:G0=G94n:B ## JIS X 0208-1983 |
555 |
wakaba |
1.2 |
Encode:Prepare: |
556 |
wakaba |
1.1 |
C:GR=undef |
557 |
|
|
C:C1=C1:~ |
558 |
|
|
C:G0=G0:~ |
559 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
560 |
wakaba |
1.1 |
C:designate:*:default=-1 |
561 |
|
|
C:designate:G94:J=0 ## JIS X 0201-1976 Roman set |
562 |
|
|
C:designate:G94n:B=0 ## JIS X 0208-1983 |
563 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B'}] |
564 |
wakaba |
1.1 |
C:option:{reset}->{Gdesignation}='J' ## JIS X 4001 does not have this limitation. |
565 |
wakaba |
1.2 |
Encode: |
566 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1983 |
567 |
|
|
->iso2022:C |
568 |
|
|
Decode: |
569 |
|
|
<-iso2022:C |
570 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1983_to_ucs |
571 |
|
|
Description: |
572 |
|
|
JIS X 4001 text (7-bit code, JIS X 4001-1989 6) |
573 |
|
|
} |
574 |
|
|
|
575 |
|
|
{ |
576 |
|
|
Name: |
577 |
|
|
jisx4001-text-8bit |
578 |
|
|
Cversion: |
579 |
|
|
C:bit=8 |
580 |
|
|
C:G0=G94n:B ## JIS X 0208-1983 |
581 |
wakaba |
1.2 |
Encode:Prepare: |
582 |
wakaba |
1.1 |
C:GR=undef |
583 |
|
|
C:C1=C1:~ |
584 |
|
|
C:G0=G0:~ |
585 |
wakaba |
1.2 |
C:designate:G94:B=-1 |
586 |
wakaba |
1.1 |
C:designate:*:default=-1 |
587 |
|
|
C:designate:G94:J=0 ## JIS X 0201-1976 Roman set |
588 |
|
|
C:designate:G94n:B=0 ## JIS X 0208-1983 |
589 |
|
|
C:option:{C1invoke_to_right}=1 |
590 |
wakaba |
1.2 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B'}] |
591 |
wakaba |
1.1 |
C:option:{reset}->{Gdesignation}='J' ## JIS X 4001 does not have this limitation. |
592 |
wakaba |
1.2 |
Encode: |
593 |
wakaba |
1.1 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1983 |
594 |
|
|
->iso2022:C |
595 |
|
|
Decode: |
596 |
|
|
<-iso2022:C |
597 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1983_to_ucs |
598 |
|
|
Description: |
599 |
|
|
JIS X 4001 text (8-bit code, JIS X 4001-1989 6) |
600 |
|
|
} |
601 |
|
|
|
602 |
|
|
POD:ENCODING:POSTAMBLE: |
603 |
|
|
Note that although other JISes such as JIS X 0212 and JIS X 9010 |
604 |
|
|
define ISO/IEC 2022-comfprming coded character sets, |
605 |
|
|
these standards do not define complete coding system (but define |
606 |
|
|
as used on ISO/IEC 2022 environment), so this module |
607 |
|
|
does not include those coded character sets. (IETF RFC 1345 |
608 |
|
|
and IANAREG give charset name to coded character sets |
609 |
|
|
consist of such standards. But those are defined by RFC 1345, |
610 |
|
|
not by JIS. Such coded character sets should be implemented |
611 |
|
|
in Encode::ISO2022::RFC1345.) |
612 |
|
|
|
613 |
|
|
POD:SEE ALSO: |
614 |
|
|
%%ReferenceJISX0201_1997%% |
615 |
|
|
|
616 |
|
|
%%ReferenceJISX0208_1997%% |
617 |
|
|
|
618 |
|
|
%%ReferenceJISX0213_2000%% |
619 |
|
|
|
620 |
|
|
JIS X 4001-1989, "File Specification for Japanese Documents |
621 |
|
|
interchange (Basic Type)", Japan Industrial Standarad |
622 |
|
|
Committee (JISC) <http://www.jisc.go.jp/>, 1989. |
623 |
|
|
|
624 |
|
|
L<Encode::ISO2022::JUNET>, L<Encode::ISO2022::EUCJA> |
625 |
|
|
|
626 |
|
|
POD:LICENSE: |
627 |
|
|
Copyright %%YEAR%% Wakaba <w@suika.fam.cx> |
628 |
|
|
|
629 |
|
|
%%PerlLicense%% |