1 |
#?PESRC/1.0 |
2 |
Name: |
3 |
ISO2022::JIS |
4 |
ShortDescription: |
5 |
The Encode module for 7-bit ISO/IEC 2022 |
6 |
based coding systems defined by JISes |
7 |
Description: |
8 |
This module defines convertion between perl internal |
9 |
representation and 7-bit ISO/IEC 2022 based coding systems] |
10 |
defined in standards of JIS (Japan Industrial Standards). |
11 |
|
12 |
Note that frequently used coding systems of JIS |
13 |
are included in other modules. For instance, |
14 |
C<ISO-2022-JP-3> (defined by JIS X 0213:2000) is |
15 |
included in Encode::ISO2022::JUNET. |
16 |
|
17 |
{ |
18 |
Name: |
19 |
jisx0201-1997-latin-7bit |
20 |
Alias: |
21 |
JIS_C6220-1969-ro iso-ir-14 ir14 jp ISO646-JP 646-jp csISO14JISC6220ro |
22 |
Cversion: |
23 |
C:bit=7 |
24 |
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
25 |
C:designate:G94:B=-1 |
26 |
C:designate:*:default=-1 |
27 |
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
28 |
Encode:Prepare: |
29 |
C:GR=undef |
30 |
C:G1=G96:~ |
31 |
C:option:{reset}->{Gdesignation}=0 |
32 |
C:option:{reset}->{Ginvoke}=0 |
33 |
Encode: |
34 |
=>ucs_to_jisx0201_latin |
35 |
->iso2022:C |
36 |
Decode:Prepare: |
37 |
C:G1=G94:J |
38 |
C:G2=G94:J |
39 |
C:G3=G94:J |
40 |
Decode: |
41 |
<-iso2022:C |
42 |
<=jisx0201_latin_to_ucs |
43 |
Description: |
44 |
The 7-bit code for Latin letters (JIS X 0201:1997 6.1). |
45 |
} |
46 |
|
47 |
{ |
48 |
Name: |
49 |
jisx0201-1997-katakana-7bit |
50 |
Alias: |
51 |
JIS_C6220-1969-jp JIS_C6220-1969 iso-ir-13 ir13 katakana x0201-7 csISO13JISC6220jp |
52 |
Cversion: |
53 |
C:bit=7 |
54 |
C:G0=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
55 |
C:designate:G94:B=-1 |
56 |
C:designate:*:default=-1 |
57 |
C:option:{undef_char}=["\x25", {type => 'G94', charset => 'I'}] |
58 |
Encode:Prepare: |
59 |
C:GR=undef |
60 |
C:G1=G96:~ |
61 |
C:option:{reset}->{Gdesignation}=0 |
62 |
C:option:{reset}->{Ginvoke}=0 |
63 |
Encode: |
64 |
=>ucs_to_jisx0201_katakana |
65 |
->iso2022:C |
66 |
Decode:Prepare: |
67 |
C:G1=G94:I |
68 |
C:G2=G94:I |
69 |
C:G3=G94:I |
70 |
Decode: |
71 |
<-iso2022:C |
72 |
<=jisx0201_katakana_to_ucs |
73 |
Description: |
74 |
The 7-bit code for Katakana (JIS X 0201:1997 6.2). |
75 |
} |
76 |
|
77 |
{ |
78 |
Name: |
79 |
jisx0201-1997-latin-katakana-7bit |
80 |
Cversion: |
81 |
C:bit=7 |
82 |
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
83 |
C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
84 |
C:designate:G94:B=-1 |
85 |
C:designate:*:default=-1 |
86 |
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
87 |
Encode:Prepare: |
88 |
C:GL=undef |
89 |
C:GR=undef |
90 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
91 |
C:option:{reset}->{Gdesignation}=0 |
92 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0201:1997 doesn't have this limitation. |
93 |
Encode: |
94 |
=>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana |
95 |
->iso2022:C |
96 |
Decode: |
97 |
<-iso2022:C |
98 |
<=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs |
99 |
Description: |
100 |
The 7-bit code for Latin and Katakana (JIS X 0201:1997 6.3). |
101 |
|
102 |
See also the description of C<jisx0201-1997-katakana-latin-7bit>. |
103 |
} |
104 |
|
105 |
{ |
106 |
Name: |
107 |
jisx0201-1997-katakana-latin-7bit |
108 |
Cversion: |
109 |
C:bit=7 |
110 |
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
111 |
C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
112 |
C:designate:G94:B=-1 |
113 |
C:designate:*:default=-1 |
114 |
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
115 |
Encode:Prepare: |
116 |
C:GL=undef |
117 |
C:GR=undef |
118 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
119 |
C:option:{reset}->{Gdesignation}=0 |
120 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0201:1997 doesn't have this limitation. |
121 |
Encode: |
122 |
=>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana |
123 |
->iso2022:C |
124 |
Decode:Prepare: |
125 |
C:GL=G1 ## In default, G1=Katakana is invoked |
126 |
Decode: |
127 |
<-iso2022:C |
128 |
<=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs |
129 |
Description: |
130 |
The 7-bit code for Latin and Katakana (JIS X 0201:1997 6.3). |
131 |
|
132 |
JIS X 0201:1997 does not define whether G0 or G1 is |
133 |
invoked to GL at the initial status of information interchange. |
134 |
(But recommends G0=Latin should be invoked.) |
135 |
|
136 |
In this module, C<jisx0201-1997-latin-katakana-7bit> |
137 |
regards that G0=Latin is invoked to GL and |
138 |
C<jisx0201-1997-katakana-latin-7bit> that G1=Katakana |
139 |
is. Note that on encoding, getting rid of this |
140 |
ambiguity, GL is regarded as undefined so that |
141 |
C<SI> or C<SO> is outputed before the first G0/G1 letter |
142 |
in both coding systems. |
143 |
} |
144 |
|
145 |
{ |
146 |
Name: |
147 |
jisx0201-1997-latin-latin-8bit |
148 |
Alias: |
149 |
JIS_X0201 X0201 csHalfWidthKatakana kana8 |
150 |
Cversion: |
151 |
C:bit=8 |
152 |
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
153 |
C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
154 |
Encode:Prepare: |
155 |
C:designate:*:default=-1 |
156 |
C:designate:G94:B=-1 |
157 |
C:option:{C1invoke_to_right}=1 |
158 |
C:option:{reset}->{Gdesignation}=0 |
159 |
C:option:{reset}->{Ginvoke}=0 |
160 |
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
161 |
Encode: |
162 |
=>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana |
163 |
->iso2022:C |
164 |
Decode: |
165 |
<-iso2022:C |
166 |
<=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs |
167 |
Description: |
168 |
The 8-bit code for Latin and Katakana (JIS X 0201:1997 6.4). |
169 |
} |
170 |
|
171 |
{ |
172 |
Name: |
173 |
jisx0208-1997-kanji-7bit |
174 |
Cversion: |
175 |
C:bit=7 |
176 |
C:G0=G94n:B@ ## JIS X 0208:1997 |
177 |
Encode:Prepare: |
178 |
C:GR=undef |
179 |
C:G1=G96:~ |
180 |
C:designate:G94:B=-1 |
181 |
C:designate:*:default=-1 |
182 |
C:option:{reset}->{Gdesignation}=0 |
183 |
C:option:{reset}->{Ginvoke}=0 |
184 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
185 |
Encode: |
186 |
=>ucs_to_jisx0208_1997 |
187 |
->iso2022:C |
188 |
Decode: |
189 |
<-iso2022:C |
190 |
<=jisx0208_1997_to_ucs |
191 |
Description: |
192 |
The 7-bit code for Kanji (JIS X 0208:1997 7.1.1). |
193 |
} |
194 |
|
195 |
{ |
196 |
Name: |
197 |
jisx0208-1997-kanji-8bit |
198 |
Cversion: |
199 |
C:bit=8 |
200 |
C:G0=G94n:B@ ## JIS X 0208:1997 |
201 |
Encode:Prepare: |
202 |
C:GR=undef |
203 |
C:G1=G96:~ |
204 |
C:designate:G94:B=-1 |
205 |
C:designate:*:default=-1 |
206 |
C:option:{C1invoke_to_right}=1 |
207 |
C:option:{reset}->{Gdesignation}=0 |
208 |
C:option:{reset}->{Ginvoke}=0 |
209 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
210 |
Encode: |
211 |
=>ucs_to_jisx0208_1997 |
212 |
->iso2022:C |
213 |
Decode: |
214 |
<-iso2022:C |
215 |
<=jisx0208_1997_to_ucs |
216 |
Description: |
217 |
The 8-bit code for Kanji (JIS X 0208:1997 7.1.2). |
218 |
} |
219 |
|
220 |
{ |
221 |
Name: |
222 |
jisx0208-1997-irv-kanji-7bit |
223 |
Cversion: |
224 |
C:bit=7 |
225 |
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
226 |
C:G1=G94n:B@ ## JIS X 0208:1997 |
227 |
Encode:Prepare: |
228 |
C:GL=undef |
229 |
C:GR=undef |
230 |
C:designate:*:default=-1 |
231 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
232 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
233 |
C:option:{reset}->{Gdesignation}=0 |
234 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
235 |
Encode: |
236 |
=>ucs_to_ascii ucs_to_jisx0208_1997 |
237 |
->iso2022:C |
238 |
Decode: |
239 |
<-iso2022:C |
240 |
<=jisx0208_1997_to_ucs |
241 |
Description: |
242 |
The 7-bit code for IRV and Kanji (JIS X 0208:1997 7.2.1). |
243 |
} |
244 |
|
245 |
{ |
246 |
Name: |
247 |
jisx0208-1997-kanji-irv-7bit |
248 |
Cversion: |
249 |
C:bit=7 |
250 |
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
251 |
C:G1=G94n:B@ ## JIS X 0208:1997 |
252 |
Encode:Prepare: |
253 |
C:GL=undef |
254 |
C:GR=undef |
255 |
C:designate:*:default=-1 |
256 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
257 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
258 |
C:option:{reset}->{Gdesignation}=0 |
259 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
260 |
Encode: |
261 |
=>ucs_to_ascii ucs_to_jisx0208_1997 |
262 |
->iso2022:C |
263 |
Decode:Prepare: |
264 |
C:GL=G1 |
265 |
Decode: |
266 |
<-iso2022:C |
267 |
<=jisx0208_1997_to_ucs |
268 |
Description: |
269 |
The 7-bit code for IRV and Kanji (JIS X 0208:1997 7.2.1). |
270 |
|
271 |
This coding system is same as C<jisx0208-1997-irv-kanji-7bit> |
272 |
but start with Kanji set. See description of |
273 |
C<jisx0201-1997-katakana-latin-7bit>. |
274 |
} |
275 |
|
276 |
{ |
277 |
Name: |
278 |
jisx0208-1997-irv-kanji-8bit |
279 |
Cversion: |
280 |
C:bit=8 |
281 |
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
282 |
C:G1=G94n:B@ ## JIS X 0208:1997 |
283 |
Encode:Prepare: |
284 |
C:GL=undef |
285 |
C:GR=undef |
286 |
C:designate:*:default=-1 |
287 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
288 |
C:option:{C1invoke_to_right}=1 |
289 |
C:option:{reset}->{Gdesignation}=0 |
290 |
C:option:{reset}->{Ginvoke}=0 |
291 |
Encode: |
292 |
=>ucs_to_ascii ucs_to_jisx0208_1997 |
293 |
->iso2022:C |
294 |
Decode: |
295 |
<-iso2022:C |
296 |
<=jisx0208_1997_to_ucs |
297 |
Description: |
298 |
The 8-bit code for IRV and Kanji (JIS X 0208:1997 7.2.2). |
299 |
|
300 |
Note that this coding system is considerable as |
301 |
a subset of C<euc-jp>. For the histrical reason, |
302 |
not a small number of Japanese EUCed applications |
303 |
did not support G2 and G3 sets (and some do not |
304 |
even now). This coding system can be used for |
305 |
information interchanges with such implementions. |
306 |
} |
307 |
|
308 |
{ |
309 |
Name: |
310 |
jisx0208-1997-latin-kanji-7bit |
311 |
Cversion: |
312 |
C:bit=7 |
313 |
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
314 |
C:G1=G94n:B@ ## JIS X 0208:1997 |
315 |
Encode:Prepare: |
316 |
C:GL=undef |
317 |
C:GR=undef |
318 |
C:designate:G94:B=-1 |
319 |
C:designate:*:default=-1 |
320 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
321 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
322 |
C:option:{reset}->{Gdesignation}=0 |
323 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
324 |
Encode: |
325 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1997 |
326 |
->iso2022:C |
327 |
Decode: |
328 |
<-iso2022:C |
329 |
<=jisx0201_latin_to_ucs jisx0208_1997_to_ucs |
330 |
Description: |
331 |
The 7-bit code for Latin and Kanji (JIS X 0208:1997 7.3.1). |
332 |
} |
333 |
|
334 |
{ |
335 |
Name: |
336 |
jisx0208-1997-kanji-latin-7bit |
337 |
Cversion: |
338 |
C:bit=7 |
339 |
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
340 |
C:G1=G94n:B@ ## JIS X 0208:1997 |
341 |
Encode:Prepare: |
342 |
C:GL=undef |
343 |
C:GR=undef |
344 |
C:designate:G94:B=-1 |
345 |
C:designate:*:default=-1 |
346 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
347 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
348 |
C:option:{reset}->{Gdesignation}=0 |
349 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
350 |
Encode: |
351 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1997 |
352 |
->iso2022:C |
353 |
Decode: |
354 |
C:GL=G1 |
355 |
<-iso2022:C |
356 |
<=jisx0201_latin_to_ucs jisx0208_1997_to_ucs |
357 |
Description: |
358 |
The 7-bit code for Latin and Kanji (JIS X 0208:1997 7.3.1). |
359 |
|
360 |
This coding system is same as C<jisx0208-1997-latin-kanji-7bit> |
361 |
but start with Kanji set. See description of |
362 |
C<jisx0201-1997-katakana-latin-7bit>. |
363 |
} |
364 |
|
365 |
{ |
366 |
Name: |
367 |
jisx0208-1997-latin-kanji-8bit |
368 |
Cversion: |
369 |
C:bit=8 |
370 |
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
371 |
C:G1=G94n:B@ ## JIS X 0208:1997 |
372 |
Encode:Prepare: |
373 |
C:GL=undef |
374 |
C:GR=undef |
375 |
C:designate:G94:B=-1 |
376 |
C:designate:*:default=-1 |
377 |
C:option:{C1invoke_to_right}=1 |
378 |
C:option:{reset}->{Gdesignation}=0 |
379 |
C:option:{reset}->{Ginvoke}=0 |
380 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}] |
381 |
Encode: |
382 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1997 |
383 |
->iso2022:C |
384 |
Decode: |
385 |
<-iso2022:C |
386 |
<=jisx0201_latin_to_ucs jisx0208_1997_to_ucs |
387 |
Description: |
388 |
The 8-bit code for Latin and Kanji (JIS X 0208:1997 7.2.2). |
389 |
} |
390 |
|
391 |
{ |
392 |
Name: |
393 |
jisx0213-2000-kanji-7bit |
394 |
Cversion: |
395 |
C:bit=7 |
396 |
C:G0=G94n:O ## JIS X 0213:2000 plane 1 |
397 |
C:G1=G94n:P ## JIS X 0213:2000 plane 2 |
398 |
Encode:Prepare: |
399 |
C:GR=undef |
400 |
C:designate:G94:B=-1 |
401 |
C:designate:*:default=-1 |
402 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
403 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
404 |
C:option:{reset}->{Gdesignation}=0 |
405 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
406 |
Encode: |
407 |
=>ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
408 |
->iso2022:C |
409 |
Decode: |
410 |
<-iso2022:C |
411 |
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
412 |
Description: |
413 |
The 7-bit code for Kanji (JIS X 0213:2000 7.1.1). |
414 |
} |
415 |
|
416 |
{ |
417 |
Name: |
418 |
jisx0213-2000-kanji-8bit |
419 |
Cversion: |
420 |
C:bit=8 |
421 |
C:G0=G94n:O ## JIS X 0213:2000 plane 1 |
422 |
C:G1=G94n:P ## JIS X 0213:2000 plane 2 |
423 |
Encode:Prepare: |
424 |
C:GR=undef |
425 |
C:designate:G94:B=-1 |
426 |
C:designate:*:default=-1 |
427 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
428 |
C:option:{C1invoke_to_right}=1 |
429 |
C:option:{reset}->{Gdesignation}=0 |
430 |
C:option:{reset}->{Ginvoke}=0 |
431 |
Encode: |
432 |
=>ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
433 |
->iso2022:C |
434 |
Decode: |
435 |
<-iso2022:C |
436 |
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
437 |
Description: |
438 |
The 8-bit code for Kanji (JIS X 0213:2000 7.1.2). |
439 |
} |
440 |
|
441 |
{ |
442 |
Name: |
443 |
jisx0213-2000-irv-kanji-7bit |
444 |
Cversion: |
445 |
C:bit=7 |
446 |
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
447 |
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
448 |
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
449 |
Encode:Prepare: |
450 |
C:GR=undef |
451 |
C:designate:*:default=-1 |
452 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
453 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
454 |
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
455 |
C:option:{reset}->{Gdesignation}=0 |
456 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
457 |
Encode: |
458 |
=>ucs_to_ascii ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
459 |
->iso2022:C |
460 |
Decode: |
461 |
<-iso2022:C |
462 |
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
463 |
Description: |
464 |
The 7-bit code for IRV and Kanji (JIS X 0213:2000 7.2.1). |
465 |
} |
466 |
|
467 |
{ |
468 |
Name: |
469 |
jisx0213-2000-irv-kanji-8bit |
470 |
Cversion: |
471 |
C:bit=8 |
472 |
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
473 |
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
474 |
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
475 |
Encode:Prepare: |
476 |
C:GR=undef |
477 |
C:designate:*:default=-1 |
478 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
479 |
C:option:{Ginvoke_to_left}=[1,0,0,0] |
480 |
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
481 |
C:option:{C1invoke_to_right}=1 |
482 |
C:option:{reset}->{Gdesignation}=0 |
483 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
484 |
Encode: |
485 |
=>ucs_to_ascii ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
486 |
->iso2022:C |
487 |
Decode: |
488 |
<-iso2022:C |
489 |
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
490 |
Description: |
491 |
The 7-bit code for IRV and Kanji (JIS X 0213:2000 7.2.1). |
492 |
} |
493 |
|
494 |
{ |
495 |
Name: |
496 |
jisx0213-2000-latin-kanji-7bit |
497 |
Cversion: |
498 |
C:bit=7 |
499 |
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
500 |
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
501 |
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
502 |
Encode:Prepare: |
503 |
C:GR=undef |
504 |
C:designate:G94:B=-1 |
505 |
C:designate:*:default=-1 |
506 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
507 |
C:option:{Ginvoke_to_left}=[1,1,1,1] |
508 |
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
509 |
C:option:{reset}->{Gdesignation}=0 |
510 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
511 |
Encode: |
512 |
=>ucs_to_jisx0201_latin ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
513 |
->iso2022:C |
514 |
Decode: |
515 |
<-iso2022:C |
516 |
<=jisx0201_latin_to_ucs jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
517 |
Description: |
518 |
The 7-bit code for Latin and Kanji (JIS X 0213:2000 7.3.1). |
519 |
} |
520 |
|
521 |
{ |
522 |
Name: |
523 |
jisx0213-2000-latin-kanji-8bit |
524 |
Cversion: |
525 |
C:bit=8 |
526 |
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
527 |
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
528 |
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
529 |
Encode:Prepare: |
530 |
C:GR=undef |
531 |
C:designate:G94:B=-1 |
532 |
C:designate:*:default=-1 |
533 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}] |
534 |
C:option:{Ginvoke_to_left}=[1,0,0,0] |
535 |
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
536 |
C:option:{C1invoke_to_right}=1 |
537 |
C:option:{reset}->{Gdesignation}=0 |
538 |
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
539 |
Encode: |
540 |
=>ucs_to_jisx0201_latin ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
541 |
->iso2022:C |
542 |
Decode: |
543 |
<-iso2022:C |
544 |
<=jisx0201_latin_to_ucs jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
545 |
Description: |
546 |
The 8-bit code for Latin and Kanji (JIS X 0213:2000 7.2.2). |
547 |
} |
548 |
|
549 |
{ |
550 |
Name: |
551 |
jisx4001-text-7bit |
552 |
Cversion: |
553 |
C:bit=7 |
554 |
C:G0=G94n:B ## JIS X 0208-1983 |
555 |
Encode:Prepare: |
556 |
C:GR=undef |
557 |
C:C1=C1:~ |
558 |
C:G0=G0:~ |
559 |
C:designate:G94:B=-1 |
560 |
C:designate:*:default=-1 |
561 |
C:designate:G94:J=0 ## JIS X 0201-1976 Roman set |
562 |
C:designate:G94n:B=0 ## JIS X 0208-1983 |
563 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B'}] |
564 |
C:option:{reset}->{Gdesignation}='J' ## JIS X 4001 does not have this limitation. |
565 |
Encode: |
566 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1983 |
567 |
->iso2022:C |
568 |
Decode: |
569 |
<-iso2022:C |
570 |
<=jisx0201_latin_to_ucs jisx0208_1983_to_ucs |
571 |
Description: |
572 |
JIS X 4001 text (7-bit code, JIS X 4001-1989 6) |
573 |
} |
574 |
|
575 |
{ |
576 |
Name: |
577 |
jisx4001-text-8bit |
578 |
Cversion: |
579 |
C:bit=8 |
580 |
C:G0=G94n:B ## JIS X 0208-1983 |
581 |
Encode:Prepare: |
582 |
C:GR=undef |
583 |
C:C1=C1:~ |
584 |
C:G0=G0:~ |
585 |
C:designate:G94:B=-1 |
586 |
C:designate:*:default=-1 |
587 |
C:designate:G94:J=0 ## JIS X 0201-1976 Roman set |
588 |
C:designate:G94n:B=0 ## JIS X 0208-1983 |
589 |
C:option:{C1invoke_to_right}=1 |
590 |
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B'}] |
591 |
C:option:{reset}->{Gdesignation}='J' ## JIS X 4001 does not have this limitation. |
592 |
Encode: |
593 |
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1983 |
594 |
->iso2022:C |
595 |
Decode: |
596 |
<-iso2022:C |
597 |
<=jisx0201_latin_to_ucs jisx0208_1983_to_ucs |
598 |
Description: |
599 |
JIS X 4001 text (8-bit code, JIS X 4001-1989 6) |
600 |
} |
601 |
|
602 |
POD:ENCODING:POSTAMBLE: |
603 |
Note that although other JISes such as JIS X 0212 and JIS X 9010 |
604 |
define ISO/IEC 2022-comfprming coded character sets, |
605 |
these standards do not define complete coding system (but define |
606 |
as used on ISO/IEC 2022 environment), so this module |
607 |
does not include those coded character sets. (IETF RFC 1345 |
608 |
and IANAREG give charset name to coded character sets |
609 |
consist of such standards. But those are defined by RFC 1345, |
610 |
not by JIS. Such coded character sets should be implemented |
611 |
in Encode::ISO2022::RFC1345.) |
612 |
|
613 |
POD:SEE ALSO: |
614 |
%%ReferenceJISX0201_1997%% |
615 |
|
616 |
%%ReferenceJISX0208_1997%% |
617 |
|
618 |
%%ReferenceJISX0213_2000%% |
619 |
|
620 |
JIS X 4001-1989, "File Specification for Japanese Documents |
621 |
interchange (Basic Type)", Japan Industrial Standarad |
622 |
Committee (JISC) <http://www.jisc.go.jp/>, 1989. |
623 |
|
624 |
L<Encode::ISO2022::JUNET>, L<Encode::ISO2022::EUCJA> |
625 |
|
626 |
POD:LICENSE: |
627 |
Copyright %%YEAR%% Wakaba <w@suika.fam.cx> |
628 |
|
629 |
%%PerlLicense%% |