1 |
wakaba |
1.1 |
#?PESRC/1.0 |
2 |
|
|
Name: |
3 |
|
|
ISO2022::JIS |
4 |
|
|
ShortDescription: |
5 |
|
|
An Encode module of 7-bit ISO/IEC 2022 |
6 |
|
|
based coding systems defined by JISes |
7 |
|
|
Description: |
8 |
|
|
This module defines convertion between perl internal |
9 |
|
|
representation and coding systems defined in standards |
10 |
|
|
of JIS (Japan Industrial Standards). |
11 |
|
|
|
12 |
|
|
Note that frequently used coding systems of JIS |
13 |
|
|
are included in other modules. For instance, |
14 |
|
|
C<ISO-2022-JP-3> (defined by JIS X 0213:2000) is |
15 |
|
|
included in Encode::ISO2022::JUNET. |
16 |
|
|
|
17 |
|
|
{ |
18 |
|
|
Name: |
19 |
|
|
jisx0201-1997-latin-7bit |
20 |
|
|
Alias: |
21 |
|
|
JIS_C6220-1969-ro iso-ir-14 ir14 jp ISO646-JP 646-jp csISO14JISC6220ro |
22 |
|
|
Cversion: |
23 |
|
|
C:bit=7 |
24 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
25 |
|
|
C:designate:*:default=-1 |
26 |
|
|
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
27 |
|
|
Encode: |
28 |
|
|
C:GR=undef |
29 |
|
|
C:G1=G96:~ |
30 |
|
|
C:option:{reset}->{Gdesignation}=0 |
31 |
|
|
C:option:{reset}->{Ginvoke}=0 |
32 |
|
|
=>ucs_to_jisx0201_latin |
33 |
|
|
->iso2022:C |
34 |
|
|
Decode: |
35 |
|
|
C:G1=G94:J |
36 |
|
|
C:G2=G94:J |
37 |
|
|
C:G3=G94:J |
38 |
|
|
<-iso2022:C |
39 |
|
|
<=jisx0201_latin_to_ucs |
40 |
|
|
Description: |
41 |
|
|
The 7-bit code for Latin letters (JIS X 0201:1997 6.1). |
42 |
|
|
} |
43 |
|
|
|
44 |
|
|
{ |
45 |
|
|
Name: |
46 |
|
|
jisx0201-1997-katakana-7bit |
47 |
|
|
Alias: |
48 |
|
|
JIS_C6220-1969-jp JIS_C6220-1969 iso-ir-13 ir13 katakana x0201-7 csISO13JISC6220jp |
49 |
|
|
Cversion: |
50 |
|
|
C:bit=7 |
51 |
|
|
C:G0=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
52 |
|
|
C:designate:*:default=-1 |
53 |
|
|
C:option:{undef_char}=["\x25", {type => 'G94', charset => 'I'}] |
54 |
|
|
Encode: |
55 |
|
|
C:GR=undef |
56 |
|
|
C:G1=G96:~ |
57 |
|
|
C:option:{reset}->{Gdesignation}=0 |
58 |
|
|
C:option:{reset}->{Ginvoke}=0 |
59 |
|
|
=>ucs_to_jisx0201_katakana |
60 |
|
|
->iso2022:C |
61 |
|
|
Decode: |
62 |
|
|
C:G1=G94:I |
63 |
|
|
C:G2=G94:I |
64 |
|
|
C:G3=G94:I |
65 |
|
|
<-iso2022:C |
66 |
|
|
<=jisx0201_katakana_to_ucs |
67 |
|
|
Description: |
68 |
|
|
The 7-bit code for Katakana (JIS X 0201:1997 6.2). |
69 |
|
|
} |
70 |
|
|
|
71 |
|
|
{ |
72 |
|
|
Name: |
73 |
|
|
jisx0201-1997-latin-katakana-7bit |
74 |
|
|
Cversion: |
75 |
|
|
C:bit=7 |
76 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
77 |
|
|
C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
78 |
|
|
C:designate:*:default=-1 |
79 |
|
|
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
80 |
|
|
Encode: |
81 |
|
|
C:GL=undef |
82 |
|
|
C:GR=undef |
83 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
84 |
|
|
C:option:{reset}->{Gdesignation}=0 |
85 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0201:1997 doesn't have this limitation. |
86 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana |
87 |
|
|
->iso2022:C |
88 |
|
|
Decode: |
89 |
|
|
<-iso2022:C |
90 |
|
|
<=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs |
91 |
|
|
Description: |
92 |
|
|
The 7-bit code for Latin and Katakana (JIS X 0201:1997 6.3). |
93 |
|
|
|
94 |
|
|
See also the description of C<jisx0201-1997-katakana-latin-7bit>. |
95 |
|
|
} |
96 |
|
|
|
97 |
|
|
{ |
98 |
|
|
Name: |
99 |
|
|
jisx0201-1997-katakana-latin-7bit |
100 |
|
|
Cversion: |
101 |
|
|
C:bit=7 |
102 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
103 |
|
|
C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
104 |
|
|
C:designate:*:default=-1 |
105 |
|
|
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
106 |
|
|
Encode: |
107 |
|
|
C:GL=undef |
108 |
|
|
C:GR=undef |
109 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
110 |
|
|
C:option:{reset}->{Gdesignation}=0 |
111 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0201:1997 doesn't have this limitation. |
112 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana |
113 |
|
|
->iso2022:C |
114 |
|
|
Decode: |
115 |
|
|
C:GL=G1 ## In default, G1=Katakana is invoked |
116 |
|
|
<-iso2022:C |
117 |
|
|
<=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs |
118 |
|
|
Description: |
119 |
|
|
The 7-bit code for Latin and Katakana (JIS X 0201:1997 6.3). |
120 |
|
|
|
121 |
|
|
JIS X 0201:1997 does not define whether G0 or G1 is |
122 |
|
|
invoked to GL at the initial status of information interchange. |
123 |
|
|
(But recommends G0=Latin should be invoked.) |
124 |
|
|
|
125 |
|
|
In this module, C<jisx0201-1997-latin-katakana-7bit> |
126 |
|
|
regards that G0=Latin is invoked to GL and |
127 |
|
|
C<jisx0201-1997-katakana-latin-7bit> that G1=Katakana |
128 |
|
|
is. Note that on encoding, getting rid of this |
129 |
|
|
ambiguity, GL is regarded as undefined so that |
130 |
|
|
C<SI> or C<SO> is outputed before the first G0/G1 letter |
131 |
|
|
in both coding systems. |
132 |
|
|
} |
133 |
|
|
|
134 |
|
|
{ |
135 |
|
|
Name: |
136 |
|
|
jisx0201-1997-latin-latin-8bit |
137 |
|
|
Alias: |
138 |
|
|
JIS_X0201 X0201 csHalfWidthKatakana kana8 |
139 |
|
|
Cversion: |
140 |
|
|
C:bit=8 |
141 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
142 |
|
|
C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana |
143 |
|
|
Encode: |
144 |
|
|
C:designate:*:default=-1 |
145 |
|
|
C:option:{C1invoke_to_right}=1 |
146 |
|
|
C:option:{reset}->{Gdesignation}=0 |
147 |
|
|
C:option:{reset}->{Ginvoke}=0 |
148 |
|
|
C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}] |
149 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana |
150 |
|
|
->iso2022:C |
151 |
|
|
Decode: |
152 |
|
|
<-iso2022:C |
153 |
|
|
<=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs |
154 |
|
|
Description: |
155 |
|
|
The 8-bit code for Latin and Katakana (JIS X 0201:1997 6.4). |
156 |
|
|
} |
157 |
|
|
|
158 |
|
|
{ |
159 |
|
|
Name: |
160 |
|
|
jisx0208-1997-kanji-7bit |
161 |
|
|
Cversion: |
162 |
|
|
C:bit=7 |
163 |
|
|
C:G0=G94n:B@ ## JIS X 0208:1997 |
164 |
|
|
Encode: |
165 |
|
|
C:GR=undef |
166 |
|
|
C:G1=G96:~ |
167 |
|
|
C:designate:*:default=-1 |
168 |
|
|
C:option:{reset}->{Gdesignation}=0 |
169 |
|
|
C:option:{reset}->{Ginvoke}=0 |
170 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B@'}] |
171 |
|
|
=>ucs_to_jisx0208_1997 |
172 |
|
|
->iso2022:C |
173 |
|
|
Decode: |
174 |
|
|
<-iso2022:C |
175 |
|
|
<=jisx0208_1997_to_ucs |
176 |
|
|
Description: |
177 |
|
|
The 7-bit code for Kanji (JIS X 0208:1997 7.1.1). |
178 |
|
|
} |
179 |
|
|
|
180 |
|
|
{ |
181 |
|
|
Name: |
182 |
|
|
jisx0208-1997-kanji-8bit |
183 |
|
|
Cversion: |
184 |
|
|
C:bit=8 |
185 |
|
|
C:G0=G94n:B@ ## JIS X 0208:1997 |
186 |
|
|
Encode: |
187 |
|
|
C:GR=undef |
188 |
|
|
C:G1=G96:~ |
189 |
|
|
C:designate:*:default=-1 |
190 |
|
|
C:option:{C1invoke_to_right}=1 |
191 |
|
|
C:option:{reset}->{Gdesignation}=0 |
192 |
|
|
C:option:{reset}->{Ginvoke}=0 |
193 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B@'}] |
194 |
|
|
=>ucs_to_jisx0208_1997 |
195 |
|
|
->iso2022:C |
196 |
|
|
Decode: |
197 |
|
|
<-iso2022:C |
198 |
|
|
<=jisx0208_1997_to_ucs |
199 |
|
|
Description: |
200 |
|
|
The 8-bit code for Kanji (JIS X 0208:1997 7.1.2). |
201 |
|
|
} |
202 |
|
|
|
203 |
|
|
{ |
204 |
|
|
Name: |
205 |
|
|
jisx0208-1997-irv-kanji-7bit |
206 |
|
|
Cversion: |
207 |
|
|
C:bit=7 |
208 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
209 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
210 |
|
|
Encode: |
211 |
|
|
C:GL=undef |
212 |
|
|
C:GR=undef |
213 |
|
|
C:designate:*:default=-1 |
214 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B@'}] |
215 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
216 |
|
|
C:option:{reset}->{Gdesignation}=0 |
217 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
218 |
|
|
=>ucs_to_ascii ucs_to_jisx0208_1997 |
219 |
|
|
->iso2022:C |
220 |
|
|
Decode: |
221 |
|
|
<-iso2022:C |
222 |
|
|
<=jisx0208_1997_to_ucs |
223 |
|
|
Description: |
224 |
|
|
The 7-bit code for IRV and Kanji (JIS X 0208:1997 7.2.1). |
225 |
|
|
} |
226 |
|
|
|
227 |
|
|
{ |
228 |
|
|
Name: |
229 |
|
|
jisx0208-1997-kanji-irv-7bit |
230 |
|
|
Cversion: |
231 |
|
|
C:bit=7 |
232 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
233 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
234 |
|
|
Encode: |
235 |
|
|
C:GL=undef |
236 |
|
|
C:GR=undef |
237 |
|
|
C:designate:*:default=-1 |
238 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B@'}] |
239 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
240 |
|
|
C:option:{reset}->{Gdesignation}=0 |
241 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
242 |
|
|
=>ucs_to_ascii ucs_to_jisx0208_1997 |
243 |
|
|
->iso2022:C |
244 |
|
|
Decode: |
245 |
|
|
C:GL=G1 |
246 |
|
|
<-iso2022:C |
247 |
|
|
<=jisx0208_1997_to_ucs |
248 |
|
|
Description: |
249 |
|
|
The 7-bit code for IRV and Kanji (JIS X 0208:1997 7.2.1). |
250 |
|
|
|
251 |
|
|
This coding system is same as C<jisx0208-1997-irv-kanji-7bit> |
252 |
|
|
but start with Kanji set. See description of |
253 |
|
|
C<jisx0201-1997-katakana-latin-7bit>. |
254 |
|
|
} |
255 |
|
|
|
256 |
|
|
{ |
257 |
|
|
Name: |
258 |
|
|
jisx0208-1997-irv-kanji-8bit |
259 |
|
|
Cversion: |
260 |
|
|
C:bit=8 |
261 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
262 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
263 |
|
|
Encode: |
264 |
|
|
C:GL=undef |
265 |
|
|
C:GR=undef |
266 |
|
|
C:designate:*:default=-1 |
267 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B@'}] |
268 |
|
|
C:option:{C1invoke_to_right}=1 |
269 |
|
|
C:option:{reset}->{Gdesignation}=0 |
270 |
|
|
C:option:{reset}->{Ginvoke}=0 |
271 |
|
|
=>ucs_to_ascii ucs_to_jisx0208_1997 |
272 |
|
|
->iso2022:C |
273 |
|
|
Decode: |
274 |
|
|
<-iso2022:C |
275 |
|
|
<=jisx0208_1997_to_ucs |
276 |
|
|
Description: |
277 |
|
|
The 8-bit code for IRV and Kanji (JIS X 0208:1997 7.2.2). |
278 |
|
|
|
279 |
|
|
Note that this coding system is considerable as |
280 |
|
|
a subset of C<euc-jp>. For the histrical reason, |
281 |
|
|
not a small number of Japanese EUCed applications |
282 |
|
|
did not support G2 and G3 sets (and some do not |
283 |
|
|
even now). This coding system can be used for |
284 |
|
|
information interchanges with such implementions. |
285 |
|
|
} |
286 |
|
|
|
287 |
|
|
{ |
288 |
|
|
Name: |
289 |
|
|
jisx0208-1997-latin-kanji-7bit |
290 |
|
|
Cversion: |
291 |
|
|
C:bit=7 |
292 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
293 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
294 |
|
|
Encode: |
295 |
|
|
C:GL=undef |
296 |
|
|
C:GR=undef |
297 |
|
|
C:designate:*:default=-1 |
298 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B@'}] |
299 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
300 |
|
|
C:option:{reset}->{Gdesignation}=0 |
301 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
302 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1997 |
303 |
|
|
->iso2022:C |
304 |
|
|
Decode: |
305 |
|
|
<-iso2022:C |
306 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1997_to_ucs |
307 |
|
|
Description: |
308 |
|
|
The 7-bit code for Latin and Kanji (JIS X 0208:1997 7.3.1). |
309 |
|
|
} |
310 |
|
|
|
311 |
|
|
{ |
312 |
|
|
Name: |
313 |
|
|
jisx0208-1997-kanji-latin-7bit |
314 |
|
|
Cversion: |
315 |
|
|
C:bit=7 |
316 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
317 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
318 |
|
|
Encode: |
319 |
|
|
C:GL=undef |
320 |
|
|
C:GR=undef |
321 |
|
|
C:designate:*:default=-1 |
322 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B@'}] |
323 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
324 |
|
|
C:option:{reset}->{Gdesignation}=0 |
325 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation. |
326 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1997 |
327 |
|
|
->iso2022:C |
328 |
|
|
Decode: |
329 |
|
|
C:GL=G1 |
330 |
|
|
<-iso2022:C |
331 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1997_to_ucs |
332 |
|
|
Description: |
333 |
|
|
The 7-bit code for Latin and Kanji (JIS X 0208:1997 7.3.1). |
334 |
|
|
|
335 |
|
|
This coding system is same as C<jisx0208-1997-latin-kanji-7bit> |
336 |
|
|
but start with Kanji set. See description of |
337 |
|
|
C<jisx0201-1997-katakana-latin-7bit>. |
338 |
|
|
} |
339 |
|
|
|
340 |
|
|
{ |
341 |
|
|
Name: |
342 |
|
|
jisx0208-1997-latin-kanji-8bit |
343 |
|
|
Cversion: |
344 |
|
|
C:bit=8 |
345 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
346 |
|
|
C:G1=G94n:B@ ## JIS X 0208:1997 |
347 |
|
|
Encode: |
348 |
|
|
C:GL=undef |
349 |
|
|
C:GR=undef |
350 |
|
|
C:designate:*:default=-1 |
351 |
|
|
C:option:{C1invoke_to_right}=1 |
352 |
|
|
C:option:{reset}->{Gdesignation}=0 |
353 |
|
|
C:option:{reset}->{Ginvoke}=0 |
354 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B@'}] |
355 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1997 |
356 |
|
|
->iso2022:C |
357 |
|
|
Decode: |
358 |
|
|
<-iso2022:C |
359 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1997_to_ucs |
360 |
|
|
Description: |
361 |
|
|
The 8-bit code for Latin and Kanji (JIS X 0208:1997 7.2.2). |
362 |
|
|
} |
363 |
|
|
|
364 |
|
|
{ |
365 |
|
|
Name: |
366 |
|
|
jisx0213-2000-kanji-7bit |
367 |
|
|
Cversion: |
368 |
|
|
C:bit=7 |
369 |
|
|
C:G0=G94n:O ## JIS X 0213:2000 plane 1 |
370 |
|
|
C:G1=G94n:P ## JIS X 0213:2000 plane 2 |
371 |
|
|
Encode: |
372 |
|
|
C:GR=undef |
373 |
|
|
C:designate:*:default=-1 |
374 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'O'}] |
375 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
376 |
|
|
C:option:{reset}->{Gdesignation}=0 |
377 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
378 |
|
|
=>ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
379 |
|
|
->iso2022:C |
380 |
|
|
Decode: |
381 |
|
|
<-iso2022:C |
382 |
|
|
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
383 |
|
|
Description: |
384 |
|
|
The 7-bit code for Kanji (JIS X 0213:2000 7.1.1). |
385 |
|
|
} |
386 |
|
|
|
387 |
|
|
{ |
388 |
|
|
Name: |
389 |
|
|
jisx0213-2000-kanji-8bit |
390 |
|
|
Cversion: |
391 |
|
|
C:bit=8 |
392 |
|
|
C:G0=G94n:O ## JIS X 0213:2000 plane 1 |
393 |
|
|
C:G1=G94n:P ## JIS X 0213:2000 plane 2 |
394 |
|
|
Encode: |
395 |
|
|
C:GR=undef |
396 |
|
|
C:designate:*:default=-1 |
397 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'O'}] |
398 |
|
|
C:option:{C1invoke_to_right}=1 |
399 |
|
|
C:option:{reset}->{Gdesignation}=0 |
400 |
|
|
C:option:{reset}->{Ginvoke}=0 |
401 |
|
|
=>ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
402 |
|
|
->iso2022:C |
403 |
|
|
Decode: |
404 |
|
|
<-iso2022:C |
405 |
|
|
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
406 |
|
|
Description: |
407 |
|
|
The 8-bit code for Kanji (JIS X 0213:2000 7.1.2). |
408 |
|
|
} |
409 |
|
|
|
410 |
|
|
{ |
411 |
|
|
Name: |
412 |
|
|
jisx0213-2000-irv-kanji-7bit |
413 |
|
|
Cversion: |
414 |
|
|
C:bit=7 |
415 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
416 |
|
|
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
417 |
|
|
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
418 |
|
|
Encode: |
419 |
|
|
C:GR=undef |
420 |
|
|
C:designate:*:default=-1 |
421 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'O'}] |
422 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
423 |
|
|
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
424 |
|
|
C:option:{reset}->{Gdesignation}=0 |
425 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
426 |
|
|
=>ucs_to_ascii ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
427 |
|
|
->iso2022:C |
428 |
|
|
Decode: |
429 |
|
|
<-iso2022:C |
430 |
|
|
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
431 |
|
|
Description: |
432 |
|
|
The 7-bit code for IRV and Kanji (JIS X 0213:2000 7.2.1). |
433 |
|
|
} |
434 |
|
|
|
435 |
|
|
{ |
436 |
|
|
Name: |
437 |
|
|
jisx0213-2000-irv-kanji-8bit |
438 |
|
|
Cversion: |
439 |
|
|
C:bit=8 |
440 |
|
|
C:G0=G94:B ## ISO/IEC 646:1991 IRV |
441 |
|
|
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
442 |
|
|
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
443 |
|
|
Encode: |
444 |
|
|
C:GR=undef |
445 |
|
|
C:designate:*:default=-1 |
446 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'O'}] |
447 |
|
|
C:option:{Ginvoke_to_left}=[1,0,0,0] |
448 |
|
|
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
449 |
|
|
C:option:{C1invoke_to_right}=1 |
450 |
|
|
C:option:{reset}->{Gdesignation}=0 |
451 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
452 |
|
|
=>ucs_to_ascii ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
453 |
|
|
->iso2022:C |
454 |
|
|
Decode: |
455 |
|
|
<-iso2022:C |
456 |
|
|
<=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
457 |
|
|
Description: |
458 |
|
|
The 7-bit code for IRV and Kanji (JIS X 0213:2000 7.2.1). |
459 |
|
|
} |
460 |
|
|
|
461 |
|
|
{ |
462 |
|
|
Name: |
463 |
|
|
jisx0213-2000-latin-kanji-7bit |
464 |
|
|
Cversion: |
465 |
|
|
C:bit=7 |
466 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
467 |
|
|
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
468 |
|
|
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
469 |
|
|
Encode: |
470 |
|
|
C:GR=undef |
471 |
|
|
C:designate:*:default=-1 |
472 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'O'}] |
473 |
|
|
C:option:{Ginvoke_to_left}=[1,1,1,1] |
474 |
|
|
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
475 |
|
|
C:option:{reset}->{Gdesignation}=0 |
476 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
477 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
478 |
|
|
->iso2022:C |
479 |
|
|
Decode: |
480 |
|
|
<-iso2022:C |
481 |
|
|
<=jisx0201_latin_to_ucs jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
482 |
|
|
Description: |
483 |
|
|
The 7-bit code for Latin and Kanji (JIS X 0213:2000 7.3.1). |
484 |
|
|
} |
485 |
|
|
|
486 |
|
|
{ |
487 |
|
|
Name: |
488 |
|
|
jisx0213-2000-latin-kanji-8bit |
489 |
|
|
Cversion: |
490 |
|
|
C:bit=8 |
491 |
|
|
C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters |
492 |
|
|
C:G1=G94n:O ## JIS X 0213:2000 plane 1 |
493 |
|
|
C:G3=G94n:P ## JIS X 0213:2000 plane 2 |
494 |
|
|
Encode: |
495 |
|
|
C:GR=undef |
496 |
|
|
C:designate:*:default=-1 |
497 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'O'}] |
498 |
|
|
C:option:{Ginvoke_to_left}=[1,0,0,0] |
499 |
|
|
C:option:{Ginvoked_by_single_shift}=[0,0,1,1] |
500 |
|
|
C:option:{C1invoke_to_right}=1 |
501 |
|
|
C:option:{reset}->{Gdesignation}=0 |
502 |
|
|
C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation. |
503 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2 |
504 |
|
|
->iso2022:C |
505 |
|
|
Decode: |
506 |
|
|
<-iso2022:C |
507 |
|
|
<=jisx0201_latin_to_ucs jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs |
508 |
|
|
Description: |
509 |
|
|
The 8-bit code for Latin and Kanji (JIS X 0213:2000 7.2.2). |
510 |
|
|
} |
511 |
|
|
|
512 |
|
|
{ |
513 |
|
|
Name: |
514 |
|
|
jisx4001-text-7bit |
515 |
|
|
Cversion: |
516 |
|
|
C:bit=7 |
517 |
|
|
C:G0=G94n:B ## JIS X 0208-1983 |
518 |
|
|
Encode: |
519 |
|
|
C:GR=undef |
520 |
|
|
C:C1=C1:~ |
521 |
|
|
C:G0=G0:~ |
522 |
|
|
C:designate:*:default=-1 |
523 |
|
|
C:designate:G94:J=0 ## JIS X 0201-1976 Roman set |
524 |
|
|
C:designate:G94n:B=0 ## JIS X 0208-1983 |
525 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B'}] |
526 |
|
|
C:option:{reset}->{Gdesignation}='J' ## JIS X 4001 does not have this limitation. |
527 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1983 |
528 |
|
|
->iso2022:C |
529 |
|
|
Decode: |
530 |
|
|
<-iso2022:C |
531 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1983_to_ucs |
532 |
|
|
Description: |
533 |
|
|
JIS X 4001 text (7-bit code, JIS X 4001-1989 6) |
534 |
|
|
} |
535 |
|
|
|
536 |
|
|
{ |
537 |
|
|
Name: |
538 |
|
|
jisx4001-text-8bit |
539 |
|
|
Cversion: |
540 |
|
|
C:bit=8 |
541 |
|
|
C:G0=G94n:B ## JIS X 0208-1983 |
542 |
|
|
Encode: |
543 |
|
|
C:GR=undef |
544 |
|
|
C:C1=C1:~ |
545 |
|
|
C:G0=G0:~ |
546 |
|
|
C:designate:*:default=-1 |
547 |
|
|
C:designate:G94:J=0 ## JIS X 0201-1976 Roman set |
548 |
|
|
C:designate:G94n:B=0 ## JIS X 0208-1983 |
549 |
|
|
C:option:{C1invoke_to_right}=1 |
550 |
|
|
C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => 'B'}] |
551 |
|
|
C:option:{reset}->{Gdesignation}='J' ## JIS X 4001 does not have this limitation. |
552 |
|
|
=>ucs_to_jisx0201_latin ucs_to_jisx0208_1983 |
553 |
|
|
->iso2022:C |
554 |
|
|
Decode: |
555 |
|
|
<-iso2022:C |
556 |
|
|
<=jisx0201_latin_to_ucs jisx0208_1983_to_ucs |
557 |
|
|
Description: |
558 |
|
|
JIS X 4001 text (8-bit code, JIS X 4001-1989 6) |
559 |
|
|
} |
560 |
|
|
|
561 |
|
|
POD:ENCODING:POSTAMBLE: |
562 |
|
|
Note that although other JISes such as JIS X 0212 and JIS X 9010 |
563 |
|
|
define ISO/IEC 2022-comfprming coded character sets, |
564 |
|
|
these standards do not define complete coding system (but define |
565 |
|
|
as used on ISO/IEC 2022 environment), so this module |
566 |
|
|
does not include those coded character sets. (IETF RFC 1345 |
567 |
|
|
and IANAREG give charset name to coded character sets |
568 |
|
|
consist of such standards. But those are defined by RFC 1345, |
569 |
|
|
not by JIS. Such coded character sets should be implemented |
570 |
|
|
in Encode::ISO2022::RFC1345.) |
571 |
|
|
|
572 |
|
|
POD:SEE ALSO: |
573 |
|
|
%%ReferenceJISX0201_1997%% |
574 |
|
|
|
575 |
|
|
%%ReferenceJISX0208_1997%% |
576 |
|
|
|
577 |
|
|
%%ReferenceJISX0213_2000%% |
578 |
|
|
|
579 |
|
|
JIS X 4001-1989, "File Specification for Japanese Documents |
580 |
|
|
interchange (Basic Type)", Japan Industrial Standarad |
581 |
|
|
Committee (JISC) <http://www.jisc.go.jp/>, 1989. |
582 |
|
|
|
583 |
|
|
L<Encode::ISO2022::JUNET>, L<Encode::ISO2022::EUCJA> |
584 |
|
|
|
585 |
|
|
POD:LICENSE: |
586 |
|
|
Copyright %%YEAR%% Wakaba <w@suika.fam.cx> |
587 |
|
|
|
588 |
|
|
%%PerlLicense%% |