/[suikacvs]/perl/lib/Encode/ISO2022/JIS.esr
Suika

Contents of /perl/lib/Encode/ISO2022/JIS.esr

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.2 - (hide annotations) (download)
Sat Dec 14 10:58:42 2002 UTC (21 years, 11 months ago) by wakaba
Branch: MAIN
Changes since 1.1: +82 -41 lines
*** empty log message ***

1 wakaba 1.1 #?PESRC/1.0
2     Name:
3     ISO2022::JIS
4     ShortDescription:
5     An Encode module of 7-bit ISO/IEC 2022
6     based coding systems defined by JISes
7     Description:
8     This module defines convertion between perl internal
9     representation and coding systems defined in standards
10     of JIS (Japan Industrial Standards).
11    
12     Note that frequently used coding systems of JIS
13     are included in other modules. For instance,
14     C<ISO-2022-JP-3> (defined by JIS X 0213:2000) is
15     included in Encode::ISO2022::JUNET.
16    
17     {
18     Name:
19     jisx0201-1997-latin-7bit
20     Alias:
21     JIS_C6220-1969-ro iso-ir-14 ir14 jp ISO646-JP 646-jp csISO14JISC6220ro
22     Cversion:
23     C:bit=7
24     C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters
25 wakaba 1.2 C:designate:G94:B=-1
26 wakaba 1.1 C:designate:*:default=-1
27     C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}]
28 wakaba 1.2 Encode:Prepare:
29 wakaba 1.1 C:GR=undef
30     C:G1=G96:~
31     C:option:{reset}->{Gdesignation}=0
32     C:option:{reset}->{Ginvoke}=0
33 wakaba 1.2 Encode:
34 wakaba 1.1 =>ucs_to_jisx0201_latin
35     ->iso2022:C
36 wakaba 1.2 Decode:Prepare:
37 wakaba 1.1 C:G1=G94:J
38     C:G2=G94:J
39     C:G3=G94:J
40 wakaba 1.2 Decode:
41 wakaba 1.1 <-iso2022:C
42     <=jisx0201_latin_to_ucs
43     Description:
44     The 7-bit code for Latin letters (JIS X 0201:1997 6.1).
45     }
46    
47     {
48     Name:
49     jisx0201-1997-katakana-7bit
50     Alias:
51     JIS_C6220-1969-jp JIS_C6220-1969 iso-ir-13 ir13 katakana x0201-7 csISO13JISC6220jp
52     Cversion:
53     C:bit=7
54     C:G0=G94:I ## JIS X 0201:1997 Graphic character set for Katakana
55 wakaba 1.2 C:designate:G94:B=-1
56 wakaba 1.1 C:designate:*:default=-1
57     C:option:{undef_char}=["\x25", {type => 'G94', charset => 'I'}]
58 wakaba 1.2 Encode:Prepare:
59 wakaba 1.1 C:GR=undef
60     C:G1=G96:~
61     C:option:{reset}->{Gdesignation}=0
62     C:option:{reset}->{Ginvoke}=0
63 wakaba 1.2 Encode:
64 wakaba 1.1 =>ucs_to_jisx0201_katakana
65     ->iso2022:C
66 wakaba 1.2 Decode:Prepare:
67 wakaba 1.1 C:G1=G94:I
68     C:G2=G94:I
69     C:G3=G94:I
70 wakaba 1.2 Decode:
71 wakaba 1.1 <-iso2022:C
72     <=jisx0201_katakana_to_ucs
73     Description:
74     The 7-bit code for Katakana (JIS X 0201:1997 6.2).
75     }
76    
77     {
78     Name:
79     jisx0201-1997-latin-katakana-7bit
80     Cversion:
81     C:bit=7
82     C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters
83     C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana
84 wakaba 1.2 C:designate:G94:B=-1
85 wakaba 1.1 C:designate:*:default=-1
86     C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}]
87 wakaba 1.2 Encode:Prepare:
88 wakaba 1.1 C:GL=undef
89     C:GR=undef
90     C:option:{Ginvoke_to_left}=[1,1,1,1]
91     C:option:{reset}->{Gdesignation}=0
92     C:option:{reset}->{Ginvoke}=1 ## JIS X 0201:1997 doesn't have this limitation.
93 wakaba 1.2 Encode:
94 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana
95     ->iso2022:C
96     Decode:
97     <-iso2022:C
98     <=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs
99     Description:
100     The 7-bit code for Latin and Katakana (JIS X 0201:1997 6.3).
101    
102     See also the description of C<jisx0201-1997-katakana-latin-7bit>.
103     }
104    
105     {
106     Name:
107     jisx0201-1997-katakana-latin-7bit
108     Cversion:
109     C:bit=7
110     C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters
111     C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana
112 wakaba 1.2 C:designate:G94:B=-1
113 wakaba 1.1 C:designate:*:default=-1
114     C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}]
115 wakaba 1.2 Encode:Prepare:
116 wakaba 1.1 C:GL=undef
117     C:GR=undef
118     C:option:{Ginvoke_to_left}=[1,1,1,1]
119     C:option:{reset}->{Gdesignation}=0
120     C:option:{reset}->{Ginvoke}=1 ## JIS X 0201:1997 doesn't have this limitation.
121 wakaba 1.2 Encode:
122 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana
123     ->iso2022:C
124 wakaba 1.2 Decode:Prepare:
125     C:GL=G1 ## In default, G1=Katakana is invoked
126 wakaba 1.1 Decode:
127     <-iso2022:C
128     <=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs
129     Description:
130     The 7-bit code for Latin and Katakana (JIS X 0201:1997 6.3).
131    
132     JIS X 0201:1997 does not define whether G0 or G1 is
133     invoked to GL at the initial status of information interchange.
134     (But recommends G0=Latin should be invoked.)
135    
136     In this module, C<jisx0201-1997-latin-katakana-7bit>
137     regards that G0=Latin is invoked to GL and
138     C<jisx0201-1997-katakana-latin-7bit> that G1=Katakana
139     is. Note that on encoding, getting rid of this
140     ambiguity, GL is regarded as undefined so that
141     C<SI> or C<SO> is outputed before the first G0/G1 letter
142     in both coding systems.
143     }
144    
145     {
146     Name:
147     jisx0201-1997-latin-latin-8bit
148     Alias:
149     JIS_X0201 X0201 csHalfWidthKatakana kana8
150     Cversion:
151     C:bit=8
152     C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters
153     C:G1=G94:I ## JIS X 0201:1997 Graphic character set for Katakana
154 wakaba 1.2 Encode:Prepare:
155 wakaba 1.1 C:designate:*:default=-1
156 wakaba 1.2 C:designate:G94:B=-1
157 wakaba 1.1 C:option:{C1invoke_to_right}=1
158     C:option:{reset}->{Gdesignation}=0
159     C:option:{reset}->{Ginvoke}=0
160     C:option:{undef_char}=["\x3F", {type => 'G94', charset => 'J'}]
161 wakaba 1.2 Encode:
162 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0201_katakana
163     ->iso2022:C
164     Decode:
165     <-iso2022:C
166     <=jisx0201_latin_to_ucs jisx0201_katakana_to_ucs
167     Description:
168     The 8-bit code for Latin and Katakana (JIS X 0201:1997 6.4).
169     }
170    
171     {
172     Name:
173     jisx0208-1997-kanji-7bit
174     Cversion:
175     C:bit=7
176     C:G0=G94n:B@ ## JIS X 0208:1997
177 wakaba 1.2 Encode:Prepare:
178 wakaba 1.1 C:GR=undef
179     C:G1=G96:~
180 wakaba 1.2 C:designate:G94:B=-1
181 wakaba 1.1 C:designate:*:default=-1
182     C:option:{reset}->{Gdesignation}=0
183     C:option:{reset}->{Ginvoke}=0
184 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}]
185     Encode:
186 wakaba 1.1 =>ucs_to_jisx0208_1997
187     ->iso2022:C
188     Decode:
189     <-iso2022:C
190     <=jisx0208_1997_to_ucs
191     Description:
192     The 7-bit code for Kanji (JIS X 0208:1997 7.1.1).
193     }
194    
195     {
196     Name:
197     jisx0208-1997-kanji-8bit
198     Cversion:
199     C:bit=8
200     C:G0=G94n:B@ ## JIS X 0208:1997
201 wakaba 1.2 Encode:Prepare:
202 wakaba 1.1 C:GR=undef
203     C:G1=G96:~
204 wakaba 1.2 C:designate:G94:B=-1
205 wakaba 1.1 C:designate:*:default=-1
206     C:option:{C1invoke_to_right}=1
207     C:option:{reset}->{Gdesignation}=0
208     C:option:{reset}->{Ginvoke}=0
209 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}]
210     Encode:
211 wakaba 1.1 =>ucs_to_jisx0208_1997
212     ->iso2022:C
213     Decode:
214     <-iso2022:C
215     <=jisx0208_1997_to_ucs
216     Description:
217     The 8-bit code for Kanji (JIS X 0208:1997 7.1.2).
218     }
219    
220     {
221     Name:
222     jisx0208-1997-irv-kanji-7bit
223     Cversion:
224     C:bit=7
225     C:G0=G94:B ## ISO/IEC 646:1991 IRV
226     C:G1=G94n:B@ ## JIS X 0208:1997
227 wakaba 1.2 Encode:Prepare:
228 wakaba 1.1 C:GL=undef
229     C:GR=undef
230     C:designate:*:default=-1
231 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}]
232 wakaba 1.1 C:option:{Ginvoke_to_left}=[1,1,1,1]
233     C:option:{reset}->{Gdesignation}=0
234     C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation.
235 wakaba 1.2 Encode:
236 wakaba 1.1 =>ucs_to_ascii ucs_to_jisx0208_1997
237     ->iso2022:C
238     Decode:
239     <-iso2022:C
240     <=jisx0208_1997_to_ucs
241     Description:
242     The 7-bit code for IRV and Kanji (JIS X 0208:1997 7.2.1).
243     }
244    
245     {
246     Name:
247     jisx0208-1997-kanji-irv-7bit
248     Cversion:
249     C:bit=7
250     C:G0=G94:B ## ISO/IEC 646:1991 IRV
251     C:G1=G94n:B@ ## JIS X 0208:1997
252 wakaba 1.2 Encode:Prepare:
253 wakaba 1.1 C:GL=undef
254     C:GR=undef
255     C:designate:*:default=-1
256 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}]
257 wakaba 1.1 C:option:{Ginvoke_to_left}=[1,1,1,1]
258     C:option:{reset}->{Gdesignation}=0
259     C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation.
260 wakaba 1.2 Encode:
261 wakaba 1.1 =>ucs_to_ascii ucs_to_jisx0208_1997
262     ->iso2022:C
263 wakaba 1.2 Decode:Prepare:
264     C:GL=G1
265 wakaba 1.1 Decode:
266     <-iso2022:C
267     <=jisx0208_1997_to_ucs
268     Description:
269     The 7-bit code for IRV and Kanji (JIS X 0208:1997 7.2.1).
270    
271     This coding system is same as C<jisx0208-1997-irv-kanji-7bit>
272     but start with Kanji set. See description of
273     C<jisx0201-1997-katakana-latin-7bit>.
274     }
275    
276     {
277     Name:
278     jisx0208-1997-irv-kanji-8bit
279     Cversion:
280     C:bit=8
281     C:G0=G94:B ## ISO/IEC 646:1991 IRV
282     C:G1=G94n:B@ ## JIS X 0208:1997
283 wakaba 1.2 Encode:Prepare:
284 wakaba 1.1 C:GL=undef
285     C:GR=undef
286     C:designate:*:default=-1
287 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}]
288 wakaba 1.1 C:option:{C1invoke_to_right}=1
289     C:option:{reset}->{Gdesignation}=0
290     C:option:{reset}->{Ginvoke}=0
291 wakaba 1.2 Encode:
292 wakaba 1.1 =>ucs_to_ascii ucs_to_jisx0208_1997
293     ->iso2022:C
294     Decode:
295     <-iso2022:C
296     <=jisx0208_1997_to_ucs
297     Description:
298     The 8-bit code for IRV and Kanji (JIS X 0208:1997 7.2.2).
299    
300     Note that this coding system is considerable as
301     a subset of C<euc-jp>. For the histrical reason,
302     not a small number of Japanese EUCed applications
303     did not support G2 and G3 sets (and some do not
304     even now). This coding system can be used for
305     information interchanges with such implementions.
306     }
307    
308     {
309     Name:
310     jisx0208-1997-latin-kanji-7bit
311     Cversion:
312     C:bit=7
313     C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters
314     C:G1=G94n:B@ ## JIS X 0208:1997
315 wakaba 1.2 Encode:Prepare:
316 wakaba 1.1 C:GL=undef
317     C:GR=undef
318 wakaba 1.2 C:designate:G94:B=-1
319 wakaba 1.1 C:designate:*:default=-1
320 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}]
321 wakaba 1.1 C:option:{Ginvoke_to_left}=[1,1,1,1]
322     C:option:{reset}->{Gdesignation}=0
323     C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation.
324 wakaba 1.2 Encode:
325 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0208_1997
326     ->iso2022:C
327     Decode:
328     <-iso2022:C
329     <=jisx0201_latin_to_ucs jisx0208_1997_to_ucs
330     Description:
331     The 7-bit code for Latin and Kanji (JIS X 0208:1997 7.3.1).
332     }
333    
334     {
335     Name:
336     jisx0208-1997-kanji-latin-7bit
337     Cversion:
338     C:bit=7
339     C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters
340     C:G1=G94n:B@ ## JIS X 0208:1997
341 wakaba 1.2 Encode:Prepare:
342 wakaba 1.1 C:GL=undef
343     C:GR=undef
344 wakaba 1.2 C:designate:G94:B=-1
345 wakaba 1.1 C:designate:*:default=-1
346 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}]
347 wakaba 1.1 C:option:{Ginvoke_to_left}=[1,1,1,1]
348     C:option:{reset}->{Gdesignation}=0
349     C:option:{reset}->{Ginvoke}=1 ## JIS X 0208:1997 does not have this limitation.
350 wakaba 1.2 Encode:
351 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0208_1997
352     ->iso2022:C
353     Decode:
354     C:GL=G1
355     <-iso2022:C
356     <=jisx0201_latin_to_ucs jisx0208_1997_to_ucs
357     Description:
358     The 7-bit code for Latin and Kanji (JIS X 0208:1997 7.3.1).
359    
360     This coding system is same as C<jisx0208-1997-latin-kanji-7bit>
361     but start with Kanji set. See description of
362     C<jisx0201-1997-katakana-latin-7bit>.
363     }
364    
365     {
366     Name:
367     jisx0208-1997-latin-kanji-8bit
368     Cversion:
369     C:bit=8
370     C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters
371     C:G1=G94n:B@ ## JIS X 0208:1997
372 wakaba 1.2 Encode:Prepare:
373 wakaba 1.1 C:GL=undef
374     C:GR=undef
375 wakaba 1.2 C:designate:G94:B=-1
376 wakaba 1.1 C:designate:*:default=-1
377     C:option:{C1invoke_to_right}=1
378     C:option:{reset}->{Gdesignation}=0
379     C:option:{reset}->{Ginvoke}=0
380 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B', revision => '@'}]
381     Encode:
382 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0208_1997
383     ->iso2022:C
384     Decode:
385     <-iso2022:C
386     <=jisx0201_latin_to_ucs jisx0208_1997_to_ucs
387     Description:
388     The 8-bit code for Latin and Kanji (JIS X 0208:1997 7.2.2).
389     }
390    
391     {
392     Name:
393     jisx0213-2000-kanji-7bit
394     Cversion:
395     C:bit=7
396     C:G0=G94n:O ## JIS X 0213:2000 plane 1
397     C:G1=G94n:P ## JIS X 0213:2000 plane 2
398 wakaba 1.2 Encode:Prepare:
399 wakaba 1.1 C:GR=undef
400 wakaba 1.2 C:designate:G94:B=-1
401 wakaba 1.1 C:designate:*:default=-1
402 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}]
403 wakaba 1.1 C:option:{Ginvoke_to_left}=[1,1,1,1]
404     C:option:{reset}->{Gdesignation}=0
405     C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation.
406 wakaba 1.2 Encode:
407 wakaba 1.1 =>ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2
408     ->iso2022:C
409     Decode:
410     <-iso2022:C
411     <=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs
412     Description:
413     The 7-bit code for Kanji (JIS X 0213:2000 7.1.1).
414     }
415    
416     {
417     Name:
418     jisx0213-2000-kanji-8bit
419     Cversion:
420     C:bit=8
421     C:G0=G94n:O ## JIS X 0213:2000 plane 1
422     C:G1=G94n:P ## JIS X 0213:2000 plane 2
423 wakaba 1.2 Encode:Prepare:
424 wakaba 1.1 C:GR=undef
425 wakaba 1.2 C:designate:G94:B=-1
426 wakaba 1.1 C:designate:*:default=-1
427 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}]
428 wakaba 1.1 C:option:{C1invoke_to_right}=1
429     C:option:{reset}->{Gdesignation}=0
430     C:option:{reset}->{Ginvoke}=0
431 wakaba 1.2 Encode:
432 wakaba 1.1 =>ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2
433     ->iso2022:C
434     Decode:
435     <-iso2022:C
436     <=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs
437     Description:
438     The 8-bit code for Kanji (JIS X 0213:2000 7.1.2).
439     }
440    
441     {
442     Name:
443     jisx0213-2000-irv-kanji-7bit
444     Cversion:
445     C:bit=7
446     C:G0=G94:B ## ISO/IEC 646:1991 IRV
447     C:G1=G94n:O ## JIS X 0213:2000 plane 1
448     C:G3=G94n:P ## JIS X 0213:2000 plane 2
449 wakaba 1.2 Encode:Prepare:
450 wakaba 1.1 C:GR=undef
451     C:designate:*:default=-1
452 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}]
453 wakaba 1.1 C:option:{Ginvoke_to_left}=[1,1,1,1]
454     C:option:{Ginvoked_by_single_shift}=[0,0,1,1]
455     C:option:{reset}->{Gdesignation}=0
456     C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation.
457 wakaba 1.2 Encode:
458 wakaba 1.1 =>ucs_to_ascii ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2
459     ->iso2022:C
460     Decode:
461     <-iso2022:C
462     <=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs
463     Description:
464     The 7-bit code for IRV and Kanji (JIS X 0213:2000 7.2.1).
465     }
466    
467     {
468     Name:
469     jisx0213-2000-irv-kanji-8bit
470     Cversion:
471     C:bit=8
472     C:G0=G94:B ## ISO/IEC 646:1991 IRV
473     C:G1=G94n:O ## JIS X 0213:2000 plane 1
474     C:G3=G94n:P ## JIS X 0213:2000 plane 2
475 wakaba 1.2 Encode:Prepare:
476 wakaba 1.1 C:GR=undef
477     C:designate:*:default=-1
478 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}]
479 wakaba 1.1 C:option:{Ginvoke_to_left}=[1,0,0,0]
480     C:option:{Ginvoked_by_single_shift}=[0,0,1,1]
481     C:option:{C1invoke_to_right}=1
482     C:option:{reset}->{Gdesignation}=0
483     C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation.
484 wakaba 1.2 Encode:
485 wakaba 1.1 =>ucs_to_ascii ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2
486     ->iso2022:C
487     Decode:
488     <-iso2022:C
489     <=jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs
490     Description:
491     The 7-bit code for IRV and Kanji (JIS X 0213:2000 7.2.1).
492     }
493    
494     {
495     Name:
496     jisx0213-2000-latin-kanji-7bit
497     Cversion:
498     C:bit=7
499     C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters
500     C:G1=G94n:O ## JIS X 0213:2000 plane 1
501     C:G3=G94n:P ## JIS X 0213:2000 plane 2
502 wakaba 1.2 Encode:Prepare:
503 wakaba 1.1 C:GR=undef
504 wakaba 1.2 C:designate:G94:B=-1
505 wakaba 1.1 C:designate:*:default=-1
506 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}]
507 wakaba 1.1 C:option:{Ginvoke_to_left}=[1,1,1,1]
508     C:option:{Ginvoked_by_single_shift}=[0,0,1,1]
509     C:option:{reset}->{Gdesignation}=0
510     C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation.
511 wakaba 1.2 Encode:
512 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2
513     ->iso2022:C
514     Decode:
515     <-iso2022:C
516     <=jisx0201_latin_to_ucs jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs
517     Description:
518     The 7-bit code for Latin and Kanji (JIS X 0213:2000 7.3.1).
519     }
520    
521     {
522     Name:
523     jisx0213-2000-latin-kanji-8bit
524     Cversion:
525     C:bit=8
526     C:G0=G94:J ## JIS X 0201:1997 Graphic character set for Latin letters
527     C:G1=G94n:O ## JIS X 0213:2000 plane 1
528     C:G3=G94n:P ## JIS X 0213:2000 plane 2
529 wakaba 1.2 Encode:Prepare:
530 wakaba 1.1 C:GR=undef
531 wakaba 1.2 C:designate:G94:B=-1
532 wakaba 1.1 C:designate:*:default=-1
533 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'O'}]
534 wakaba 1.1 C:option:{Ginvoke_to_left}=[1,0,0,0]
535     C:option:{Ginvoked_by_single_shift}=[0,0,1,1]
536     C:option:{C1invoke_to_right}=1
537     C:option:{reset}->{Gdesignation}=0
538     C:option:{reset}->{Ginvoke}=1 ## JIS X 0213:2000 does not have this limitation.
539 wakaba 1.2 Encode:
540 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0213_2000_1 ucs_to_jisx0213_2000_2
541     ->iso2022:C
542     Decode:
543     <-iso2022:C
544     <=jisx0201_latin_to_ucs jisx0213_2000_1_to_ucs jisx0213_2000_2_to_ucs
545     Description:
546     The 8-bit code for Latin and Kanji (JIS X 0213:2000 7.2.2).
547     }
548    
549     {
550     Name:
551     jisx4001-text-7bit
552     Cversion:
553     C:bit=7
554     C:G0=G94n:B ## JIS X 0208-1983
555 wakaba 1.2 Encode:Prepare:
556 wakaba 1.1 C:GR=undef
557     C:C1=C1:~
558     C:G0=G0:~
559 wakaba 1.2 C:designate:G94:B=-1
560 wakaba 1.1 C:designate:*:default=-1
561     C:designate:G94:J=0 ## JIS X 0201-1976 Roman set
562     C:designate:G94n:B=0 ## JIS X 0208-1983
563 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B'}]
564 wakaba 1.1 C:option:{reset}->{Gdesignation}='J' ## JIS X 4001 does not have this limitation.
565 wakaba 1.2 Encode:
566 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0208_1983
567     ->iso2022:C
568     Decode:
569     <-iso2022:C
570     <=jisx0201_latin_to_ucs jisx0208_1983_to_ucs
571     Description:
572     JIS X 4001 text (7-bit code, JIS X 4001-1989 6)
573     }
574    
575     {
576     Name:
577     jisx4001-text-8bit
578     Cversion:
579     C:bit=8
580     C:G0=G94n:B ## JIS X 0208-1983
581 wakaba 1.2 Encode:Prepare:
582 wakaba 1.1 C:GR=undef
583     C:C1=C1:~
584     C:G0=G0:~
585 wakaba 1.2 C:designate:G94:B=-1
586 wakaba 1.1 C:designate:*:default=-1
587     C:designate:G94:J=0 ## JIS X 0201-1976 Roman set
588     C:designate:G94n:B=0 ## JIS X 0208-1983
589     C:option:{C1invoke_to_right}=1
590 wakaba 1.2 C:option:{undef_char}=["\x22\x2E", {type => 'G94n', charset => 'B'}]
591 wakaba 1.1 C:option:{reset}->{Gdesignation}='J' ## JIS X 4001 does not have this limitation.
592 wakaba 1.2 Encode:
593 wakaba 1.1 =>ucs_to_jisx0201_latin ucs_to_jisx0208_1983
594     ->iso2022:C
595     Decode:
596     <-iso2022:C
597     <=jisx0201_latin_to_ucs jisx0208_1983_to_ucs
598     Description:
599     JIS X 4001 text (8-bit code, JIS X 4001-1989 6)
600     }
601    
602     POD:ENCODING:POSTAMBLE:
603     Note that although other JISes such as JIS X 0212 and JIS X 9010
604     define ISO/IEC 2022-comfprming coded character sets,
605     these standards do not define complete coding system (but define
606     as used on ISO/IEC 2022 environment), so this module
607     does not include those coded character sets. (IETF RFC 1345
608     and IANAREG give charset name to coded character sets
609     consist of such standards. But those are defined by RFC 1345,
610     not by JIS. Such coded character sets should be implemented
611     in Encode::ISO2022::RFC1345.)
612    
613     POD:SEE ALSO:
614     %%ReferenceJISX0201_1997%%
615    
616     %%ReferenceJISX0208_1997%%
617    
618     %%ReferenceJISX0213_2000%%
619    
620     JIS X 4001-1989, "File Specification for Japanese Documents
621     interchange (Basic Type)", Japan Industrial Standarad
622     Committee (JISC) <http://www.jisc.go.jp/>, 1989.
623    
624     L<Encode::ISO2022::JUNET>, L<Encode::ISO2022::EUCJA>
625    
626     POD:LICENSE:
627     Copyright %%YEAR%% Wakaba <w@suika.fam.cx>
628    
629     %%PerlLicense%%

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24