/[suikacvs]/markup/html/whatpm/Whatpm/Charset/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/Charset/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.17 - (hide annotations) (download)
Sun Sep 14 06:32:49 2008 UTC (16 years, 7 months ago) by wakaba
Branch: MAIN
Changes since 1.16: +5 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 06:32:02 -0000
	* HTML.pm.src ($char_onerror): Have character decoder's |line|
	and |column| a higher priority than the one set by the
	tokenizer's input handler.
	($self->{read_until}): Exclude U+FFFD (but this might
	not be necessary, since now we do line/column fixup in
	the character decode handle).

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	14 Sep 2008 06:32:40 -0000
	* DecodeHandle.pm: EUCJP class reimplemented using |read|-centric
	model.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.14 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.17 * DecodeHandle.pm: EUCJP class reimplemented using |read|-centric
4     model.
5    
6     2008-09-14 Wakaba <wakaba@suika.fam.cx>
7    
8 wakaba 1.16 * DecodeHandle.pm: Typo fixed.
9    
10     2008-09-14 Wakaba <wakaba@suika.fam.cx>
11    
12 wakaba 1.15 * DecodeHandle.pm, UnicodeChecker.pm: Change |manakai_getc_until|
13     to |manakai_read_until| to reduce the number of string copies.
14    
15     2008-09-14 Wakaba <wakaba@suika.fam.cx>
16    
17 wakaba 1.14 * DecodeHandle.pm (CharString): New class.
18     (Encode read): Don't remove read string from |{char_buffer}|,
19     to decease the number of string operations and to enable
20     |manakai_getc_until| ungetc'ing without any string operation.
21     (manakai_getc_until): New method.
22    
23     * UnicodeChecker.pm (getc): Don't |read| more than one
24     character, to prevent characters being bufferred
25     such that mixture of |getc| and |manakai_getc_until|
26     calls does not make the result broken.
27    
28 wakaba 1.11 2008-09-11 Wakaba <wakaba@suika.fam.cx>
29    
30 wakaba 1.13 * UnicodeChecker.pm, DecodeHandle.pm: Try to reduce the
31     number of string copies and method calls, first round.
32    
33     2008-09-11 Wakaba <wakaba@suika.fam.cx>
34    
35 wakaba 1.12 * UnicodeChecker.pm, DecodeHandle.pm: Tentative support
36     for |read| method.
37    
38     2008-09-11 Wakaba <wakaba@suika.fam.cx>
39    
40 wakaba 1.11 * UnicodeChecker.pm: New module.
41    
42 wakaba 1.10 2008-09-10 Wakaba <wakaba@suika.fam.cx>
43    
44     * DecodeHandle.pm: Set error levels.
45    
46     * WebLatin1.pm: Support for |us-ascii| and |iso-8859-5|
47     charsets (this module no longer for Latin1, but for Latin*
48     encodings).
49    
50     * WebThai.pm: Support for |tis-620| charset.
51    
52 wakaba 1.7 2008-05-18 Wakaba <wakaba@suika.fam.cx>
53    
54 wakaba 1.9 * WebThai.pm, WebLatin1.pm: New modules.
55    
56     * DecodeHandle.pm: Support for fallback character mappings,
57     used for WebLatin1 and WebThai encodings.
58    
59     2008-05-18 Wakaba <wakaba@suika.fam.cx>
60    
61 wakaba 1.8 * DecodeHandle.pm (ShiftJIS): \xF0-\xFC should be considered
62     as part of double-byte characters for more user-friendly error
63     recovery.
64    
65     2008-05-18 Wakaba <wakaba@suika.fam.cx>
66    
67 wakaba 1.7 * DecodeHandle.pm (getc): Ignore BOM if BOM pattern is specified.
68    
69 wakaba 1.6 2008-05-17 Wakaba <wakaba@suika.fam.cx>
70    
71     * DecodeHandle.pm (ByteBuffer): New class.
72    
73 wakaba 1.5 2008-02-10 Wakaba <wakaba@suika.fam.cx>
74    
75     * CharDet.pm (detect): Catch an error in detection function.
76    
77 wakaba 1.4 2007-11-19 Wakaba <wakaba@suika.fam.cx>
78    
79     * Makefile: New file.
80    
81     * UniversalCharDet.pm, UniversalCharDet.pod: New files.
82    
83 wakaba 1.2 2007-07-15 Wakaba <wakaba@suika.fam.cx>
84    
85 wakaba 1.3 * DecodeHandle.pm (create_decode_handle): Set canonical
86     or specified name to |input_encoding| attribute.
87     (uri_to_name): Reimplemented.
88     (input_encoding): Return charset name returned
89     by |uri_to_name| if available.
90     ($CharsetDef): Property |xml_name| is now contain
91     only name defined in XML specifications.
92    
93     2007-07-15 Wakaba <wakaba@suika.fam.cx>
94    
95 wakaba 1.2 * DecodeHandle.pm: New Perl module (created
96     from manakai's |Encode.dis|).
97    
98     2007-07-15 Wakaba <wakaba@suika.fam.cx>
99    
100     * ChangeLog: New file.
101    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24  
Google Analytics is used in this page; Cookies are used. 忍者AdMax is used in this page; Cookies are used. Privacy policy.