/[suikacvs]/markup/html/whatpm/Whatpm/Charset/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/Charset/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.22 - (hide annotations) (download)
Mon Sep 15 07:19:03 2008 UTC (16 years, 8 months ago) by wakaba
Branch: MAIN
Changes since 1.21: +8 -0 lines
++ whatpm/Whatpm/ChangeLog	15 Sep 2008 07:17:34 -0000
	* HTML.pm.src: Remove checking for control character, surrogate
	pair, or noncharacter code points and non-Unicode code
	points (they should be handled by Whatpm::Charset::UnicodeChecker).
	(parse_char_stream): Support for the |$get_wrapper| argument and
	character stream error handlers.

2008-09-15  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	15 Sep 2008 07:18:45 -0000
	* DecodeHandle.pm (onerror): Return |undef| if no explicit value
	is set.

	* UnicodeChecker.pm: Support for HTML5 parse errors.
	(onerror): Return |undef| if no explicit value is set.

2008-09-15  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.21 2008-09-15 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.22 * DecodeHandle.pm (onerror): Return |undef| if no explicit value
4     is set.
5    
6     * UnicodeChecker.pm: Support for HTML5 parse errors.
7     (onerror): Return |undef| if no explicit value is set.
8    
9     2008-09-15 Wakaba <wakaba@suika.fam.cx>
10    
11 wakaba 1.21 * UnicodeChecker.pm: Use hash for better performance.
12    
13 wakaba 1.14 2008-09-14 Wakaba <wakaba@suika.fam.cx>
14    
15 wakaba 1.20 * DecodeHandle.pm (CharString onerror): New method.
16    
17     * UnicodeString.pm (read): New.
18     (getc): Removed.
19     (manakai_read_until): Checking operation implemented.
20    
21     2008-09-14 Wakaba <wakaba@suika.fam.cx>
22    
23 wakaba 1.19 * DecodeHandle.pm: Merge the ShiftJIS class into the Encode class.
24    
25     2008-09-14 Wakaba <wakaba@suika.fam.cx>
26    
27 wakaba 1.18 * DecodeHandle.pm: Merge the EUCJP class into the Encode class.
28    
29     2008-09-14 Wakaba <wakaba@suika.fam.cx>
30    
31 wakaba 1.17 * DecodeHandle.pm: EUCJP class reimplemented using |read|-centric
32     model.
33    
34     2008-09-14 Wakaba <wakaba@suika.fam.cx>
35    
36 wakaba 1.16 * DecodeHandle.pm: Typo fixed.
37    
38     2008-09-14 Wakaba <wakaba@suika.fam.cx>
39    
40 wakaba 1.15 * DecodeHandle.pm, UnicodeChecker.pm: Change |manakai_getc_until|
41     to |manakai_read_until| to reduce the number of string copies.
42    
43     2008-09-14 Wakaba <wakaba@suika.fam.cx>
44    
45 wakaba 1.14 * DecodeHandle.pm (CharString): New class.
46     (Encode read): Don't remove read string from |{char_buffer}|,
47     to decease the number of string operations and to enable
48     |manakai_getc_until| ungetc'ing without any string operation.
49     (manakai_getc_until): New method.
50    
51     * UnicodeChecker.pm (getc): Don't |read| more than one
52     character, to prevent characters being bufferred
53     such that mixture of |getc| and |manakai_getc_until|
54     calls does not make the result broken.
55    
56 wakaba 1.11 2008-09-11 Wakaba <wakaba@suika.fam.cx>
57    
58 wakaba 1.13 * UnicodeChecker.pm, DecodeHandle.pm: Try to reduce the
59     number of string copies and method calls, first round.
60    
61     2008-09-11 Wakaba <wakaba@suika.fam.cx>
62    
63 wakaba 1.12 * UnicodeChecker.pm, DecodeHandle.pm: Tentative support
64     for |read| method.
65    
66     2008-09-11 Wakaba <wakaba@suika.fam.cx>
67    
68 wakaba 1.11 * UnicodeChecker.pm: New module.
69    
70 wakaba 1.10 2008-09-10 Wakaba <wakaba@suika.fam.cx>
71    
72     * DecodeHandle.pm: Set error levels.
73    
74     * WebLatin1.pm: Support for |us-ascii| and |iso-8859-5|
75     charsets (this module no longer for Latin1, but for Latin*
76     encodings).
77    
78     * WebThai.pm: Support for |tis-620| charset.
79    
80 wakaba 1.7 2008-05-18 Wakaba <wakaba@suika.fam.cx>
81    
82 wakaba 1.9 * WebThai.pm, WebLatin1.pm: New modules.
83    
84     * DecodeHandle.pm: Support for fallback character mappings,
85     used for WebLatin1 and WebThai encodings.
86    
87     2008-05-18 Wakaba <wakaba@suika.fam.cx>
88    
89 wakaba 1.8 * DecodeHandle.pm (ShiftJIS): \xF0-\xFC should be considered
90     as part of double-byte characters for more user-friendly error
91     recovery.
92    
93     2008-05-18 Wakaba <wakaba@suika.fam.cx>
94    
95 wakaba 1.7 * DecodeHandle.pm (getc): Ignore BOM if BOM pattern is specified.
96    
97 wakaba 1.6 2008-05-17 Wakaba <wakaba@suika.fam.cx>
98    
99     * DecodeHandle.pm (ByteBuffer): New class.
100    
101 wakaba 1.5 2008-02-10 Wakaba <wakaba@suika.fam.cx>
102    
103     * CharDet.pm (detect): Catch an error in detection function.
104    
105 wakaba 1.4 2007-11-19 Wakaba <wakaba@suika.fam.cx>
106    
107     * Makefile: New file.
108    
109     * UniversalCharDet.pm, UniversalCharDet.pod: New files.
110    
111 wakaba 1.2 2007-07-15 Wakaba <wakaba@suika.fam.cx>
112    
113 wakaba 1.3 * DecodeHandle.pm (create_decode_handle): Set canonical
114     or specified name to |input_encoding| attribute.
115     (uri_to_name): Reimplemented.
116     (input_encoding): Return charset name returned
117     by |uri_to_name| if available.
118     ($CharsetDef): Property |xml_name| is now contain
119     only name defined in XML specifications.
120    
121     2007-07-15 Wakaba <wakaba@suika.fam.cx>
122    
123 wakaba 1.2 * DecodeHandle.pm: New Perl module (created
124     from manakai's |Encode.dis|).
125    
126     2007-07-15 Wakaba <wakaba@suika.fam.cx>
127    
128     * ChangeLog: New file.
129    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24