/[suikacvs]/markup/html/whatpm/Whatpm/Charset/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/Charset/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.14 - (hide annotations) (download)
Sun Sep 14 01:51:08 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.13: +13 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 01:47:27 -0000
2008-09-14  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm.src (parse_char_string): Use newly created
	|Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
	standard feature to |open| a string as a filehandle,
	since Perl's string filehandle seems not supporting |ungetc|
	method correctly.
	(parse_char_stream): Define |{getc_until}| method.
	(DATA_STATE): Experimental support for |getc_until| feature.

++ whatpm/Whatpm/Charset/ChangeLog	14 Sep 2008 01:50:52 -0000
2008-09-14  Wakaba  <wakaba@suika.fam.cx>

	* DecodeHandle.pm (CharString): New class.
	(Encode read): Don't remove read string from |{char_buffer}|,
	to decease the number of string operations and to enable
	|manakai_getc_until| ungetc'ing without any string operation.
	(manakai_getc_until): New method.

	* UnicodeChecker.pm (getc): Don't |read| more than one
	character, to prevent characters being bufferred
	such that mixture of |getc| and |manakai_getc_until|
	calls does not make the result broken.

1 wakaba 1.14 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2    
3     * DecodeHandle.pm (CharString): New class.
4     (Encode read): Don't remove read string from |{char_buffer}|,
5     to decease the number of string operations and to enable
6     |manakai_getc_until| ungetc'ing without any string operation.
7     (manakai_getc_until): New method.
8    
9     * UnicodeChecker.pm (getc): Don't |read| more than one
10     character, to prevent characters being bufferred
11     such that mixture of |getc| and |manakai_getc_until|
12     calls does not make the result broken.
13    
14 wakaba 1.11 2008-09-11 Wakaba <wakaba@suika.fam.cx>
15    
16 wakaba 1.13 * UnicodeChecker.pm, DecodeHandle.pm: Try to reduce the
17     number of string copies and method calls, first round.
18    
19     2008-09-11 Wakaba <wakaba@suika.fam.cx>
20    
21 wakaba 1.12 * UnicodeChecker.pm, DecodeHandle.pm: Tentative support
22     for |read| method.
23    
24     2008-09-11 Wakaba <wakaba@suika.fam.cx>
25    
26 wakaba 1.11 * UnicodeChecker.pm: New module.
27    
28 wakaba 1.10 2008-09-10 Wakaba <wakaba@suika.fam.cx>
29    
30     * DecodeHandle.pm: Set error levels.
31    
32     * WebLatin1.pm: Support for |us-ascii| and |iso-8859-5|
33     charsets (this module no longer for Latin1, but for Latin*
34     encodings).
35    
36     * WebThai.pm: Support for |tis-620| charset.
37    
38 wakaba 1.7 2008-05-18 Wakaba <wakaba@suika.fam.cx>
39    
40 wakaba 1.9 * WebThai.pm, WebLatin1.pm: New modules.
41    
42     * DecodeHandle.pm: Support for fallback character mappings,
43     used for WebLatin1 and WebThai encodings.
44    
45     2008-05-18 Wakaba <wakaba@suika.fam.cx>
46    
47 wakaba 1.8 * DecodeHandle.pm (ShiftJIS): \xF0-\xFC should be considered
48     as part of double-byte characters for more user-friendly error
49     recovery.
50    
51     2008-05-18 Wakaba <wakaba@suika.fam.cx>
52    
53 wakaba 1.7 * DecodeHandle.pm (getc): Ignore BOM if BOM pattern is specified.
54    
55 wakaba 1.6 2008-05-17 Wakaba <wakaba@suika.fam.cx>
56    
57     * DecodeHandle.pm (ByteBuffer): New class.
58    
59 wakaba 1.5 2008-02-10 Wakaba <wakaba@suika.fam.cx>
60    
61     * CharDet.pm (detect): Catch an error in detection function.
62    
63 wakaba 1.4 2007-11-19 Wakaba <wakaba@suika.fam.cx>
64    
65     * Makefile: New file.
66    
67     * UniversalCharDet.pm, UniversalCharDet.pod: New files.
68    
69 wakaba 1.2 2007-07-15 Wakaba <wakaba@suika.fam.cx>
70    
71 wakaba 1.3 * DecodeHandle.pm (create_decode_handle): Set canonical
72     or specified name to |input_encoding| attribute.
73     (uri_to_name): Reimplemented.
74     (input_encoding): Return charset name returned
75     by |uri_to_name| if available.
76     ($CharsetDef): Property |xml_name| is now contain
77     only name defined in XML specifications.
78    
79     2007-07-15 Wakaba <wakaba@suika.fam.cx>
80    
81 wakaba 1.2 * DecodeHandle.pm: New Perl module (created
82     from manakai's |Encode.dis|).
83    
84     2007-07-15 Wakaba <wakaba@suika.fam.cx>
85    
86     * ChangeLog: New file.
87    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24