/[suikacvs]/markup/html/whatpm/Whatpm/Charset/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/Charset/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.14 - (show annotations) (download)
Sun Sep 14 01:51:08 2008 UTC (16 years, 8 months ago) by wakaba
Branch: MAIN
Changes since 1.13: +13 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 01:47:27 -0000
2008-09-14  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm.src (parse_char_string): Use newly created
	|Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
	standard feature to |open| a string as a filehandle,
	since Perl's string filehandle seems not supporting |ungetc|
	method correctly.
	(parse_char_stream): Define |{getc_until}| method.
	(DATA_STATE): Experimental support for |getc_until| feature.

++ whatpm/Whatpm/Charset/ChangeLog	14 Sep 2008 01:50:52 -0000
2008-09-14  Wakaba  <wakaba@suika.fam.cx>

	* DecodeHandle.pm (CharString): New class.
	(Encode read): Don't remove read string from |{char_buffer}|,
	to decease the number of string operations and to enable
	|manakai_getc_until| ungetc'ing without any string operation.
	(manakai_getc_until): New method.

	* UnicodeChecker.pm (getc): Don't |read| more than one
	character, to prevent characters being bufferred
	such that mixture of |getc| and |manakai_getc_until|
	calls does not make the result broken.

1 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2
3 * DecodeHandle.pm (CharString): New class.
4 (Encode read): Don't remove read string from |{char_buffer}|,
5 to decease the number of string operations and to enable
6 |manakai_getc_until| ungetc'ing without any string operation.
7 (manakai_getc_until): New method.
8
9 * UnicodeChecker.pm (getc): Don't |read| more than one
10 character, to prevent characters being bufferred
11 such that mixture of |getc| and |manakai_getc_until|
12 calls does not make the result broken.
13
14 2008-09-11 Wakaba <wakaba@suika.fam.cx>
15
16 * UnicodeChecker.pm, DecodeHandle.pm: Try to reduce the
17 number of string copies and method calls, first round.
18
19 2008-09-11 Wakaba <wakaba@suika.fam.cx>
20
21 * UnicodeChecker.pm, DecodeHandle.pm: Tentative support
22 for |read| method.
23
24 2008-09-11 Wakaba <wakaba@suika.fam.cx>
25
26 * UnicodeChecker.pm: New module.
27
28 2008-09-10 Wakaba <wakaba@suika.fam.cx>
29
30 * DecodeHandle.pm: Set error levels.
31
32 * WebLatin1.pm: Support for |us-ascii| and |iso-8859-5|
33 charsets (this module no longer for Latin1, but for Latin*
34 encodings).
35
36 * WebThai.pm: Support for |tis-620| charset.
37
38 2008-05-18 Wakaba <wakaba@suika.fam.cx>
39
40 * WebThai.pm, WebLatin1.pm: New modules.
41
42 * DecodeHandle.pm: Support for fallback character mappings,
43 used for WebLatin1 and WebThai encodings.
44
45 2008-05-18 Wakaba <wakaba@suika.fam.cx>
46
47 * DecodeHandle.pm (ShiftJIS): \xF0-\xFC should be considered
48 as part of double-byte characters for more user-friendly error
49 recovery.
50
51 2008-05-18 Wakaba <wakaba@suika.fam.cx>
52
53 * DecodeHandle.pm (getc): Ignore BOM if BOM pattern is specified.
54
55 2008-05-17 Wakaba <wakaba@suika.fam.cx>
56
57 * DecodeHandle.pm (ByteBuffer): New class.
58
59 2008-02-10 Wakaba <wakaba@suika.fam.cx>
60
61 * CharDet.pm (detect): Catch an error in detection function.
62
63 2007-11-19 Wakaba <wakaba@suika.fam.cx>
64
65 * Makefile: New file.
66
67 * UniversalCharDet.pm, UniversalCharDet.pod: New files.
68
69 2007-07-15 Wakaba <wakaba@suika.fam.cx>
70
71 * DecodeHandle.pm (create_decode_handle): Set canonical
72 or specified name to |input_encoding| attribute.
73 (uri_to_name): Reimplemented.
74 (input_encoding): Return charset name returned
75 by |uri_to_name| if available.
76 ($CharsetDef): Property |xml_name| is now contain
77 only name defined in XML specifications.
78
79 2007-07-15 Wakaba <wakaba@suika.fam.cx>
80
81 * DecodeHandle.pm: New Perl module (created
82 from manakai's |Encode.dis|).
83
84 2007-07-15 Wakaba <wakaba@suika.fam.cx>
85
86 * ChangeLog: New file.
87

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24