/[suikacvs]/markup/html/whatpm/Whatpm/Charset/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/Charset/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.17 - (show annotations) (download)
Sun Sep 14 06:32:49 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.16: +5 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 06:32:02 -0000
	* HTML.pm.src ($char_onerror): Have character decoder's |line|
	and |column| a higher priority than the one set by the
	tokenizer's input handler.
	($self->{read_until}): Exclude U+FFFD (but this might
	not be necessary, since now we do line/column fixup in
	the character decode handle).

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	14 Sep 2008 06:32:40 -0000
	* DecodeHandle.pm: EUCJP class reimplemented using |read|-centric
	model.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

1 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2
3 * DecodeHandle.pm: EUCJP class reimplemented using |read|-centric
4 model.
5
6 2008-09-14 Wakaba <wakaba@suika.fam.cx>
7
8 * DecodeHandle.pm: Typo fixed.
9
10 2008-09-14 Wakaba <wakaba@suika.fam.cx>
11
12 * DecodeHandle.pm, UnicodeChecker.pm: Change |manakai_getc_until|
13 to |manakai_read_until| to reduce the number of string copies.
14
15 2008-09-14 Wakaba <wakaba@suika.fam.cx>
16
17 * DecodeHandle.pm (CharString): New class.
18 (Encode read): Don't remove read string from |{char_buffer}|,
19 to decease the number of string operations and to enable
20 |manakai_getc_until| ungetc'ing without any string operation.
21 (manakai_getc_until): New method.
22
23 * UnicodeChecker.pm (getc): Don't |read| more than one
24 character, to prevent characters being bufferred
25 such that mixture of |getc| and |manakai_getc_until|
26 calls does not make the result broken.
27
28 2008-09-11 Wakaba <wakaba@suika.fam.cx>
29
30 * UnicodeChecker.pm, DecodeHandle.pm: Try to reduce the
31 number of string copies and method calls, first round.
32
33 2008-09-11 Wakaba <wakaba@suika.fam.cx>
34
35 * UnicodeChecker.pm, DecodeHandle.pm: Tentative support
36 for |read| method.
37
38 2008-09-11 Wakaba <wakaba@suika.fam.cx>
39
40 * UnicodeChecker.pm: New module.
41
42 2008-09-10 Wakaba <wakaba@suika.fam.cx>
43
44 * DecodeHandle.pm: Set error levels.
45
46 * WebLatin1.pm: Support for |us-ascii| and |iso-8859-5|
47 charsets (this module no longer for Latin1, but for Latin*
48 encodings).
49
50 * WebThai.pm: Support for |tis-620| charset.
51
52 2008-05-18 Wakaba <wakaba@suika.fam.cx>
53
54 * WebThai.pm, WebLatin1.pm: New modules.
55
56 * DecodeHandle.pm: Support for fallback character mappings,
57 used for WebLatin1 and WebThai encodings.
58
59 2008-05-18 Wakaba <wakaba@suika.fam.cx>
60
61 * DecodeHandle.pm (ShiftJIS): \xF0-\xFC should be considered
62 as part of double-byte characters for more user-friendly error
63 recovery.
64
65 2008-05-18 Wakaba <wakaba@suika.fam.cx>
66
67 * DecodeHandle.pm (getc): Ignore BOM if BOM pattern is specified.
68
69 2008-05-17 Wakaba <wakaba@suika.fam.cx>
70
71 * DecodeHandle.pm (ByteBuffer): New class.
72
73 2008-02-10 Wakaba <wakaba@suika.fam.cx>
74
75 * CharDet.pm (detect): Catch an error in detection function.
76
77 2007-11-19 Wakaba <wakaba@suika.fam.cx>
78
79 * Makefile: New file.
80
81 * UniversalCharDet.pm, UniversalCharDet.pod: New files.
82
83 2007-07-15 Wakaba <wakaba@suika.fam.cx>
84
85 * DecodeHandle.pm (create_decode_handle): Set canonical
86 or specified name to |input_encoding| attribute.
87 (uri_to_name): Reimplemented.
88 (input_encoding): Return charset name returned
89 by |uri_to_name| if available.
90 ($CharsetDef): Property |xml_name| is now contain
91 only name defined in XML specifications.
92
93 2007-07-15 Wakaba <wakaba@suika.fam.cx>
94
95 * DecodeHandle.pm: New Perl module (created
96 from manakai's |Encode.dis|).
97
98 2007-07-15 Wakaba <wakaba@suika.fam.cx>
99
100 * ChangeLog: New file.
101

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24