2008-10-05 Wakaba * UniversalCharDet.pod: Typo fixed. 2008-09-20 Wakaba * DecodeHandleOld.pm: DecodeHandle.pm revision 1.6 reintroduced since the latest implementation of DeocdeHandle.pm shows considerably bad performance when combined with Message::DOM::XMLParserTemp. 2008-09-15 Wakaba * handles.en.html: New documentation. 2008-09-15 Wakaba * DecodeHandle.pm (onerror): Return |undef| if no explicit value is set. * UnicodeChecker.pm: Support for HTML5 parse errors. (onerror): Return |undef| if no explicit value is set. 2008-09-15 Wakaba * UnicodeChecker.pm: Use hash for better performance. 2008-09-14 Wakaba * DecodeHandle.pm (CharString onerror): New method. * UnicodeString.pm (read): New. (getc): Removed. (manakai_read_until): Checking operation implemented. 2008-09-14 Wakaba * DecodeHandle.pm: Merge the ShiftJIS class into the Encode class. 2008-09-14 Wakaba * DecodeHandle.pm: Merge the EUCJP class into the Encode class. 2008-09-14 Wakaba * DecodeHandle.pm: EUCJP class reimplemented using |read|-centric model. 2008-09-14 Wakaba * DecodeHandle.pm: Typo fixed. 2008-09-14 Wakaba * DecodeHandle.pm, UnicodeChecker.pm: Change |manakai_getc_until| to |manakai_read_until| to reduce the number of string copies. 2008-09-14 Wakaba * DecodeHandle.pm (CharString): New class. (Encode read): Don't remove read string from |{char_buffer}|, to decease the number of string operations and to enable |manakai_getc_until| ungetc'ing without any string operation. (manakai_getc_until): New method. * UnicodeChecker.pm (getc): Don't |read| more than one character, to prevent characters being bufferred such that mixture of |getc| and |manakai_getc_until| calls does not make the result broken. 2008-09-11 Wakaba * UnicodeChecker.pm, DecodeHandle.pm: Try to reduce the number of string copies and method calls, first round. 2008-09-11 Wakaba * UnicodeChecker.pm, DecodeHandle.pm: Tentative support for |read| method. 2008-09-11 Wakaba * UnicodeChecker.pm: New module. 2008-09-10 Wakaba * DecodeHandle.pm: Set error levels. * WebLatin1.pm: Support for |us-ascii| and |iso-8859-5| charsets (this module no longer for Latin1, but for Latin* encodings). * WebThai.pm: Support for |tis-620| charset. 2008-05-18 Wakaba * WebThai.pm, WebLatin1.pm: New modules. * DecodeHandle.pm: Support for fallback character mappings, used for WebLatin1 and WebThai encodings. 2008-05-18 Wakaba * DecodeHandle.pm (ShiftJIS): \xF0-\xFC should be considered as part of double-byte characters for more user-friendly error recovery. 2008-05-18 Wakaba * DecodeHandle.pm (getc): Ignore BOM if BOM pattern is specified. 2008-05-17 Wakaba * DecodeHandle.pm (ByteBuffer): New class. 2008-02-10 Wakaba * CharDet.pm (detect): Catch an error in detection function. 2007-11-19 Wakaba * Makefile: New file. * UniversalCharDet.pm, UniversalCharDet.pod: New files. 2007-07-15 Wakaba * DecodeHandle.pm (create_decode_handle): Set canonical or specified name to |input_encoding| attribute. (uri_to_name): Reimplemented. (input_encoding): Return charset name returned by |uri_to_name| if available. ($CharsetDef): Property |xml_name| is now contain only name defined in XML specifications. 2007-07-15 Wakaba * DecodeHandle.pm: New Perl module (created from manakai's |Encode.dis|). 2007-07-15 Wakaba * ChangeLog: New file.