Parent Directory | Revision Log
Links to HEAD: | (view) (download) (annotate) |
Links to HEAD: | (view) (download) (annotate) |
Sticky Tag: |
++ ChangeLog 5 Oct 2008 06:41:29 -0000 2008-10-05 Wakaba <wakaba@suika.fam.cx> * readme.en.html: Missing link to Whatpm::RDFXML module is added. Typo fixed. Noted that Media Query Level 3 is not supported yet. Linked to WebHACC supported standards documentation. ++ whatpm/Whatpm/Charset/ChangeLog 5 Oct 2008 06:41:56 -0000 2008-10-05 Wakaba <wakaba@suika.fam.cx> * UniversalCharDet.pod: Typo fixed.
++ whatpm/Whatpm/Charset/ChangeLog 20 Sep 2008 04:46:38 -0000 2008-09-20 Wakaba <wakaba@suika.fam.cx> * DecodeHandleOld.pm: DecodeHandle.pm revision 1.6 reintroduced since the latest implementation of DeocdeHandle.pm shows considerably bad performance when combined with Message::DOM::XMLParserTemp.
++ ChangeLog 15 Sep 2008 12:22:04 -0000 2008-09-15 Wakaba <wakaba@suika.fam.cx> * readme.en.html: Link to handles.en.html. ++ whatpm/Whatpm/Charset/ChangeLog 15 Sep 2008 12:21:33 -0000 * handles.en.html: New documentation. 2008-09-15 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 15 Sep 2008 07:17:34 -0000 * HTML.pm.src: Remove checking for control character, surrogate pair, or noncharacter code points and non-Unicode code points (they should be handled by Whatpm::Charset::UnicodeChecker). (parse_char_stream): Support for the |$get_wrapper| argument and character stream error handlers. 2008-09-15 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 15 Sep 2008 07:18:45 -0000 * DecodeHandle.pm (onerror): Return |undef| if no explicit value is set. * UnicodeChecker.pm: Support for HTML5 parse errors. (onerror): Return |undef| if no explicit value is set. 2008-09-15 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/Charset/ChangeLog 15 Sep 2008 00:48:59 -0000 2008-09-15 Wakaba <wakaba@suika.fam.cx> * UnicodeChecker.pm: Use hash for better performance.
++ whatpm/Whatpm/ChangeLog 14 Sep 2008 11:56:24 -0000 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html| would report character error from now. 2008-09-14 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 14 Sep 2008 11:57:38 -0000 * DecodeHandle.pm (CharString onerror): New method. * UnicodeString.pm (read): New. (getc): Removed. (manakai_read_until): Checking operation implemented. 2008-09-14 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 14 Sep 2008 07:19:17 -0000 * HTML.pm.src: Make a "bare ero" error for unknown entities point the "&" character. 2008-09-14 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 14 Sep 2008 07:19:40 -0000 * DecodeHandle.pm: Merge the ShiftJIS class into the Encode class. 2008-09-14 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 14 Sep 2008 06:57:36 -0000 * HTML.pm.src: It turns out that U+FFFD don't have to be added to the list of excluded characters. 2008-09-14 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 14 Sep 2008 06:58:07 -0000 * DecodeHandle.pm: Merge the EUCJP class into the Encode class. 2008-09-14 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 14 Sep 2008 06:32:02 -0000 * HTML.pm.src ($char_onerror): Have character decoder's |line| and |column| a higher priority than the one set by the tokenizer's input handler. ($self->{read_until}): Exclude U+FFFD (but this might not be necessary, since now we do line/column fixup in the character decode handle). 2008-09-14 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 14 Sep 2008 06:32:40 -0000 * DecodeHandle.pm: EUCJP class reimplemented using |read|-centric model. 2008-09-14 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 14 Sep 2008 03:58:42 -0000 * HTML.pm.src: Use |{read_until}| where possible. 2008-09-14 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 14 Sep 2008 03:59:00 -0000 * DecodeHandle.pm: Typo fixed. 2008-09-14 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 14 Sep 2008 03:06:56 -0000 * HTML.pm.src: Change |{getc_until}| to |{read_until}| and |manakai_getc_until| to |manakai_read_until| to reduce the number of string copies. 2008-09-14 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 14 Sep 2008 03:07:37 -0000 * DecodeHandle.pm, UnicodeChecker.pm: Change |manakai_getc_until| to |manakai_read_until| to reduce the number of string copies. 2008-09-14 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 14 Sep 2008 01:47:27 -0000 2008-09-14 Wakaba <wakaba@suika.fam.cx> * HTML.pm.src (parse_char_string): Use newly created |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's standard feature to |open| a string as a filehandle, since Perl's string filehandle seems not supporting |ungetc| method correctly. (parse_char_stream): Define |{getc_until}| method. (DATA_STATE): Experimental support for |getc_until| feature. ++ whatpm/Whatpm/Charset/ChangeLog 14 Sep 2008 01:50:52 -0000 2008-09-14 Wakaba <wakaba@suika.fam.cx> * DecodeHandle.pm (CharString): New class. (Encode read): Don't remove read string from |{char_buffer}|, to decease the number of string operations and to enable |manakai_getc_until| ungetc'ing without any string operation. (manakai_getc_until): New method. * UnicodeChecker.pm (getc): Don't |read| more than one character, to prevent characters being bufferred such that mixture of |getc| and |manakai_getc_until| calls does not make the result broken.
++ whatpm/Whatpm/Charset/ChangeLog 11 Sep 2008 12:09:15 -0000 * UnicodeChecker.pm, DecodeHandle.pm: Try to reduce the number of string copies and method calls, first round. 2008-09-11 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/Charset/ChangeLog 11 Sep 2008 09:55:54 -0000 * UnicodeChecker.pm, DecodeHandle.pm: Tentative support for |read| method. 2008-09-11 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 11 Sep 2008 09:12:24 -0000 2008-09-11 Wakaba <wakaba@suika.fam.cx> * HTML.pm.src: Methods now accept additional parameter, $get_wrapper, which can be used to insert some wrapper between the character stream handle and the tokenizer. (It is currently not supported for |set_inner_html| for |Element|s). ++ whatpm/Whatpm/Charset/ChangeLog 11 Sep 2008 09:10:24 -0000 2008-09-11 Wakaba <wakaba@suika.fam.cx> * UnicodeChecker.pm: New module.
++ whatpm/Whatpm/ChangeLog 10 Sep 2008 10:25:19 -0000 2008-09-10 Wakaba <wakaba@suika.fam.cx> * ContentChecker.pm: Support for charset-layer error levels. * HTML.pm.src: Don't specify |text| argument for the |chardecode:fallback| error, since it is not the encoding being used alternatively. ++ whatpm/Whatpm/Charset/ChangeLog 10 Sep 2008 10:26:52 -0000 2008-09-10 Wakaba <wakaba@suika.fam.cx> * DecodeHandle.pm: Set error levels. * WebLatin1.pm: Support for |us-ascii| and |iso-8859-5| charsets (this module no longer for Latin1, but for Latin* encodings). * WebThai.pm: Support for |tis-620| charset.
++ whatpm/Whatpm/Charset/ChangeLog 18 May 2008 06:07:15 -0000 * WebThai.pm, WebLatin1.pm: New modules. * DecodeHandle.pm: Support for fallback character mappings, used for WebLatin1 and WebThai encodings. 2008-05-18 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 18 May 2008 04:15:00 -0000 * HTML.pm.src (parse_byte_string): Redefined to invoke |parse_byte_stream|. (parse_byte_stream): New method. 2008-05-18 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 18 May 2008 04:15:42 -0000 * DecodeHandle.pm (ShiftJIS): \xF0-\xFC should be considered as part of double-byte characters for more user-friendly error recovery. 2008-05-18 Wakaba <wakaba@suika.fam.cx>
++ whatpm/Whatpm/ChangeLog 18 May 2008 03:46:02 -0000 2008-05-18 Wakaba <wakaba@suika.fam.cx> * HTML.pm.src (parse_byte_string): Fix the column number reported by encoding layer error reporter. ++ whatpm/Whatpm/Charset/ChangeLog 18 May 2008 03:46:17 -0000 2008-05-18 Wakaba <wakaba@suika.fam.cx> * DecodeHandle.pm (getc): Ignore BOM if BOM pattern is specified.
++ whatpm/Whatpm/ChangeLog 17 May 2008 12:28:47 -0000 * HTML.pm.src (parse_byte_string): Use streaming decoder rather than converting the whole byte string and then parsing. Propagate errors in character encoding layer. (get_next_token): Precise error reporting for |bare stago| error. 2008-05-17 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 17 May 2008 12:29:09 -0000 2008-05-17 Wakaba <wakaba@suika.fam.cx> * DecodeHandle.pm (ByteBuffer): New class.
++ whatpm/t/ChangeLog 10 Feb 2008 07:34:02 -0000 2008-02-10 Wakaba <wakaba@suika.fam.cx> * css-3.dat: New test file. * CSS-Parser-1.t: |css-3.dat| is added. '-moz-pre-wrap' added. Typo for 'collapse' value fixed. Support for '@namespace' serialization. * css-1.dat: New test data for selectors parsing. * css-font.dat: Error locations for 'u' errors are corrected. * css-table.dat: Test data for 'border-collapse' are added. * css-text.dat: Test data for '-moz-pre-wrap' are added. * css-visual.dat: Test data for 'background-position' are added. ++ whatpm/Whatpm/Charset/ChangeLog 10 Feb 2008 07:31:32 -0000 2008-02-10 Wakaba <wakaba@suika.fam.cx> * CharDet.pm (detect): Catch an error in detection function. ++ whatpm/Whatpm/CSS/ChangeLog 10 Feb 2008 07:30:34 -0000 2008-02-10 Wakaba <wakaba@suika.fam.cx> * Parser.pm: |attr(prefix|localname)| did not work. Support for uppercase namespace prefixes (namespace prefixes were not normalized into lowercase when they were looked up.). Report the property name token when an unknown property error is raised. The '-moz-pre-wrap' value is supported for 'white-space' property. ('background-position'): 'center left' and 'center right' were not supported. * SelectorsParser.pm: Report the namespace prefix token when an undeclared prefix error is raised. S_TOKEN in COMBINATOR_STATE was not handled correctly.
++ ChangeLog 19 Nov 2007 12:16:52 -0000 2007-11-19 Wakaba <wakaba@suika.fam.cx> * readme.en.html: Link to |Whatpm::Charset::UniversalCharDet|. ++ whatpm/Whatpm/ChangeLog 19 Nov 2007 12:17:47 -0000 2007-11-19 Wakaba <wakaba@suika.fam.cx> * HTML.pm.src (parse_byte_string): Detect charset by universalchardet if charset parameter is not specified. * Makefile (Charset-all, Charset-clean): New rules. ++ whatpm/Whatpm/Charset/ChangeLog 19 Nov 2007 11:54:20 -0000 2007-11-19 Wakaba <wakaba@suika.fam.cx> * Makefile: New file. * UniversalCharDet.pm, UniversalCharDet.pod: New files.
++ whatpm/t/ChangeLog 15 Jul 2007 16:51:05 -0000 * Charset-DecodeHandler.t: Cases for charset names has been changed to match with the module's implementation. 2007-07-15 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 15 Jul 2007 16:49:53 -0000 * DecodeHandle.pm (create_decode_handle): Set canonical or specified name to |input_encoding| attribute. (uri_to_name): Reimplemented. (input_encoding): Return charset name returned by |uri_to_name| if available. ($CharsetDef): Property |xml_name| is now contain only name defined in XML specifications. 2007-07-15 Wakaba <wakaba@suika.fam.cx>
++ whatpm/t/ChangeLog 15 Jul 2007 12:41:07 -0000 2007-07-15 Wakaba <wakaba@suika.fam.cx> * Charset-DecodeHandler.t: New test script. ++ whatpm/Whatpm/ChangeLog 15 Jul 2007 08:22:21 -0000 * Charset/: New directory. 2007-07-15 Wakaba <wakaba@suika.fam.cx> ++ whatpm/Whatpm/Charset/ChangeLog 15 Jul 2007 08:22:53 -0000 2007-07-15 Wakaba <wakaba@suika.fam.cx> * DecodeHandle.pm: New Perl module (created from manakai's |Encode.dis|). 2007-07-15 Wakaba <wakaba@suika.fam.cx> * ChangeLog: New file.
New
This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, select a symbolic revision name using the selection box, or choose 'Use Text Field' and enter a numeric revision.
admin@suikawiki.org | ViewVC Help |
Powered by ViewVC 1.1.24 |