/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.294 by wakaba, Sat Sep 13 07:51:32 2008 UTC revision 1.310 by wakaba, Mon Sep 15 02:54:12 2008 UTC
# Line 1  Line 1 
1    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: Don't call |loda_ns_module|
4            for null-namespace elements/attributes.
5    
6            * HTML.pm.src: Fact out $disallowed_control_chars
7            as a hash.
8    
9    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
10    
11            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
12            and |{next_char}| initializations are moved to initialization
13            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
14            with |parse_char_stream|.
15    
16    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src (parse_char_stream): Make |set_next_char|
19            invoke |manakai_read_until|, not only |read|, where
20            possible, to decrease the number of |read| method calls.
21    
22            * mkhtmlparser.pl: Related changes to the aforementioned
23            modification.
24    
25    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
28            would report character error from now.
29    
30    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src: White-space-leaded non-white-space character
33            tokens in "before head insertion mode" was not
34            correctly handled.
35            (set_inner_html): Reimplemented using CharString decodehandle
36            class.  Support for $get_wrapper argument.  Support
37            for |{read_until}| feature.
38    
39    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
40    
41            * HTML.pm.src: Make a "bare ero" error for unknown
42            entities point the "&" character.
43    
44    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
45    
46            * HTML.pm.src: It turns out that U+FFFD don't have to
47            be added to the list of excluded characters.
48    
49    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src ($char_onerror): Have character decoder's |line|
52            and |column| a higher priority than the one set by the
53            tokenizer's input handler.
54            ($self->{read_until}): Exclude U+FFFD (but this might
55            not be necessary, since now we do line/column fixup in
56            the character decode handle).
57    
58    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: Use |{read_until}| where possible.
61    
62    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
65            and |manakai_getc_until| to |manakai_read_until| to
66            reduce the number of string copies.
67    
68    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src (parse_char_string): Use newly created
71            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
72            standard feature to |open| a string as a filehandle,
73            since Perl's string filehandle seems not supporting |ungetc|
74            method correctly.
75            (parse_char_stream): Define |{getc_until}| method.
76            (DATA_STATE): Experimental support for |getc_until| feature.
77    
78    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
79    
80            * HTML.pm.src: Check points added to newly added branches.
81    
82    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
83    
84            * HTML.pm.src: Remove |{char}|, which is no longer used.
85            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
86            and replaced by |{prev_state}|.
87    
88            * mkhtmlparser.pl: Remove |{char}| feature.
89            Remove |!!!back-next-input-character;| macro.
90    
91    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
92    
93            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
94            entity related tokenizer states in favor of new states
95            implementing the consume character reference algorithm.
96    
97    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: "Consume a character reference" algorithm is
100            now implemented as a tokenizer's state, rather than
101            a method, with minimum changes (more changes will
102            be made, in due course).  "Bogus comment state"'s inner
103            loop gets removed.
104    
105    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
108            into their own tokenizer states.
109    
110  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
111    
112          * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|          * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|

Legend:
Removed from v.1.294  
changed lines
  Added in v.1.310

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24