/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.276 by wakaba, Sun Aug 17 05:09:12 2008 UTC revision 1.308 by wakaba, Sun Sep 14 13:09:00 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (parse_char_stream): Make |set_next_char|
4            invoke |manakai_read_until|, not only |read|, where
5            possible, to decrease the number of |read| method calls.
6    
7            * mkhtmlparser.pl: Related changes to the aforementioned
8            modification.
9    
10    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
13            would report character error from now.
14    
15    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: White-space-leaded non-white-space character
18            tokens in "before head insertion mode" was not
19            correctly handled.
20            (set_inner_html): Reimplemented using CharString decodehandle
21            class.  Support for $get_wrapper argument.  Support
22            for |{read_until}| feature.
23    
24    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src: Make a "bare ero" error for unknown
27            entities point the "&" character.
28    
29    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
30    
31            * HTML.pm.src: It turns out that U+FFFD don't have to
32            be added to the list of excluded characters.
33    
34    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
35    
36            * HTML.pm.src ($char_onerror): Have character decoder's |line|
37            and |column| a higher priority than the one set by the
38            tokenizer's input handler.
39            ($self->{read_until}): Exclude U+FFFD (but this might
40            not be necessary, since now we do line/column fixup in
41            the character decode handle).
42    
43    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTML.pm.src: Use |{read_until}| where possible.
46    
47    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
48    
49            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
50            and |manakai_getc_until| to |manakai_read_until| to
51            reduce the number of string copies.
52    
53    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src (parse_char_string): Use newly created
56            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
57            standard feature to |open| a string as a filehandle,
58            since Perl's string filehandle seems not supporting |ungetc|
59            method correctly.
60            (parse_char_stream): Define |{getc_until}| method.
61            (DATA_STATE): Experimental support for |getc_until| feature.
62    
63    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src: Check points added to newly added branches.
66    
67    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Remove |{char}|, which is no longer used.
70            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
71            and replaced by |{prev_state}|.
72    
73            * mkhtmlparser.pl: Remove |{char}| feature.
74            Remove |!!!back-next-input-character;| macro.
75    
76    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
79            entity related tokenizer states in favor of new states
80            implementing the consume character reference algorithm.
81    
82    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
83    
84            * HTML.pm.src: "Consume a character reference" algorithm is
85            now implemented as a tokenizer's state, rather than
86            a method, with minimum changes (more changes will
87            be made, in due course).  "Bogus comment state"'s inner
88            loop gets removed.
89    
90    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
93            into their own tokenizer states.
94    
95    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
96    
97            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
98            is split into three states.
99    
100    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
103            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
104            no longer does the tokenizer have to push back next input
105            characters in those states.
106    
107    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
108    
109            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
110            into four states so that no longer does the tokenizer have to push
111            back next input characters in that state.
112    
113    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
114    
115            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
116            which can be used to insert some wrapper between the character
117            stream handle and the tokenizer.  (It is currently not supported
118            for |set_inner_html| for |Element|s).
119    
120    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: Ignore punctuations in charset names.
123    
124    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
125    
126            * ContentChecker.pm: Support for charset-layer error levels.
127    
128            * HTML.pm.src: Don't specify |text| argument for the
129            |chardecode:fallback| error, since it is not the encoding
130            being used alternatively.
131    
132    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
135    
136    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
137    
138            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
139    
140    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: Bug fix and sync with the spec with regard
143            to after after frameset insertion mode processing (HTML5
144            revision 1909).  Note that the implementation was wrong
145            per the old spec before the r1909 changes.
146    
147    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTMLTable.pm: scope=auto algorithm fix synced with the
150            spec (HTML5 revision 2093).
151            ($process_row): Algorithm step numbers synced with the
152            spec (HTML5 revision 2092).
153    
154    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
155    
156            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
157            revision 2094).
158    
159    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
160    
161            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
162    
163    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
164    
165            * HTML.pm.src: '"' and "'" at the end of attribute
166            name (after another attribute) now raise parse error (HTML5
167            revision 2123).  Empty unquoted attribute values are no
168            longer allowed (HTML5 revision 2122).
169    
170    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
171    
172            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
173            revision 2130).
174    
175    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
176    
177            * ContentChecker.pm: |xml:lang| attribute value must be same
178            as |lang| attribute value for HTML elements (HTML5 revision 2062
179            and so on).
180    
181    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
182    
183            * ContentChecker.pm: Error level definition for |xml_id_error|
184            was missing.
185    
186            * URIChecker.pm: The end of the URL should be marked as the
187            error location for an empty path error.  The position
188            between the userinfo and the port components should be
189            marked as the error location for an empty host error.
190    
191    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
192    
193            * URIChecker.pm: Set parameters representing where in the
194            value the error occurs for errors.  Report unknown
195            address format error in warning level, since address
196            formats are rarely added.  Path segments starting with "/.."
197            were misinterpreted as a dot-segment.
198    
199    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
200    
201            * URIChecker.pm (check_iri_reference): Requires
202            |Message::DOM::DOMImplementation|.
203    
204    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
205    
206            * IMTChecker.pm: Updated for the new error reporting architecture.
207    
208            * ContentChecker.pm: Error levels for IMTs are added.
209    
210  2008-08-17  Wakaba  <wakaba@suika.fam.cx>  2008-08-17  Wakaba  <wakaba@suika.fam.cx>
211    
212          * H2H.pm (_shift_token): Support for unquoted HTML attribute          * H2H.pm (_shift_token): Support for unquoted HTML attribute

Legend:
Removed from v.1.276  
changed lines
  Added in v.1.308

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24