/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.232 by wakaba, Fri May 16 13:56:16 2008 UTC revision 1.306 by wakaba, Sun Sep 14 09:05:54 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: White-space-leaded non-white-space character
4            tokens in "before head insertion mode" was not
5            correctly handled.
6            (set_inner_html): Reimplemented using CharString decodehandle
7            class.  Support for $get_wrapper argument.  Support
8            for |{read_until}| feature.
9    
10    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Make a "bare ero" error for unknown
13            entities point the "&" character.
14    
15    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: It turns out that U+FFFD don't have to
18            be added to the list of excluded characters.
19    
20    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src ($char_onerror): Have character decoder's |line|
23            and |column| a higher priority than the one set by the
24            tokenizer's input handler.
25            ($self->{read_until}): Exclude U+FFFD (but this might
26            not be necessary, since now we do line/column fixup in
27            the character decode handle).
28    
29    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
30    
31            * HTML.pm.src: Use |{read_until}| where possible.
32    
33    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
36            and |manakai_getc_until| to |manakai_read_until| to
37            reduce the number of string copies.
38    
39    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
40    
41            * HTML.pm.src (parse_char_string): Use newly created
42            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
43            standard feature to |open| a string as a filehandle,
44            since Perl's string filehandle seems not supporting |ungetc|
45            method correctly.
46            (parse_char_stream): Define |{getc_until}| method.
47            (DATA_STATE): Experimental support for |getc_until| feature.
48    
49    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src: Check points added to newly added branches.
52    
53    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: Remove |{char}|, which is no longer used.
56            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
57            and replaced by |{prev_state}|.
58    
59            * mkhtmlparser.pl: Remove |{char}| feature.
60            Remove |!!!back-next-input-character;| macro.
61    
62    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
65            entity related tokenizer states in favor of new states
66            implementing the consume character reference algorithm.
67    
68    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: "Consume a character reference" algorithm is
71            now implemented as a tokenizer's state, rather than
72            a method, with minimum changes (more changes will
73            be made, in due course).  "Bogus comment state"'s inner
74            loop gets removed.
75    
76    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
79            into their own tokenizer states.
80    
81    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
84            is split into three states.
85    
86    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
87    
88            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
89            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
90            no longer does the tokenizer have to push back next input
91            characters in those states.
92    
93    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
96            into four states so that no longer does the tokenizer have to push
97            back next input characters in that state.
98    
99    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
102            which can be used to insert some wrapper between the character
103            stream handle and the tokenizer.  (It is currently not supported
104            for |set_inner_html| for |Element|s).
105    
106    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
107    
108            * HTML.pm.src: Ignore punctuations in charset names.
109    
110    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
111    
112            * ContentChecker.pm: Support for charset-layer error levels.
113    
114            * HTML.pm.src: Don't specify |text| argument for the
115            |chardecode:fallback| error, since it is not the encoding
116            being used alternatively.
117    
118    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
121    
122    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
123    
124            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
125    
126    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src: Bug fix and sync with the spec with regard
129            to after after frameset insertion mode processing (HTML5
130            revision 1909).  Note that the implementation was wrong
131            per the old spec before the r1909 changes.
132    
133    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTMLTable.pm: scope=auto algorithm fix synced with the
136            spec (HTML5 revision 2093).
137            ($process_row): Algorithm step numbers synced with the
138            spec (HTML5 revision 2092).
139    
140    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
143            revision 2094).
144    
145    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
146    
147            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
148    
149    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: '"' and "'" at the end of attribute
152            name (after another attribute) now raise parse error (HTML5
153            revision 2123).  Empty unquoted attribute values are no
154            longer allowed (HTML5 revision 2122).
155    
156    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
157    
158            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
159            revision 2130).
160    
161    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
162    
163            * ContentChecker.pm: |xml:lang| attribute value must be same
164            as |lang| attribute value for HTML elements (HTML5 revision 2062
165            and so on).
166    
167    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
168    
169            * ContentChecker.pm: Error level definition for |xml_id_error|
170            was missing.
171    
172            * URIChecker.pm: The end of the URL should be marked as the
173            error location for an empty path error.  The position
174            between the userinfo and the port components should be
175            marked as the error location for an empty host error.
176    
177    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
178    
179            * URIChecker.pm: Set parameters representing where in the
180            value the error occurs for errors.  Report unknown
181            address format error in warning level, since address
182            formats are rarely added.  Path segments starting with "/.."
183            were misinterpreted as a dot-segment.
184    
185    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
186    
187            * URIChecker.pm (check_iri_reference): Requires
188            |Message::DOM::DOMImplementation|.
189    
190    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
191    
192            * IMTChecker.pm: Updated for the new error reporting architecture.
193    
194            * ContentChecker.pm: Error levels for IMTs are added.
195    
196    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
197    
198            * H2H.pm (_shift_token): Support for unquoted HTML attribute
199            values.
200    
201    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
202    
203            * CacheManifest.pm: Support for new style of error
204            reports.
205    
206            * HTML.pm.src: Set line=1, column=1 to the document node.
207    
208    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
209    
210            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
211            and URL checkers.  Support for more error levels for bogus
212            langauge tag and URL "standards".
213    
214            * LangTag.pm, URIChecker.pm: Support for new style error
215            level reporting.
216    
217    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
218    
219            * ContentChecker.pm: Support for RDF/XML error levels.
220    
221            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
222            specifying.  Error types are revised.
223    
224    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
225    
226            * ContentChecker.pm: All error reporting method calls are
227            renewed.
228    
229    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: All error type names and "text" parameters
232            are revised.  Use new style for "level" specification.
233    
234            * mkhtmlparser.pl: Use new style for "level" specification.
235    
236    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
237    
238            * WebIDL.pm (parse_char_string): Simplified error
239            reporting process for broken ignored valuetype definition.
240            (Valuetype idl_text): Support for special "DOMString" name.
241    
242    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
243    
244            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
245            terminal of the ScopedName is "DOMString", such that whether
246            the last part of the scoped name is "DOMString" or "_DOMString"
247            later.  It is necessary to determine whether a |typedef|
248            definition should be ignored or not.
249            (parse_char_string): Unescape the identifier of
250            exception members.
251            ($resolve): Return undef for builtin types and sequence<T>
252            types (we might not have to do this, however...).
253            (check): Support checking for Exceptions, Valuetypes,
254            and Typedefs.
255            ($serialize_type): Support for "DOMString::::" syntax.
256            (Typedef idl_text): Output Type as "DOMString" if it
257            is really "DOMString" (i.e. its internal representation
258            is "::DOMString::").
259    
260    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
261    
262            * WebIDL.pm ($resolve): New code, based on resolve code
263            for constant types in the |check| method.
264            (check): Support for checking of attributes, operations, and
265            arguments.
266            (Attribute/Operation idl_text): Exception names in getraises,
267            setraises, and raises clauses is serizlied by |$serialize_type|
268            code.
269    
270    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
271    
272            * WebIDL.pm ($integer): Order of selections are changed to match
273            hexadecimal numbers (the original pattern, taken from the spec,
274            was not work for hexadecimal numbers, because the "0" prefix
275            matches to the [0-7]* part (as an empty string) and therefore
276            it does not match with remaining "x..." part of a "0x..." integer
277            literal.
278            ($get_type): It now returns a string, not an array reference,
279            for regular types and |sequence| types (i.e. it in any case
280            returns a string).
281            ($get_next_token): The second item in the array that represents
282            a integer or float token is now a Perl number value, not the
283            original string representation of the number.
284            (check): Support for const value consistency checking.
285            No extended attribute is defined for constants.
286            (Node subclasses): Use simple strings rather than array references
287            for default data type values.
288            ($serialize_type): Type values are now simple strings.
289            (value): If the new attribute value is a false value, then
290            a FALSE value is set to the attribute.
291    
292    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
293    
294            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
295            in its stringified format ("scoped name" as defined in the
296            spec).  Note that future version of this module should not use
297            array references for type values and the |type_text| attribute
298            should be made obsolete.
299            (parse_char_string): Unescape attribute names.
300            (check): Support for checking of whether inherited interfaces
301            are actually defined or not.  Support for checking of whether
302            interface member identifiers are duplicated or not.
303            ($serialize_type): Scoped names are returned as is.  A future
304            version of this code should escape identifiers other than "DOMString",
305            otherwise the idl_text would be non-conforming.
306    
307    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
308    
309            * WebIDL.pm (parse_char_string): Set line/column numbers
310            to generated nodes.  Unescape identifiers.  Extended attributes
311            for Definition's were ignored.
312            (append_child): Set |parent_node| attribute.
313            (parent_node): New attribute.
314            (check): Support interface/exception members.  Support
315            extended attributes.  Support definition identifier uniqueness
316            constraint.
317            (qualified_name): New attribute.
318            (Interface/Exception idl_text): Extended attributes were
319            not prepended to the returned text.
320    
321    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
322    
323            * WebIDL.pm (parse_char_string): Set line/column numbers
324            to interface object experimentally.  s/shift/pop/g, shift
325            would make things wrong.  Support for interface forward
326            declarations was missing.  Broken interface declarations
327            with no block were not ignored entirely.
328            (Whatpm::WebIDL::Node): New abstract class.  This class
329            makes things easier.
330            (child_nodes): New attribute.  Unlike DOM's attribute with
331            same name, this attribute returns a dead list of nodes for
332            simplicity.
333            (get_user_data, set_user_data): New methods.
334            (Module idl_text): A SPACE character should be inserted
335            before the |{| character.
336            (Interface idl_text): Support for interface forward declarations.
337            (is_forward_declaration): New attribute.
338    
339    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
340    
341            * WebIDL.pm (type_text): Better serializer.
342    
343    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
344    
345            * WebIDL.pm: Revise forward-compatible parsing so that
346            it now can handle broken extended attributes and as such.
347    
348    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
349    
350            * WebIDL.pm: Real support for extended attributes.
351            Support for extended attributes with arguments.
352    
353    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
354    
355            * WebIDL.pm: Support for |exception| syntax.
356            (Interface->idl_text): Tentative support for inheritances.
357    
358    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
359    
360            * WebIDL.pm: Hierarchical scoped name support was broken.
361            Support for raises, setraises, and getraises syntaxes.
362    
363    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
364    
365            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
366            proper support for types, extended attributes, and exceptions yet).
367            WebIDL parser, version 1 (no support for exceptions yet,
368            no proper support for extended attributes yet).
369    
370    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
371    
372            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
373    
374    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
375    
376            * WebIDL.pm: Support for valuetype and const.
377    
378    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
379            
380            * WebIDL.pm: New module.
381    
382    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
383    
384            * Makefile (Entities.html): URI changed.
385    
386    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
387    
388            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
389    
390    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
391    
392            * HTML.pm.src (_get_next_token): A parse error was missing.
393    
394    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
395    
396            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
397            spec (commented out). (HTML5 revision 1711).
398    
399    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
400    
401            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
402    
403            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
404            as UTF-16 (HTML5 revision 1701).
405    
406    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
407    
408            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
409            1692).
410    
411    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
412    
413            * HTML.pm.src: The secondary insertion mode used when switching
414            to foreign content is the "in body" insertion mode (HTML5 revision
415            1696).
416    
417    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
418    
419            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
420            revision 1697).
421    
422    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
423    
424            * HTML.pm.src: Support for end-of-file token in foreign content
425            insertion mode (HTML5 revision 1693).  Update SVG camelCase
426            attribute list (HTML5 revision 1700).  <textarea> closes
427            </select> (HTML5 revision 1699).  More start tags close in
428            foreign content insertion mode (HTML5 revision 1698).
429    
430    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
431    
432            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
433    
434    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
435    
436            * HTML.pm.src: More robust charset parameter detection (HTML5
437            revision 1674).
438    
439    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
440    
441            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
442            revision 1676).
443    
444    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
445    
446            * HTML.pm.src: Ignore language part of public identifiers for
447            quriks mode detection (HTML5 revision 1679).
448    
449    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
450    
451            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
452            revision 1685).
453    
454    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
455    
456            * HTML.pm.src: Support for EOF in new states for tags (HTML5
457            revision 1684).
458    
459    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
460    
461            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
462            work (HTML5 revision 1690).
463    
464    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
465    
466            * HTML.pm.src (_tree_construction_main): Change handling of
467            end tags in head insertion modes (HTML5 revision 1686).
468            (parse_char_string): Bug fix for non-utf8 character string handlings.
469            (parse_char_stream): |ungetc| does not work well for this context.
470    
471    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
472    
473            * HTML.pm.src (parse_byte_string): Redefined to invoke
474            |parse_byte_stream|.
475            (parse_byte_stream): New method.
476    
477    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
478    
479            * HTML.pm.src (parse_byte_string): Fix the column number reported
480            by encoding layer error reporter.
481    
482    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
483    
484            * HTML.pm.src (parse_byte_string): Use streaming decoder
485            rather than converting the whole byte string and then parsing.
486            Propagate errors in character encoding layer.
487            (get_next_token): Precise error reporting for |bare stago| error.
488    
489    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
490    
491            * HTML.pm.src (parse_char_stream): New method.
492            (parse_char_string): This method is now defined as an invocation
493            of the |parse_char_stream| method.
494    
495    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
496    
497            * HTML.pm.src (parse_byte_string): Report various status
498            of the sniffing as info-level errors.  Support for new
499            decoding framework in parser resestting.
500            (new): Various default error levels were not set.
501    
502    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
503    
504            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
505            algorithm, except for the actual sniffing, is implemented
506            with new framework with Message::Charset::Info.
507    
508  2008-05-16  Wakaba  <wakaba@suika.fam.cx>  2008-05-16  Wakaba  <wakaba@suika.fam.cx>
509    
510          * CacheManifest.pm (_parse): Drop fragment identifiers from          * CacheManifest.pm (_parse): Drop fragment identifiers from

Legend:
Removed from v.1.232  
changed lines
  Added in v.1.306

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24