/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.224 by wakaba, Mon May 5 06:12:43 2008 UTC revision 1.299 by wakaba, Sat Sep 13 12:25:44 2008 UTC
# Line 1  Line 1 
1    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Check points added to newly added branches.
4    
5    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
6    
7            * HTML.pm.src: Remove |{char}|, which is no longer used.
8            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
9            and replaced by |{prev_state}|.
10    
11            * mkhtmlparser.pl: Remove |{char}| feature.
12            Remove |!!!back-next-input-character;| macro.
13    
14    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
15    
16            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
17            entity related tokenizer states in favor of new states
18            implementing the consume character reference algorithm.
19    
20    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: "Consume a character reference" algorithm is
23            now implemented as a tokenizer's state, rather than
24            a method, with minimum changes (more changes will
25            be made, in due course).  "Bogus comment state"'s inner
26            loop gets removed.
27    
28    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
31            into their own tokenizer states.
32    
33    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
36            is split into three states.
37    
38    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
41            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
42            no longer does the tokenizer have to push back next input
43            characters in those states.
44    
45    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
46    
47            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
48            into four states so that no longer does the tokenizer have to push
49            back next input characters in that state.
50    
51    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
54            which can be used to insert some wrapper between the character
55            stream handle and the tokenizer.  (It is currently not supported
56            for |set_inner_html| for |Element|s).
57    
58    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: Ignore punctuations in charset names.
61    
62    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
63    
64            * ContentChecker.pm: Support for charset-layer error levels.
65    
66            * HTML.pm.src: Don't specify |text| argument for the
67            |chardecode:fallback| error, since it is not the encoding
68            being used alternatively.
69    
70    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
73    
74    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
75    
76            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
77    
78    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
79    
80            * HTML.pm.src: Bug fix and sync with the spec with regard
81            to after after frameset insertion mode processing (HTML5
82            revision 1909).  Note that the implementation was wrong
83            per the old spec before the r1909 changes.
84    
85    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTMLTable.pm: scope=auto algorithm fix synced with the
88            spec (HTML5 revision 2093).
89            ($process_row): Algorithm step numbers synced with the
90            spec (HTML5 revision 2092).
91    
92    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
93    
94            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
95            revision 2094).
96    
97    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
98    
99            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
100    
101    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
102    
103            * HTML.pm.src: '"' and "'" at the end of attribute
104            name (after another attribute) now raise parse error (HTML5
105            revision 2123).  Empty unquoted attribute values are no
106            longer allowed (HTML5 revision 2122).
107    
108    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
109    
110            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
111            revision 2130).
112    
113    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
114    
115            * ContentChecker.pm: |xml:lang| attribute value must be same
116            as |lang| attribute value for HTML elements (HTML5 revision 2062
117            and so on).
118    
119    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
120    
121            * ContentChecker.pm: Error level definition for |xml_id_error|
122            was missing.
123    
124            * URIChecker.pm: The end of the URL should be marked as the
125            error location for an empty path error.  The position
126            between the userinfo and the port components should be
127            marked as the error location for an empty host error.
128    
129    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
130    
131            * URIChecker.pm: Set parameters representing where in the
132            value the error occurs for errors.  Report unknown
133            address format error in warning level, since address
134            formats are rarely added.  Path segments starting with "/.."
135            were misinterpreted as a dot-segment.
136    
137    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
138    
139            * URIChecker.pm (check_iri_reference): Requires
140            |Message::DOM::DOMImplementation|.
141    
142    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
143    
144            * IMTChecker.pm: Updated for the new error reporting architecture.
145    
146            * ContentChecker.pm: Error levels for IMTs are added.
147    
148    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
149    
150            * H2H.pm (_shift_token): Support for unquoted HTML attribute
151            values.
152    
153    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
154    
155            * CacheManifest.pm: Support for new style of error
156            reports.
157    
158            * HTML.pm.src: Set line=1, column=1 to the document node.
159    
160    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
161    
162            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
163            and URL checkers.  Support for more error levels for bogus
164            langauge tag and URL "standards".
165    
166            * LangTag.pm, URIChecker.pm: Support for new style error
167            level reporting.
168    
169    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
170    
171            * ContentChecker.pm: Support for RDF/XML error levels.
172    
173            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
174            specifying.  Error types are revised.
175    
176    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
177    
178            * ContentChecker.pm: All error reporting method calls are
179            renewed.
180    
181    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
182    
183            * HTML.pm.src: All error type names and "text" parameters
184            are revised.  Use new style for "level" specification.
185    
186            * mkhtmlparser.pl: Use new style for "level" specification.
187    
188    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
189    
190            * WebIDL.pm (parse_char_string): Simplified error
191            reporting process for broken ignored valuetype definition.
192            (Valuetype idl_text): Support for special "DOMString" name.
193    
194    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
195    
196            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
197            terminal of the ScopedName is "DOMString", such that whether
198            the last part of the scoped name is "DOMString" or "_DOMString"
199            later.  It is necessary to determine whether a |typedef|
200            definition should be ignored or not.
201            (parse_char_string): Unescape the identifier of
202            exception members.
203            ($resolve): Return undef for builtin types and sequence<T>
204            types (we might not have to do this, however...).
205            (check): Support checking for Exceptions, Valuetypes,
206            and Typedefs.
207            ($serialize_type): Support for "DOMString::::" syntax.
208            (Typedef idl_text): Output Type as "DOMString" if it
209            is really "DOMString" (i.e. its internal representation
210            is "::DOMString::").
211    
212    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
213    
214            * WebIDL.pm ($resolve): New code, based on resolve code
215            for constant types in the |check| method.
216            (check): Support for checking of attributes, operations, and
217            arguments.
218            (Attribute/Operation idl_text): Exception names in getraises,
219            setraises, and raises clauses is serizlied by |$serialize_type|
220            code.
221    
222    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
223    
224            * WebIDL.pm ($integer): Order of selections are changed to match
225            hexadecimal numbers (the original pattern, taken from the spec,
226            was not work for hexadecimal numbers, because the "0" prefix
227            matches to the [0-7]* part (as an empty string) and therefore
228            it does not match with remaining "x..." part of a "0x..." integer
229            literal.
230            ($get_type): It now returns a string, not an array reference,
231            for regular types and |sequence| types (i.e. it in any case
232            returns a string).
233            ($get_next_token): The second item in the array that represents
234            a integer or float token is now a Perl number value, not the
235            original string representation of the number.
236            (check): Support for const value consistency checking.
237            No extended attribute is defined for constants.
238            (Node subclasses): Use simple strings rather than array references
239            for default data type values.
240            ($serialize_type): Type values are now simple strings.
241            (value): If the new attribute value is a false value, then
242            a FALSE value is set to the attribute.
243    
244    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
245    
246            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
247            in its stringified format ("scoped name" as defined in the
248            spec).  Note that future version of this module should not use
249            array references for type values and the |type_text| attribute
250            should be made obsolete.
251            (parse_char_string): Unescape attribute names.
252            (check): Support for checking of whether inherited interfaces
253            are actually defined or not.  Support for checking of whether
254            interface member identifiers are duplicated or not.
255            ($serialize_type): Scoped names are returned as is.  A future
256            version of this code should escape identifiers other than "DOMString",
257            otherwise the idl_text would be non-conforming.
258    
259    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
260    
261            * WebIDL.pm (parse_char_string): Set line/column numbers
262            to generated nodes.  Unescape identifiers.  Extended attributes
263            for Definition's were ignored.
264            (append_child): Set |parent_node| attribute.
265            (parent_node): New attribute.
266            (check): Support interface/exception members.  Support
267            extended attributes.  Support definition identifier uniqueness
268            constraint.
269            (qualified_name): New attribute.
270            (Interface/Exception idl_text): Extended attributes were
271            not prepended to the returned text.
272    
273    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
274    
275            * WebIDL.pm (parse_char_string): Set line/column numbers
276            to interface object experimentally.  s/shift/pop/g, shift
277            would make things wrong.  Support for interface forward
278            declarations was missing.  Broken interface declarations
279            with no block were not ignored entirely.
280            (Whatpm::WebIDL::Node): New abstract class.  This class
281            makes things easier.
282            (child_nodes): New attribute.  Unlike DOM's attribute with
283            same name, this attribute returns a dead list of nodes for
284            simplicity.
285            (get_user_data, set_user_data): New methods.
286            (Module idl_text): A SPACE character should be inserted
287            before the |{| character.
288            (Interface idl_text): Support for interface forward declarations.
289            (is_forward_declaration): New attribute.
290    
291    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
292    
293            * WebIDL.pm (type_text): Better serializer.
294    
295    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
296    
297            * WebIDL.pm: Revise forward-compatible parsing so that
298            it now can handle broken extended attributes and as such.
299    
300    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
301    
302            * WebIDL.pm: Real support for extended attributes.
303            Support for extended attributes with arguments.
304    
305    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
306    
307            * WebIDL.pm: Support for |exception| syntax.
308            (Interface->idl_text): Tentative support for inheritances.
309    
310    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
311    
312            * WebIDL.pm: Hierarchical scoped name support was broken.
313            Support for raises, setraises, and getraises syntaxes.
314    
315    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
316    
317            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
318            proper support for types, extended attributes, and exceptions yet).
319            WebIDL parser, version 1 (no support for exceptions yet,
320            no proper support for extended attributes yet).
321    
322    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
323    
324            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
325    
326    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
327    
328            * WebIDL.pm: Support for valuetype and const.
329    
330    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
331            
332            * WebIDL.pm: New module.
333    
334    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
335    
336            * Makefile (Entities.html): URI changed.
337    
338    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
339    
340            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
341    
342    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
343    
344            * HTML.pm.src (_get_next_token): A parse error was missing.
345    
346    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
347    
348            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
349            spec (commented out). (HTML5 revision 1711).
350    
351    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
352    
353            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
354    
355            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
356            as UTF-16 (HTML5 revision 1701).
357    
358    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
359    
360            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
361            1692).
362    
363    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
364    
365            * HTML.pm.src: The secondary insertion mode used when switching
366            to foreign content is the "in body" insertion mode (HTML5 revision
367            1696).
368    
369    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
370    
371            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
372            revision 1697).
373    
374    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
375    
376            * HTML.pm.src: Support for end-of-file token in foreign content
377            insertion mode (HTML5 revision 1693).  Update SVG camelCase
378            attribute list (HTML5 revision 1700).  <textarea> closes
379            </select> (HTML5 revision 1699).  More start tags close in
380            foreign content insertion mode (HTML5 revision 1698).
381    
382    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
383    
384            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
385    
386    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
387    
388            * HTML.pm.src: More robust charset parameter detection (HTML5
389            revision 1674).
390    
391    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
392    
393            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
394            revision 1676).
395    
396    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
397    
398            * HTML.pm.src: Ignore language part of public identifiers for
399            quriks mode detection (HTML5 revision 1679).
400    
401    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
402    
403            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
404            revision 1685).
405    
406    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
407    
408            * HTML.pm.src: Support for EOF in new states for tags (HTML5
409            revision 1684).
410    
411    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
412    
413            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
414            work (HTML5 revision 1690).
415    
416    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
417    
418            * HTML.pm.src (_tree_construction_main): Change handling of
419            end tags in head insertion modes (HTML5 revision 1686).
420            (parse_char_string): Bug fix for non-utf8 character string handlings.
421            (parse_char_stream): |ungetc| does not work well for this context.
422    
423    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
424    
425            * HTML.pm.src (parse_byte_string): Redefined to invoke
426            |parse_byte_stream|.
427            (parse_byte_stream): New method.
428    
429    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
430    
431            * HTML.pm.src (parse_byte_string): Fix the column number reported
432            by encoding layer error reporter.
433    
434    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
435    
436            * HTML.pm.src (parse_byte_string): Use streaming decoder
437            rather than converting the whole byte string and then parsing.
438            Propagate errors in character encoding layer.
439            (get_next_token): Precise error reporting for |bare stago| error.
440    
441    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
442    
443            * HTML.pm.src (parse_char_stream): New method.
444            (parse_char_string): This method is now defined as an invocation
445            of the |parse_char_stream| method.
446    
447    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
448    
449            * HTML.pm.src (parse_byte_string): Report various status
450            of the sniffing as info-level errors.  Support for new
451            decoding framework in parser resestting.
452            (new): Various default error levels were not set.
453    
454    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
455    
456            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
457            algorithm, except for the actual sniffing, is implemented
458            with new framework with Message::Charset::Info.
459    
460    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
461    
462            * CacheManifest.pm (_parse): Drop fragment identifiers from
463            URIs in fallback section (HTML5 revision 1596).
464    
465    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
466    
467            * Makefile (Entities.html): URI has changed.
468    
469    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
470    
471            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
472            1553).
473    
474    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
475    
476            * ContentChecker.pm: Noted that those returned in |table| are
477            no longer table elements, but table objects returned
478            by Whatpm::HTMLTable.
479    
480            * HTMLTable.pm (form_table): Return table element node
481            as |$table->{element}|.
482            (assign_header): Support for the |headers=""| attribute.
483    
484    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
485    
486            * HTMLTable.pm (assign_header): New function; first version
487            with no support for headers="".
488            (form_table): Include table width and height to the returned
489            table object for covenience.  Indexing in column assignement was wrong.
490            Set whether a data cell is empty or not for convenience.
491    
492    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
493    
494            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
495    
496    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
497    
498            * HTMLTable.pm: How table model errors are detected is
499            changed (HTML5 revision 1387).
500    
501    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
502    
503            * HTMLTable.pm: The algorithm now moves |tfoot| elements
504            to the end of the table (HTML5 revision 1380).
505    
506  2008-05-05  Wakaba  <wakaba@suika.fam.cx>  2008-05-05  Wakaba  <wakaba@suika.fam.cx>
507    
508          * HTMLTable.pm: The algorithm is now 0-based indexing, instead          * HTMLTable.pm: The algorithm is now 0-based indexing, instead

Legend:
Removed from v.1.224  
changed lines
  Added in v.1.299

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24