/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.218 by wakaba, Tue Apr 29 08:16:46 2008 UTC revision 1.297 by wakaba, Sat Sep 13 10:49:21 2008 UTC
# Line 1  Line 1 
1    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
4            entity related tokenizer states in favor of new states
5            implementing the consume character reference algorithm.
6    
7    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
8    
9            * HTML.pm.src: "Consume a character reference" algorithm is
10            now implemented as a tokenizer's state, rather than
11            a method, with minimum changes (more changes will
12            be made, in due course).  "Bogus comment state"'s inner
13            loop gets removed.
14    
15    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
18            into their own tokenizer states.
19    
20    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
23            is split into three states.
24    
25    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
28            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
29            no longer does the tokenizer have to push back next input
30            characters in those states.
31    
32    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
35            into four states so that no longer does the tokenizer have to push
36            back next input characters in that state.
37    
38    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
41            which can be used to insert some wrapper between the character
42            stream handle and the tokenizer.  (It is currently not supported
43            for |set_inner_html| for |Element|s).
44    
45    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
46    
47            * HTML.pm.src: Ignore punctuations in charset names.
48    
49    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
50    
51            * ContentChecker.pm: Support for charset-layer error levels.
52    
53            * HTML.pm.src: Don't specify |text| argument for the
54            |chardecode:fallback| error, since it is not the encoding
55            being used alternatively.
56    
57    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
60    
61    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
62    
63            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
64    
65    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
66    
67            * HTML.pm.src: Bug fix and sync with the spec with regard
68            to after after frameset insertion mode processing (HTML5
69            revision 1909).  Note that the implementation was wrong
70            per the old spec before the r1909 changes.
71    
72    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTMLTable.pm: scope=auto algorithm fix synced with the
75            spec (HTML5 revision 2093).
76            ($process_row): Algorithm step numbers synced with the
77            spec (HTML5 revision 2092).
78    
79    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
80    
81            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
82            revision 2094).
83    
84    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
85    
86            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
87    
88    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
89    
90            * HTML.pm.src: '"' and "'" at the end of attribute
91            name (after another attribute) now raise parse error (HTML5
92            revision 2123).  Empty unquoted attribute values are no
93            longer allowed (HTML5 revision 2122).
94    
95    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
96    
97            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
98            revision 2130).
99    
100    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
101    
102            * ContentChecker.pm: |xml:lang| attribute value must be same
103            as |lang| attribute value for HTML elements (HTML5 revision 2062
104            and so on).
105    
106    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
107    
108            * ContentChecker.pm: Error level definition for |xml_id_error|
109            was missing.
110    
111            * URIChecker.pm: The end of the URL should be marked as the
112            error location for an empty path error.  The position
113            between the userinfo and the port components should be
114            marked as the error location for an empty host error.
115    
116    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
117    
118            * URIChecker.pm: Set parameters representing where in the
119            value the error occurs for errors.  Report unknown
120            address format error in warning level, since address
121            formats are rarely added.  Path segments starting with "/.."
122            were misinterpreted as a dot-segment.
123    
124    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
125    
126            * URIChecker.pm (check_iri_reference): Requires
127            |Message::DOM::DOMImplementation|.
128    
129    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
130    
131            * IMTChecker.pm: Updated for the new error reporting architecture.
132    
133            * ContentChecker.pm: Error levels for IMTs are added.
134    
135    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
136    
137            * H2H.pm (_shift_token): Support for unquoted HTML attribute
138            values.
139    
140    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
141    
142            * CacheManifest.pm: Support for new style of error
143            reports.
144    
145            * HTML.pm.src: Set line=1, column=1 to the document node.
146    
147    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
148    
149            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
150            and URL checkers.  Support for more error levels for bogus
151            langauge tag and URL "standards".
152    
153            * LangTag.pm, URIChecker.pm: Support for new style error
154            level reporting.
155    
156    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
157    
158            * ContentChecker.pm: Support for RDF/XML error levels.
159    
160            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
161            specifying.  Error types are revised.
162    
163    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
164    
165            * ContentChecker.pm: All error reporting method calls are
166            renewed.
167    
168    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
169    
170            * HTML.pm.src: All error type names and "text" parameters
171            are revised.  Use new style for "level" specification.
172    
173            * mkhtmlparser.pl: Use new style for "level" specification.
174    
175    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
176    
177            * WebIDL.pm (parse_char_string): Simplified error
178            reporting process for broken ignored valuetype definition.
179            (Valuetype idl_text): Support for special "DOMString" name.
180    
181    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
182    
183            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
184            terminal of the ScopedName is "DOMString", such that whether
185            the last part of the scoped name is "DOMString" or "_DOMString"
186            later.  It is necessary to determine whether a |typedef|
187            definition should be ignored or not.
188            (parse_char_string): Unescape the identifier of
189            exception members.
190            ($resolve): Return undef for builtin types and sequence<T>
191            types (we might not have to do this, however...).
192            (check): Support checking for Exceptions, Valuetypes,
193            and Typedefs.
194            ($serialize_type): Support for "DOMString::::" syntax.
195            (Typedef idl_text): Output Type as "DOMString" if it
196            is really "DOMString" (i.e. its internal representation
197            is "::DOMString::").
198    
199    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
200    
201            * WebIDL.pm ($resolve): New code, based on resolve code
202            for constant types in the |check| method.
203            (check): Support for checking of attributes, operations, and
204            arguments.
205            (Attribute/Operation idl_text): Exception names in getraises,
206            setraises, and raises clauses is serizlied by |$serialize_type|
207            code.
208    
209    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
210    
211            * WebIDL.pm ($integer): Order of selections are changed to match
212            hexadecimal numbers (the original pattern, taken from the spec,
213            was not work for hexadecimal numbers, because the "0" prefix
214            matches to the [0-7]* part (as an empty string) and therefore
215            it does not match with remaining "x..." part of a "0x..." integer
216            literal.
217            ($get_type): It now returns a string, not an array reference,
218            for regular types and |sequence| types (i.e. it in any case
219            returns a string).
220            ($get_next_token): The second item in the array that represents
221            a integer or float token is now a Perl number value, not the
222            original string representation of the number.
223            (check): Support for const value consistency checking.
224            No extended attribute is defined for constants.
225            (Node subclasses): Use simple strings rather than array references
226            for default data type values.
227            ($serialize_type): Type values are now simple strings.
228            (value): If the new attribute value is a false value, then
229            a FALSE value is set to the attribute.
230    
231    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
232    
233            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
234            in its stringified format ("scoped name" as defined in the
235            spec).  Note that future version of this module should not use
236            array references for type values and the |type_text| attribute
237            should be made obsolete.
238            (parse_char_string): Unescape attribute names.
239            (check): Support for checking of whether inherited interfaces
240            are actually defined or not.  Support for checking of whether
241            interface member identifiers are duplicated or not.
242            ($serialize_type): Scoped names are returned as is.  A future
243            version of this code should escape identifiers other than "DOMString",
244            otherwise the idl_text would be non-conforming.
245    
246    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
247    
248            * WebIDL.pm (parse_char_string): Set line/column numbers
249            to generated nodes.  Unescape identifiers.  Extended attributes
250            for Definition's were ignored.
251            (append_child): Set |parent_node| attribute.
252            (parent_node): New attribute.
253            (check): Support interface/exception members.  Support
254            extended attributes.  Support definition identifier uniqueness
255            constraint.
256            (qualified_name): New attribute.
257            (Interface/Exception idl_text): Extended attributes were
258            not prepended to the returned text.
259    
260    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
261    
262            * WebIDL.pm (parse_char_string): Set line/column numbers
263            to interface object experimentally.  s/shift/pop/g, shift
264            would make things wrong.  Support for interface forward
265            declarations was missing.  Broken interface declarations
266            with no block were not ignored entirely.
267            (Whatpm::WebIDL::Node): New abstract class.  This class
268            makes things easier.
269            (child_nodes): New attribute.  Unlike DOM's attribute with
270            same name, this attribute returns a dead list of nodes for
271            simplicity.
272            (get_user_data, set_user_data): New methods.
273            (Module idl_text): A SPACE character should be inserted
274            before the |{| character.
275            (Interface idl_text): Support for interface forward declarations.
276            (is_forward_declaration): New attribute.
277    
278    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
279    
280            * WebIDL.pm (type_text): Better serializer.
281    
282    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
283    
284            * WebIDL.pm: Revise forward-compatible parsing so that
285            it now can handle broken extended attributes and as such.
286    
287    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
288    
289            * WebIDL.pm: Real support for extended attributes.
290            Support for extended attributes with arguments.
291    
292    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
293    
294            * WebIDL.pm: Support for |exception| syntax.
295            (Interface->idl_text): Tentative support for inheritances.
296    
297    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
298    
299            * WebIDL.pm: Hierarchical scoped name support was broken.
300            Support for raises, setraises, and getraises syntaxes.
301    
302    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
303    
304            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
305            proper support for types, extended attributes, and exceptions yet).
306            WebIDL parser, version 1 (no support for exceptions yet,
307            no proper support for extended attributes yet).
308    
309    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
310    
311            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
312    
313    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
314    
315            * WebIDL.pm: Support for valuetype and const.
316    
317    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
318            
319            * WebIDL.pm: New module.
320    
321    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
322    
323            * Makefile (Entities.html): URI changed.
324    
325    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
326    
327            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
328    
329    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
330    
331            * HTML.pm.src (_get_next_token): A parse error was missing.
332    
333    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
334    
335            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
336            spec (commented out). (HTML5 revision 1711).
337    
338    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
339    
340            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
341    
342            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
343            as UTF-16 (HTML5 revision 1701).
344    
345    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
346    
347            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
348            1692).
349    
350    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
351    
352            * HTML.pm.src: The secondary insertion mode used when switching
353            to foreign content is the "in body" insertion mode (HTML5 revision
354            1696).
355    
356    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
357    
358            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
359            revision 1697).
360    
361    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
362    
363            * HTML.pm.src: Support for end-of-file token in foreign content
364            insertion mode (HTML5 revision 1693).  Update SVG camelCase
365            attribute list (HTML5 revision 1700).  <textarea> closes
366            </select> (HTML5 revision 1699).  More start tags close in
367            foreign content insertion mode (HTML5 revision 1698).
368    
369    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
370    
371            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
372    
373    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
374    
375            * HTML.pm.src: More robust charset parameter detection (HTML5
376            revision 1674).
377    
378    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
379    
380            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
381            revision 1676).
382    
383    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
384    
385            * HTML.pm.src: Ignore language part of public identifiers for
386            quriks mode detection (HTML5 revision 1679).
387    
388    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
389    
390            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
391            revision 1685).
392    
393    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
394    
395            * HTML.pm.src: Support for EOF in new states for tags (HTML5
396            revision 1684).
397    
398    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
399    
400            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
401            work (HTML5 revision 1690).
402    
403    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
404    
405            * HTML.pm.src (_tree_construction_main): Change handling of
406            end tags in head insertion modes (HTML5 revision 1686).
407            (parse_char_string): Bug fix for non-utf8 character string handlings.
408            (parse_char_stream): |ungetc| does not work well for this context.
409    
410    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
411    
412            * HTML.pm.src (parse_byte_string): Redefined to invoke
413            |parse_byte_stream|.
414            (parse_byte_stream): New method.
415    
416    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
417    
418            * HTML.pm.src (parse_byte_string): Fix the column number reported
419            by encoding layer error reporter.
420    
421    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
422    
423            * HTML.pm.src (parse_byte_string): Use streaming decoder
424            rather than converting the whole byte string and then parsing.
425            Propagate errors in character encoding layer.
426            (get_next_token): Precise error reporting for |bare stago| error.
427    
428    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
429    
430            * HTML.pm.src (parse_char_stream): New method.
431            (parse_char_string): This method is now defined as an invocation
432            of the |parse_char_stream| method.
433    
434    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
435    
436            * HTML.pm.src (parse_byte_string): Report various status
437            of the sniffing as info-level errors.  Support for new
438            decoding framework in parser resestting.
439            (new): Various default error levels were not set.
440    
441    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
442    
443            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
444            algorithm, except for the actual sniffing, is implemented
445            with new framework with Message::Charset::Info.
446    
447    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
448    
449            * CacheManifest.pm (_parse): Drop fragment identifiers from
450            URIs in fallback section (HTML5 revision 1596).
451    
452    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
453    
454            * Makefile (Entities.html): URI has changed.
455    
456    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
457    
458            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
459            1553).
460    
461    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
462    
463            * ContentChecker.pm: Noted that those returned in |table| are
464            no longer table elements, but table objects returned
465            by Whatpm::HTMLTable.
466    
467            * HTMLTable.pm (form_table): Return table element node
468            as |$table->{element}|.
469            (assign_header): Support for the |headers=""| attribute.
470    
471    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
472    
473            * HTMLTable.pm (assign_header): New function; first version
474            with no support for headers="".
475            (form_table): Include table width and height to the returned
476            table object for covenience.  Indexing in column assignement was wrong.
477            Set whether a data cell is empty or not for convenience.
478    
479    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
480    
481            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
482    
483    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
484    
485            * HTMLTable.pm: How table model errors are detected is
486            changed (HTML5 revision 1387).
487    
488    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
489    
490            * HTMLTable.pm: The algorithm now moves |tfoot| elements
491            to the end of the table (HTML5 revision 1380).
492    
493    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
494    
495            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
496            of 1-based (HTML5 revision 1376).
497    
498    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
499    
500            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
501            and "Content-Encoding" no longer prevent sniffing (HTML5
502            revision 1288).
503    
504    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
505    
506            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
507            1282).
508    
509    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
510    
511            * ContentChecker.pm: Support for global attributes.
512            Status of XML specs are added.
513    
514    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
515    
516            * ContentChecker.pm (check_element): Support for |template=""|
517            and |ref=""| attribute (referent element type checking).
518    
519  2008-04-29  Wakaba  <wakaba@suika.fam.cx>  2008-04-29  Wakaba  <wakaba@suika.fam.cx>
520    
521          * CacheManifest.pm (_parse): New same origin definition          * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
522          is defined (except for IDNA part and URI-scheme-specific          1500) is implemented (except for IDNA part and URI-scheme-specific
523          knowledge).  Line number counting was wrong for LF-only          knowledge).  Line number counting was wrong for LF-only
524          documents.          documents.
525    

Legend:
Removed from v.1.218  
changed lines
  Added in v.1.297

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24