/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.129 by wakaba, Sun Nov 25 03:46:07 2007 UTC revision 1.311 by wakaba, Mon Sep 15 07:19:03 2008 UTC
# Line 1  Line 1 
1    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Remove checking for control character, surrogate
4            pair, or noncharacter code points and non-Unicode code
5            points (they should be handled by Whatpm::Charset::UnicodeChecker).
6            (parse_char_stream): Support for the |$get_wrapper| argument and
7            character stream error handlers.
8    
9    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
10    
11            * ContentChecker.pm: Don't call |loda_ns_module|
12            for null-namespace elements/attributes.
13    
14            * HTML.pm.src: Fact out $disallowed_control_chars
15            as a hash.
16    
17    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
20            and |{next_char}| initializations are moved to initialization
21            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
22            with |parse_char_stream|.
23    
24    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src (parse_char_stream): Make |set_next_char|
27            invoke |manakai_read_until|, not only |read|, where
28            possible, to decrease the number of |read| method calls.
29    
30            * mkhtmlparser.pl: Related changes to the aforementioned
31            modification.
32    
33    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
36            would report character error from now.
37    
38    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: White-space-leaded non-white-space character
41            tokens in "before head insertion mode" was not
42            correctly handled.
43            (set_inner_html): Reimplemented using CharString decodehandle
44            class.  Support for $get_wrapper argument.  Support
45            for |{read_until}| feature.
46    
47    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
48    
49            * HTML.pm.src: Make a "bare ero" error for unknown
50            entities point the "&" character.
51    
52    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: It turns out that U+FFFD don't have to
55            be added to the list of excluded characters.
56    
57    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src ($char_onerror): Have character decoder's |line|
60            and |column| a higher priority than the one set by the
61            tokenizer's input handler.
62            ($self->{read_until}): Exclude U+FFFD (but this might
63            not be necessary, since now we do line/column fixup in
64            the character decode handle).
65    
66    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
67    
68            * HTML.pm.src: Use |{read_until}| where possible.
69    
70    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
73            and |manakai_getc_until| to |manakai_read_until| to
74            reduce the number of string copies.
75    
76    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src (parse_char_string): Use newly created
79            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
80            standard feature to |open| a string as a filehandle,
81            since Perl's string filehandle seems not supporting |ungetc|
82            method correctly.
83            (parse_char_stream): Define |{getc_until}| method.
84            (DATA_STATE): Experimental support for |getc_until| feature.
85    
86    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
87    
88            * HTML.pm.src: Check points added to newly added branches.
89    
90    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src: Remove |{char}|, which is no longer used.
93            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
94            and replaced by |{prev_state}|.
95    
96            * mkhtmlparser.pl: Remove |{char}| feature.
97            Remove |!!!back-next-input-character;| macro.
98    
99    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
102            entity related tokenizer states in favor of new states
103            implementing the consume character reference algorithm.
104    
105    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: "Consume a character reference" algorithm is
108            now implemented as a tokenizer's state, rather than
109            a method, with minimum changes (more changes will
110            be made, in due course).  "Bogus comment state"'s inner
111            loop gets removed.
112    
113    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
114    
115            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
116            into their own tokenizer states.
117    
118    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
121            is split into three states.
122    
123    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
124    
125            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
126            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
127            no longer does the tokenizer have to push back next input
128            characters in those states.
129    
130    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
133            into four states so that no longer does the tokenizer have to push
134            back next input characters in that state.
135    
136    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
137    
138            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
139            which can be used to insert some wrapper between the character
140            stream handle and the tokenizer.  (It is currently not supported
141            for |set_inner_html| for |Element|s).
142    
143    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
144    
145            * HTML.pm.src: Ignore punctuations in charset names.
146    
147    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
148    
149            * ContentChecker.pm: Support for charset-layer error levels.
150    
151            * HTML.pm.src: Don't specify |text| argument for the
152            |chardecode:fallback| error, since it is not the encoding
153            being used alternatively.
154    
155    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
156    
157            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
158    
159    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
160    
161            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
162    
163    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
164    
165            * HTML.pm.src: Bug fix and sync with the spec with regard
166            to after after frameset insertion mode processing (HTML5
167            revision 1909).  Note that the implementation was wrong
168            per the old spec before the r1909 changes.
169    
170    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTMLTable.pm: scope=auto algorithm fix synced with the
173            spec (HTML5 revision 2093).
174            ($process_row): Algorithm step numbers synced with the
175            spec (HTML5 revision 2092).
176    
177    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
180            revision 2094).
181    
182    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
183    
184            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
185    
186    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
187    
188            * HTML.pm.src: '"' and "'" at the end of attribute
189            name (after another attribute) now raise parse error (HTML5
190            revision 2123).  Empty unquoted attribute values are no
191            longer allowed (HTML5 revision 2122).
192    
193    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
194    
195            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
196            revision 2130).
197    
198    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
199    
200            * ContentChecker.pm: |xml:lang| attribute value must be same
201            as |lang| attribute value for HTML elements (HTML5 revision 2062
202            and so on).
203    
204    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
205    
206            * ContentChecker.pm: Error level definition for |xml_id_error|
207            was missing.
208    
209            * URIChecker.pm: The end of the URL should be marked as the
210            error location for an empty path error.  The position
211            between the userinfo and the port components should be
212            marked as the error location for an empty host error.
213    
214    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
215    
216            * URIChecker.pm: Set parameters representing where in the
217            value the error occurs for errors.  Report unknown
218            address format error in warning level, since address
219            formats are rarely added.  Path segments starting with "/.."
220            were misinterpreted as a dot-segment.
221    
222    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
223    
224            * URIChecker.pm (check_iri_reference): Requires
225            |Message::DOM::DOMImplementation|.
226    
227    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
228    
229            * IMTChecker.pm: Updated for the new error reporting architecture.
230    
231            * ContentChecker.pm: Error levels for IMTs are added.
232    
233    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
234    
235            * H2H.pm (_shift_token): Support for unquoted HTML attribute
236            values.
237    
238    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
239    
240            * CacheManifest.pm: Support for new style of error
241            reports.
242    
243            * HTML.pm.src: Set line=1, column=1 to the document node.
244    
245    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
246    
247            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
248            and URL checkers.  Support for more error levels for bogus
249            langauge tag and URL "standards".
250    
251            * LangTag.pm, URIChecker.pm: Support for new style error
252            level reporting.
253    
254    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
255    
256            * ContentChecker.pm: Support for RDF/XML error levels.
257    
258            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
259            specifying.  Error types are revised.
260    
261    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
262    
263            * ContentChecker.pm: All error reporting method calls are
264            renewed.
265    
266    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: All error type names and "text" parameters
269            are revised.  Use new style for "level" specification.
270    
271            * mkhtmlparser.pl: Use new style for "level" specification.
272    
273    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
274    
275            * WebIDL.pm (parse_char_string): Simplified error
276            reporting process for broken ignored valuetype definition.
277            (Valuetype idl_text): Support for special "DOMString" name.
278    
279    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
280    
281            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
282            terminal of the ScopedName is "DOMString", such that whether
283            the last part of the scoped name is "DOMString" or "_DOMString"
284            later.  It is necessary to determine whether a |typedef|
285            definition should be ignored or not.
286            (parse_char_string): Unescape the identifier of
287            exception members.
288            ($resolve): Return undef for builtin types and sequence<T>
289            types (we might not have to do this, however...).
290            (check): Support checking for Exceptions, Valuetypes,
291            and Typedefs.
292            ($serialize_type): Support for "DOMString::::" syntax.
293            (Typedef idl_text): Output Type as "DOMString" if it
294            is really "DOMString" (i.e. its internal representation
295            is "::DOMString::").
296    
297    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
298    
299            * WebIDL.pm ($resolve): New code, based on resolve code
300            for constant types in the |check| method.
301            (check): Support for checking of attributes, operations, and
302            arguments.
303            (Attribute/Operation idl_text): Exception names in getraises,
304            setraises, and raises clauses is serizlied by |$serialize_type|
305            code.
306    
307    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
308    
309            * WebIDL.pm ($integer): Order of selections are changed to match
310            hexadecimal numbers (the original pattern, taken from the spec,
311            was not work for hexadecimal numbers, because the "0" prefix
312            matches to the [0-7]* part (as an empty string) and therefore
313            it does not match with remaining "x..." part of a "0x..." integer
314            literal.
315            ($get_type): It now returns a string, not an array reference,
316            for regular types and |sequence| types (i.e. it in any case
317            returns a string).
318            ($get_next_token): The second item in the array that represents
319            a integer or float token is now a Perl number value, not the
320            original string representation of the number.
321            (check): Support for const value consistency checking.
322            No extended attribute is defined for constants.
323            (Node subclasses): Use simple strings rather than array references
324            for default data type values.
325            ($serialize_type): Type values are now simple strings.
326            (value): If the new attribute value is a false value, then
327            a FALSE value is set to the attribute.
328    
329    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
330    
331            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
332            in its stringified format ("scoped name" as defined in the
333            spec).  Note that future version of this module should not use
334            array references for type values and the |type_text| attribute
335            should be made obsolete.
336            (parse_char_string): Unescape attribute names.
337            (check): Support for checking of whether inherited interfaces
338            are actually defined or not.  Support for checking of whether
339            interface member identifiers are duplicated or not.
340            ($serialize_type): Scoped names are returned as is.  A future
341            version of this code should escape identifiers other than "DOMString",
342            otherwise the idl_text would be non-conforming.
343    
344    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
345    
346            * WebIDL.pm (parse_char_string): Set line/column numbers
347            to generated nodes.  Unescape identifiers.  Extended attributes
348            for Definition's were ignored.
349            (append_child): Set |parent_node| attribute.
350            (parent_node): New attribute.
351            (check): Support interface/exception members.  Support
352            extended attributes.  Support definition identifier uniqueness
353            constraint.
354            (qualified_name): New attribute.
355            (Interface/Exception idl_text): Extended attributes were
356            not prepended to the returned text.
357    
358    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
359    
360            * WebIDL.pm (parse_char_string): Set line/column numbers
361            to interface object experimentally.  s/shift/pop/g, shift
362            would make things wrong.  Support for interface forward
363            declarations was missing.  Broken interface declarations
364            with no block were not ignored entirely.
365            (Whatpm::WebIDL::Node): New abstract class.  This class
366            makes things easier.
367            (child_nodes): New attribute.  Unlike DOM's attribute with
368            same name, this attribute returns a dead list of nodes for
369            simplicity.
370            (get_user_data, set_user_data): New methods.
371            (Module idl_text): A SPACE character should be inserted
372            before the |{| character.
373            (Interface idl_text): Support for interface forward declarations.
374            (is_forward_declaration): New attribute.
375    
376    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
377    
378            * WebIDL.pm (type_text): Better serializer.
379    
380    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
381    
382            * WebIDL.pm: Revise forward-compatible parsing so that
383            it now can handle broken extended attributes and as such.
384    
385    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
386    
387            * WebIDL.pm: Real support for extended attributes.
388            Support for extended attributes with arguments.
389    
390    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
391    
392            * WebIDL.pm: Support for |exception| syntax.
393            (Interface->idl_text): Tentative support for inheritances.
394    
395    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
396    
397            * WebIDL.pm: Hierarchical scoped name support was broken.
398            Support for raises, setraises, and getraises syntaxes.
399    
400    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
401    
402            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
403            proper support for types, extended attributes, and exceptions yet).
404            WebIDL parser, version 1 (no support for exceptions yet,
405            no proper support for extended attributes yet).
406    
407    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
408    
409            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
410    
411    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
412    
413            * WebIDL.pm: Support for valuetype and const.
414    
415    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
416            
417            * WebIDL.pm: New module.
418    
419    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
420    
421            * Makefile (Entities.html): URI changed.
422    
423    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
424    
425            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
426    
427    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
428    
429            * HTML.pm.src (_get_next_token): A parse error was missing.
430    
431    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
432    
433            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
434            spec (commented out). (HTML5 revision 1711).
435    
436    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
437    
438            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
439    
440            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
441            as UTF-16 (HTML5 revision 1701).
442    
443    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
444    
445            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
446            1692).
447    
448    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
449    
450            * HTML.pm.src: The secondary insertion mode used when switching
451            to foreign content is the "in body" insertion mode (HTML5 revision
452            1696).
453    
454    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
455    
456            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
457            revision 1697).
458    
459    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
460    
461            * HTML.pm.src: Support for end-of-file token in foreign content
462            insertion mode (HTML5 revision 1693).  Update SVG camelCase
463            attribute list (HTML5 revision 1700).  <textarea> closes
464            </select> (HTML5 revision 1699).  More start tags close in
465            foreign content insertion mode (HTML5 revision 1698).
466    
467    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
468    
469            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
470    
471    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
472    
473            * HTML.pm.src: More robust charset parameter detection (HTML5
474            revision 1674).
475    
476    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
477    
478            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
479            revision 1676).
480    
481    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
482    
483            * HTML.pm.src: Ignore language part of public identifiers for
484            quriks mode detection (HTML5 revision 1679).
485    
486    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
487    
488            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
489            revision 1685).
490    
491    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
492    
493            * HTML.pm.src: Support for EOF in new states for tags (HTML5
494            revision 1684).
495    
496    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
497    
498            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
499            work (HTML5 revision 1690).
500    
501    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
502    
503            * HTML.pm.src (_tree_construction_main): Change handling of
504            end tags in head insertion modes (HTML5 revision 1686).
505            (parse_char_string): Bug fix for non-utf8 character string handlings.
506            (parse_char_stream): |ungetc| does not work well for this context.
507    
508    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
509    
510            * HTML.pm.src (parse_byte_string): Redefined to invoke
511            |parse_byte_stream|.
512            (parse_byte_stream): New method.
513    
514    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
515    
516            * HTML.pm.src (parse_byte_string): Fix the column number reported
517            by encoding layer error reporter.
518    
519    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
520    
521            * HTML.pm.src (parse_byte_string): Use streaming decoder
522            rather than converting the whole byte string and then parsing.
523            Propagate errors in character encoding layer.
524            (get_next_token): Precise error reporting for |bare stago| error.
525    
526    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
527    
528            * HTML.pm.src (parse_char_stream): New method.
529            (parse_char_string): This method is now defined as an invocation
530            of the |parse_char_stream| method.
531    
532    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
533    
534            * HTML.pm.src (parse_byte_string): Report various status
535            of the sniffing as info-level errors.  Support for new
536            decoding framework in parser resestting.
537            (new): Various default error levels were not set.
538    
539    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
540    
541            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
542            algorithm, except for the actual sniffing, is implemented
543            with new framework with Message::Charset::Info.
544    
545    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
546    
547            * CacheManifest.pm (_parse): Drop fragment identifiers from
548            URIs in fallback section (HTML5 revision 1596).
549    
550    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
551    
552            * Makefile (Entities.html): URI has changed.
553    
554    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
555    
556            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
557            1553).
558    
559    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
560    
561            * ContentChecker.pm: Noted that those returned in |table| are
562            no longer table elements, but table objects returned
563            by Whatpm::HTMLTable.
564    
565            * HTMLTable.pm (form_table): Return table element node
566            as |$table->{element}|.
567            (assign_header): Support for the |headers=""| attribute.
568    
569    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
570    
571            * HTMLTable.pm (assign_header): New function; first version
572            with no support for headers="".
573            (form_table): Include table width and height to the returned
574            table object for covenience.  Indexing in column assignement was wrong.
575            Set whether a data cell is empty or not for convenience.
576    
577    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
578    
579            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
580    
581    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
582    
583            * HTMLTable.pm: How table model errors are detected is
584            changed (HTML5 revision 1387).
585    
586    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
587    
588            * HTMLTable.pm: The algorithm now moves |tfoot| elements
589            to the end of the table (HTML5 revision 1380).
590    
591    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
592    
593            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
594            of 1-based (HTML5 revision 1376).
595    
596    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
597    
598            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
599            and "Content-Encoding" no longer prevent sniffing (HTML5
600            revision 1288).
601    
602    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
603    
604            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
605            1282).
606    
607    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
608    
609            * ContentChecker.pm: Support for global attributes.
610            Status of XML specs are added.
611    
612    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
613    
614            * ContentChecker.pm (check_element): Support for |template=""|
615            and |ref=""| attribute (referent element type checking).
616    
617    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
618    
619            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
620            1500) is implemented (except for IDNA part and URI-scheme-specific
621            knowledge).  Line number counting was wrong for LF-only
622            documents.
623    
624    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
625    
626            * HTML.pm.src: Raise an parse error for any disallowed
627            character (HTML5 revision 1263).
628    
629    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
630    
631            * mkentitylist.pl: Support for new HTML5 entity table format (the
632            definition for |AElig;| was missing).
633    
634    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
635    
636            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
637            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
638    
639    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
640    
641            * HTML.pm.src: List of element names that close foreign content
642            insertion mode is added (HTML5 revisions 1412 and 1418).
643    
644    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
645    
646            * HTML.pm.src: Support for |mglyph| and |malignmark|
647            elements (HTML5 revision 1410).
648    
649    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
650    
651            * HTML.pm.src: Support for new long MathML entities (HTML5
652            revision 1406).
653    
654    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
655    
656            * HTML.pm.src: CDATA section support for MathML and SVG
657            elements (HTML5 revision 1404 and 1420).
658    
659    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
660    
661            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
662            elements (HTML5 revision 1404).  Unused !!!macro definitions
663            are removed.
664    
665    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
666    
667            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
668            are raised is changed (HTML5 revision 1404).
669    
670    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
671    
672            * HTML.pm.src: Category sets that are no longer used are removed.
673    
674    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
675    
676            * HTML.pm.src: The ->[1] property of stack entries are now
677            replaced by constants representing element category.
678    
679    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
680    
681            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
682            for error reporting.  (This is a preparation for using constant
683            value for ->[1].)
684    
685    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
686    
687            * RDFXML.pm: Typo fixed.
688    
689    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
690    
691            * ContentChecker.pm: |fact_level| is now treated
692            as same as |must_level|, i.e. level = |m|.
693            (check_element): Make list of URIs in the DOM.
694    
695    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
696    
697            * RDFXML.pm: Language accessor implemented.
698            Local (null-namespace) attribute support.
699    
700    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
701    
702            * RDFXML.pm: Factored out ID checking code.
703    
704    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
705    
706            * RDFXML.pm: TODO items noted.  Validation of ID
707            and URI attributes is implemented.  Warn if unknown
708            value is used in rdf:parseType="" attribute.
709    
710            * URIChecker.pm (check_rdf_uri_reference): New function.
711    
712    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
713    
714            * RDFXML.pm: bnodeid implemented.  Relative references
715            are now resolved.
716    
717    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
718    
719            * ContentChecker.pm: RDF reification implemented.
720    
721            * RDFXML.pm: undef vs false bug fixed.
722            Reification implemented.
723    
724    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
725    
726            * RDFXML.pm: s/id/ID/ for attribute name.
727            The |node| arguments are added for |ontriple| calls.
728            Too many "attribute not allowed" errors were raised.
729    
730            * ContentChecker.pm: Initial experimental support for rdf:RDF
731            element.
732    
733    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
734    
735            * RDFXML.pm: New module.
736    
737    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
738    
739            * HTML.pm.src (set_inner_html): Line/column number
740            code was old one yet.
741    
742    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
743    
744            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
745            errors.  Line/column markings for DOCTYPE, comment, and
746            character tokens are reintroduced; otherwise,
747            error location for "not HTML5" error and errors
748            for implied elements are not attached.
749    
750    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
751    
752            * HTML.pm.src: Set line/column numbers to attributes.
753    
754            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
755            (value): Setter implemented.
756    
757            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
758    
759    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
760    
761            * HTML.pm.src: Unused line/column markings are removed.
762    
763    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
764    
765            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
766            flag, which is no longer used.
767    
768    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
769    
770            * HTML.pm.src: Set line/column information to element nodes.
771    
772            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
773            and !!!insert-element-t): Set line/column information to
774            element nodes.
775    
776    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
777    
778            * HTML.pm.src (_get_next_token): The first "<" character
779            in "<?", "<>", or "</>" should be the error point.
780    
781    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
782    
783            * HTML.pm.src: Some more fixes on error position reporting.
784    
785    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
786    
787            * HTML.pm.src: Token-level precious error reporting.
788    
789    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
790    
791            * HTML.pm.src: Preparation for more precious error point
792            reporting.
793    
794    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
795    
796            * HTML.pm.src: Error type revised.
797    
798    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
799    
800            * HTML.pm.src: Similar codes are merged together, again.
801    
802    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
803    
804            * HTML.pm.src: Similar codes are merged together.
805    
806    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
807    
808            * mkhtmlparser.pl: Set "level" parameter to parse errors.
809    
810            * HTML.pm.src: Code refined.
811    
812    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
813    
814            * HTML.pm.src: |</body>| treatement has been changed (HTML5
815            revision 1348).  Note that I really don't know this makes
816            any difference in the black-box behavior of the parser.
817    
818    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
819    
820            * HTML.pm.src: New end-of-file token implementation (HTML5
821            revision 1348).
822    
823    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
824    
825            * HTML.pm.src: |applet| support (HTML5 revision 1347).
826    
827    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
828    
829            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
830    
831    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
832    
833            * HTML.pm.src: Support for |<input>| in the "in select" insertion
834            mode and support for the "in select in table" insertion mode (HTML5
835            revision 1342).
836    
837    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
838    
839            * HTML.pm.src: No foster parenting for <script> and <script>
840            in non-tainted <table>s (HTML5 revision 1336).
841    
842    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
843    
844            * HTML.pm.src: Ignore white space characters between <html>
845            and <head> (HTML5 revision 1332).
846    
847    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
848    
849            * HTML.pm.src: Treat <input type=hidden> as if it were a
850            white space (HTML5 revision 1331).
851    
852    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
853    
854            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
855            element (HTML5 revision 1330).
856    
857    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
858    
859            * HTML.pm.src: <title> is always appended to the current
860            element (HTML5 revision 1328).
861    
862    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
863    
864            * HTML.pm.src: White space in tainted tables are moved
865            into foster parents (HTML5 revision 1326).
866    
867    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
868    
869            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
870            revision 1321).
871    
872    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
873    
874            * HTML.pm.src: |</h/n/>| case code rearranged to align with
875            the spec (HTML5 revision 1320).  Note that we finally complete
876            all of HTML5 revision 1320 changes.
877    
878    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
879    
880            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
881            tags (HTML5 revision 1320).
882    
883    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
884    
885            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
886    
887    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
888    
889            * HTML.pm.src: A "generate implied end tags" code (t409.1)
890            could not be reached so that it is now removed (HTML5 revision
891            1320).
892    
893    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
894    
895            * HTML.pm.src: Code for the case of |</div>| and so on are revised
896            to align with new spec text (HTML5 revision 1320).
897    
898    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
899    
900            * HTML.pm.src: Remove strange |if| condition; however, it should
901            have had no harm in theory.
902    
903    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
904    
905            * HTML.pm.src (_tree_construction_main): '</p>' in body
906            case is split from other end tags for the preparation
907            of implementing HTML revision 1320.
908    
909    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
910    
911            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
912            revision 1320).
913    
914    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
915    
916            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
917            and "div" (HTML5 revision 1318).  Add comments to where
918            |form| pointer association codes should be inserted (HTML5 revision
919            1319).
920    
921    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
922    
923            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
924    
925    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
926    
927            * HTML.pm.src: Since the case t268 should never be reached (no
928            other token type, there are), it is replaced by a |die| statement.
929    
930    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
931    
932            * HTML.pm.src: Typo fixed.
933    
934    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
935    
936            * HTML.pm.src (_tree_construction_initial): Some limited quirks
937            doctypes were not uppercased for comparison.
938    
939    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
940    
941            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
942            are added.
943    
944    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
945    
946            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
947            are set.  Cases that are unlikely reached are noted as so.
948    
949    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
950    
951            * HTML.pm.src: Checkpoints for debugging are added.
952    
953            * mkhtmlparser.pl: Support for |!!!cp| syntax.
954    
955    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
956    
957            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
958            for simplicity.
959    
960    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
961    
962            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
963            revision 1307).
964    
965    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
966    
967            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
968            is set are changed (HTML5 revision 1305).
969    
970    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
971    
972            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
973            1292).  Entities are not parsed in comment-like part in RCDATA
974            elements (HTML5 revision 1294).  Allow bare & at the end
975            of attribute value literals (HTML5 revision 1296).  More
976            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
977            between attributes and ban attribute names or unquoted
978            attribute values containing single or double quotes (HTML5
979            revision 1303).
980    
981    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
982    
983            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
984            and related errors unless it is an HTML document (though the spec
985            is unclear on whether it is applied to XHTML document).
986    
987            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
988            are added.
989    
990    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
991    
992            * HTML.pm.src: s/local_name/manakai_local_name/g.
993    
994    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
995    
996            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
997    
998            * HTML.pm.src: |charset| in |content| attribute is
999            case-insensitive (HTML5 revision 1270).
1000    
1001    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1002    
1003            * ContentChecker.pm: New status constants are added.
1004            ($ElementDefault): |status| added.
1005            (check_element): Err for non-standard or deprecated elements.
1006            (_attr_status_info): For non-standard or deprecated attributes.
1007    
1008    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1009    
1010            * ContentChecker.pm (_attr_status_info): New internal method.
1011    
1012    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1013    
1014            * ContentChecker.pm (check_element): Element standardized
1015            status information is now dispatched.
1016    
1017    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1018    
1019            * ContentChecker.pm (check_element): Fix |del|-and-significant
1020            problem by adding some more arguments.
1021    
1022    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * ContentChecker.pm (check_element): Use context of
1025            container-for-the-purpose-of-content-model element (not transparent
1026            element) for |check_child_element| calling and significant
1027            text flag marking.  This reintroduces |<del>|-and-significant
1028            problem again.
1029    
1030    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1031    
1032            * ContentChecker.pm (check_element): Make semi-transparent
1033            elements ignored for the purpose of phase changes in
1034            content model checking.
1035    
1036    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1037    
1038            * ContentChecker.pm (check_element): In-element state
1039            was not properly managed for transparent cases.
1040    
1041    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1042    
1043            * ContentChecker.pm (check_element): Support for |video|
1044            and |audio| as semi-transparent elements.
1045    
1046    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1047    
1048            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1049            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1050            Support for |html:object| as a semi-transparent element.
1051    
1052    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1053    
1054            * ContentChecker.pm (check_element): The way to traverse
1055            the tree is entirely revised to make it easier to track
1056            the state of ancestors/descendants.  As a result of this
1057            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1058            support for content model checking for HTML elements |figure|,
1059            |object|, |video|, and |audio| and checking for XML elements (and
1060            some XMLNS checkings) are dropped for now.  They will be
1061            reimplemented in due cource.
1062    
1063    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1064    
1065            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1066            token (HTML5 revision 1225).
1067    
1068    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1069    
1070            * ContentChecker.pm ({unsupported_level}): New value.
1071    
1072            * HTML.pm.src: Save whether |meta| |content| attribute
1073            contains character references or not.
1074    
1075    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1076    
1077            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1078            imlement significant content checking for contents
1079            with |del| elements.
1080    
1081    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1082    
1083            * ContenteChecker.pm ($HTMLTransparentElements): More
1084            elements are added.
1085            (_get_children): HTML |object| elements are now semi-transparent.
1086    
1087            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1088    
1089    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1090    
1091            * CacheManifest.pm: HTML5 revision 1211 implemented.
1092    
1093            * CacheManifest.pod: Updated.
1094    
1095    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1096    
1097            * ContentChecker.pm (check_document, check_element): Support
1098            for second argument ($onsubdoc).
1099            (_get_css_parser): Removed (now it is part of WDCC).
1100    
1101    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1102    
1103            * ContentChecker.pm (_get_css_parser): New.
1104    
1105    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1106    
1107            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1108            for checking had been used.
1109    
1110  2007-11-25  Wakaba  <wakaba@suika.fam.cx>  2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1111    
1112          * ContentChecker.pm (check_element): New todo item type |descendant|.          * ContentChecker.pm (check_element): New todo item type |descendant|.

Legend:
Removed from v.1.129  
changed lines
  Added in v.1.311

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24