/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.106 by wakaba, Mon Sep 24 04:23:43 2007 UTC revision 1.350 by wakaba, Mon Oct 13 08:22:29 2008 UTC
# Line 1  Line 1 
1    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Element category constants redefined.
4            
5    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
6    
7            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
8            construction stage synced with the spec (HTML5 revisions 2139 and
9            2302).
10    
11    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
12    
13            * ContentChecker.pm: New error level "html5_fact" added, which
14            should be tentatively used until all of requirements are properly
15            specced as RFC 2119 "MUST" in HTML5.
16    
17    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
18    
19            * ContentChecker.pod: Note on internal flags is added.
20    
21    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: An AAA bug fixed.
24    
25    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: If another node is inserted by the parser, don't
28            reuse existing Text node to append a character (HTML5 revision
29            2124).
30    
31    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
34            revisions 1731 and 2128).
35    
36    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
39            Support for end tags of camelCase SVG elements were broken.  A
40            wrong error type text fixed.
41    
42    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
45    
46    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: Support for new definition of |param| and |source|
49            start tag parsing (HTML5 revision 1731).
50    
51    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
54    
55    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
56    
57            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
58            revisions 1731 and 1831).
59    
60    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
61    
62            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
63            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
64            revision 1731).
65    
66    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
67    
68            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
69            revision 1731).  End tags of |option| and |optgroup| elements are
70            now optional (HTML5 revision 1731).
71    
72    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: New "special" elements added to the list (HTML5
75            revision 1778).  "strile" -> "strike".
76    
77    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
78    
79            * ContentType.pm (get_sniffed_type): Support for the "better"
80            content sniffing (HTML5 revision 1927).  In a case the official
81            type was not returned when the method is invoked in the list
82            context.
83    
84    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
85    
86            * HTML.pm.src: Character references for non-space C0 characters,
87            including U+000B VT, DEL character, noncharacter code points, are
88            now converted to the U+FFFD character (cf. HTML5 revision 2138).
89    
90    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
91    
92            * ContentChecker.pm: |form=""| check support added.
93    
94    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
95    
96            * ContentChecker.pm: |contextmenu| validness is now checked using
97            |id| and |id_type| properties, and |menu| property is removed.
98    
99    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
100    
101            * ContentChecker.pm: Prepare for |form| |name| attribute's
102            duplication checking.
103    
104    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
107            unsupported charset="" parameter value (HTML5 revision 2131).
108    
109    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: Reminding places where U+000B is allowed as a space
112            character is fixed (cf. HTML5 revision 1738).
113    
114            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
115            space characters (HTML5 revision 1738).
116    
117    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src: The "anything else" case for the "after after body"
120            insertion mode was not updated to swtich to the "in body"
121            insertion mode.  U+000B is no longer a space character for the
122            purpose of tree construction phase (HTML5 revision 1738).
123    
124    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: U+000B is no longer a space character (HTML5
127            revision 1738).
128    
129    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
130    
131            * ContentType.pm: 0x0B is no longer a space character (HTML5
132            revision 1738).
133    
134            * HTML.pm.src: U+000B is no longer a space character for the
135            algorithm for extracting an encoding from a Content-Type (HTML5
136            revision 1738).
137    
138    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
139    
140            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
141    
142    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
143    
144            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
145    
146            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
147            matching.
148    
149    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
150    
151            * LangTag.pm: Warn for private use language subtags.  Error level
152            typos fixed.  Support for Suppress-Script field.
153    
154            * mklangreg.pl: Support for dumping of nested structure.
155    
156    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
157    
158            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
159            recommended case as per RFC 4646.
160    
161    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
162    
163            * LangTag.pm (check_rfc4646_langtag): New method.
164    
165    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
166    
167            * mklangreg.pl: New script.
168    
169            * Makefile: Updated for creation of the module for language subtag
170            registry.
171            
172    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
173    
174            * Makefile: WebIDL.html added.
175    
176            * WebIDL.pod: New documentation.
177    
178    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
179    
180            * WebIDL.pm: Checker's error types are redefined.
181    
182    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
183    
184            * WebIDL.pm: Parser's error types are redefined.  Some forward
185            compatible parsing bugs are fixed.  Some unreachable codes are
186            commented out.
187    
188    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
189    
190            * WebIDL.pm: Support for the reminding extended attributes are
191            added.  It does not satisfy the definition that a forward
192            interface declaration has an extended attribute.  It seems that
193            unless explicitly allowed multiple extended attributes with the
194            same name is not allowed, though it is not explicitly mentioned in
195            the spec.
196    
197    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
198    
199            * WebIDL.pm: Unescapes extended attribute names and extended
200            attribute identifiers.  Preserve whether an extended attribute has
201            an argument list of not.  Support for extended attributes:
202            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
203            NameGetter, NameSetter, and Null.
204            (has_argument_list): New attribute.
205            (idl_text): Stringifies argument lists, if any, even if it is
206            empty.
207    
208    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
211            |{s_kwd}| in DATA_STATE as default.
212    
213    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
214    
215            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
216            by |{s_kwd}| in DATA_STATE.
217    
218    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Shorten keys.
221    
222    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
223    
224            * HTML.pm.src: Remove checking for control character, surrogate
225            pair, or noncharacter code points and non-Unicode code
226            points (they should be handled by Whatpm::Charset::UnicodeChecker).
227            (parse_char_stream): Support for the |$get_wrapper| argument and
228            character stream error handlers.
229    
230    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
231    
232            * ContentChecker.pm: Don't call |loda_ns_module|
233            for null-namespace elements/attributes.
234    
235            * HTML.pm.src: Fact out $disallowed_control_chars
236            as a hash.
237    
238    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
239    
240            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
241            and |{next_char}| initializations are moved to initialization
242            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
243            with |parse_char_stream|.
244    
245    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
246    
247            * HTML.pm.src (parse_char_stream): Make |set_next_char|
248            invoke |manakai_read_until|, not only |read|, where
249            possible, to decrease the number of |read| method calls.
250    
251            * mkhtmlparser.pl: Related changes to the aforementioned
252            modification.
253    
254    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
255    
256            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
257            would report character error from now.
258    
259    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
260    
261            * HTML.pm.src: White-space-leaded non-white-space character
262            tokens in "before head insertion mode" was not
263            correctly handled.
264            (set_inner_html): Reimplemented using CharString decodehandle
265            class.  Support for $get_wrapper argument.  Support
266            for |{read_until}| feature.
267    
268    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Make a "bare ero" error for unknown
271            entities point the "&" character.
272    
273    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
274    
275            * HTML.pm.src: It turns out that U+FFFD don't have to
276            be added to the list of excluded characters.
277    
278    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src ($char_onerror): Have character decoder's |line|
281            and |column| a higher priority than the one set by the
282            tokenizer's input handler.
283            ($self->{read_until}): Exclude U+FFFD (but this might
284            not be necessary, since now we do line/column fixup in
285            the character decode handle).
286    
287    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Use |{read_until}| where possible.
290    
291    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
292    
293            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
294            and |manakai_getc_until| to |manakai_read_until| to
295            reduce the number of string copies.
296    
297    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
298    
299            * HTML.pm.src (parse_char_string): Use newly created
300            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
301            standard feature to |open| a string as a filehandle,
302            since Perl's string filehandle seems not supporting |ungetc|
303            method correctly.
304            (parse_char_stream): Define |{getc_until}| method.
305            (DATA_STATE): Experimental support for |getc_until| feature.
306    
307    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
308    
309            * HTML.pm.src: Check points added to newly added branches.
310    
311    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
312    
313            * HTML.pm.src: Remove |{char}|, which is no longer used.
314            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
315            and replaced by |{prev_state}|.
316    
317            * mkhtmlparser.pl: Remove |{char}| feature.
318            Remove |!!!back-next-input-character;| macro.
319    
320    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
321    
322            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
323            entity related tokenizer states in favor of new states
324            implementing the consume character reference algorithm.
325    
326    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
327    
328            * HTML.pm.src: "Consume a character reference" algorithm is
329            now implemented as a tokenizer's state, rather than
330            a method, with minimum changes (more changes will
331            be made, in due course).  "Bogus comment state"'s inner
332            loop gets removed.
333    
334    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
335    
336            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
337            into their own tokenizer states.
338    
339    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
340    
341            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
342            is split into three states.
343    
344    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
345    
346            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
347            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
348            no longer does the tokenizer have to push back next input
349            characters in those states.
350    
351    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
352    
353            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
354            into four states so that no longer does the tokenizer have to push
355            back next input characters in that state.
356    
357    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
358    
359            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
360            which can be used to insert some wrapper between the character
361            stream handle and the tokenizer.  (It is currently not supported
362            for |set_inner_html| for |Element|s).
363    
364    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
365    
366            * HTML.pm.src: Ignore punctuations in charset names.
367    
368    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
369    
370            * ContentChecker.pm: Support for charset-layer error levels.
371    
372            * HTML.pm.src: Don't specify |text| argument for the
373            |chardecode:fallback| error, since it is not the encoding
374            being used alternatively.
375    
376    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
377    
378            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
379    
380    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
381    
382            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
383    
384    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
385    
386            * HTML.pm.src: Bug fix and sync with the spec with regard
387            to after after frameset insertion mode processing (HTML5
388            revision 1909).  Note that the implementation was wrong
389            per the old spec before the r1909 changes.
390    
391    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
392    
393            * HTMLTable.pm: scope=auto algorithm fix synced with the
394            spec (HTML5 revision 2093).
395            ($process_row): Algorithm step numbers synced with the
396            spec (HTML5 revision 2092).
397    
398    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
399    
400            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
401            revision 2094).
402    
403    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
404    
405            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
406    
407    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pm.src: '"' and "'" at the end of attribute
410            name (after another attribute) now raise parse error (HTML5
411            revision 2123).  Empty unquoted attribute values are no
412            longer allowed (HTML5 revision 2122).
413    
414    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
415    
416            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
417            revision 2130).
418    
419    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
420    
421            * ContentChecker.pm: |xml:lang| attribute value must be same
422            as |lang| attribute value for HTML elements (HTML5 revision 2062
423            and so on).
424    
425    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
426    
427            * ContentChecker.pm: Error level definition for |xml_id_error|
428            was missing.
429    
430            * URIChecker.pm: The end of the URL should be marked as the
431            error location for an empty path error.  The position
432            between the userinfo and the port components should be
433            marked as the error location for an empty host error.
434    
435    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
436    
437            * URIChecker.pm: Set parameters representing where in the
438            value the error occurs for errors.  Report unknown
439            address format error in warning level, since address
440            formats are rarely added.  Path segments starting with "/.."
441            were misinterpreted as a dot-segment.
442    
443    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
444    
445            * URIChecker.pm (check_iri_reference): Requires
446            |Message::DOM::DOMImplementation|.
447    
448    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
449    
450            * IMTChecker.pm: Updated for the new error reporting architecture.
451    
452            * ContentChecker.pm: Error levels for IMTs are added.
453    
454    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
455    
456            * H2H.pm (_shift_token): Support for unquoted HTML attribute
457            values.
458    
459    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
460    
461            * CacheManifest.pm: Support for new style of error
462            reports.
463    
464            * HTML.pm.src: Set line=1, column=1 to the document node.
465    
466    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
467    
468            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
469            and URL checkers.  Support for more error levels for bogus
470            langauge tag and URL "standards".
471    
472            * LangTag.pm, URIChecker.pm: Support for new style error
473            level reporting.
474    
475    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
476    
477            * ContentChecker.pm: Support for RDF/XML error levels.
478    
479            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
480            specifying.  Error types are revised.
481    
482    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
483    
484            * ContentChecker.pm: All error reporting method calls are
485            renewed.
486    
487    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
488    
489            * HTML.pm.src: All error type names and "text" parameters
490            are revised.  Use new style for "level" specification.
491    
492            * mkhtmlparser.pl: Use new style for "level" specification.
493    
494    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
495    
496            * WebIDL.pm (parse_char_string): Simplified error
497            reporting process for broken ignored valuetype definition.
498            (Valuetype idl_text): Support for special "DOMString" name.
499    
500    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
501    
502            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
503            terminal of the ScopedName is "DOMString", such that whether
504            the last part of the scoped name is "DOMString" or "_DOMString"
505            later.  It is necessary to determine whether a |typedef|
506            definition should be ignored or not.
507            (parse_char_string): Unescape the identifier of
508            exception members.
509            ($resolve): Return undef for builtin types and sequence<T>
510            types (we might not have to do this, however...).
511            (check): Support checking for Exceptions, Valuetypes,
512            and Typedefs.
513            ($serialize_type): Support for "DOMString::::" syntax.
514            (Typedef idl_text): Output Type as "DOMString" if it
515            is really "DOMString" (i.e. its internal representation
516            is "::DOMString::").
517    
518    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
519    
520            * WebIDL.pm ($resolve): New code, based on resolve code
521            for constant types in the |check| method.
522            (check): Support for checking of attributes, operations, and
523            arguments.
524            (Attribute/Operation idl_text): Exception names in getraises,
525            setraises, and raises clauses is serizlied by |$serialize_type|
526            code.
527    
528    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
529    
530            * WebIDL.pm ($integer): Order of selections are changed to match
531            hexadecimal numbers (the original pattern, taken from the spec,
532            was not work for hexadecimal numbers, because the "0" prefix
533            matches to the [0-7]* part (as an empty string) and therefore
534            it does not match with remaining "x..." part of a "0x..." integer
535            literal.
536            ($get_type): It now returns a string, not an array reference,
537            for regular types and |sequence| types (i.e. it in any case
538            returns a string).
539            ($get_next_token): The second item in the array that represents
540            a integer or float token is now a Perl number value, not the
541            original string representation of the number.
542            (check): Support for const value consistency checking.
543            No extended attribute is defined for constants.
544            (Node subclasses): Use simple strings rather than array references
545            for default data type values.
546            ($serialize_type): Type values are now simple strings.
547            (value): If the new attribute value is a false value, then
548            a FALSE value is set to the attribute.
549    
550    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
551    
552            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
553            in its stringified format ("scoped name" as defined in the
554            spec).  Note that future version of this module should not use
555            array references for type values and the |type_text| attribute
556            should be made obsolete.
557            (parse_char_string): Unescape attribute names.
558            (check): Support for checking of whether inherited interfaces
559            are actually defined or not.  Support for checking of whether
560            interface member identifiers are duplicated or not.
561            ($serialize_type): Scoped names are returned as is.  A future
562            version of this code should escape identifiers other than "DOMString",
563            otherwise the idl_text would be non-conforming.
564    
565    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
566    
567            * WebIDL.pm (parse_char_string): Set line/column numbers
568            to generated nodes.  Unescape identifiers.  Extended attributes
569            for Definition's were ignored.
570            (append_child): Set |parent_node| attribute.
571            (parent_node): New attribute.
572            (check): Support interface/exception members.  Support
573            extended attributes.  Support definition identifier uniqueness
574            constraint.
575            (qualified_name): New attribute.
576            (Interface/Exception idl_text): Extended attributes were
577            not prepended to the returned text.
578    
579    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
580    
581            * WebIDL.pm (parse_char_string): Set line/column numbers
582            to interface object experimentally.  s/shift/pop/g, shift
583            would make things wrong.  Support for interface forward
584            declarations was missing.  Broken interface declarations
585            with no block were not ignored entirely.
586            (Whatpm::WebIDL::Node): New abstract class.  This class
587            makes things easier.
588            (child_nodes): New attribute.  Unlike DOM's attribute with
589            same name, this attribute returns a dead list of nodes for
590            simplicity.
591            (get_user_data, set_user_data): New methods.
592            (Module idl_text): A SPACE character should be inserted
593            before the |{| character.
594            (Interface idl_text): Support for interface forward declarations.
595            (is_forward_declaration): New attribute.
596    
597    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
598    
599            * WebIDL.pm (type_text): Better serializer.
600    
601    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
602    
603            * WebIDL.pm: Revise forward-compatible parsing so that
604            it now can handle broken extended attributes and as such.
605    
606    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
607    
608            * WebIDL.pm: Real support for extended attributes.
609            Support for extended attributes with arguments.
610    
611    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
612    
613            * WebIDL.pm: Support for |exception| syntax.
614            (Interface->idl_text): Tentative support for inheritances.
615    
616    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
617    
618            * WebIDL.pm: Hierarchical scoped name support was broken.
619            Support for raises, setraises, and getraises syntaxes.
620    
621    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
622    
623            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
624            proper support for types, extended attributes, and exceptions yet).
625            WebIDL parser, version 1 (no support for exceptions yet,
626            no proper support for extended attributes yet).
627    
628    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
629    
630            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
631    
632    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
633    
634            * WebIDL.pm: Support for valuetype and const.
635    
636    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
637            
638            * WebIDL.pm: New module.
639    
640    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
641    
642            * Makefile (Entities.html): URI changed.
643    
644    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
645    
646            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
647    
648    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
649    
650            * HTML.pm.src (_get_next_token): A parse error was missing.
651    
652    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
653    
654            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
655            spec (commented out). (HTML5 revision 1711).
656    
657    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
658    
659            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
660    
661            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
662            as UTF-16 (HTML5 revision 1701).
663    
664    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
665    
666            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
667            1692).
668    
669    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
670    
671            * HTML.pm.src: The secondary insertion mode used when switching
672            to foreign content is the "in body" insertion mode (HTML5 revision
673            1696).
674    
675    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
676    
677            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
678            revision 1697).
679    
680    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
681    
682            * HTML.pm.src: Support for end-of-file token in foreign content
683            insertion mode (HTML5 revision 1693).  Update SVG camelCase
684            attribute list (HTML5 revision 1700).  <textarea> closes
685            </select> (HTML5 revision 1699).  More start tags close in
686            foreign content insertion mode (HTML5 revision 1698).
687    
688    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
689    
690            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
691    
692    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
693    
694            * HTML.pm.src: More robust charset parameter detection (HTML5
695            revision 1674).
696    
697    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
698    
699            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
700            revision 1676).
701    
702    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
703    
704            * HTML.pm.src: Ignore language part of public identifiers for
705            quriks mode detection (HTML5 revision 1679).
706    
707    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
708    
709            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
710            revision 1685).
711    
712    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
713    
714            * HTML.pm.src: Support for EOF in new states for tags (HTML5
715            revision 1684).
716    
717    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
718    
719            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
720            work (HTML5 revision 1690).
721    
722    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
723    
724            * HTML.pm.src (_tree_construction_main): Change handling of
725            end tags in head insertion modes (HTML5 revision 1686).
726            (parse_char_string): Bug fix for non-utf8 character string handlings.
727            (parse_char_stream): |ungetc| does not work well for this context.
728    
729    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
730    
731            * HTML.pm.src (parse_byte_string): Redefined to invoke
732            |parse_byte_stream|.
733            (parse_byte_stream): New method.
734    
735    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
736    
737            * HTML.pm.src (parse_byte_string): Fix the column number reported
738            by encoding layer error reporter.
739    
740    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
741    
742            * HTML.pm.src (parse_byte_string): Use streaming decoder
743            rather than converting the whole byte string and then parsing.
744            Propagate errors in character encoding layer.
745            (get_next_token): Precise error reporting for |bare stago| error.
746    
747    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
748    
749            * HTML.pm.src (parse_char_stream): New method.
750            (parse_char_string): This method is now defined as an invocation
751            of the |parse_char_stream| method.
752    
753    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
754    
755            * HTML.pm.src (parse_byte_string): Report various status
756            of the sniffing as info-level errors.  Support for new
757            decoding framework in parser resestting.
758            (new): Various default error levels were not set.
759    
760    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
761    
762            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
763            algorithm, except for the actual sniffing, is implemented
764            with new framework with Message::Charset::Info.
765    
766    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
767    
768            * CacheManifest.pm (_parse): Drop fragment identifiers from
769            URIs in fallback section (HTML5 revision 1596).
770    
771    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
772    
773            * Makefile (Entities.html): URI has changed.
774    
775    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
776    
777            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
778            1553).
779    
780    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
781    
782            * ContentChecker.pm: Noted that those returned in |table| are
783            no longer table elements, but table objects returned
784            by Whatpm::HTMLTable.
785    
786            * HTMLTable.pm (form_table): Return table element node
787            as |$table->{element}|.
788            (assign_header): Support for the |headers=""| attribute.
789    
790    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
791    
792            * HTMLTable.pm (assign_header): New function; first version
793            with no support for headers="".
794            (form_table): Include table width and height to the returned
795            table object for covenience.  Indexing in column assignement was wrong.
796            Set whether a data cell is empty or not for convenience.
797    
798    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
799    
800            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
801    
802    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
803    
804            * HTMLTable.pm: How table model errors are detected is
805            changed (HTML5 revision 1387).
806    
807    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
808    
809            * HTMLTable.pm: The algorithm now moves |tfoot| elements
810            to the end of the table (HTML5 revision 1380).
811    
812    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
813    
814            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
815            of 1-based (HTML5 revision 1376).
816    
817    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
818    
819            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
820            and "Content-Encoding" no longer prevent sniffing (HTML5
821            revision 1288).
822    
823    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
824    
825            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
826            1282).
827    
828    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
829    
830            * ContentChecker.pm: Support for global attributes.
831            Status of XML specs are added.
832    
833    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
834    
835            * ContentChecker.pm (check_element): Support for |template=""|
836            and |ref=""| attribute (referent element type checking).
837    
838    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
839    
840            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
841            1500) is implemented (except for IDNA part and URI-scheme-specific
842            knowledge).  Line number counting was wrong for LF-only
843            documents.
844    
845    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
846    
847            * HTML.pm.src: Raise an parse error for any disallowed
848            character (HTML5 revision 1263).
849    
850    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
851    
852            * mkentitylist.pl: Support for new HTML5 entity table format (the
853            definition for |AElig;| was missing).
854    
855    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
856    
857            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
858            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
859    
860    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
861    
862            * HTML.pm.src: List of element names that close foreign content
863            insertion mode is added (HTML5 revisions 1412 and 1418).
864    
865    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
866    
867            * HTML.pm.src: Support for |mglyph| and |malignmark|
868            elements (HTML5 revision 1410).
869    
870    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
871    
872            * HTML.pm.src: Support for new long MathML entities (HTML5
873            revision 1406).
874    
875    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
876    
877            * HTML.pm.src: CDATA section support for MathML and SVG
878            elements (HTML5 revision 1404 and 1420).
879    
880    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
881    
882            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
883            elements (HTML5 revision 1404).  Unused !!!macro definitions
884            are removed.
885    
886    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
887    
888            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
889            are raised is changed (HTML5 revision 1404).
890    
891    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
892    
893            * HTML.pm.src: Category sets that are no longer used are removed.
894    
895    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
896    
897            * HTML.pm.src: The ->[1] property of stack entries are now
898            replaced by constants representing element category.
899    
900    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
901    
902            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
903            for error reporting.  (This is a preparation for using constant
904            value for ->[1].)
905    
906    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
907    
908            * RDFXML.pm: Typo fixed.
909    
910    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
911    
912            * ContentChecker.pm: |fact_level| is now treated
913            as same as |must_level|, i.e. level = |m|.
914            (check_element): Make list of URIs in the DOM.
915    
916    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
917    
918            * RDFXML.pm: Language accessor implemented.
919            Local (null-namespace) attribute support.
920    
921    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
922    
923            * RDFXML.pm: Factored out ID checking code.
924    
925    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
926    
927            * RDFXML.pm: TODO items noted.  Validation of ID
928            and URI attributes is implemented.  Warn if unknown
929            value is used in rdf:parseType="" attribute.
930    
931            * URIChecker.pm (check_rdf_uri_reference): New function.
932    
933    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
934    
935            * RDFXML.pm: bnodeid implemented.  Relative references
936            are now resolved.
937    
938    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
939    
940            * ContentChecker.pm: RDF reification implemented.
941    
942            * RDFXML.pm: undef vs false bug fixed.
943            Reification implemented.
944    
945    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
946    
947            * RDFXML.pm: s/id/ID/ for attribute name.
948            The |node| arguments are added for |ontriple| calls.
949            Too many "attribute not allowed" errors were raised.
950    
951            * ContentChecker.pm: Initial experimental support for rdf:RDF
952            element.
953    
954    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
955    
956            * RDFXML.pm: New module.
957    
958    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
959    
960            * HTML.pm.src (set_inner_html): Line/column number
961            code was old one yet.
962    
963    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
964    
965            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
966            errors.  Line/column markings for DOCTYPE, comment, and
967            character tokens are reintroduced; otherwise,
968            error location for "not HTML5" error and errors
969            for implied elements are not attached.
970    
971    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
972    
973            * HTML.pm.src: Set line/column numbers to attributes.
974    
975            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
976            (value): Setter implemented.
977    
978            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
979    
980    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
981    
982            * HTML.pm.src: Unused line/column markings are removed.
983    
984    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
985    
986            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
987            flag, which is no longer used.
988    
989    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
990    
991            * HTML.pm.src: Set line/column information to element nodes.
992    
993            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
994            and !!!insert-element-t): Set line/column information to
995            element nodes.
996    
997    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
998    
999            * HTML.pm.src (_get_next_token): The first "<" character
1000            in "<?", "<>", or "</>" should be the error point.
1001    
1002    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1003    
1004            * HTML.pm.src: Some more fixes on error position reporting.
1005    
1006    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1007    
1008            * HTML.pm.src: Token-level precious error reporting.
1009    
1010    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1011    
1012            * HTML.pm.src: Preparation for more precious error point
1013            reporting.
1014    
1015    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1016    
1017            * HTML.pm.src: Error type revised.
1018    
1019    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1020    
1021            * HTML.pm.src: Similar codes are merged together, again.
1022    
1023    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1024    
1025            * HTML.pm.src: Similar codes are merged together.
1026    
1027    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1028    
1029            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1030    
1031            * HTML.pm.src: Code refined.
1032    
1033    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1034    
1035            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1036            revision 1348).  Note that I really don't know this makes
1037            any difference in the black-box behavior of the parser.
1038    
1039    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * HTML.pm.src: New end-of-file token implementation (HTML5
1042            revision 1348).
1043    
1044    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1045    
1046            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1047    
1048    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1051    
1052    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1053    
1054            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1055            mode and support for the "in select in table" insertion mode (HTML5
1056            revision 1342).
1057    
1058    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1059    
1060            * HTML.pm.src: No foster parenting for <script> and <script>
1061            in non-tainted <table>s (HTML5 revision 1336).
1062    
1063    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1064    
1065            * HTML.pm.src: Ignore white space characters between <html>
1066            and <head> (HTML5 revision 1332).
1067    
1068    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1069    
1070            * HTML.pm.src: Treat <input type=hidden> as if it were a
1071            white space (HTML5 revision 1331).
1072    
1073    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1074    
1075            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1076            element (HTML5 revision 1330).
1077    
1078    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1079    
1080            * HTML.pm.src: <title> is always appended to the current
1081            element (HTML5 revision 1328).
1082    
1083    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1084    
1085            * HTML.pm.src: White space in tainted tables are moved
1086            into foster parents (HTML5 revision 1326).
1087    
1088    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1089    
1090            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1091            revision 1321).
1092    
1093    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1094    
1095            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1096            the spec (HTML5 revision 1320).  Note that we finally complete
1097            all of HTML5 revision 1320 changes.
1098    
1099    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1100    
1101            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1102            tags (HTML5 revision 1320).
1103    
1104    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1105    
1106            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1107    
1108    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1109    
1110            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1111            could not be reached so that it is now removed (HTML5 revision
1112            1320).
1113    
1114    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1115    
1116            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1117            to align with new spec text (HTML5 revision 1320).
1118    
1119    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1120    
1121            * HTML.pm.src: Remove strange |if| condition; however, it should
1122            have had no harm in theory.
1123    
1124    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1125    
1126            * HTML.pm.src (_tree_construction_main): '</p>' in body
1127            case is split from other end tags for the preparation
1128            of implementing HTML revision 1320.
1129    
1130    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1131    
1132            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1133            revision 1320).
1134    
1135    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1136    
1137            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1138            and "div" (HTML5 revision 1318).  Add comments to where
1139            |form| pointer association codes should be inserted (HTML5 revision
1140            1319).
1141    
1142    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1143    
1144            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1145    
1146    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1147    
1148            * HTML.pm.src: Since the case t268 should never be reached (no
1149            other token type, there are), it is replaced by a |die| statement.
1150    
1151    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1152    
1153            * HTML.pm.src: Typo fixed.
1154    
1155    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1156    
1157            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1158            doctypes were not uppercased for comparison.
1159    
1160    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1161    
1162            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1163            are added.
1164    
1165    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1166    
1167            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1168            are set.  Cases that are unlikely reached are noted as so.
1169    
1170    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1171    
1172            * HTML.pm.src: Checkpoints for debugging are added.
1173    
1174            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1175    
1176    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1177    
1178            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1179            for simplicity.
1180    
1181    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1182    
1183            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1184            revision 1307).
1185    
1186    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1187    
1188            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1189            is set are changed (HTML5 revision 1305).
1190    
1191    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1192    
1193            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1194            1292).  Entities are not parsed in comment-like part in RCDATA
1195            elements (HTML5 revision 1294).  Allow bare & at the end
1196            of attribute value literals (HTML5 revision 1296).  More
1197            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1198            between attributes and ban attribute names or unquoted
1199            attribute values containing single or double quotes (HTML5
1200            revision 1303).
1201    
1202    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1203    
1204            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1205            and related errors unless it is an HTML document (though the spec
1206            is unclear on whether it is applied to XHTML document).
1207    
1208            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1209            are added.
1210    
1211    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1212    
1213            * HTML.pm.src: s/local_name/manakai_local_name/g.
1214    
1215    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1216    
1217            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1218    
1219            * HTML.pm.src: |charset| in |content| attribute is
1220            case-insensitive (HTML5 revision 1270).
1221    
1222    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1223    
1224            * ContentChecker.pm: New status constants are added.
1225            ($ElementDefault): |status| added.
1226            (check_element): Err for non-standard or deprecated elements.
1227            (_attr_status_info): For non-standard or deprecated attributes.
1228    
1229    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1230    
1231            * ContentChecker.pm (_attr_status_info): New internal method.
1232    
1233    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1234    
1235            * ContentChecker.pm (check_element): Element standardized
1236            status information is now dispatched.
1237    
1238    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1239    
1240            * ContentChecker.pm (check_element): Fix |del|-and-significant
1241            problem by adding some more arguments.
1242    
1243    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1244    
1245            * ContentChecker.pm (check_element): Use context of
1246            container-for-the-purpose-of-content-model element (not transparent
1247            element) for |check_child_element| calling and significant
1248            text flag marking.  This reintroduces |<del>|-and-significant
1249            problem again.
1250    
1251    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1252    
1253            * ContentChecker.pm (check_element): Make semi-transparent
1254            elements ignored for the purpose of phase changes in
1255            content model checking.
1256    
1257    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1258    
1259            * ContentChecker.pm (check_element): In-element state
1260            was not properly managed for transparent cases.
1261    
1262    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1263    
1264            * ContentChecker.pm (check_element): Support for |video|
1265            and |audio| as semi-transparent elements.
1266    
1267    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1268    
1269            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1270            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1271            Support for |html:object| as a semi-transparent element.
1272    
1273    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1274    
1275            * ContentChecker.pm (check_element): The way to traverse
1276            the tree is entirely revised to make it easier to track
1277            the state of ancestors/descendants.  As a result of this
1278            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1279            support for content model checking for HTML elements |figure|,
1280            |object|, |video|, and |audio| and checking for XML elements (and
1281            some XMLNS checkings) are dropped for now.  They will be
1282            reimplemented in due cource.
1283    
1284    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1285    
1286            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1287            token (HTML5 revision 1225).
1288    
1289    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1290    
1291            * ContentChecker.pm ({unsupported_level}): New value.
1292    
1293            * HTML.pm.src: Save whether |meta| |content| attribute
1294            contains character references or not.
1295    
1296    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1297    
1298            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1299            imlement significant content checking for contents
1300            with |del| elements.
1301    
1302    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1303    
1304            * ContenteChecker.pm ($HTMLTransparentElements): More
1305            elements are added.
1306            (_get_children): HTML |object| elements are now semi-transparent.
1307    
1308            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1309    
1310    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1311    
1312            * CacheManifest.pm: HTML5 revision 1211 implemented.
1313    
1314            * CacheManifest.pod: Updated.
1315    
1316    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1317    
1318            * ContentChecker.pm (check_document, check_element): Support
1319            for second argument ($onsubdoc).
1320            (_get_css_parser): Removed (now it is part of WDCC).
1321    
1322    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1323    
1324            * ContentChecker.pm (_get_css_parser): New.
1325    
1326    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1327    
1328            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1329            for checking had been used.
1330    
1331    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1332    
1333            * ContentChecker.pm (check_element): New todo item type |descendant|.
1334    
1335    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1336    
1337            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1338            specced in RFC 2046 and RFC 4288.
1339            (application/atom+xml): Definition added.
1340    
1341    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1342    
1343            * URIChecker.pm: Make RFC 3986 should-level errors
1344            warnings (rather than SHOULD-level errors).
1345    
1346    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1347    
1348            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1349    
1350            * HTML.pm.src: A flag for character references in attribute
1351            values are added.  Set |manakai_has_reference| user data
1352            to |charset| attribute.
1353    
1354    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1355    
1356            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1357            attributes.
1358    
1359            * ContentChecker.pm (check_document): Warn if charset requirements
1360            cannot be tested.
1361    
1362    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1363    
1364            * HTML.pm.src (parse_byte_string): Detect charset
1365            by universalchardet if charset parameter is not specified.
1366    
1367            * Makefile (Charset-all, Charset-clean): New rules.
1368    
1369    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1370    
1371            * ContentChecker.pm (check_document): Check the existence
1372            of character encoding declaration and selection of encoding
1373            for HTML document.
1374    
1375    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1376    
1377            * ContentType.pm (get_sniffed_type): Return also the official
1378            type in list context.
1379    
1380    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1381    
1382            * ContentType.pm: Sniffing with leading white space
1383            ignoring (HTML5 revisions 1013 and 1016).
1384    
1385    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1386    
1387            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1388            white spaces, are implemented.
1389    
1390    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1391    
1392            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1393            attribute if possible.
1394    
1395    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1396    
1397            * HTML.pm.src (parse_byte_string): New method.
1398            (parse_char_string): New alias for |parse_string|.
1399            (main phase): Invoking "change the encoding" algorithm if desired.
1400    
1401            * HTML.pod: Updated.
1402    
1403    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1404    
1405            * HTML.pod (get_inner_html): Removed.
1406    
1407            * Makefile (HTML-all, HTML-clean): New.
1408    
1409    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1410    
1411            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1412    
1413    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1414    
1415            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1416    
1417    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1418    
1419            * Makefile: |CacheManifest.html| is added.
1420    
1421            * CacheManifest.pod: New file.
1422    
1423    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1424    
1425            * CacheManifest.pm: New module.
1426    
1427    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1428    
1429            * HTML.pm.src: Support for application cache selection algorithm
1430            callback.
1431    
1432    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1433    
1434            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1435    
1436    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1437    
1438            * Makefile (clean): New rule.
1439    
1440            * NanoDOM.pm (public_id, system_id): New attributes.
1441    
1442    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1443    
1444            * Makefile (CSS-all, CSS-clean, clean): New rules.
1445    
1446    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1447    
1448            * ContentChecker.pm (check_document): Support for
1449            new |is_xml_root| flag.
1450            (check_element): Support for new |pluses| state.
1451            (_add_pluses): New method.
1452            (_remove_minuses): Support for new |minus| item.
1453    
1454    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1455    
1456            * ContentChecker.pm: Raise specific error for invalid
1457            root element.
1458    
1459  2007-09-24  Wakaba  <wakaba@suika.fam.cx>  2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1460    
1461          * ContentChecker.pm: Set level values for later uses.          * ContentChecker.pm: Set level values for later uses.

Legend:
Removed from v.1.106  
changed lines
  Added in v.1.350

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24