/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.17 by wakaba, Sat May 19 14:29:09 2007 UTC revision 1.394 by wakaba, Sun Sep 6 02:20:52 2009 UTC
# Line 1  Line 1 
1    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
4    
5    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
6    
7            * HTML.pm.src: No longer does foster parenting |style|, |script|,
8            and |input type=hidden| elements (HTML5 revision 2988).
9    
10    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
13            2870).
14    
15    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
18            element (HTML5 revision 2960).
19    
20    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: Moved the "in table text" insertion mode code up
23            such that it does also work as specced in HTML5 in the remaining
24            cases (HTML5 revision 3382).
25    
26    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: Implementing the "in table text" insertion mode,
29            first half (HTML5 revision 3382).
30    
31    2009-08-30  Wakaba  <wakaba@suika.fam.cx>
32    
33            * ContentChecker.pm: Allow |xml:lang| in HTML document if the
34            owner element is not an HTML element (HTML5 revision 3697).
35    
36    2009-08-23  Wakaba  <wakaba@suika.fam.cx>
37    
38            * ContentChecker.pm: Allow <input type=hidden> in context where
39            interactive content is disallowed (HTML5 revision 2392).
40    
41    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
42    
43            * HTML.pm.src: Dropped support for "XSLT-compat" and added
44            "about:legacy-compat" (HTML5 revision 2725).
45    
46    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
49            spec (HTML5 revision 2700).
50    
51    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: Added support for <script> element in the "in
54            select" and "in select in table" insertion modes (HTML5 revision
55            2500).
56    
57    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: An <hn> start tag in an <hn> element should close
60            the earlier element (HTML5 revision 2501).
61    
62    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: The </br> end tags before the <body> start tag
65            should equally survive as the first br element child of the body
66            element (HTML5 revision 2586).
67    
68    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: <font> in SVG land is handled as an SVG element as
71            long as it did not have HTML attributes (HTML5 revision 2590).
72    
73    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
74    
75            * HTML.pm.src (_tree_construction_main): Improved error reporting
76            on "select" start tag in "in select" and "in select in table"
77            parsing modes (c.f. HTML5 revision 3604).
78    
79    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
80    
81            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
82            invalid (HTML5 revision 2701).
83    
84    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
85    
86            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
87            optional before </body> (HTML5 revision 2806).
88    
89    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
90    
91            * HTML.pm.src: Marked where SVG |script| execution should be
92            implemented (HTML5 revision 2904).
93    
94    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
97            inherits the document quirksness flag of the main document (HTML5
98            revision 2935).
99    
100    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
103            quirks mode (The Hixie Quirk, HTML5 revision 3132).
104    
105    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
108            3163).
109    
110    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: Don't change encoding to UTF-8 if the input
113            encoding is UTF-16 (HTML5 revision 3204).
114    
115    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src (parse_byte_stream): Don't change to another
118            encoding if the input is UTF-16 (HTML5 revision 3205).
119    
120    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: |isindex| should no longer generate |p| element
123            node (HTML5 revision 3236).
124    
125    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
126    
127            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
128            support for new |id_type| value of |any|, which means any kind of
129            element is allowed as a referenced element.
130    
131    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
132    
133            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
134            of defining codes for checking required attribute and as such.
135    
136    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
137    
138            * IMTChecker.pm: Added more definitions for subtypes.
139    
140    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
141    
142            * URIChecker.pm: Some of |pos_end| values were wrong.
143    
144    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
145    
146            * ContentChecker.pm (check_element): Added support for "no
147            referenced datalist" error.
148    
149    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
150    
151            * URIChecker.pm: Bug fix: It did not work unless
152            Message::DOM::DOMImplementation has been |require|d.
153    
154    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
155    
156            * NanoDOM.pm (document_uri): New attribute.
157    
158            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
159    
160    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
161    
162            * NanoDOM.pm (text_content): Don't create a Text node if the new
163            value is empty.
164    
165    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
166    
167            * SWML/: New directory.
168    
169    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
170    
171            * NanoDOM.pm (specified, all_declarations_processed,
172            manakai_attribute_type): New attributes.
173    
174    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
175    
176            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
177            array.
178    
179    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
180    
181            * NanoDOM.pm (notation_name): New attribute.
182    
183    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
184    
185            * NanoDOM.pm (public_id, system_id): New attributes.a
186    
187    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
188    
189            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
190            implemented.
191            (allowed_tokens, default_type, declared_type): Implemented.
192    
193    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
194    
195            * NanoDOM.pm (node_name): New attribute.
196            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
197            constants.
198            (create_element_type_definition_node, create_attribute_definition,
199            create_notation, create_general_entity,
200            get_element_type_definition_node,
201            set_element_type_definition_node, get_general_entity_node,
202            set_general_entity_node, get_notation_node, set_notation_node,
203            get_attribute_definition_node, set_attribute_definition_node): New
204            methods.
205            (element_types, entities, notations, attribute_definitions): New
206            attributes.
207            (DocumentType): Support for child nodes, entities, notations, and
208            element types.
209            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
210            New classes.
211    
212            * Dumper.pm: Support for general entities, notations, element type
213            definitions, and attribute definitions.
214    
215    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
216    
217            * NanoDOM.pm (create_processing_instruction): New method.
218            (xml_version, xml_encoding, xml_standalone): New attributes.
219            (ProcessingInstruction): New class.
220    
221    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
222    
223            * HTML.pm.src: Handling of end tags in the foreign content
224            insertion mode was partially wrong, because of wrong bit
225            operations.
226    
227    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
228    
229            * NanoDOM.pm (dom_config): New attribute (do nothing), for
230            Whatpm::XML::Parser support.
231    
232    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
233    
234            * Makefile: New rule to make HTML/Tokenizer.pm is added.
235    
236            * HTML.pm.src: Tokenizer part moved to another file.
237    
238    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
239    
240            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
241    
242    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
243    
244            * HTML.pm.src: Element category constants redefined.
245            
246    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
249            construction stage synced with the spec (HTML5 revisions 2139 and
250            2302).
251    
252    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
253    
254            * ContentChecker.pm: New error level "html5_fact" added, which
255            should be tentatively used until all of requirements are properly
256            specced as RFC 2119 "MUST" in HTML5.
257    
258    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
259    
260            * ContentChecker.pod: Note on internal flags is added.
261    
262    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
263    
264            * HTML.pm.src: An AAA bug fixed.
265    
266    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: If another node is inserted by the parser, don't
269            reuse existing Text node to append a character (HTML5 revision
270            2124).
271    
272    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
273    
274            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
275            revisions 1731 and 2128).
276    
277    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
278    
279            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
280            Support for end tags of camelCase SVG elements were broken.  A
281            wrong error type text fixed.
282    
283    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
286    
287    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Support for new definition of |param| and |source|
290            start tag parsing (HTML5 revision 1731).
291    
292    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
293    
294            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
295    
296    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
297    
298            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
299            revisions 1731 and 1831).
300    
301    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
302    
303            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
304            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
305            revision 1731).
306    
307    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
308    
309            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
310            revision 1731).  End tags of |option| and |optgroup| elements are
311            now optional (HTML5 revision 1731).
312    
313    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
314    
315            * HTML.pm.src: New "special" elements added to the list (HTML5
316            revision 1778).  "strile" -> "strike".
317    
318    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
319    
320            * ContentType.pm (get_sniffed_type): Support for the "better"
321            content sniffing (HTML5 revision 1927).  In a case the official
322            type was not returned when the method is invoked in the list
323            context.
324    
325    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
326    
327            * HTML.pm.src: Character references for non-space C0 characters,
328            including U+000B VT, DEL character, noncharacter code points, are
329            now converted to the U+FFFD character (cf. HTML5 revision 2138).
330    
331    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
332    
333            * ContentChecker.pm: |form=""| check support added.
334    
335    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
336    
337            * ContentChecker.pm: |contextmenu| validness is now checked using
338            |id| and |id_type| properties, and |menu| property is removed.
339    
340    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
341    
342            * ContentChecker.pm: Prepare for |form| |name| attribute's
343            duplication checking.
344    
345    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
346    
347            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
348            unsupported charset="" parameter value (HTML5 revision 2131).
349    
350    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
351    
352            * HTML.pm.src: Reminding places where U+000B is allowed as a space
353            character is fixed (cf. HTML5 revision 1738).
354    
355            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
356            space characters (HTML5 revision 1738).
357    
358    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
359    
360            * HTML.pm.src: The "anything else" case for the "after after body"
361            insertion mode was not updated to swtich to the "in body"
362            insertion mode.  U+000B is no longer a space character for the
363            purpose of tree construction phase (HTML5 revision 1738).
364    
365    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
366    
367            * HTML.pm.src: U+000B is no longer a space character (HTML5
368            revision 1738).
369    
370    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
371    
372            * ContentType.pm: 0x0B is no longer a space character (HTML5
373            revision 1738).
374    
375            * HTML.pm.src: U+000B is no longer a space character for the
376            algorithm for extracting an encoding from a Content-Type (HTML5
377            revision 1738).
378    
379    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
380    
381            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
382    
383    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
384    
385            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
386    
387            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
388            matching.
389    
390    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
391    
392            * LangTag.pm: Warn for private use language subtags.  Error level
393            typos fixed.  Support for Suppress-Script field.
394    
395            * mklangreg.pl: Support for dumping of nested structure.
396    
397    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
398    
399            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
400            recommended case as per RFC 4646.
401    
402    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
403    
404            * LangTag.pm (check_rfc4646_langtag): New method.
405    
406    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
407    
408            * mklangreg.pl: New script.
409    
410            * Makefile: Updated for creation of the module for language subtag
411            registry.
412            
413    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
414    
415            * Makefile: WebIDL.html added.
416    
417            * WebIDL.pod: New documentation.
418    
419    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
420    
421            * WebIDL.pm: Checker's error types are redefined.
422    
423    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
424    
425            * WebIDL.pm: Parser's error types are redefined.  Some forward
426            compatible parsing bugs are fixed.  Some unreachable codes are
427            commented out.
428    
429    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
430    
431            * WebIDL.pm: Support for the reminding extended attributes are
432            added.  It does not satisfy the definition that a forward
433            interface declaration has an extended attribute.  It seems that
434            unless explicitly allowed multiple extended attributes with the
435            same name is not allowed, though it is not explicitly mentioned in
436            the spec.
437    
438    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
439    
440            * WebIDL.pm: Unescapes extended attribute names and extended
441            attribute identifiers.  Preserve whether an extended attribute has
442            an argument list of not.  Support for extended attributes:
443            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
444            NameGetter, NameSetter, and Null.
445            (has_argument_list): New attribute.
446            (idl_text): Stringifies argument lists, if any, even if it is
447            empty.
448    
449    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
450    
451            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
452            |{s_kwd}| in DATA_STATE as default.
453    
454    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
455    
456            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
457            by |{s_kwd}| in DATA_STATE.
458    
459    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
460    
461            * HTML.pm.src: Shorten keys.
462    
463    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
464    
465            * HTML.pm.src: Remove checking for control character, surrogate
466            pair, or noncharacter code points and non-Unicode code
467            points (they should be handled by Whatpm::Charset::UnicodeChecker).
468            (parse_char_stream): Support for the |$get_wrapper| argument and
469            character stream error handlers.
470    
471    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
472    
473            * ContentChecker.pm: Don't call |loda_ns_module|
474            for null-namespace elements/attributes.
475    
476            * HTML.pm.src: Fact out $disallowed_control_chars
477            as a hash.
478    
479    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
480    
481            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
482            and |{next_char}| initializations are moved to initialization
483            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
484            with |parse_char_stream|.
485    
486    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
487    
488            * HTML.pm.src (parse_char_stream): Make |set_next_char|
489            invoke |manakai_read_until|, not only |read|, where
490            possible, to decrease the number of |read| method calls.
491    
492            * mkhtmlparser.pl: Related changes to the aforementioned
493            modification.
494    
495    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
496    
497            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
498            would report character error from now.
499    
500    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
501    
502            * HTML.pm.src: White-space-leaded non-white-space character
503            tokens in "before head insertion mode" was not
504            correctly handled.
505            (set_inner_html): Reimplemented using CharString decodehandle
506            class.  Support for $get_wrapper argument.  Support
507            for |{read_until}| feature.
508    
509    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
510    
511            * HTML.pm.src: Make a "bare ero" error for unknown
512            entities point the "&" character.
513    
514    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
515    
516            * HTML.pm.src: It turns out that U+FFFD don't have to
517            be added to the list of excluded characters.
518    
519    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
520    
521            * HTML.pm.src ($char_onerror): Have character decoder's |line|
522            and |column| a higher priority than the one set by the
523            tokenizer's input handler.
524            ($self->{read_until}): Exclude U+FFFD (but this might
525            not be necessary, since now we do line/column fixup in
526            the character decode handle).
527    
528    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
529    
530            * HTML.pm.src: Use |{read_until}| where possible.
531    
532    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
533    
534            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
535            and |manakai_getc_until| to |manakai_read_until| to
536            reduce the number of string copies.
537    
538    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
539    
540            * HTML.pm.src (parse_char_string): Use newly created
541            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
542            standard feature to |open| a string as a filehandle,
543            since Perl's string filehandle seems not supporting |ungetc|
544            method correctly.
545            (parse_char_stream): Define |{getc_until}| method.
546            (DATA_STATE): Experimental support for |getc_until| feature.
547    
548    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
549    
550            * HTML.pm.src: Check points added to newly added branches.
551    
552    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
553    
554            * HTML.pm.src: Remove |{char}|, which is no longer used.
555            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
556            and replaced by |{prev_state}|.
557    
558            * mkhtmlparser.pl: Remove |{char}| feature.
559            Remove |!!!back-next-input-character;| macro.
560    
561    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
562    
563            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
564            entity related tokenizer states in favor of new states
565            implementing the consume character reference algorithm.
566    
567    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
568    
569            * HTML.pm.src: "Consume a character reference" algorithm is
570            now implemented as a tokenizer's state, rather than
571            a method, with minimum changes (more changes will
572            be made, in due course).  "Bogus comment state"'s inner
573            loop gets removed.
574    
575    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
576    
577            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
578            into their own tokenizer states.
579    
580    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
581    
582            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
583            is split into three states.
584    
585    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
586    
587            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
588            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
589            no longer does the tokenizer have to push back next input
590            characters in those states.
591    
592    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
593    
594            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
595            into four states so that no longer does the tokenizer have to push
596            back next input characters in that state.
597    
598    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
599    
600            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
601            which can be used to insert some wrapper between the character
602            stream handle and the tokenizer.  (It is currently not supported
603            for |set_inner_html| for |Element|s).
604    
605    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
606    
607            * HTML.pm.src: Ignore punctuations in charset names.
608    
609    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
610    
611            * ContentChecker.pm: Support for charset-layer error levels.
612    
613            * HTML.pm.src: Don't specify |text| argument for the
614            |chardecode:fallback| error, since it is not the encoding
615            being used alternatively.
616    
617    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
618    
619            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
620    
621    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
622    
623            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
624    
625    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
626    
627            * HTML.pm.src: Bug fix and sync with the spec with regard
628            to after after frameset insertion mode processing (HTML5
629            revision 1909).  Note that the implementation was wrong
630            per the old spec before the r1909 changes.
631    
632    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
633    
634            * HTMLTable.pm: scope=auto algorithm fix synced with the
635            spec (HTML5 revision 2093).
636            ($process_row): Algorithm step numbers synced with the
637            spec (HTML5 revision 2092).
638    
639    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
640    
641            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
642            revision 2094).
643    
644    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
645    
646            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
647    
648    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
649    
650            * HTML.pm.src: '"' and "'" at the end of attribute
651            name (after another attribute) now raise parse error (HTML5
652            revision 2123).  Empty unquoted attribute values are no
653            longer allowed (HTML5 revision 2122).
654    
655    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
656    
657            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
658            revision 2130).
659    
660    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
661    
662            * ContentChecker.pm: |xml:lang| attribute value must be same
663            as |lang| attribute value for HTML elements (HTML5 revision 2062
664            and so on).
665    
666    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
667    
668            * ContentChecker.pm: Error level definition for |xml_id_error|
669            was missing.
670    
671            * URIChecker.pm: The end of the URL should be marked as the
672            error location for an empty path error.  The position
673            between the userinfo and the port components should be
674            marked as the error location for an empty host error.
675    
676    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
677    
678            * URIChecker.pm: Set parameters representing where in the
679            value the error occurs for errors.  Report unknown
680            address format error in warning level, since address
681            formats are rarely added.  Path segments starting with "/.."
682            were misinterpreted as a dot-segment.
683    
684    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
685    
686            * URIChecker.pm (check_iri_reference): Requires
687            |Message::DOM::DOMImplementation|.
688    
689    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
690    
691            * IMTChecker.pm: Updated for the new error reporting architecture.
692    
693            * ContentChecker.pm: Error levels for IMTs are added.
694    
695    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
696    
697            * H2H.pm (_shift_token): Support for unquoted HTML attribute
698            values.
699    
700    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
701    
702            * CacheManifest.pm: Support for new style of error
703            reports.
704    
705            * HTML.pm.src: Set line=1, column=1 to the document node.
706    
707    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
708    
709            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
710            and URL checkers.  Support for more error levels for bogus
711            langauge tag and URL "standards".
712    
713            * LangTag.pm, URIChecker.pm: Support for new style error
714            level reporting.
715    
716    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
717    
718            * ContentChecker.pm: Support for RDF/XML error levels.
719    
720            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
721            specifying.  Error types are revised.
722    
723    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
724    
725            * ContentChecker.pm: All error reporting method calls are
726            renewed.
727    
728    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
729    
730            * HTML.pm.src: All error type names and "text" parameters
731            are revised.  Use new style for "level" specification.
732    
733            * mkhtmlparser.pl: Use new style for "level" specification.
734    
735    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
736    
737            * WebIDL.pm (parse_char_string): Simplified error
738            reporting process for broken ignored valuetype definition.
739            (Valuetype idl_text): Support for special "DOMString" name.
740    
741    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
742    
743            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
744            terminal of the ScopedName is "DOMString", such that whether
745            the last part of the scoped name is "DOMString" or "_DOMString"
746            later.  It is necessary to determine whether a |typedef|
747            definition should be ignored or not.
748            (parse_char_string): Unescape the identifier of
749            exception members.
750            ($resolve): Return undef for builtin types and sequence<T>
751            types (we might not have to do this, however...).
752            (check): Support checking for Exceptions, Valuetypes,
753            and Typedefs.
754            ($serialize_type): Support for "DOMString::::" syntax.
755            (Typedef idl_text): Output Type as "DOMString" if it
756            is really "DOMString" (i.e. its internal representation
757            is "::DOMString::").
758    
759    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
760    
761            * WebIDL.pm ($resolve): New code, based on resolve code
762            for constant types in the |check| method.
763            (check): Support for checking of attributes, operations, and
764            arguments.
765            (Attribute/Operation idl_text): Exception names in getraises,
766            setraises, and raises clauses is serizlied by |$serialize_type|
767            code.
768    
769    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
770    
771            * WebIDL.pm ($integer): Order of selections are changed to match
772            hexadecimal numbers (the original pattern, taken from the spec,
773            was not work for hexadecimal numbers, because the "0" prefix
774            matches to the [0-7]* part (as an empty string) and therefore
775            it does not match with remaining "x..." part of a "0x..." integer
776            literal.
777            ($get_type): It now returns a string, not an array reference,
778            for regular types and |sequence| types (i.e. it in any case
779            returns a string).
780            ($get_next_token): The second item in the array that represents
781            a integer or float token is now a Perl number value, not the
782            original string representation of the number.
783            (check): Support for const value consistency checking.
784            No extended attribute is defined for constants.
785            (Node subclasses): Use simple strings rather than array references
786            for default data type values.
787            ($serialize_type): Type values are now simple strings.
788            (value): If the new attribute value is a false value, then
789            a FALSE value is set to the attribute.
790    
791    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
792    
793            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
794            in its stringified format ("scoped name" as defined in the
795            spec).  Note that future version of this module should not use
796            array references for type values and the |type_text| attribute
797            should be made obsolete.
798            (parse_char_string): Unescape attribute names.
799            (check): Support for checking of whether inherited interfaces
800            are actually defined or not.  Support for checking of whether
801            interface member identifiers are duplicated or not.
802            ($serialize_type): Scoped names are returned as is.  A future
803            version of this code should escape identifiers other than "DOMString",
804            otherwise the idl_text would be non-conforming.
805    
806    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
807    
808            * WebIDL.pm (parse_char_string): Set line/column numbers
809            to generated nodes.  Unescape identifiers.  Extended attributes
810            for Definition's were ignored.
811            (append_child): Set |parent_node| attribute.
812            (parent_node): New attribute.
813            (check): Support interface/exception members.  Support
814            extended attributes.  Support definition identifier uniqueness
815            constraint.
816            (qualified_name): New attribute.
817            (Interface/Exception idl_text): Extended attributes were
818            not prepended to the returned text.
819    
820    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
821    
822            * WebIDL.pm (parse_char_string): Set line/column numbers
823            to interface object experimentally.  s/shift/pop/g, shift
824            would make things wrong.  Support for interface forward
825            declarations was missing.  Broken interface declarations
826            with no block were not ignored entirely.
827            (Whatpm::WebIDL::Node): New abstract class.  This class
828            makes things easier.
829            (child_nodes): New attribute.  Unlike DOM's attribute with
830            same name, this attribute returns a dead list of nodes for
831            simplicity.
832            (get_user_data, set_user_data): New methods.
833            (Module idl_text): A SPACE character should be inserted
834            before the |{| character.
835            (Interface idl_text): Support for interface forward declarations.
836            (is_forward_declaration): New attribute.
837    
838    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
839    
840            * WebIDL.pm (type_text): Better serializer.
841    
842    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
843    
844            * WebIDL.pm: Revise forward-compatible parsing so that
845            it now can handle broken extended attributes and as such.
846    
847    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
848    
849            * WebIDL.pm: Real support for extended attributes.
850            Support for extended attributes with arguments.
851    
852    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
853    
854            * WebIDL.pm: Support for |exception| syntax.
855            (Interface->idl_text): Tentative support for inheritances.
856    
857    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
858    
859            * WebIDL.pm: Hierarchical scoped name support was broken.
860            Support for raises, setraises, and getraises syntaxes.
861    
862    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
863    
864            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
865            proper support for types, extended attributes, and exceptions yet).
866            WebIDL parser, version 1 (no support for exceptions yet,
867            no proper support for extended attributes yet).
868    
869    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
870    
871            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
872    
873    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
874    
875            * WebIDL.pm: Support for valuetype and const.
876    
877    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
878            
879            * WebIDL.pm: New module.
880    
881    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
882    
883            * Makefile (Entities.html): URI changed.
884    
885    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
886    
887            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
888    
889    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
890    
891            * HTML.pm.src (_get_next_token): A parse error was missing.
892    
893    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
894    
895            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
896            spec (commented out). (HTML5 revision 1711).
897    
898    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
899    
900            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
901    
902            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
903            as UTF-16 (HTML5 revision 1701).
904    
905    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
906    
907            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
908            1692).
909    
910    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
911    
912            * HTML.pm.src: The secondary insertion mode used when switching
913            to foreign content is the "in body" insertion mode (HTML5 revision
914            1696).
915    
916    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
917    
918            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
919            revision 1697).
920    
921    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
922    
923            * HTML.pm.src: Support for end-of-file token in foreign content
924            insertion mode (HTML5 revision 1693).  Update SVG camelCase
925            attribute list (HTML5 revision 1700).  <textarea> closes
926            </select> (HTML5 revision 1699).  More start tags close in
927            foreign content insertion mode (HTML5 revision 1698).
928    
929    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
930    
931            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
932    
933    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
934    
935            * HTML.pm.src: More robust charset parameter detection (HTML5
936            revision 1674).
937    
938    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
939    
940            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
941            revision 1676).
942    
943    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
944    
945            * HTML.pm.src: Ignore language part of public identifiers for
946            quriks mode detection (HTML5 revision 1679).
947    
948    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
949    
950            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
951            revision 1685).
952    
953    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
954    
955            * HTML.pm.src: Support for EOF in new states for tags (HTML5
956            revision 1684).
957    
958    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
959    
960            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
961            work (HTML5 revision 1690).
962    
963    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
964    
965            * HTML.pm.src (_tree_construction_main): Change handling of
966            end tags in head insertion modes (HTML5 revision 1686).
967            (parse_char_string): Bug fix for non-utf8 character string handlings.
968            (parse_char_stream): |ungetc| does not work well for this context.
969    
970    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
971    
972            * HTML.pm.src (parse_byte_string): Redefined to invoke
973            |parse_byte_stream|.
974            (parse_byte_stream): New method.
975    
976    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
977    
978            * HTML.pm.src (parse_byte_string): Fix the column number reported
979            by encoding layer error reporter.
980    
981    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
982    
983            * HTML.pm.src (parse_byte_string): Use streaming decoder
984            rather than converting the whole byte string and then parsing.
985            Propagate errors in character encoding layer.
986            (get_next_token): Precise error reporting for |bare stago| error.
987    
988    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
989    
990            * HTML.pm.src (parse_char_stream): New method.
991            (parse_char_string): This method is now defined as an invocation
992            of the |parse_char_stream| method.
993    
994    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
995    
996            * HTML.pm.src (parse_byte_string): Report various status
997            of the sniffing as info-level errors.  Support for new
998            decoding framework in parser resestting.
999            (new): Various default error levels were not set.
1000    
1001    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1002    
1003            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1004            algorithm, except for the actual sniffing, is implemented
1005            with new framework with Message::Charset::Info.
1006    
1007    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
1008    
1009            * CacheManifest.pm (_parse): Drop fragment identifiers from
1010            URIs in fallback section (HTML5 revision 1596).
1011    
1012    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1013    
1014            * Makefile (Entities.html): URI has changed.
1015    
1016    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1017    
1018            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1019            1553).
1020    
1021    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1022    
1023            * ContentChecker.pm: Noted that those returned in |table| are
1024            no longer table elements, but table objects returned
1025            by Whatpm::HTMLTable.
1026    
1027            * HTMLTable.pm (form_table): Return table element node
1028            as |$table->{element}|.
1029            (assign_header): Support for the |headers=""| attribute.
1030    
1031    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1032    
1033            * HTMLTable.pm (assign_header): New function; first version
1034            with no support for headers="".
1035            (form_table): Include table width and height to the returned
1036            table object for covenience.  Indexing in column assignement was wrong.
1037            Set whether a data cell is empty or not for convenience.
1038    
1039    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1042    
1043    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1044    
1045            * HTMLTable.pm: How table model errors are detected is
1046            changed (HTML5 revision 1387).
1047    
1048    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * HTMLTable.pm: The algorithm now moves |tfoot| elements
1051            to the end of the table (HTML5 revision 1380).
1052    
1053    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1054    
1055            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1056            of 1-based (HTML5 revision 1376).
1057    
1058    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1059    
1060            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1061            and "Content-Encoding" no longer prevent sniffing (HTML5
1062            revision 1288).
1063    
1064    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1065    
1066            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1067            1282).
1068    
1069    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1070    
1071            * ContentChecker.pm: Support for global attributes.
1072            Status of XML specs are added.
1073    
1074    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1075    
1076            * ContentChecker.pm (check_element): Support for |template=""|
1077            and |ref=""| attribute (referent element type checking).
1078    
1079    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
1080    
1081            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1082            1500) is implemented (except for IDNA part and URI-scheme-specific
1083            knowledge).  Line number counting was wrong for LF-only
1084            documents.
1085    
1086    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1087    
1088            * HTML.pm.src: Raise an parse error for any disallowed
1089            character (HTML5 revision 1263).
1090    
1091    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1092    
1093            * mkentitylist.pl: Support for new HTML5 entity table format (the
1094            definition for |AElig;| was missing).
1095    
1096    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1097    
1098            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1099            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1100    
1101    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1102    
1103            * HTML.pm.src: List of element names that close foreign content
1104            insertion mode is added (HTML5 revisions 1412 and 1418).
1105    
1106    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1107    
1108            * HTML.pm.src: Support for |mglyph| and |malignmark|
1109            elements (HTML5 revision 1410).
1110    
1111    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1112    
1113            * HTML.pm.src: Support for new long MathML entities (HTML5
1114            revision 1406).
1115    
1116    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1117    
1118            * HTML.pm.src: CDATA section support for MathML and SVG
1119            elements (HTML5 revision 1404 and 1420).
1120    
1121    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1122    
1123            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1124            elements (HTML5 revision 1404).  Unused !!!macro definitions
1125            are removed.
1126    
1127    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1128    
1129            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1130            are raised is changed (HTML5 revision 1404).
1131    
1132    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1133    
1134            * HTML.pm.src: Category sets that are no longer used are removed.
1135    
1136    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1137    
1138            * HTML.pm.src: The ->[1] property of stack entries are now
1139            replaced by constants representing element category.
1140    
1141    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1142    
1143            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1144            for error reporting.  (This is a preparation for using constant
1145            value for ->[1].)
1146    
1147    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1148    
1149            * RDFXML.pm: Typo fixed.
1150    
1151    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1152    
1153            * ContentChecker.pm: |fact_level| is now treated
1154            as same as |must_level|, i.e. level = |m|.
1155            (check_element): Make list of URIs in the DOM.
1156    
1157    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1158    
1159            * RDFXML.pm: Language accessor implemented.
1160            Local (null-namespace) attribute support.
1161    
1162    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1163    
1164            * RDFXML.pm: Factored out ID checking code.
1165    
1166    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1167    
1168            * RDFXML.pm: TODO items noted.  Validation of ID
1169            and URI attributes is implemented.  Warn if unknown
1170            value is used in rdf:parseType="" attribute.
1171    
1172            * URIChecker.pm (check_rdf_uri_reference): New function.
1173    
1174    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1175    
1176            * RDFXML.pm: bnodeid implemented.  Relative references
1177            are now resolved.
1178    
1179    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1180    
1181            * ContentChecker.pm: RDF reification implemented.
1182    
1183            * RDFXML.pm: undef vs false bug fixed.
1184            Reification implemented.
1185    
1186    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1187    
1188            * RDFXML.pm: s/id/ID/ for attribute name.
1189            The |node| arguments are added for |ontriple| calls.
1190            Too many "attribute not allowed" errors were raised.
1191    
1192            * ContentChecker.pm: Initial experimental support for rdf:RDF
1193            element.
1194    
1195    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1196    
1197            * RDFXML.pm: New module.
1198    
1199    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1200    
1201            * HTML.pm.src (set_inner_html): Line/column number
1202            code was old one yet.
1203    
1204    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1205    
1206            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1207            errors.  Line/column markings for DOCTYPE, comment, and
1208            character tokens are reintroduced; otherwise,
1209            error location for "not HTML5" error and errors
1210            for implied elements are not attached.
1211    
1212    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1213    
1214            * HTML.pm.src: Set line/column numbers to attributes.
1215    
1216            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1217            (value): Setter implemented.
1218    
1219            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1220    
1221    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1222    
1223            * HTML.pm.src: Unused line/column markings are removed.
1224    
1225    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1226    
1227            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1228            flag, which is no longer used.
1229    
1230    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1231    
1232            * HTML.pm.src: Set line/column information to element nodes.
1233    
1234            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1235            and !!!insert-element-t): Set line/column information to
1236            element nodes.
1237    
1238    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1239    
1240            * HTML.pm.src (_get_next_token): The first "<" character
1241            in "<?", "<>", or "</>" should be the error point.
1242    
1243    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1244    
1245            * HTML.pm.src: Some more fixes on error position reporting.
1246    
1247    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1248    
1249            * HTML.pm.src: Token-level precious error reporting.
1250    
1251    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1252    
1253            * HTML.pm.src: Preparation for more precious error point
1254            reporting.
1255    
1256    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1257    
1258            * HTML.pm.src: Error type revised.
1259    
1260    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1261    
1262            * HTML.pm.src: Similar codes are merged together, again.
1263    
1264    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1265    
1266            * HTML.pm.src: Similar codes are merged together.
1267    
1268    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1269    
1270            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1271    
1272            * HTML.pm.src: Code refined.
1273    
1274    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1275    
1276            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1277            revision 1348).  Note that I really don't know this makes
1278            any difference in the black-box behavior of the parser.
1279    
1280    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1281    
1282            * HTML.pm.src: New end-of-file token implementation (HTML5
1283            revision 1348).
1284    
1285    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1286    
1287            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1288    
1289    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1290    
1291            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1292    
1293    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1294    
1295            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1296            mode and support for the "in select in table" insertion mode (HTML5
1297            revision 1342).
1298    
1299    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1300    
1301            * HTML.pm.src: No foster parenting for <script> and <script>
1302            in non-tainted <table>s (HTML5 revision 1336).
1303    
1304    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1305    
1306            * HTML.pm.src: Ignore white space characters between <html>
1307            and <head> (HTML5 revision 1332).
1308    
1309    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1310    
1311            * HTML.pm.src: Treat <input type=hidden> as if it were a
1312            white space (HTML5 revision 1331).
1313    
1314    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1315    
1316            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1317            element (HTML5 revision 1330).
1318    
1319    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1320    
1321            * HTML.pm.src: <title> is always appended to the current
1322            element (HTML5 revision 1328).
1323    
1324    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1325    
1326            * HTML.pm.src: White space in tainted tables are moved
1327            into foster parents (HTML5 revision 1326).
1328    
1329    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1330    
1331            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1332            revision 1321).
1333    
1334    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1335    
1336            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1337            the spec (HTML5 revision 1320).  Note that we finally complete
1338            all of HTML5 revision 1320 changes.
1339    
1340    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1341    
1342            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1343            tags (HTML5 revision 1320).
1344    
1345    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1346    
1347            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1348    
1349    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1350    
1351            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1352            could not be reached so that it is now removed (HTML5 revision
1353            1320).
1354    
1355    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1356    
1357            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1358            to align with new spec text (HTML5 revision 1320).
1359    
1360    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1361    
1362            * HTML.pm.src: Remove strange |if| condition; however, it should
1363            have had no harm in theory.
1364    
1365    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1366    
1367            * HTML.pm.src (_tree_construction_main): '</p>' in body
1368            case is split from other end tags for the preparation
1369            of implementing HTML revision 1320.
1370    
1371    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1372    
1373            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1374            revision 1320).
1375    
1376    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1377    
1378            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1379            and "div" (HTML5 revision 1318).  Add comments to where
1380            |form| pointer association codes should be inserted (HTML5 revision
1381            1319).
1382    
1383    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1384    
1385            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1386    
1387    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1388    
1389            * HTML.pm.src: Since the case t268 should never be reached (no
1390            other token type, there are), it is replaced by a |die| statement.
1391    
1392    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1393    
1394            * HTML.pm.src: Typo fixed.
1395    
1396    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1397    
1398            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1399            doctypes were not uppercased for comparison.
1400    
1401    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1402    
1403            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1404            are added.
1405    
1406    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1407    
1408            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1409            are set.  Cases that are unlikely reached are noted as so.
1410    
1411    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1412    
1413            * HTML.pm.src: Checkpoints for debugging are added.
1414    
1415            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1416    
1417    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1418    
1419            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1420            for simplicity.
1421    
1422    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1423    
1424            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1425            revision 1307).
1426    
1427    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1428    
1429            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1430            is set are changed (HTML5 revision 1305).
1431    
1432    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1433    
1434            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1435            1292).  Entities are not parsed in comment-like part in RCDATA
1436            elements (HTML5 revision 1294).  Allow bare & at the end
1437            of attribute value literals (HTML5 revision 1296).  More
1438            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1439            between attributes and ban attribute names or unquoted
1440            attribute values containing single or double quotes (HTML5
1441            revision 1303).
1442    
1443    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1444    
1445            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1446            and related errors unless it is an HTML document (though the spec
1447            is unclear on whether it is applied to XHTML document).
1448    
1449            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1450            are added.
1451    
1452    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1453    
1454            * HTML.pm.src: s/local_name/manakai_local_name/g.
1455    
1456    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1457    
1458            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1459    
1460            * HTML.pm.src: |charset| in |content| attribute is
1461            case-insensitive (HTML5 revision 1270).
1462    
1463    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1464    
1465            * ContentChecker.pm: New status constants are added.
1466            ($ElementDefault): |status| added.
1467            (check_element): Err for non-standard or deprecated elements.
1468            (_attr_status_info): For non-standard or deprecated attributes.
1469    
1470    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1471    
1472            * ContentChecker.pm (_attr_status_info): New internal method.
1473    
1474    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1475    
1476            * ContentChecker.pm (check_element): Element standardized
1477            status information is now dispatched.
1478    
1479    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1480    
1481            * ContentChecker.pm (check_element): Fix |del|-and-significant
1482            problem by adding some more arguments.
1483    
1484    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1485    
1486            * ContentChecker.pm (check_element): Use context of
1487            container-for-the-purpose-of-content-model element (not transparent
1488            element) for |check_child_element| calling and significant
1489            text flag marking.  This reintroduces |<del>|-and-significant
1490            problem again.
1491    
1492    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1493    
1494            * ContentChecker.pm (check_element): Make semi-transparent
1495            elements ignored for the purpose of phase changes in
1496            content model checking.
1497    
1498    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1499    
1500            * ContentChecker.pm (check_element): In-element state
1501            was not properly managed for transparent cases.
1502    
1503    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1504    
1505            * ContentChecker.pm (check_element): Support for |video|
1506            and |audio| as semi-transparent elements.
1507    
1508    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1509    
1510            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1511            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1512            Support for |html:object| as a semi-transparent element.
1513    
1514    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1515    
1516            * ContentChecker.pm (check_element): The way to traverse
1517            the tree is entirely revised to make it easier to track
1518            the state of ancestors/descendants.  As a result of this
1519            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1520            support for content model checking for HTML elements |figure|,
1521            |object|, |video|, and |audio| and checking for XML elements (and
1522            some XMLNS checkings) are dropped for now.  They will be
1523            reimplemented in due cource.
1524    
1525    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1526    
1527            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1528            token (HTML5 revision 1225).
1529    
1530    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1531    
1532            * ContentChecker.pm ({unsupported_level}): New value.
1533    
1534            * HTML.pm.src: Save whether |meta| |content| attribute
1535            contains character references or not.
1536    
1537    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1538    
1539            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1540            imlement significant content checking for contents
1541            with |del| elements.
1542    
1543    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1544    
1545            * ContenteChecker.pm ($HTMLTransparentElements): More
1546            elements are added.
1547            (_get_children): HTML |object| elements are now semi-transparent.
1548    
1549            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1550    
1551    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1552    
1553            * CacheManifest.pm: HTML5 revision 1211 implemented.
1554    
1555            * CacheManifest.pod: Updated.
1556    
1557    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1558    
1559            * ContentChecker.pm (check_document, check_element): Support
1560            for second argument ($onsubdoc).
1561            (_get_css_parser): Removed (now it is part of WDCC).
1562    
1563    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1564    
1565            * ContentChecker.pm (_get_css_parser): New.
1566    
1567    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1568    
1569            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1570            for checking had been used.
1571    
1572    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1573    
1574            * ContentChecker.pm (check_element): New todo item type |descendant|.
1575    
1576    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1577    
1578            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1579            specced in RFC 2046 and RFC 4288.
1580            (application/atom+xml): Definition added.
1581    
1582    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1583    
1584            * URIChecker.pm: Make RFC 3986 should-level errors
1585            warnings (rather than SHOULD-level errors).
1586    
1587    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1588    
1589            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1590    
1591            * HTML.pm.src: A flag for character references in attribute
1592            values are added.  Set |manakai_has_reference| user data
1593            to |charset| attribute.
1594    
1595    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1596    
1597            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1598            attributes.
1599    
1600            * ContentChecker.pm (check_document): Warn if charset requirements
1601            cannot be tested.
1602    
1603    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1604    
1605            * HTML.pm.src (parse_byte_string): Detect charset
1606            by universalchardet if charset parameter is not specified.
1607    
1608            * Makefile (Charset-all, Charset-clean): New rules.
1609    
1610    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1611    
1612            * ContentChecker.pm (check_document): Check the existence
1613            of character encoding declaration and selection of encoding
1614            for HTML document.
1615    
1616    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1617    
1618            * ContentType.pm (get_sniffed_type): Return also the official
1619            type in list context.
1620    
1621    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1622    
1623            * ContentType.pm: Sniffing with leading white space
1624            ignoring (HTML5 revisions 1013 and 1016).
1625    
1626    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1627    
1628            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1629            white spaces, are implemented.
1630    
1631    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1632    
1633            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1634            attribute if possible.
1635    
1636    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1637    
1638            * HTML.pm.src (parse_byte_string): New method.
1639            (parse_char_string): New alias for |parse_string|.
1640            (main phase): Invoking "change the encoding" algorithm if desired.
1641    
1642            * HTML.pod: Updated.
1643    
1644    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1645    
1646            * HTML.pod (get_inner_html): Removed.
1647    
1648            * Makefile (HTML-all, HTML-clean): New.
1649    
1650    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1651    
1652            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1653    
1654    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1655    
1656            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1657    
1658    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1659    
1660            * Makefile: |CacheManifest.html| is added.
1661    
1662            * CacheManifest.pod: New file.
1663    
1664    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1665    
1666            * CacheManifest.pm: New module.
1667    
1668    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1669    
1670            * HTML.pm.src: Support for application cache selection algorithm
1671            callback.
1672    
1673    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1674    
1675            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1676    
1677    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1678    
1679            * Makefile (clean): New rule.
1680    
1681            * NanoDOM.pm (public_id, system_id): New attributes.
1682    
1683    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1684    
1685            * Makefile (CSS-all, CSS-clean, clean): New rules.
1686    
1687    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1688    
1689            * ContentChecker.pm (check_document): Support for
1690            new |is_xml_root| flag.
1691            (check_element): Support for new |pluses| state.
1692            (_add_pluses): New method.
1693            (_remove_minuses): Support for new |minus| item.
1694    
1695    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1696    
1697            * ContentChecker.pm: Raise specific error for invalid
1698            root element.
1699    
1700    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1701    
1702            * ContentChecker.pm: Set level values for later uses.
1703    
1704    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1705    
1706            * ContentChecker.pm: Support for language tag validation.
1707    
1708    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1709    
1710            * LangTag.pm (check_rfc3066_language_tag): New method.
1711    
1712    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1713    
1714            * LangTag.pm: New module.
1715    
1716    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1717    
1718            * HTML.pm.src: Some error types were wrong.
1719    
1720    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1721    
1722            * CSS/: New directory.
1723    
1724    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1725    
1726            * ContentChecker.pm (_check_get_children): Support
1727            for |noscript| in |head|.
1728    
1729    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1730    
1731            * URI/: New directory.
1732    
1733    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1734    
1735            * HTML.pm.src: Tokenizer's states are now represented in
1736            number.
1737    
1738    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1739    
1740            * HTML.pm.src: |or|s for insertion modes are replaced
1741            by |&|s.
1742    
1743    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1744    
1745            * HTML.pm.src: Token types are now represented in number.
1746    
1747    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1748    
1749            * ContentType.pm (SEE ALSO): Updated.
1750    
1751            * HTML.pm.src: Insertion modes are now represented in number.
1752    
1753    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1754    
1755            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1756            999) is implemented.
1757    
1758    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1759    
1760            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1761            983) is implemented.
1762    
1763    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1764    
1765            * ContentChecker.pod: New documentation.
1766    
1767            * Makefile: A rule for |ContentChecker.html| is added.
1768    
1769            * ContentChecker.pm: A pod "LICENSE" section is added.
1770    
1771            * NanoDOM.pm ($VERSION): New variable.
1772    
1773    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1774    
1775            * H2H.pm: |b|, |i|, and |sub| are added to the
1776            list of allowed HTML elements.
1777    
1778    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1779    
1780            * H2H.pm: |samp| is added to the list of allowed
1781            HTML elements.
1782    
1783            * URIChecker.pm (check_iri): New.
1784            (check_iri_reference): Error type for IRI reference
1785            syntax error is changed.
1786    
1787    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1788    
1789            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1790            is added.
1791            (check_document): Load appropriate module before validation.
1792    
1793    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1794    
1795            * ContentChecker/: New directory.
1796    
1797    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1798    
1799            * ContentChecker.pm: HTML |time| element is implemented.
1800    
1801            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1802    
1803    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1804    
1805            * ContentChecker.pm (check_document): Return value
1806            even if no document element is found.
1807    
1808    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1809    
1810            * HTML.pm.src: |$in_body| is no longer a function.
1811    
1812    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1813    
1814            * HTML.pm.src: The |$in_body| code has been moved down.
1815    
1816    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1817    
1818            * HTML.pm.src: The "trailing end" insertion mode
1819            is split into "after html body" and "after html frameset"
1820            insertion modes.  Their codes are merged with "after body"
1821            and "after frameset" codes.  |$previous_insertion_mode|
1822            has been removed.  "after frameset" code is
1823            merged with "in frameset" code.
1824    
1825    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1826    
1827            * HTML.pm.src: The "before head" insertion mode is
1828            merged with the "in head" insertion mode.
1829    
1830    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1831    
1832            * HTML.pm.src: Cases in "in head" insertion mode are
1833            reorganized.
1834    
1835    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1836    
1837            * HTML.pm.src: Some cases in "in table" insertion mode
1838            are merged.
1839    
1840    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1841    
1842            * HTML.pm.src: The "in row" insertion mode is merged
1843            with "in table" insertion mode.
1844    
1845    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1846    
1847            * HTML.pm.src: The "in table" and "in table body" insertion
1848            modes are merged.
1849    
1850    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1851    
1852            * HTML.pm.src: There is no "in table head" or "in table foot"
1853            insertion mode!
1854    
1855    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1856    
1857            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1858            now directly invoke the handler.
1859    
1860    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1861    
1862            * HTML.pm.src: Codes for "in cell" insertion mode
1863            is merged to the "in body" insertion mode code.
1864    
1865    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1866    
1867            * HTML.pm.src: Codes for "in body" and "in caption"
1868            insertion modes are merged.
1869    
1870    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1871    
1872            * HTML.pm.src: Two |!!!next-token|s were missing.
1873    
1874    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1875    
1876            * HTML.pm.src: Use numeric constant for |{content_mode}|
1877            instead of string constant for |{content_model_flag}|.
1878    
1879    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1880    
1881            * HTML.pm.src: Add the name of the attribute
1882            to the "duplicate attribute" error.
1883    
1884    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1885    
1886            * ContentChecker.pm: Return the |class| node list.
1887    
1888    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1889    
1890            * ContentChecker.pm: Return the |id| node list.
1891    
1892            * HTML.pm.src: A typo is fixed.
1893    
1894    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1895    
1896            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1897            from "in HTML:xml:lang" error.  Character position
1898            is now the last part of the error type in the URI error
1899            description.  Report "unsupported" status for language
1900            tags, media queries, script codes, and style sheets.
1901    
1902    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1903    
1904            * ContentChecker.pm: Report error if |xml:lang|
1905            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1906            in XML.
1907    
1908            * NanoDOM.pm (Attr.owner_document): New attribute.
1909    
1910    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1911    
1912            * HTML.pm.src: The character immediately following
1913            a bare |hcro| was discarded.  Fix handling of
1914            entity references in attribute values.
1915    
1916    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1917    
1918            * HTML.pm.src (main and trailin end phases): Token
1919            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1920            factored out.  Error types |in html:#DOCTYPE|
1921            and |after html:#DOCTYPE| are merged
1922            into |DOCTYPE in the middle|.  |</frameset>|
1923            in fragment parsing mode changed the
1924            insertion mode.
1925    
1926    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1927    
1928            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1929            phase is now an insertion mode.  Treatments for white
1930            space character tokens were incorrect for some
1931            insertion modes.  An old |meta| case was not removed.
1932    
1933    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1934    
1935            * HTML.pm.src: |meta| charset declaration extraction
1936            implemented (but changing the encoding is not yet:-).
1937    
1938    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1939    
1940            * Charset/: New directory.
1941    
1942    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1943    
1944            * H2H.pm: New Perl module (created from
1945            manakai's H2H.dis).
1946    
1947    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1948    
1949            * XMLSerializer.pm: New Perl module (created from
1950            manakai's SimpleLS.dis).
1951    
1952    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
1953    
1954            * HTML.pm.src: |<!---x-->| was not processed correctly.
1955    
1956    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1957    
1958            * HTML.pm.src: Report correct error message
1959            for |<body></div></body>|.
1960    
1961    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1962    
1963            * HTMLTable.pm: An error description was incorrect.
1964    
1965    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1966    
1967            * ContentChecker.pm: Return |{term}| list.
1968    
1969    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1970    
1971            * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1972            nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1973            and <title> outside of head).
1974    
1975    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1976    
1977            * IMTChecker.pm: Report warning for unregistered
1978            and private types/subtypes.
1979    
1980            * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1981            URIChecker.pm, HTMLTable.pm: Error messages are now
1982            consistent; they are all listed in
1983            <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1984    
1985    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1986    
1987            * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1988            is now erred.  |<datalist>| is implemented.
1989            Attribute checker for |<command>| and |<menu>| are
1990            added.  Support for |contextmenu| global attribute
1991            is added.
1992    
1993    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1994    
1995            * HTML.pm.src (_reset_insertion_mode): Interpretation
1996            of Step 3 has been changed.
1997    
1998    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1999    
2000            * HTML.pm.src: Late |<html>| parse error is implemented.
2001    
2002    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2003    
2004            * URIChecker.pm (check_iri_reference): A |decode| method name was
2005            incorrect.
2006    
2007            * ContentChecker.pm: Support for the |footer| element.
2008            Check URI syntax for space-separated URI attributes.
2009            Support for the |tabindex| attribute.  Support
2010            for |datetime| attribute.
2011    
2012    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2013    
2014            * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
2015            character references).  HTML5 revision 1.146 (white space
2016            characters before root start tag).  HTML5 revision
2017            1.148 (named character references in attribute values).
2018            HTML5 revision 1.152 (<plaintext>.innerHTML get).
2019    
2020    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2021    
2022            * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
2023            in <head>).
2024    
2025    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2026    
2027            * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
2028            in body).
2029    
2030            * ContentChecker.pm: HTML5 revision 938 (scoped="").
2031    
2032    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2033    
2034            * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
2035            or RCDATA in fragment parsing mode).  HTML5 revision
2036            924 (<!--> and <!--->).  HTML5 revision 926 (hn in hn).
2037    
2038    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2039    
2040            * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
2041            for <pre> and <textarea>).
2042    
2043    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2044    
2045            * HTML.pm.src: HTML5 revision 920 (<isindex>).
2046    
2047    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2048    
2049            * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
2050    
2051    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2052    
2053            * HTML.pm.src: HTML5 revision 916 (</body>).
2054            HTML5 revision 917 (conforming bare &).
2055    
2056    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2057    
2058            * NanoDOM.pm (manakai_is_html): Setting to false did not work.
2059    
2060            * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
2061            HTML5 revision 915 (<nobr>).
2062    
2063    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2064    
2065            * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
2066            
2067            * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
2068            New attributes.
2069    
2070    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2071    
2072            * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
2073            in tags).
2074    
2075    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2076    
2077            * .cvsignore: |Entities.html| is added.
2078    
2079            * HTML.pm.src: |$entity_char| is removed and
2080            requires |Whatpm::_NamedEntityList| instead.
2081            HTML5 revision 898 (refc), except that lack of refc
2082            is parse error.
2083    
2084            * mkentitylist.pl: New script.
2085    
2086            * Makefile (all): |_NamedEntityList.pm| is added.
2087            (_NamedEntityList.pm, Entities.html): New rules.
2088    
2089    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2090    
2091            * HTML.pm.src: Parse errors immediately after U+000D
2092            were ignored and U+000D immediately following another
2093            U+000D was not converted to U+000A.
2094    
2095    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2096    
2097            * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
2098            nodes before appended).  Parser was not ready for NULL
2099            parse error and escape flag.
2100    
2101            * NanoDOM.pm (adopt_node): New.
2102    
2103    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2104    
2105            * HTML.pm.src: HTML5 revision 886 (insane comment in
2106            CDATA and RCDATA).  Note that current implementation
2107            is simply repeating what the spec says and it is maybe not
2108            a best way to do it.
2109    
2110    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2111    
2112            * HTML.pm.src: HTML5 revision 884 (</form> don't close
2113            the form element if a descendant element without implied end tag has
2114            still been open).
2115    
2116    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2117    
2118            * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
2119            space characters non-conforming).
2120    
2121    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2122    
2123            * HTML.pm.src: An error message was incorrect.
2124            HTML5 revision 869 (C1 character references).
2125            
2126    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
2127    
2128            * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2129            a |textarea| is removed).
2130    
2131    2007-06-05  Wakaba  <wakaba@suika.fam.cx>
2132    
2133            * NanoDOM.pm (get_attribute_node_ns): New method.
2134    
2135            * ContentChecker.pm: |script| |async| and |defer|
2136            no longer require |src|.  |async| MUST NOT be
2137            specified if |defer|.  (HTML5 revision 858).
2138    
2139    2007-05-30  Wakaba  <wakaba@suika.fam.cx>
2140    
2141            * HTML.pm.src: |<form><form>| went to inifinite loop.
2142    
2143    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
2144    
2145            * ContentChecker.pm (html): Set |is_root| (allowed
2146            as a document element) flag on.
2147            (new): Removed.
2148            (check_document): New method.
2149    
2150    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
2151    
2152            * ContentChecker.pm (thead, tfoot): Checker specifications
2153            were incorrect.
2154    
2155    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
2156    
2157            * ContentChecker.pm ($HTMLURIAttrChecker): Include
2158            error position in the |type| option of the error.
2159    
2160            * HTMLTable.pm (form_table): The |$onerror| parameter
2161            is now optional.  Some bugs are fixed.
2162    
2163    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
2164    
2165            * HTMLTable.pm: New module.
2166    
2167            * ContentChecker.pm (table): Invoke table model error checker.
2168    
2169            * NanoDOM.pm (first_child, get_attribute_ns): New.
2170    
2171    2007-05-26  Wakaba  <wakaba@suika.fam.cx>
2172    
2173            * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2174            (link/@rel, a/@rel, area/@rel): Use new checker.
2175    
2176            * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2177    
2178            * _LinkTypeList.pm: New file.
2179    
2180            * mklinktypelist.pl: New file.
2181    
2182            * .cvsignore: |RelExtensions.html| added.
2183    
2184            * NanoDOM.pm (child_nodes): Returns an empty array
2185            for non-child-containing node types.
2186            (text_content): New attribute.
2187    
2188    2007-05-26  Wakaba  <wakaba@suika.fam.cx>
2189    
2190            * IMTChecker.pm: New module.
2191    
2192            * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2193            to test parameter value validity.
2194    
2195            * HTML.pm.src ($style_start_tag): Attributes were
2196            discarded.
2197    
2198    2007-05-25  Wakaba  <wakaba@suika.fam.cx>
2199    
2200            * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2201    
2202    2007-05-25  Wakaba  <wakaba@suika.fam.cx>
2203    
2204            * URIChecker.pm: All recommendations from RFC 3986
2205            and RFC 3987 are listed (not all testable items are checked yet).
2206    
2207    2007-05-25  Wakaba  <wakaba@suika.fam.cx>
2208    
2209            * URIChecker.pm: New module.
2210    
2211    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
2212    
2213            * ContentChecker.pm: Now most attributes are implemented
2214            or associated to some placeholder.
2215            ($ElementDefault): Warn unknown attributes
2216            for unknown elements as "attribute not supported".
2217            ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2218            ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2219            (|a| attribute checker): Reimplemented.
2220    
2221    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
2222    
2223            * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2224            ($HTMLAttrChecker): Event handler content attributes
2225            are added.
2226            (link, embed): Required attribute is now checked.
2227            (embed): Unknown local attributes are no longer warned.
2228    
2229    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
2230    
2231            * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2232            ($HTMLIMTAttrChecker): New checker.
2233            (link@rel, link@href, link@type, style@type,
2234            a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2235            object@data, object@type, source@src, source@type, area@alt,
2236            area@shape, area@coords, area@href,
2237            area@ping, area@rel, area@type, script@src,
2238            script@defer, script@async, script@type): Checkers added.
2239    
2240    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
2241    
2242            * ContentChecker.pm: Descendant checking was incorrect.
2243    
2244  2007-05-19  Wakaba  <wakaba@suika.fam.cx>  2007-05-19  Wakaba  <wakaba@suika.fam.cx>
2245    
2246          * ContentChecker.pm: Support |xml:*| and |xmlns:*|          * ContentChecker.pm: Support |xml:*| and |xmlns:*|

Legend:
Removed from v.1.17  
changed lines
  Added in v.1.394

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24