/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.129 by wakaba, Sun Nov 25 03:46:07 2007 UTC revision 1.396 by wakaba, Sun Sep 6 08:15:37 2009 UTC
# Line 1  Line 1 
1    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
4            revision 3689).  It should make no difference since the |xmp|
5            element has the PCDATA content and cannot be the non-bottommost
6            element in the stack of open elements.
7    
8    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
9    
10            * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
11            fragment parsing algorithm is invoked (HTML5 revision 2699).
12    
13    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
14    
15            * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
16    
17    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: No longer does foster parenting |style|, |script|,
20            and |input type=hidden| elements (HTML5 revision 2988).
21    
22    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
25            2870).
26    
27    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
30            element (HTML5 revision 2960).
31    
32    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: Moved the "in table text" insertion mode code up
35            such that it does also work as specced in HTML5 in the remaining
36            cases (HTML5 revision 3382).
37    
38    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Implementing the "in table text" insertion mode,
41            first half (HTML5 revision 3382).
42    
43    2009-08-30  Wakaba  <wakaba@suika.fam.cx>
44    
45            * ContentChecker.pm: Allow |xml:lang| in HTML document if the
46            owner element is not an HTML element (HTML5 revision 3697).
47    
48    2009-08-23  Wakaba  <wakaba@suika.fam.cx>
49    
50            * ContentChecker.pm: Allow <input type=hidden> in context where
51            interactive content is disallowed (HTML5 revision 2392).
52    
53    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: Dropped support for "XSLT-compat" and added
56            "about:legacy-compat" (HTML5 revision 2725).
57    
58    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
61            spec (HTML5 revision 2700).
62    
63    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src: Added support for <script> element in the "in
66            select" and "in select in table" insertion modes (HTML5 revision
67            2500).
68    
69    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
70    
71            * HTML.pm.src: An <hn> start tag in an <hn> element should close
72            the earlier element (HTML5 revision 2501).
73    
74    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src: The </br> end tags before the <body> start tag
77            should equally survive as the first br element child of the body
78            element (HTML5 revision 2586).
79    
80    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: <font> in SVG land is handled as an SVG element as
83            long as it did not have HTML attributes (HTML5 revision 2590).
84    
85    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src (_tree_construction_main): Improved error reporting
88            on "select" start tag in "in select" and "in select in table"
89            parsing modes (c.f. HTML5 revision 3604).
90    
91    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
92    
93            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
94            invalid (HTML5 revision 2701).
95    
96    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
97    
98            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
99            optional before </body> (HTML5 revision 2806).
100    
101    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
102    
103            * HTML.pm.src: Marked where SVG |script| execution should be
104            implemented (HTML5 revision 2904).
105    
106    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
107    
108            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
109            inherits the document quirksness flag of the main document (HTML5
110            revision 2935).
111    
112    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
113    
114            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
115            quirks mode (The Hixie Quirk, HTML5 revision 3132).
116    
117    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
120            3163).
121    
122    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
123    
124            * HTML.pm.src: Don't change encoding to UTF-8 if the input
125            encoding is UTF-16 (HTML5 revision 3204).
126    
127    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
128    
129            * HTML.pm.src (parse_byte_stream): Don't change to another
130            encoding if the input is UTF-16 (HTML5 revision 3205).
131    
132    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: |isindex| should no longer generate |p| element
135            node (HTML5 revision 3236).
136    
137    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
138    
139            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
140            support for new |id_type| value of |any|, which means any kind of
141            element is allowed as a referenced element.
142    
143    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
144    
145            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
146            of defining codes for checking required attribute and as such.
147    
148    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
149    
150            * IMTChecker.pm: Added more definitions for subtypes.
151    
152    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
153    
154            * URIChecker.pm: Some of |pos_end| values were wrong.
155    
156    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
157    
158            * ContentChecker.pm (check_element): Added support for "no
159            referenced datalist" error.
160    
161    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
162    
163            * URIChecker.pm: Bug fix: It did not work unless
164            Message::DOM::DOMImplementation has been |require|d.
165    
166    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
167    
168            * NanoDOM.pm (document_uri): New attribute.
169    
170            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
171    
172    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
173    
174            * NanoDOM.pm (text_content): Don't create a Text node if the new
175            value is empty.
176    
177    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
178    
179            * SWML/: New directory.
180    
181    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
182    
183            * NanoDOM.pm (specified, all_declarations_processed,
184            manakai_attribute_type): New attributes.
185    
186    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
187    
188            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
189            array.
190    
191    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
192    
193            * NanoDOM.pm (notation_name): New attribute.
194    
195    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
196    
197            * NanoDOM.pm (public_id, system_id): New attributes.a
198    
199    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
200    
201            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
202            implemented.
203            (allowed_tokens, default_type, declared_type): Implemented.
204    
205    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
206    
207            * NanoDOM.pm (node_name): New attribute.
208            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
209            constants.
210            (create_element_type_definition_node, create_attribute_definition,
211            create_notation, create_general_entity,
212            get_element_type_definition_node,
213            set_element_type_definition_node, get_general_entity_node,
214            set_general_entity_node, get_notation_node, set_notation_node,
215            get_attribute_definition_node, set_attribute_definition_node): New
216            methods.
217            (element_types, entities, notations, attribute_definitions): New
218            attributes.
219            (DocumentType): Support for child nodes, entities, notations, and
220            element types.
221            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
222            New classes.
223    
224            * Dumper.pm: Support for general entities, notations, element type
225            definitions, and attribute definitions.
226    
227    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
228    
229            * NanoDOM.pm (create_processing_instruction): New method.
230            (xml_version, xml_encoding, xml_standalone): New attributes.
231            (ProcessingInstruction): New class.
232    
233    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
234    
235            * HTML.pm.src: Handling of end tags in the foreign content
236            insertion mode was partially wrong, because of wrong bit
237            operations.
238    
239    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
240    
241            * NanoDOM.pm (dom_config): New attribute (do nothing), for
242            Whatpm::XML::Parser support.
243    
244    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
245    
246            * Makefile: New rule to make HTML/Tokenizer.pm is added.
247    
248            * HTML.pm.src: Tokenizer part moved to another file.
249    
250    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
253    
254    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
255    
256            * HTML.pm.src: Element category constants redefined.
257            
258    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
259    
260            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
261            construction stage synced with the spec (HTML5 revisions 2139 and
262            2302).
263    
264    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
265    
266            * ContentChecker.pm: New error level "html5_fact" added, which
267            should be tentatively used until all of requirements are properly
268            specced as RFC 2119 "MUST" in HTML5.
269    
270    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
271    
272            * ContentChecker.pod: Note on internal flags is added.
273    
274    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src: An AAA bug fixed.
277    
278    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: If another node is inserted by the parser, don't
281            reuse existing Text node to append a character (HTML5 revision
282            2124).
283    
284    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
285    
286            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
287            revisions 1731 and 2128).
288    
289    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
290    
291            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
292            Support for end tags of camelCase SVG elements were broken.  A
293            wrong error type text fixed.
294    
295    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
296    
297            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
298    
299    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src: Support for new definition of |param| and |source|
302            start tag parsing (HTML5 revision 1731).
303    
304    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
305    
306            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
307    
308    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
309    
310            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
311            revisions 1731 and 1831).
312    
313    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
314    
315            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
316            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
317            revision 1731).
318    
319    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
320    
321            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
322            revision 1731).  End tags of |option| and |optgroup| elements are
323            now optional (HTML5 revision 1731).
324    
325    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
326    
327            * HTML.pm.src: New "special" elements added to the list (HTML5
328            revision 1778).  "strile" -> "strike".
329    
330    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
331    
332            * ContentType.pm (get_sniffed_type): Support for the "better"
333            content sniffing (HTML5 revision 1927).  In a case the official
334            type was not returned when the method is invoked in the list
335            context.
336    
337    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
338    
339            * HTML.pm.src: Character references for non-space C0 characters,
340            including U+000B VT, DEL character, noncharacter code points, are
341            now converted to the U+FFFD character (cf. HTML5 revision 2138).
342    
343    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
344    
345            * ContentChecker.pm: |form=""| check support added.
346    
347    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
348    
349            * ContentChecker.pm: |contextmenu| validness is now checked using
350            |id| and |id_type| properties, and |menu| property is removed.
351    
352    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
353    
354            * ContentChecker.pm: Prepare for |form| |name| attribute's
355            duplication checking.
356    
357    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
358    
359            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
360            unsupported charset="" parameter value (HTML5 revision 2131).
361    
362    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
363    
364            * HTML.pm.src: Reminding places where U+000B is allowed as a space
365            character is fixed (cf. HTML5 revision 1738).
366    
367            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
368            space characters (HTML5 revision 1738).
369    
370    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
371    
372            * HTML.pm.src: The "anything else" case for the "after after body"
373            insertion mode was not updated to swtich to the "in body"
374            insertion mode.  U+000B is no longer a space character for the
375            purpose of tree construction phase (HTML5 revision 1738).
376    
377    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
378    
379            * HTML.pm.src: U+000B is no longer a space character (HTML5
380            revision 1738).
381    
382    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
383    
384            * ContentType.pm: 0x0B is no longer a space character (HTML5
385            revision 1738).
386    
387            * HTML.pm.src: U+000B is no longer a space character for the
388            algorithm for extracting an encoding from a Content-Type (HTML5
389            revision 1738).
390    
391    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
392    
393            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
394    
395    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
396    
397            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
398    
399            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
400            matching.
401    
402    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
403    
404            * LangTag.pm: Warn for private use language subtags.  Error level
405            typos fixed.  Support for Suppress-Script field.
406    
407            * mklangreg.pl: Support for dumping of nested structure.
408    
409    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
410    
411            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
412            recommended case as per RFC 4646.
413    
414    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
415    
416            * LangTag.pm (check_rfc4646_langtag): New method.
417    
418    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
419    
420            * mklangreg.pl: New script.
421    
422            * Makefile: Updated for creation of the module for language subtag
423            registry.
424            
425    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
426    
427            * Makefile: WebIDL.html added.
428    
429            * WebIDL.pod: New documentation.
430    
431    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
432    
433            * WebIDL.pm: Checker's error types are redefined.
434    
435    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
436    
437            * WebIDL.pm: Parser's error types are redefined.  Some forward
438            compatible parsing bugs are fixed.  Some unreachable codes are
439            commented out.
440    
441    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
442    
443            * WebIDL.pm: Support for the reminding extended attributes are
444            added.  It does not satisfy the definition that a forward
445            interface declaration has an extended attribute.  It seems that
446            unless explicitly allowed multiple extended attributes with the
447            same name is not allowed, though it is not explicitly mentioned in
448            the spec.
449    
450    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
451    
452            * WebIDL.pm: Unescapes extended attribute names and extended
453            attribute identifiers.  Preserve whether an extended attribute has
454            an argument list of not.  Support for extended attributes:
455            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
456            NameGetter, NameSetter, and Null.
457            (has_argument_list): New attribute.
458            (idl_text): Stringifies argument lists, if any, even if it is
459            empty.
460    
461    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
462    
463            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
464            |{s_kwd}| in DATA_STATE as default.
465    
466    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
467    
468            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
469            by |{s_kwd}| in DATA_STATE.
470    
471    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
472    
473            * HTML.pm.src: Shorten keys.
474    
475    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
476    
477            * HTML.pm.src: Remove checking for control character, surrogate
478            pair, or noncharacter code points and non-Unicode code
479            points (they should be handled by Whatpm::Charset::UnicodeChecker).
480            (parse_char_stream): Support for the |$get_wrapper| argument and
481            character stream error handlers.
482    
483    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
484    
485            * ContentChecker.pm: Don't call |loda_ns_module|
486            for null-namespace elements/attributes.
487    
488            * HTML.pm.src: Fact out $disallowed_control_chars
489            as a hash.
490    
491    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
492    
493            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
494            and |{next_char}| initializations are moved to initialization
495            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
496            with |parse_char_stream|.
497    
498    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
499    
500            * HTML.pm.src (parse_char_stream): Make |set_next_char|
501            invoke |manakai_read_until|, not only |read|, where
502            possible, to decrease the number of |read| method calls.
503    
504            * mkhtmlparser.pl: Related changes to the aforementioned
505            modification.
506    
507    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
508    
509            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
510            would report character error from now.
511    
512    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
513    
514            * HTML.pm.src: White-space-leaded non-white-space character
515            tokens in "before head insertion mode" was not
516            correctly handled.
517            (set_inner_html): Reimplemented using CharString decodehandle
518            class.  Support for $get_wrapper argument.  Support
519            for |{read_until}| feature.
520    
521    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
522    
523            * HTML.pm.src: Make a "bare ero" error for unknown
524            entities point the "&" character.
525    
526    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
527    
528            * HTML.pm.src: It turns out that U+FFFD don't have to
529            be added to the list of excluded characters.
530    
531    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
532    
533            * HTML.pm.src ($char_onerror): Have character decoder's |line|
534            and |column| a higher priority than the one set by the
535            tokenizer's input handler.
536            ($self->{read_until}): Exclude U+FFFD (but this might
537            not be necessary, since now we do line/column fixup in
538            the character decode handle).
539    
540    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
541    
542            * HTML.pm.src: Use |{read_until}| where possible.
543    
544    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
545    
546            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
547            and |manakai_getc_until| to |manakai_read_until| to
548            reduce the number of string copies.
549    
550    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
551    
552            * HTML.pm.src (parse_char_string): Use newly created
553            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
554            standard feature to |open| a string as a filehandle,
555            since Perl's string filehandle seems not supporting |ungetc|
556            method correctly.
557            (parse_char_stream): Define |{getc_until}| method.
558            (DATA_STATE): Experimental support for |getc_until| feature.
559    
560    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
561    
562            * HTML.pm.src: Check points added to newly added branches.
563    
564    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
565    
566            * HTML.pm.src: Remove |{char}|, which is no longer used.
567            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
568            and replaced by |{prev_state}|.
569    
570            * mkhtmlparser.pl: Remove |{char}| feature.
571            Remove |!!!back-next-input-character;| macro.
572    
573    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
574    
575            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
576            entity related tokenizer states in favor of new states
577            implementing the consume character reference algorithm.
578    
579    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
580    
581            * HTML.pm.src: "Consume a character reference" algorithm is
582            now implemented as a tokenizer's state, rather than
583            a method, with minimum changes (more changes will
584            be made, in due course).  "Bogus comment state"'s inner
585            loop gets removed.
586    
587    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
588    
589            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
590            into their own tokenizer states.
591    
592    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
593    
594            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
595            is split into three states.
596    
597    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
598    
599            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
600            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
601            no longer does the tokenizer have to push back next input
602            characters in those states.
603    
604    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
605    
606            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
607            into four states so that no longer does the tokenizer have to push
608            back next input characters in that state.
609    
610    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
611    
612            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
613            which can be used to insert some wrapper between the character
614            stream handle and the tokenizer.  (It is currently not supported
615            for |set_inner_html| for |Element|s).
616    
617    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
618    
619            * HTML.pm.src: Ignore punctuations in charset names.
620    
621    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
622    
623            * ContentChecker.pm: Support for charset-layer error levels.
624    
625            * HTML.pm.src: Don't specify |text| argument for the
626            |chardecode:fallback| error, since it is not the encoding
627            being used alternatively.
628    
629    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
630    
631            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
632    
633    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
634    
635            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
636    
637    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
638    
639            * HTML.pm.src: Bug fix and sync with the spec with regard
640            to after after frameset insertion mode processing (HTML5
641            revision 1909).  Note that the implementation was wrong
642            per the old spec before the r1909 changes.
643    
644    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
645    
646            * HTMLTable.pm: scope=auto algorithm fix synced with the
647            spec (HTML5 revision 2093).
648            ($process_row): Algorithm step numbers synced with the
649            spec (HTML5 revision 2092).
650    
651    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
652    
653            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
654            revision 2094).
655    
656    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
657    
658            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
659    
660    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
661    
662            * HTML.pm.src: '"' and "'" at the end of attribute
663            name (after another attribute) now raise parse error (HTML5
664            revision 2123).  Empty unquoted attribute values are no
665            longer allowed (HTML5 revision 2122).
666    
667    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
668    
669            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
670            revision 2130).
671    
672    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
673    
674            * ContentChecker.pm: |xml:lang| attribute value must be same
675            as |lang| attribute value for HTML elements (HTML5 revision 2062
676            and so on).
677    
678    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
679    
680            * ContentChecker.pm: Error level definition for |xml_id_error|
681            was missing.
682    
683            * URIChecker.pm: The end of the URL should be marked as the
684            error location for an empty path error.  The position
685            between the userinfo and the port components should be
686            marked as the error location for an empty host error.
687    
688    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
689    
690            * URIChecker.pm: Set parameters representing where in the
691            value the error occurs for errors.  Report unknown
692            address format error in warning level, since address
693            formats are rarely added.  Path segments starting with "/.."
694            were misinterpreted as a dot-segment.
695    
696    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
697    
698            * URIChecker.pm (check_iri_reference): Requires
699            |Message::DOM::DOMImplementation|.
700    
701    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
702    
703            * IMTChecker.pm: Updated for the new error reporting architecture.
704    
705            * ContentChecker.pm: Error levels for IMTs are added.
706    
707    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
708    
709            * H2H.pm (_shift_token): Support for unquoted HTML attribute
710            values.
711    
712    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
713    
714            * CacheManifest.pm: Support for new style of error
715            reports.
716    
717            * HTML.pm.src: Set line=1, column=1 to the document node.
718    
719    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
720    
721            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
722            and URL checkers.  Support for more error levels for bogus
723            langauge tag and URL "standards".
724    
725            * LangTag.pm, URIChecker.pm: Support for new style error
726            level reporting.
727    
728    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
729    
730            * ContentChecker.pm: Support for RDF/XML error levels.
731    
732            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
733            specifying.  Error types are revised.
734    
735    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
736    
737            * ContentChecker.pm: All error reporting method calls are
738            renewed.
739    
740    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
741    
742            * HTML.pm.src: All error type names and "text" parameters
743            are revised.  Use new style for "level" specification.
744    
745            * mkhtmlparser.pl: Use new style for "level" specification.
746    
747    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
748    
749            * WebIDL.pm (parse_char_string): Simplified error
750            reporting process for broken ignored valuetype definition.
751            (Valuetype idl_text): Support for special "DOMString" name.
752    
753    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
754    
755            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
756            terminal of the ScopedName is "DOMString", such that whether
757            the last part of the scoped name is "DOMString" or "_DOMString"
758            later.  It is necessary to determine whether a |typedef|
759            definition should be ignored or not.
760            (parse_char_string): Unescape the identifier of
761            exception members.
762            ($resolve): Return undef for builtin types and sequence<T>
763            types (we might not have to do this, however...).
764            (check): Support checking for Exceptions, Valuetypes,
765            and Typedefs.
766            ($serialize_type): Support for "DOMString::::" syntax.
767            (Typedef idl_text): Output Type as "DOMString" if it
768            is really "DOMString" (i.e. its internal representation
769            is "::DOMString::").
770    
771    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
772    
773            * WebIDL.pm ($resolve): New code, based on resolve code
774            for constant types in the |check| method.
775            (check): Support for checking of attributes, operations, and
776            arguments.
777            (Attribute/Operation idl_text): Exception names in getraises,
778            setraises, and raises clauses is serizlied by |$serialize_type|
779            code.
780    
781    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
782    
783            * WebIDL.pm ($integer): Order of selections are changed to match
784            hexadecimal numbers (the original pattern, taken from the spec,
785            was not work for hexadecimal numbers, because the "0" prefix
786            matches to the [0-7]* part (as an empty string) and therefore
787            it does not match with remaining "x..." part of a "0x..." integer
788            literal.
789            ($get_type): It now returns a string, not an array reference,
790            for regular types and |sequence| types (i.e. it in any case
791            returns a string).
792            ($get_next_token): The second item in the array that represents
793            a integer or float token is now a Perl number value, not the
794            original string representation of the number.
795            (check): Support for const value consistency checking.
796            No extended attribute is defined for constants.
797            (Node subclasses): Use simple strings rather than array references
798            for default data type values.
799            ($serialize_type): Type values are now simple strings.
800            (value): If the new attribute value is a false value, then
801            a FALSE value is set to the attribute.
802    
803    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
804    
805            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
806            in its stringified format ("scoped name" as defined in the
807            spec).  Note that future version of this module should not use
808            array references for type values and the |type_text| attribute
809            should be made obsolete.
810            (parse_char_string): Unescape attribute names.
811            (check): Support for checking of whether inherited interfaces
812            are actually defined or not.  Support for checking of whether
813            interface member identifiers are duplicated or not.
814            ($serialize_type): Scoped names are returned as is.  A future
815            version of this code should escape identifiers other than "DOMString",
816            otherwise the idl_text would be non-conforming.
817    
818    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
819    
820            * WebIDL.pm (parse_char_string): Set line/column numbers
821            to generated nodes.  Unescape identifiers.  Extended attributes
822            for Definition's were ignored.
823            (append_child): Set |parent_node| attribute.
824            (parent_node): New attribute.
825            (check): Support interface/exception members.  Support
826            extended attributes.  Support definition identifier uniqueness
827            constraint.
828            (qualified_name): New attribute.
829            (Interface/Exception idl_text): Extended attributes were
830            not prepended to the returned text.
831    
832    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
833    
834            * WebIDL.pm (parse_char_string): Set line/column numbers
835            to interface object experimentally.  s/shift/pop/g, shift
836            would make things wrong.  Support for interface forward
837            declarations was missing.  Broken interface declarations
838            with no block were not ignored entirely.
839            (Whatpm::WebIDL::Node): New abstract class.  This class
840            makes things easier.
841            (child_nodes): New attribute.  Unlike DOM's attribute with
842            same name, this attribute returns a dead list of nodes for
843            simplicity.
844            (get_user_data, set_user_data): New methods.
845            (Module idl_text): A SPACE character should be inserted
846            before the |{| character.
847            (Interface idl_text): Support for interface forward declarations.
848            (is_forward_declaration): New attribute.
849    
850    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
851    
852            * WebIDL.pm (type_text): Better serializer.
853    
854    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
855    
856            * WebIDL.pm: Revise forward-compatible parsing so that
857            it now can handle broken extended attributes and as such.
858    
859    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
860    
861            * WebIDL.pm: Real support for extended attributes.
862            Support for extended attributes with arguments.
863    
864    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
865    
866            * WebIDL.pm: Support for |exception| syntax.
867            (Interface->idl_text): Tentative support for inheritances.
868    
869    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
870    
871            * WebIDL.pm: Hierarchical scoped name support was broken.
872            Support for raises, setraises, and getraises syntaxes.
873    
874    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
875    
876            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
877            proper support for types, extended attributes, and exceptions yet).
878            WebIDL parser, version 1 (no support for exceptions yet,
879            no proper support for extended attributes yet).
880    
881    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
882    
883            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
884    
885    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
886    
887            * WebIDL.pm: Support for valuetype and const.
888    
889    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
890            
891            * WebIDL.pm: New module.
892    
893    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
894    
895            * Makefile (Entities.html): URI changed.
896    
897    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
898    
899            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
900    
901    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
902    
903            * HTML.pm.src (_get_next_token): A parse error was missing.
904    
905    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
906    
907            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
908            spec (commented out). (HTML5 revision 1711).
909    
910    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
911    
912            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
913    
914            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
915            as UTF-16 (HTML5 revision 1701).
916    
917    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
920            1692).
921    
922    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
923    
924            * HTML.pm.src: The secondary insertion mode used when switching
925            to foreign content is the "in body" insertion mode (HTML5 revision
926            1696).
927    
928    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
929    
930            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
931            revision 1697).
932    
933    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
934    
935            * HTML.pm.src: Support for end-of-file token in foreign content
936            insertion mode (HTML5 revision 1693).  Update SVG camelCase
937            attribute list (HTML5 revision 1700).  <textarea> closes
938            </select> (HTML5 revision 1699).  More start tags close in
939            foreign content insertion mode (HTML5 revision 1698).
940    
941    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
942    
943            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
944    
945    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
946    
947            * HTML.pm.src: More robust charset parameter detection (HTML5
948            revision 1674).
949    
950    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
951    
952            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
953            revision 1676).
954    
955    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
956    
957            * HTML.pm.src: Ignore language part of public identifiers for
958            quriks mode detection (HTML5 revision 1679).
959    
960    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
961    
962            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
963            revision 1685).
964    
965    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
966    
967            * HTML.pm.src: Support for EOF in new states for tags (HTML5
968            revision 1684).
969    
970    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
971    
972            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
973            work (HTML5 revision 1690).
974    
975    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
976    
977            * HTML.pm.src (_tree_construction_main): Change handling of
978            end tags in head insertion modes (HTML5 revision 1686).
979            (parse_char_string): Bug fix for non-utf8 character string handlings.
980            (parse_char_stream): |ungetc| does not work well for this context.
981    
982    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
983    
984            * HTML.pm.src (parse_byte_string): Redefined to invoke
985            |parse_byte_stream|.
986            (parse_byte_stream): New method.
987    
988    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
989    
990            * HTML.pm.src (parse_byte_string): Fix the column number reported
991            by encoding layer error reporter.
992    
993    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
994    
995            * HTML.pm.src (parse_byte_string): Use streaming decoder
996            rather than converting the whole byte string and then parsing.
997            Propagate errors in character encoding layer.
998            (get_next_token): Precise error reporting for |bare stago| error.
999    
1000    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1001    
1002            * HTML.pm.src (parse_char_stream): New method.
1003            (parse_char_string): This method is now defined as an invocation
1004            of the |parse_char_stream| method.
1005    
1006    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1007    
1008            * HTML.pm.src (parse_byte_string): Report various status
1009            of the sniffing as info-level errors.  Support for new
1010            decoding framework in parser resestting.
1011            (new): Various default error levels were not set.
1012    
1013    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1014    
1015            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1016            algorithm, except for the actual sniffing, is implemented
1017            with new framework with Message::Charset::Info.
1018    
1019    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
1020    
1021            * CacheManifest.pm (_parse): Drop fragment identifiers from
1022            URIs in fallback section (HTML5 revision 1596).
1023    
1024    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1025    
1026            * Makefile (Entities.html): URI has changed.
1027    
1028    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1029    
1030            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1031            1553).
1032    
1033    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1034    
1035            * ContentChecker.pm: Noted that those returned in |table| are
1036            no longer table elements, but table objects returned
1037            by Whatpm::HTMLTable.
1038    
1039            * HTMLTable.pm (form_table): Return table element node
1040            as |$table->{element}|.
1041            (assign_header): Support for the |headers=""| attribute.
1042    
1043    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1044    
1045            * HTMLTable.pm (assign_header): New function; first version
1046            with no support for headers="".
1047            (form_table): Include table width and height to the returned
1048            table object for covenience.  Indexing in column assignement was wrong.
1049            Set whether a data cell is empty or not for convenience.
1050    
1051    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1052    
1053            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1054    
1055    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1056    
1057            * HTMLTable.pm: How table model errors are detected is
1058            changed (HTML5 revision 1387).
1059    
1060    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1061    
1062            * HTMLTable.pm: The algorithm now moves |tfoot| elements
1063            to the end of the table (HTML5 revision 1380).
1064    
1065    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1066    
1067            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1068            of 1-based (HTML5 revision 1376).
1069    
1070    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1071    
1072            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1073            and "Content-Encoding" no longer prevent sniffing (HTML5
1074            revision 1288).
1075    
1076    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1077    
1078            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1079            1282).
1080    
1081    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1082    
1083            * ContentChecker.pm: Support for global attributes.
1084            Status of XML specs are added.
1085    
1086    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1087    
1088            * ContentChecker.pm (check_element): Support for |template=""|
1089            and |ref=""| attribute (referent element type checking).
1090    
1091    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
1092    
1093            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1094            1500) is implemented (except for IDNA part and URI-scheme-specific
1095            knowledge).  Line number counting was wrong for LF-only
1096            documents.
1097    
1098    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1099    
1100            * HTML.pm.src: Raise an parse error for any disallowed
1101            character (HTML5 revision 1263).
1102    
1103    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1104    
1105            * mkentitylist.pl: Support for new HTML5 entity table format (the
1106            definition for |AElig;| was missing).
1107    
1108    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1109    
1110            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1111            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1112    
1113    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1114    
1115            * HTML.pm.src: List of element names that close foreign content
1116            insertion mode is added (HTML5 revisions 1412 and 1418).
1117    
1118    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1119    
1120            * HTML.pm.src: Support for |mglyph| and |malignmark|
1121            elements (HTML5 revision 1410).
1122    
1123    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1124    
1125            * HTML.pm.src: Support for new long MathML entities (HTML5
1126            revision 1406).
1127    
1128    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1129    
1130            * HTML.pm.src: CDATA section support for MathML and SVG
1131            elements (HTML5 revision 1404 and 1420).
1132    
1133    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1134    
1135            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1136            elements (HTML5 revision 1404).  Unused !!!macro definitions
1137            are removed.
1138    
1139    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1140    
1141            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1142            are raised is changed (HTML5 revision 1404).
1143    
1144    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1145    
1146            * HTML.pm.src: Category sets that are no longer used are removed.
1147    
1148    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1149    
1150            * HTML.pm.src: The ->[1] property of stack entries are now
1151            replaced by constants representing element category.
1152    
1153    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1154    
1155            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1156            for error reporting.  (This is a preparation for using constant
1157            value for ->[1].)
1158    
1159    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1160    
1161            * RDFXML.pm: Typo fixed.
1162    
1163    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1164    
1165            * ContentChecker.pm: |fact_level| is now treated
1166            as same as |must_level|, i.e. level = |m|.
1167            (check_element): Make list of URIs in the DOM.
1168    
1169    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1170    
1171            * RDFXML.pm: Language accessor implemented.
1172            Local (null-namespace) attribute support.
1173    
1174    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1175    
1176            * RDFXML.pm: Factored out ID checking code.
1177    
1178    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1179    
1180            * RDFXML.pm: TODO items noted.  Validation of ID
1181            and URI attributes is implemented.  Warn if unknown
1182            value is used in rdf:parseType="" attribute.
1183    
1184            * URIChecker.pm (check_rdf_uri_reference): New function.
1185    
1186    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1187    
1188            * RDFXML.pm: bnodeid implemented.  Relative references
1189            are now resolved.
1190    
1191    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1192    
1193            * ContentChecker.pm: RDF reification implemented.
1194    
1195            * RDFXML.pm: undef vs false bug fixed.
1196            Reification implemented.
1197    
1198    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1199    
1200            * RDFXML.pm: s/id/ID/ for attribute name.
1201            The |node| arguments are added for |ontriple| calls.
1202            Too many "attribute not allowed" errors were raised.
1203    
1204            * ContentChecker.pm: Initial experimental support for rdf:RDF
1205            element.
1206    
1207    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1208    
1209            * RDFXML.pm: New module.
1210    
1211    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1212    
1213            * HTML.pm.src (set_inner_html): Line/column number
1214            code was old one yet.
1215    
1216    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1217    
1218            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1219            errors.  Line/column markings for DOCTYPE, comment, and
1220            character tokens are reintroduced; otherwise,
1221            error location for "not HTML5" error and errors
1222            for implied elements are not attached.
1223    
1224    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1225    
1226            * HTML.pm.src: Set line/column numbers to attributes.
1227    
1228            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1229            (value): Setter implemented.
1230    
1231            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1232    
1233    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1234    
1235            * HTML.pm.src: Unused line/column markings are removed.
1236    
1237    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1238    
1239            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1240            flag, which is no longer used.
1241    
1242    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1243    
1244            * HTML.pm.src: Set line/column information to element nodes.
1245    
1246            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1247            and !!!insert-element-t): Set line/column information to
1248            element nodes.
1249    
1250    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1251    
1252            * HTML.pm.src (_get_next_token): The first "<" character
1253            in "<?", "<>", or "</>" should be the error point.
1254    
1255    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1256    
1257            * HTML.pm.src: Some more fixes on error position reporting.
1258    
1259    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1260    
1261            * HTML.pm.src: Token-level precious error reporting.
1262    
1263    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1264    
1265            * HTML.pm.src: Preparation for more precious error point
1266            reporting.
1267    
1268    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1269    
1270            * HTML.pm.src: Error type revised.
1271    
1272    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1273    
1274            * HTML.pm.src: Similar codes are merged together, again.
1275    
1276    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1277    
1278            * HTML.pm.src: Similar codes are merged together.
1279    
1280    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1281    
1282            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1283    
1284            * HTML.pm.src: Code refined.
1285    
1286    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1287    
1288            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1289            revision 1348).  Note that I really don't know this makes
1290            any difference in the black-box behavior of the parser.
1291    
1292    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1293    
1294            * HTML.pm.src: New end-of-file token implementation (HTML5
1295            revision 1348).
1296    
1297    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1298    
1299            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1300    
1301    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1302    
1303            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1304    
1305    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1306    
1307            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1308            mode and support for the "in select in table" insertion mode (HTML5
1309            revision 1342).
1310    
1311    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1312    
1313            * HTML.pm.src: No foster parenting for <script> and <script>
1314            in non-tainted <table>s (HTML5 revision 1336).
1315    
1316    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1317    
1318            * HTML.pm.src: Ignore white space characters between <html>
1319            and <head> (HTML5 revision 1332).
1320    
1321    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1322    
1323            * HTML.pm.src: Treat <input type=hidden> as if it were a
1324            white space (HTML5 revision 1331).
1325    
1326    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1327    
1328            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1329            element (HTML5 revision 1330).
1330    
1331    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1332    
1333            * HTML.pm.src: <title> is always appended to the current
1334            element (HTML5 revision 1328).
1335    
1336    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1337    
1338            * HTML.pm.src: White space in tainted tables are moved
1339            into foster parents (HTML5 revision 1326).
1340    
1341    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1342    
1343            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1344            revision 1321).
1345    
1346    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1347    
1348            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1349            the spec (HTML5 revision 1320).  Note that we finally complete
1350            all of HTML5 revision 1320 changes.
1351    
1352    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1353    
1354            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1355            tags (HTML5 revision 1320).
1356    
1357    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1358    
1359            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1360    
1361    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1362    
1363            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1364            could not be reached so that it is now removed (HTML5 revision
1365            1320).
1366    
1367    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1368    
1369            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1370            to align with new spec text (HTML5 revision 1320).
1371    
1372    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1373    
1374            * HTML.pm.src: Remove strange |if| condition; however, it should
1375            have had no harm in theory.
1376    
1377    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1378    
1379            * HTML.pm.src (_tree_construction_main): '</p>' in body
1380            case is split from other end tags for the preparation
1381            of implementing HTML revision 1320.
1382    
1383    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1384    
1385            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1386            revision 1320).
1387    
1388    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1389    
1390            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1391            and "div" (HTML5 revision 1318).  Add comments to where
1392            |form| pointer association codes should be inserted (HTML5 revision
1393            1319).
1394    
1395    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1396    
1397            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1398    
1399    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1400    
1401            * HTML.pm.src: Since the case t268 should never be reached (no
1402            other token type, there are), it is replaced by a |die| statement.
1403    
1404    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1405    
1406            * HTML.pm.src: Typo fixed.
1407    
1408    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1409    
1410            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1411            doctypes were not uppercased for comparison.
1412    
1413    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1414    
1415            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1416            are added.
1417    
1418    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1419    
1420            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1421            are set.  Cases that are unlikely reached are noted as so.
1422    
1423    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1424    
1425            * HTML.pm.src: Checkpoints for debugging are added.
1426    
1427            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1428    
1429    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1430    
1431            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1432            for simplicity.
1433    
1434    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1435    
1436            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1437            revision 1307).
1438    
1439    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1440    
1441            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1442            is set are changed (HTML5 revision 1305).
1443    
1444    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1445    
1446            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1447            1292).  Entities are not parsed in comment-like part in RCDATA
1448            elements (HTML5 revision 1294).  Allow bare & at the end
1449            of attribute value literals (HTML5 revision 1296).  More
1450            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1451            between attributes and ban attribute names or unquoted
1452            attribute values containing single or double quotes (HTML5
1453            revision 1303).
1454    
1455    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1456    
1457            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1458            and related errors unless it is an HTML document (though the spec
1459            is unclear on whether it is applied to XHTML document).
1460    
1461            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1462            are added.
1463    
1464    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1465    
1466            * HTML.pm.src: s/local_name/manakai_local_name/g.
1467    
1468    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1469    
1470            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1471    
1472            * HTML.pm.src: |charset| in |content| attribute is
1473            case-insensitive (HTML5 revision 1270).
1474    
1475    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1476    
1477            * ContentChecker.pm: New status constants are added.
1478            ($ElementDefault): |status| added.
1479            (check_element): Err for non-standard or deprecated elements.
1480            (_attr_status_info): For non-standard or deprecated attributes.
1481    
1482    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1483    
1484            * ContentChecker.pm (_attr_status_info): New internal method.
1485    
1486    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1487    
1488            * ContentChecker.pm (check_element): Element standardized
1489            status information is now dispatched.
1490    
1491    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1492    
1493            * ContentChecker.pm (check_element): Fix |del|-and-significant
1494            problem by adding some more arguments.
1495    
1496    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1497    
1498            * ContentChecker.pm (check_element): Use context of
1499            container-for-the-purpose-of-content-model element (not transparent
1500            element) for |check_child_element| calling and significant
1501            text flag marking.  This reintroduces |<del>|-and-significant
1502            problem again.
1503    
1504    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1505    
1506            * ContentChecker.pm (check_element): Make semi-transparent
1507            elements ignored for the purpose of phase changes in
1508            content model checking.
1509    
1510    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1511    
1512            * ContentChecker.pm (check_element): In-element state
1513            was not properly managed for transparent cases.
1514    
1515    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1516    
1517            * ContentChecker.pm (check_element): Support for |video|
1518            and |audio| as semi-transparent elements.
1519    
1520    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1521    
1522            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1523            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1524            Support for |html:object| as a semi-transparent element.
1525    
1526    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1527    
1528            * ContentChecker.pm (check_element): The way to traverse
1529            the tree is entirely revised to make it easier to track
1530            the state of ancestors/descendants.  As a result of this
1531            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1532            support for content model checking for HTML elements |figure|,
1533            |object|, |video|, and |audio| and checking for XML elements (and
1534            some XMLNS checkings) are dropped for now.  They will be
1535            reimplemented in due cource.
1536    
1537    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1538    
1539            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1540            token (HTML5 revision 1225).
1541    
1542    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1543    
1544            * ContentChecker.pm ({unsupported_level}): New value.
1545    
1546            * HTML.pm.src: Save whether |meta| |content| attribute
1547            contains character references or not.
1548    
1549    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1550    
1551            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1552            imlement significant content checking for contents
1553            with |del| elements.
1554    
1555    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1556    
1557            * ContenteChecker.pm ($HTMLTransparentElements): More
1558            elements are added.
1559            (_get_children): HTML |object| elements are now semi-transparent.
1560    
1561            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1562    
1563    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1564    
1565            * CacheManifest.pm: HTML5 revision 1211 implemented.
1566    
1567            * CacheManifest.pod: Updated.
1568    
1569    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1570    
1571            * ContentChecker.pm (check_document, check_element): Support
1572            for second argument ($onsubdoc).
1573            (_get_css_parser): Removed (now it is part of WDCC).
1574    
1575    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1576    
1577            * ContentChecker.pm (_get_css_parser): New.
1578    
1579    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1580    
1581            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1582            for checking had been used.
1583    
1584  2007-11-25  Wakaba  <wakaba@suika.fam.cx>  2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1585    
1586          * ContentChecker.pm (check_element): New todo item type |descendant|.          * ContentChecker.pm (check_element): New todo item type |descendant|.

Legend:
Removed from v.1.129  
changed lines
  Added in v.1.396

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24