/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.48 by wakaba, Sat Jun 23 16:42:43 2007 UTC revision 1.400 by wakaba, Sun Sep 6 10:21:13 2009 UTC
# Line 1  Line 1 
1    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: |<hr/>| was incorrectly marked as an error.
4    
5    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
6    
7            * HTML.pm.src: Changed the handling of </html> end tag of "in
8            body" insertion mode according to the latest spec definition.
9    
10    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Rewrote end tag handling in |head| area (cf. HTML5
13            revision 2730, but it was entirely broken, maybe I missed some
14            spec changes before rev.2730).
15    
16    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Added the |hgroup| element (HTML5 revision 3039 and
19            HTML5 revision 3040).
20    
21    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
24            revision 3689).  It should make no difference since the |xmp|
25            element has the PCDATA content and cannot be the non-bottommost
26            element in the stack of open elements.
27    
28    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
31            fragment parsing algorithm is invoked (HTML5 revision 2699).
32    
33    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
36    
37    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
38    
39            * HTML.pm.src: No longer does foster parenting |style|, |script|,
40            and |input type=hidden| elements (HTML5 revision 2988).
41    
42    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
45            2870).
46    
47    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
48    
49            * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
50            element (HTML5 revision 2960).
51    
52    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: Moved the "in table text" insertion mode code up
55            such that it does also work as specced in HTML5 in the remaining
56            cases (HTML5 revision 3382).
57    
58    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: Implementing the "in table text" insertion mode,
61            first half (HTML5 revision 3382).
62    
63    2009-08-30  Wakaba  <wakaba@suika.fam.cx>
64    
65            * ContentChecker.pm: Allow |xml:lang| in HTML document if the
66            owner element is not an HTML element (HTML5 revision 3697).
67    
68    2009-08-23  Wakaba  <wakaba@suika.fam.cx>
69    
70            * ContentChecker.pm: Allow <input type=hidden> in context where
71            interactive content is disallowed (HTML5 revision 2392).
72    
73    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
74    
75            * HTML.pm.src: Dropped support for "XSLT-compat" and added
76            "about:legacy-compat" (HTML5 revision 2725).
77    
78    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
79    
80            * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
81            spec (HTML5 revision 2700).
82    
83    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
84    
85            * HTML.pm.src: Added support for <script> element in the "in
86            select" and "in select in table" insertion modes (HTML5 revision
87            2500).
88    
89    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * HTML.pm.src: An <hn> start tag in an <hn> element should close
92            the earlier element (HTML5 revision 2501).
93    
94    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: The </br> end tags before the <body> start tag
97            should equally survive as the first br element child of the body
98            element (HTML5 revision 2586).
99    
100    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: <font> in SVG land is handled as an SVG element as
103            long as it did not have HTML attributes (HTML5 revision 2590).
104    
105    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src (_tree_construction_main): Improved error reporting
108            on "select" start tag in "in select" and "in select in table"
109            parsing modes (c.f. HTML5 revision 3604).
110    
111    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
112    
113            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
114            invalid (HTML5 revision 2701).
115    
116    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
117    
118            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
119            optional before </body> (HTML5 revision 2806).
120    
121    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
122    
123            * HTML.pm.src: Marked where SVG |script| execution should be
124            implemented (HTML5 revision 2904).
125    
126    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
129            inherits the document quirksness flag of the main document (HTML5
130            revision 2935).
131    
132    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
135            quirks mode (The Hixie Quirk, HTML5 revision 3132).
136    
137    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
140            3163).
141    
142    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
143    
144            * HTML.pm.src: Don't change encoding to UTF-8 if the input
145            encoding is UTF-16 (HTML5 revision 3204).
146    
147    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src (parse_byte_stream): Don't change to another
150            encoding if the input is UTF-16 (HTML5 revision 3205).
151    
152    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src: |isindex| should no longer generate |p| element
155            node (HTML5 revision 3236).
156    
157    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
158    
159            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
160            support for new |id_type| value of |any|, which means any kind of
161            element is allowed as a referenced element.
162    
163    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
164    
165            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
166            of defining codes for checking required attribute and as such.
167    
168    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
169    
170            * IMTChecker.pm: Added more definitions for subtypes.
171    
172    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
173    
174            * URIChecker.pm: Some of |pos_end| values were wrong.
175    
176    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
177    
178            * ContentChecker.pm (check_element): Added support for "no
179            referenced datalist" error.
180    
181    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
182    
183            * URIChecker.pm: Bug fix: It did not work unless
184            Message::DOM::DOMImplementation has been |require|d.
185    
186    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
187    
188            * NanoDOM.pm (document_uri): New attribute.
189    
190            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
191    
192    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
193    
194            * NanoDOM.pm (text_content): Don't create a Text node if the new
195            value is empty.
196    
197    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
198    
199            * SWML/: New directory.
200    
201    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
202    
203            * NanoDOM.pm (specified, all_declarations_processed,
204            manakai_attribute_type): New attributes.
205    
206    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
207    
208            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
209            array.
210    
211    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
212    
213            * NanoDOM.pm (notation_name): New attribute.
214    
215    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
216    
217            * NanoDOM.pm (public_id, system_id): New attributes.a
218    
219    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
220    
221            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
222            implemented.
223            (allowed_tokens, default_type, declared_type): Implemented.
224    
225    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
226    
227            * NanoDOM.pm (node_name): New attribute.
228            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
229            constants.
230            (create_element_type_definition_node, create_attribute_definition,
231            create_notation, create_general_entity,
232            get_element_type_definition_node,
233            set_element_type_definition_node, get_general_entity_node,
234            set_general_entity_node, get_notation_node, set_notation_node,
235            get_attribute_definition_node, set_attribute_definition_node): New
236            methods.
237            (element_types, entities, notations, attribute_definitions): New
238            attributes.
239            (DocumentType): Support for child nodes, entities, notations, and
240            element types.
241            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
242            New classes.
243    
244            * Dumper.pm: Support for general entities, notations, element type
245            definitions, and attribute definitions.
246    
247    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
248    
249            * NanoDOM.pm (create_processing_instruction): New method.
250            (xml_version, xml_encoding, xml_standalone): New attributes.
251            (ProcessingInstruction): New class.
252    
253    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: Handling of end tags in the foreign content
256            insertion mode was partially wrong, because of wrong bit
257            operations.
258    
259    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
260    
261            * NanoDOM.pm (dom_config): New attribute (do nothing), for
262            Whatpm::XML::Parser support.
263    
264    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
265    
266            * Makefile: New rule to make HTML/Tokenizer.pm is added.
267    
268            * HTML.pm.src: Tokenizer part moved to another file.
269    
270    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
271    
272            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
273    
274    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src: Element category constants redefined.
277            
278    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
281            construction stage synced with the spec (HTML5 revisions 2139 and
282            2302).
283    
284    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
285    
286            * ContentChecker.pm: New error level "html5_fact" added, which
287            should be tentatively used until all of requirements are properly
288            specced as RFC 2119 "MUST" in HTML5.
289    
290    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
291    
292            * ContentChecker.pod: Note on internal flags is added.
293    
294    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
295    
296            * HTML.pm.src: An AAA bug fixed.
297    
298    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
299    
300            * HTML.pm.src: If another node is inserted by the parser, don't
301            reuse existing Text node to append a character (HTML5 revision
302            2124).
303    
304    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
305    
306            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
307            revisions 1731 and 2128).
308    
309    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
312            Support for end tags of camelCase SVG elements were broken.  A
313            wrong error type text fixed.
314    
315    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
316    
317            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
318    
319    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
320    
321            * HTML.pm.src: Support for new definition of |param| and |source|
322            start tag parsing (HTML5 revision 1731).
323    
324    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
325    
326            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
327    
328    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
329    
330            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
331            revisions 1731 and 1831).
332    
333    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
334    
335            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
336            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
337            revision 1731).
338    
339    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
340    
341            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
342            revision 1731).  End tags of |option| and |optgroup| elements are
343            now optional (HTML5 revision 1731).
344    
345    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
346    
347            * HTML.pm.src: New "special" elements added to the list (HTML5
348            revision 1778).  "strile" -> "strike".
349    
350    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
351    
352            * ContentType.pm (get_sniffed_type): Support for the "better"
353            content sniffing (HTML5 revision 1927).  In a case the official
354            type was not returned when the method is invoked in the list
355            context.
356    
357    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
358    
359            * HTML.pm.src: Character references for non-space C0 characters,
360            including U+000B VT, DEL character, noncharacter code points, are
361            now converted to the U+FFFD character (cf. HTML5 revision 2138).
362    
363    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
364    
365            * ContentChecker.pm: |form=""| check support added.
366    
367    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
368    
369            * ContentChecker.pm: |contextmenu| validness is now checked using
370            |id| and |id_type| properties, and |menu| property is removed.
371    
372    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
373    
374            * ContentChecker.pm: Prepare for |form| |name| attribute's
375            duplication checking.
376    
377    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
378    
379            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
380            unsupported charset="" parameter value (HTML5 revision 2131).
381    
382    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
383    
384            * HTML.pm.src: Reminding places where U+000B is allowed as a space
385            character is fixed (cf. HTML5 revision 1738).
386    
387            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
388            space characters (HTML5 revision 1738).
389    
390    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
391    
392            * HTML.pm.src: The "anything else" case for the "after after body"
393            insertion mode was not updated to swtich to the "in body"
394            insertion mode.  U+000B is no longer a space character for the
395            purpose of tree construction phase (HTML5 revision 1738).
396    
397    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
398    
399            * HTML.pm.src: U+000B is no longer a space character (HTML5
400            revision 1738).
401    
402    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
403    
404            * ContentType.pm: 0x0B is no longer a space character (HTML5
405            revision 1738).
406    
407            * HTML.pm.src: U+000B is no longer a space character for the
408            algorithm for extracting an encoding from a Content-Type (HTML5
409            revision 1738).
410    
411    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
412    
413            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
414    
415    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
416    
417            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
418    
419            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
420            matching.
421    
422    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
423    
424            * LangTag.pm: Warn for private use language subtags.  Error level
425            typos fixed.  Support for Suppress-Script field.
426    
427            * mklangreg.pl: Support for dumping of nested structure.
428    
429    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
430    
431            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
432            recommended case as per RFC 4646.
433    
434    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
435    
436            * LangTag.pm (check_rfc4646_langtag): New method.
437    
438    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
439    
440            * mklangreg.pl: New script.
441    
442            * Makefile: Updated for creation of the module for language subtag
443            registry.
444            
445    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
446    
447            * Makefile: WebIDL.html added.
448    
449            * WebIDL.pod: New documentation.
450    
451    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
452    
453            * WebIDL.pm: Checker's error types are redefined.
454    
455    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
456    
457            * WebIDL.pm: Parser's error types are redefined.  Some forward
458            compatible parsing bugs are fixed.  Some unreachable codes are
459            commented out.
460    
461    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
462    
463            * WebIDL.pm: Support for the reminding extended attributes are
464            added.  It does not satisfy the definition that a forward
465            interface declaration has an extended attribute.  It seems that
466            unless explicitly allowed multiple extended attributes with the
467            same name is not allowed, though it is not explicitly mentioned in
468            the spec.
469    
470    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
471    
472            * WebIDL.pm: Unescapes extended attribute names and extended
473            attribute identifiers.  Preserve whether an extended attribute has
474            an argument list of not.  Support for extended attributes:
475            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
476            NameGetter, NameSetter, and Null.
477            (has_argument_list): New attribute.
478            (idl_text): Stringifies argument lists, if any, even if it is
479            empty.
480    
481    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
482    
483            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
484            |{s_kwd}| in DATA_STATE as default.
485    
486    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
487    
488            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
489            by |{s_kwd}| in DATA_STATE.
490    
491    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
492    
493            * HTML.pm.src: Shorten keys.
494    
495    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
496    
497            * HTML.pm.src: Remove checking for control character, surrogate
498            pair, or noncharacter code points and non-Unicode code
499            points (they should be handled by Whatpm::Charset::UnicodeChecker).
500            (parse_char_stream): Support for the |$get_wrapper| argument and
501            character stream error handlers.
502    
503    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
504    
505            * ContentChecker.pm: Don't call |loda_ns_module|
506            for null-namespace elements/attributes.
507    
508            * HTML.pm.src: Fact out $disallowed_control_chars
509            as a hash.
510    
511    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
512    
513            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
514            and |{next_char}| initializations are moved to initialization
515            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
516            with |parse_char_stream|.
517    
518    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
519    
520            * HTML.pm.src (parse_char_stream): Make |set_next_char|
521            invoke |manakai_read_until|, not only |read|, where
522            possible, to decrease the number of |read| method calls.
523    
524            * mkhtmlparser.pl: Related changes to the aforementioned
525            modification.
526    
527    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
528    
529            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
530            would report character error from now.
531    
532    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
533    
534            * HTML.pm.src: White-space-leaded non-white-space character
535            tokens in "before head insertion mode" was not
536            correctly handled.
537            (set_inner_html): Reimplemented using CharString decodehandle
538            class.  Support for $get_wrapper argument.  Support
539            for |{read_until}| feature.
540    
541    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
542    
543            * HTML.pm.src: Make a "bare ero" error for unknown
544            entities point the "&" character.
545    
546    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
547    
548            * HTML.pm.src: It turns out that U+FFFD don't have to
549            be added to the list of excluded characters.
550    
551    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
552    
553            * HTML.pm.src ($char_onerror): Have character decoder's |line|
554            and |column| a higher priority than the one set by the
555            tokenizer's input handler.
556            ($self->{read_until}): Exclude U+FFFD (but this might
557            not be necessary, since now we do line/column fixup in
558            the character decode handle).
559    
560    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
561    
562            * HTML.pm.src: Use |{read_until}| where possible.
563    
564    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
565    
566            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
567            and |manakai_getc_until| to |manakai_read_until| to
568            reduce the number of string copies.
569    
570    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
571    
572            * HTML.pm.src (parse_char_string): Use newly created
573            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
574            standard feature to |open| a string as a filehandle,
575            since Perl's string filehandle seems not supporting |ungetc|
576            method correctly.
577            (parse_char_stream): Define |{getc_until}| method.
578            (DATA_STATE): Experimental support for |getc_until| feature.
579    
580    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
581    
582            * HTML.pm.src: Check points added to newly added branches.
583    
584    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
585    
586            * HTML.pm.src: Remove |{char}|, which is no longer used.
587            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
588            and replaced by |{prev_state}|.
589    
590            * mkhtmlparser.pl: Remove |{char}| feature.
591            Remove |!!!back-next-input-character;| macro.
592    
593    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
594    
595            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
596            entity related tokenizer states in favor of new states
597            implementing the consume character reference algorithm.
598    
599    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
600    
601            * HTML.pm.src: "Consume a character reference" algorithm is
602            now implemented as a tokenizer's state, rather than
603            a method, with minimum changes (more changes will
604            be made, in due course).  "Bogus comment state"'s inner
605            loop gets removed.
606    
607    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
608    
609            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
610            into their own tokenizer states.
611    
612    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
613    
614            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
615            is split into three states.
616    
617    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
618    
619            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
620            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
621            no longer does the tokenizer have to push back next input
622            characters in those states.
623    
624    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
625    
626            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
627            into four states so that no longer does the tokenizer have to push
628            back next input characters in that state.
629    
630    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
631    
632            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
633            which can be used to insert some wrapper between the character
634            stream handle and the tokenizer.  (It is currently not supported
635            for |set_inner_html| for |Element|s).
636    
637    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
638    
639            * HTML.pm.src: Ignore punctuations in charset names.
640    
641    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
642    
643            * ContentChecker.pm: Support for charset-layer error levels.
644    
645            * HTML.pm.src: Don't specify |text| argument for the
646            |chardecode:fallback| error, since it is not the encoding
647            being used alternatively.
648    
649    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
650    
651            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
652    
653    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
654    
655            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
656    
657    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
658    
659            * HTML.pm.src: Bug fix and sync with the spec with regard
660            to after after frameset insertion mode processing (HTML5
661            revision 1909).  Note that the implementation was wrong
662            per the old spec before the r1909 changes.
663    
664    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
665    
666            * HTMLTable.pm: scope=auto algorithm fix synced with the
667            spec (HTML5 revision 2093).
668            ($process_row): Algorithm step numbers synced with the
669            spec (HTML5 revision 2092).
670    
671    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
672    
673            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
674            revision 2094).
675    
676    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
677    
678            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
679    
680    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
681    
682            * HTML.pm.src: '"' and "'" at the end of attribute
683            name (after another attribute) now raise parse error (HTML5
684            revision 2123).  Empty unquoted attribute values are no
685            longer allowed (HTML5 revision 2122).
686    
687    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
688    
689            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
690            revision 2130).
691    
692    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
693    
694            * ContentChecker.pm: |xml:lang| attribute value must be same
695            as |lang| attribute value for HTML elements (HTML5 revision 2062
696            and so on).
697    
698    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
699    
700            * ContentChecker.pm: Error level definition for |xml_id_error|
701            was missing.
702    
703            * URIChecker.pm: The end of the URL should be marked as the
704            error location for an empty path error.  The position
705            between the userinfo and the port components should be
706            marked as the error location for an empty host error.
707    
708    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
709    
710            * URIChecker.pm: Set parameters representing where in the
711            value the error occurs for errors.  Report unknown
712            address format error in warning level, since address
713            formats are rarely added.  Path segments starting with "/.."
714            were misinterpreted as a dot-segment.
715    
716    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
717    
718            * URIChecker.pm (check_iri_reference): Requires
719            |Message::DOM::DOMImplementation|.
720    
721    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
722    
723            * IMTChecker.pm: Updated for the new error reporting architecture.
724    
725            * ContentChecker.pm: Error levels for IMTs are added.
726    
727    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
728    
729            * H2H.pm (_shift_token): Support for unquoted HTML attribute
730            values.
731    
732    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
733    
734            * CacheManifest.pm: Support for new style of error
735            reports.
736    
737            * HTML.pm.src: Set line=1, column=1 to the document node.
738    
739    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
740    
741            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
742            and URL checkers.  Support for more error levels for bogus
743            langauge tag and URL "standards".
744    
745            * LangTag.pm, URIChecker.pm: Support for new style error
746            level reporting.
747    
748    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
749    
750            * ContentChecker.pm: Support for RDF/XML error levels.
751    
752            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
753            specifying.  Error types are revised.
754    
755    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
756    
757            * ContentChecker.pm: All error reporting method calls are
758            renewed.
759    
760    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
761    
762            * HTML.pm.src: All error type names and "text" parameters
763            are revised.  Use new style for "level" specification.
764    
765            * mkhtmlparser.pl: Use new style for "level" specification.
766    
767    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
768    
769            * WebIDL.pm (parse_char_string): Simplified error
770            reporting process for broken ignored valuetype definition.
771            (Valuetype idl_text): Support for special "DOMString" name.
772    
773    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
774    
775            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
776            terminal of the ScopedName is "DOMString", such that whether
777            the last part of the scoped name is "DOMString" or "_DOMString"
778            later.  It is necessary to determine whether a |typedef|
779            definition should be ignored or not.
780            (parse_char_string): Unescape the identifier of
781            exception members.
782            ($resolve): Return undef for builtin types and sequence<T>
783            types (we might not have to do this, however...).
784            (check): Support checking for Exceptions, Valuetypes,
785            and Typedefs.
786            ($serialize_type): Support for "DOMString::::" syntax.
787            (Typedef idl_text): Output Type as "DOMString" if it
788            is really "DOMString" (i.e. its internal representation
789            is "::DOMString::").
790    
791    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
792    
793            * WebIDL.pm ($resolve): New code, based on resolve code
794            for constant types in the |check| method.
795            (check): Support for checking of attributes, operations, and
796            arguments.
797            (Attribute/Operation idl_text): Exception names in getraises,
798            setraises, and raises clauses is serizlied by |$serialize_type|
799            code.
800    
801    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
802    
803            * WebIDL.pm ($integer): Order of selections are changed to match
804            hexadecimal numbers (the original pattern, taken from the spec,
805            was not work for hexadecimal numbers, because the "0" prefix
806            matches to the [0-7]* part (as an empty string) and therefore
807            it does not match with remaining "x..." part of a "0x..." integer
808            literal.
809            ($get_type): It now returns a string, not an array reference,
810            for regular types and |sequence| types (i.e. it in any case
811            returns a string).
812            ($get_next_token): The second item in the array that represents
813            a integer or float token is now a Perl number value, not the
814            original string representation of the number.
815            (check): Support for const value consistency checking.
816            No extended attribute is defined for constants.
817            (Node subclasses): Use simple strings rather than array references
818            for default data type values.
819            ($serialize_type): Type values are now simple strings.
820            (value): If the new attribute value is a false value, then
821            a FALSE value is set to the attribute.
822    
823    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
824    
825            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
826            in its stringified format ("scoped name" as defined in the
827            spec).  Note that future version of this module should not use
828            array references for type values and the |type_text| attribute
829            should be made obsolete.
830            (parse_char_string): Unescape attribute names.
831            (check): Support for checking of whether inherited interfaces
832            are actually defined or not.  Support for checking of whether
833            interface member identifiers are duplicated or not.
834            ($serialize_type): Scoped names are returned as is.  A future
835            version of this code should escape identifiers other than "DOMString",
836            otherwise the idl_text would be non-conforming.
837    
838    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
839    
840            * WebIDL.pm (parse_char_string): Set line/column numbers
841            to generated nodes.  Unescape identifiers.  Extended attributes
842            for Definition's were ignored.
843            (append_child): Set |parent_node| attribute.
844            (parent_node): New attribute.
845            (check): Support interface/exception members.  Support
846            extended attributes.  Support definition identifier uniqueness
847            constraint.
848            (qualified_name): New attribute.
849            (Interface/Exception idl_text): Extended attributes were
850            not prepended to the returned text.
851    
852    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
853    
854            * WebIDL.pm (parse_char_string): Set line/column numbers
855            to interface object experimentally.  s/shift/pop/g, shift
856            would make things wrong.  Support for interface forward
857            declarations was missing.  Broken interface declarations
858            with no block were not ignored entirely.
859            (Whatpm::WebIDL::Node): New abstract class.  This class
860            makes things easier.
861            (child_nodes): New attribute.  Unlike DOM's attribute with
862            same name, this attribute returns a dead list of nodes for
863            simplicity.
864            (get_user_data, set_user_data): New methods.
865            (Module idl_text): A SPACE character should be inserted
866            before the |{| character.
867            (Interface idl_text): Support for interface forward declarations.
868            (is_forward_declaration): New attribute.
869    
870    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
871    
872            * WebIDL.pm (type_text): Better serializer.
873    
874    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
875    
876            * WebIDL.pm: Revise forward-compatible parsing so that
877            it now can handle broken extended attributes and as such.
878    
879    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
880    
881            * WebIDL.pm: Real support for extended attributes.
882            Support for extended attributes with arguments.
883    
884    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
885    
886            * WebIDL.pm: Support for |exception| syntax.
887            (Interface->idl_text): Tentative support for inheritances.
888    
889    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
890    
891            * WebIDL.pm: Hierarchical scoped name support was broken.
892            Support for raises, setraises, and getraises syntaxes.
893    
894    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
895    
896            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
897            proper support for types, extended attributes, and exceptions yet).
898            WebIDL parser, version 1 (no support for exceptions yet,
899            no proper support for extended attributes yet).
900    
901    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
902    
903            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
904    
905    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
906    
907            * WebIDL.pm: Support for valuetype and const.
908    
909    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
910            
911            * WebIDL.pm: New module.
912    
913    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
914    
915            * Makefile (Entities.html): URI changed.
916    
917    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
920    
921    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
922    
923            * HTML.pm.src (_get_next_token): A parse error was missing.
924    
925    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
926    
927            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
928            spec (commented out). (HTML5 revision 1711).
929    
930    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
931    
932            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
933    
934            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
935            as UTF-16 (HTML5 revision 1701).
936    
937    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
938    
939            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
940            1692).
941    
942    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
943    
944            * HTML.pm.src: The secondary insertion mode used when switching
945            to foreign content is the "in body" insertion mode (HTML5 revision
946            1696).
947    
948    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
949    
950            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
951            revision 1697).
952    
953    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
954    
955            * HTML.pm.src: Support for end-of-file token in foreign content
956            insertion mode (HTML5 revision 1693).  Update SVG camelCase
957            attribute list (HTML5 revision 1700).  <textarea> closes
958            </select> (HTML5 revision 1699).  More start tags close in
959            foreign content insertion mode (HTML5 revision 1698).
960    
961    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
962    
963            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
964    
965    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
966    
967            * HTML.pm.src: More robust charset parameter detection (HTML5
968            revision 1674).
969    
970    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
971    
972            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
973            revision 1676).
974    
975    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
976    
977            * HTML.pm.src: Ignore language part of public identifiers for
978            quriks mode detection (HTML5 revision 1679).
979    
980    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
981    
982            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
983            revision 1685).
984    
985    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
986    
987            * HTML.pm.src: Support for EOF in new states for tags (HTML5
988            revision 1684).
989    
990    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
991    
992            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
993            work (HTML5 revision 1690).
994    
995    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
996    
997            * HTML.pm.src (_tree_construction_main): Change handling of
998            end tags in head insertion modes (HTML5 revision 1686).
999            (parse_char_string): Bug fix for non-utf8 character string handlings.
1000            (parse_char_stream): |ungetc| does not work well for this context.
1001    
1002    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
1003    
1004            * HTML.pm.src (parse_byte_string): Redefined to invoke
1005            |parse_byte_stream|.
1006            (parse_byte_stream): New method.
1007    
1008    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
1009    
1010            * HTML.pm.src (parse_byte_string): Fix the column number reported
1011            by encoding layer error reporter.
1012    
1013    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1014    
1015            * HTML.pm.src (parse_byte_string): Use streaming decoder
1016            rather than converting the whole byte string and then parsing.
1017            Propagate errors in character encoding layer.
1018            (get_next_token): Precise error reporting for |bare stago| error.
1019    
1020    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1021    
1022            * HTML.pm.src (parse_char_stream): New method.
1023            (parse_char_string): This method is now defined as an invocation
1024            of the |parse_char_stream| method.
1025    
1026    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1027    
1028            * HTML.pm.src (parse_byte_string): Report various status
1029            of the sniffing as info-level errors.  Support for new
1030            decoding framework in parser resestting.
1031            (new): Various default error levels were not set.
1032    
1033    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1034    
1035            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1036            algorithm, except for the actual sniffing, is implemented
1037            with new framework with Message::Charset::Info.
1038    
1039    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * CacheManifest.pm (_parse): Drop fragment identifiers from
1042            URIs in fallback section (HTML5 revision 1596).
1043    
1044    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1045    
1046            * Makefile (Entities.html): URI has changed.
1047    
1048    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1051            1553).
1052    
1053    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1054    
1055            * ContentChecker.pm: Noted that those returned in |table| are
1056            no longer table elements, but table objects returned
1057            by Whatpm::HTMLTable.
1058    
1059            * HTMLTable.pm (form_table): Return table element node
1060            as |$table->{element}|.
1061            (assign_header): Support for the |headers=""| attribute.
1062    
1063    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1064    
1065            * HTMLTable.pm (assign_header): New function; first version
1066            with no support for headers="".
1067            (form_table): Include table width and height to the returned
1068            table object for covenience.  Indexing in column assignement was wrong.
1069            Set whether a data cell is empty or not for convenience.
1070    
1071    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1072    
1073            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1074    
1075    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1076    
1077            * HTMLTable.pm: How table model errors are detected is
1078            changed (HTML5 revision 1387).
1079    
1080    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1081    
1082            * HTMLTable.pm: The algorithm now moves |tfoot| elements
1083            to the end of the table (HTML5 revision 1380).
1084    
1085    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1086    
1087            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1088            of 1-based (HTML5 revision 1376).
1089    
1090    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1091    
1092            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1093            and "Content-Encoding" no longer prevent sniffing (HTML5
1094            revision 1288).
1095    
1096    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1097    
1098            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1099            1282).
1100    
1101    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1102    
1103            * ContentChecker.pm: Support for global attributes.
1104            Status of XML specs are added.
1105    
1106    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1107    
1108            * ContentChecker.pm (check_element): Support for |template=""|
1109            and |ref=""| attribute (referent element type checking).
1110    
1111    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
1112    
1113            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1114            1500) is implemented (except for IDNA part and URI-scheme-specific
1115            knowledge).  Line number counting was wrong for LF-only
1116            documents.
1117    
1118    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1119    
1120            * HTML.pm.src: Raise an parse error for any disallowed
1121            character (HTML5 revision 1263).
1122    
1123    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1124    
1125            * mkentitylist.pl: Support for new HTML5 entity table format (the
1126            definition for |AElig;| was missing).
1127    
1128    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1129    
1130            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1131            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1132    
1133    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1134    
1135            * HTML.pm.src: List of element names that close foreign content
1136            insertion mode is added (HTML5 revisions 1412 and 1418).
1137    
1138    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1139    
1140            * HTML.pm.src: Support for |mglyph| and |malignmark|
1141            elements (HTML5 revision 1410).
1142    
1143    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1144    
1145            * HTML.pm.src: Support for new long MathML entities (HTML5
1146            revision 1406).
1147    
1148    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1149    
1150            * HTML.pm.src: CDATA section support for MathML and SVG
1151            elements (HTML5 revision 1404 and 1420).
1152    
1153    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1154    
1155            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1156            elements (HTML5 revision 1404).  Unused !!!macro definitions
1157            are removed.
1158    
1159    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1160    
1161            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1162            are raised is changed (HTML5 revision 1404).
1163    
1164    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1165    
1166            * HTML.pm.src: Category sets that are no longer used are removed.
1167    
1168    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1169    
1170            * HTML.pm.src: The ->[1] property of stack entries are now
1171            replaced by constants representing element category.
1172    
1173    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1174    
1175            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1176            for error reporting.  (This is a preparation for using constant
1177            value for ->[1].)
1178    
1179    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1180    
1181            * RDFXML.pm: Typo fixed.
1182    
1183    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1184    
1185            * ContentChecker.pm: |fact_level| is now treated
1186            as same as |must_level|, i.e. level = |m|.
1187            (check_element): Make list of URIs in the DOM.
1188    
1189    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1190    
1191            * RDFXML.pm: Language accessor implemented.
1192            Local (null-namespace) attribute support.
1193    
1194    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1195    
1196            * RDFXML.pm: Factored out ID checking code.
1197    
1198    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1199    
1200            * RDFXML.pm: TODO items noted.  Validation of ID
1201            and URI attributes is implemented.  Warn if unknown
1202            value is used in rdf:parseType="" attribute.
1203    
1204            * URIChecker.pm (check_rdf_uri_reference): New function.
1205    
1206    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1207    
1208            * RDFXML.pm: bnodeid implemented.  Relative references
1209            are now resolved.
1210    
1211    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1212    
1213            * ContentChecker.pm: RDF reification implemented.
1214    
1215            * RDFXML.pm: undef vs false bug fixed.
1216            Reification implemented.
1217    
1218    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1219    
1220            * RDFXML.pm: s/id/ID/ for attribute name.
1221            The |node| arguments are added for |ontriple| calls.
1222            Too many "attribute not allowed" errors were raised.
1223    
1224            * ContentChecker.pm: Initial experimental support for rdf:RDF
1225            element.
1226    
1227    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1228    
1229            * RDFXML.pm: New module.
1230    
1231    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1232    
1233            * HTML.pm.src (set_inner_html): Line/column number
1234            code was old one yet.
1235    
1236    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1237    
1238            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1239            errors.  Line/column markings for DOCTYPE, comment, and
1240            character tokens are reintroduced; otherwise,
1241            error location for "not HTML5" error and errors
1242            for implied elements are not attached.
1243    
1244    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1245    
1246            * HTML.pm.src: Set line/column numbers to attributes.
1247    
1248            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1249            (value): Setter implemented.
1250    
1251            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1252    
1253    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1254    
1255            * HTML.pm.src: Unused line/column markings are removed.
1256    
1257    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1258    
1259            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1260            flag, which is no longer used.
1261    
1262    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1263    
1264            * HTML.pm.src: Set line/column information to element nodes.
1265    
1266            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1267            and !!!insert-element-t): Set line/column information to
1268            element nodes.
1269    
1270    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1271    
1272            * HTML.pm.src (_get_next_token): The first "<" character
1273            in "<?", "<>", or "</>" should be the error point.
1274    
1275    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1276    
1277            * HTML.pm.src: Some more fixes on error position reporting.
1278    
1279    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1280    
1281            * HTML.pm.src: Token-level precious error reporting.
1282    
1283    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1284    
1285            * HTML.pm.src: Preparation for more precious error point
1286            reporting.
1287    
1288    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1289    
1290            * HTML.pm.src: Error type revised.
1291    
1292    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1293    
1294            * HTML.pm.src: Similar codes are merged together, again.
1295    
1296    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1297    
1298            * HTML.pm.src: Similar codes are merged together.
1299    
1300    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1301    
1302            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1303    
1304            * HTML.pm.src: Code refined.
1305    
1306    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1307    
1308            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1309            revision 1348).  Note that I really don't know this makes
1310            any difference in the black-box behavior of the parser.
1311    
1312    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1313    
1314            * HTML.pm.src: New end-of-file token implementation (HTML5
1315            revision 1348).
1316    
1317    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1318    
1319            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1320    
1321    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1322    
1323            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1324    
1325    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1326    
1327            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1328            mode and support for the "in select in table" insertion mode (HTML5
1329            revision 1342).
1330    
1331    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1332    
1333            * HTML.pm.src: No foster parenting for <script> and <script>
1334            in non-tainted <table>s (HTML5 revision 1336).
1335    
1336    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1337    
1338            * HTML.pm.src: Ignore white space characters between <html>
1339            and <head> (HTML5 revision 1332).
1340    
1341    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1342    
1343            * HTML.pm.src: Treat <input type=hidden> as if it were a
1344            white space (HTML5 revision 1331).
1345    
1346    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1347    
1348            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1349            element (HTML5 revision 1330).
1350    
1351    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1352    
1353            * HTML.pm.src: <title> is always appended to the current
1354            element (HTML5 revision 1328).
1355    
1356    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1357    
1358            * HTML.pm.src: White space in tainted tables are moved
1359            into foster parents (HTML5 revision 1326).
1360    
1361    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1362    
1363            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1364            revision 1321).
1365    
1366    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1367    
1368            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1369            the spec (HTML5 revision 1320).  Note that we finally complete
1370            all of HTML5 revision 1320 changes.
1371    
1372    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1373    
1374            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1375            tags (HTML5 revision 1320).
1376    
1377    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1378    
1379            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1380    
1381    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1382    
1383            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1384            could not be reached so that it is now removed (HTML5 revision
1385            1320).
1386    
1387    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1388    
1389            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1390            to align with new spec text (HTML5 revision 1320).
1391    
1392    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1393    
1394            * HTML.pm.src: Remove strange |if| condition; however, it should
1395            have had no harm in theory.
1396    
1397    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1398    
1399            * HTML.pm.src (_tree_construction_main): '</p>' in body
1400            case is split from other end tags for the preparation
1401            of implementing HTML revision 1320.
1402    
1403    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1404    
1405            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1406            revision 1320).
1407    
1408    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1409    
1410            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1411            and "div" (HTML5 revision 1318).  Add comments to where
1412            |form| pointer association codes should be inserted (HTML5 revision
1413            1319).
1414    
1415    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1416    
1417            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1418    
1419    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1420    
1421            * HTML.pm.src: Since the case t268 should never be reached (no
1422            other token type, there are), it is replaced by a |die| statement.
1423    
1424    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1425    
1426            * HTML.pm.src: Typo fixed.
1427    
1428    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1429    
1430            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1431            doctypes were not uppercased for comparison.
1432    
1433    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1434    
1435            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1436            are added.
1437    
1438    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1439    
1440            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1441            are set.  Cases that are unlikely reached are noted as so.
1442    
1443    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1444    
1445            * HTML.pm.src: Checkpoints for debugging are added.
1446    
1447            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1448    
1449    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1450    
1451            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1452            for simplicity.
1453    
1454    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1455    
1456            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1457            revision 1307).
1458    
1459    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1460    
1461            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1462            is set are changed (HTML5 revision 1305).
1463    
1464    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1465    
1466            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1467            1292).  Entities are not parsed in comment-like part in RCDATA
1468            elements (HTML5 revision 1294).  Allow bare & at the end
1469            of attribute value literals (HTML5 revision 1296).  More
1470            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1471            between attributes and ban attribute names or unquoted
1472            attribute values containing single or double quotes (HTML5
1473            revision 1303).
1474    
1475    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1476    
1477            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1478            and related errors unless it is an HTML document (though the spec
1479            is unclear on whether it is applied to XHTML document).
1480    
1481            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1482            are added.
1483    
1484    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1485    
1486            * HTML.pm.src: s/local_name/manakai_local_name/g.
1487    
1488    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1489    
1490            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1491    
1492            * HTML.pm.src: |charset| in |content| attribute is
1493            case-insensitive (HTML5 revision 1270).
1494    
1495    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1496    
1497            * ContentChecker.pm: New status constants are added.
1498            ($ElementDefault): |status| added.
1499            (check_element): Err for non-standard or deprecated elements.
1500            (_attr_status_info): For non-standard or deprecated attributes.
1501    
1502    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1503    
1504            * ContentChecker.pm (_attr_status_info): New internal method.
1505    
1506    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1507    
1508            * ContentChecker.pm (check_element): Element standardized
1509            status information is now dispatched.
1510    
1511    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1512    
1513            * ContentChecker.pm (check_element): Fix |del|-and-significant
1514            problem by adding some more arguments.
1515    
1516    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1517    
1518            * ContentChecker.pm (check_element): Use context of
1519            container-for-the-purpose-of-content-model element (not transparent
1520            element) for |check_child_element| calling and significant
1521            text flag marking.  This reintroduces |<del>|-and-significant
1522            problem again.
1523    
1524    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1525    
1526            * ContentChecker.pm (check_element): Make semi-transparent
1527            elements ignored for the purpose of phase changes in
1528            content model checking.
1529    
1530    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1531    
1532            * ContentChecker.pm (check_element): In-element state
1533            was not properly managed for transparent cases.
1534    
1535    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1536    
1537            * ContentChecker.pm (check_element): Support for |video|
1538            and |audio| as semi-transparent elements.
1539    
1540    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1541    
1542            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1543            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1544            Support for |html:object| as a semi-transparent element.
1545    
1546    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1547    
1548            * ContentChecker.pm (check_element): The way to traverse
1549            the tree is entirely revised to make it easier to track
1550            the state of ancestors/descendants.  As a result of this
1551            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1552            support for content model checking for HTML elements |figure|,
1553            |object|, |video|, and |audio| and checking for XML elements (and
1554            some XMLNS checkings) are dropped for now.  They will be
1555            reimplemented in due cource.
1556    
1557    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1558    
1559            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1560            token (HTML5 revision 1225).
1561    
1562    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1563    
1564            * ContentChecker.pm ({unsupported_level}): New value.
1565    
1566            * HTML.pm.src: Save whether |meta| |content| attribute
1567            contains character references or not.
1568    
1569    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1570    
1571            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1572            imlement significant content checking for contents
1573            with |del| elements.
1574    
1575    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1576    
1577            * ContenteChecker.pm ($HTMLTransparentElements): More
1578            elements are added.
1579            (_get_children): HTML |object| elements are now semi-transparent.
1580    
1581            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1582    
1583    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1584    
1585            * CacheManifest.pm: HTML5 revision 1211 implemented.
1586    
1587            * CacheManifest.pod: Updated.
1588    
1589    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1590    
1591            * ContentChecker.pm (check_document, check_element): Support
1592            for second argument ($onsubdoc).
1593            (_get_css_parser): Removed (now it is part of WDCC).
1594    
1595    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1596    
1597            * ContentChecker.pm (_get_css_parser): New.
1598    
1599    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1600    
1601            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1602            for checking had been used.
1603    
1604    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1605    
1606            * ContentChecker.pm (check_element): New todo item type |descendant|.
1607    
1608    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1609    
1610            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1611            specced in RFC 2046 and RFC 4288.
1612            (application/atom+xml): Definition added.
1613    
1614    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1615    
1616            * URIChecker.pm: Make RFC 3986 should-level errors
1617            warnings (rather than SHOULD-level errors).
1618    
1619    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1620    
1621            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1622    
1623            * HTML.pm.src: A flag for character references in attribute
1624            values are added.  Set |manakai_has_reference| user data
1625            to |charset| attribute.
1626    
1627    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1628    
1629            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1630            attributes.
1631    
1632            * ContentChecker.pm (check_document): Warn if charset requirements
1633            cannot be tested.
1634    
1635    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1636    
1637            * HTML.pm.src (parse_byte_string): Detect charset
1638            by universalchardet if charset parameter is not specified.
1639    
1640            * Makefile (Charset-all, Charset-clean): New rules.
1641    
1642    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1643    
1644            * ContentChecker.pm (check_document): Check the existence
1645            of character encoding declaration and selection of encoding
1646            for HTML document.
1647    
1648    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1649    
1650            * ContentType.pm (get_sniffed_type): Return also the official
1651            type in list context.
1652    
1653    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1654    
1655            * ContentType.pm: Sniffing with leading white space
1656            ignoring (HTML5 revisions 1013 and 1016).
1657    
1658    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1659    
1660            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1661            white spaces, are implemented.
1662    
1663    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1664    
1665            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1666            attribute if possible.
1667    
1668    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1669    
1670            * HTML.pm.src (parse_byte_string): New method.
1671            (parse_char_string): New alias for |parse_string|.
1672            (main phase): Invoking "change the encoding" algorithm if desired.
1673    
1674            * HTML.pod: Updated.
1675    
1676    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1677    
1678            * HTML.pod (get_inner_html): Removed.
1679    
1680            * Makefile (HTML-all, HTML-clean): New.
1681    
1682    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1683    
1684            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1685    
1686    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1687    
1688            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1689    
1690    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1691    
1692            * Makefile: |CacheManifest.html| is added.
1693    
1694            * CacheManifest.pod: New file.
1695    
1696    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1697    
1698            * CacheManifest.pm: New module.
1699    
1700    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1701    
1702            * HTML.pm.src: Support for application cache selection algorithm
1703            callback.
1704    
1705    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1706    
1707            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1708    
1709    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1710    
1711            * Makefile (clean): New rule.
1712    
1713            * NanoDOM.pm (public_id, system_id): New attributes.
1714    
1715    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1716    
1717            * Makefile (CSS-all, CSS-clean, clean): New rules.
1718    
1719    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1720    
1721            * ContentChecker.pm (check_document): Support for
1722            new |is_xml_root| flag.
1723            (check_element): Support for new |pluses| state.
1724            (_add_pluses): New method.
1725            (_remove_minuses): Support for new |minus| item.
1726    
1727    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1728    
1729            * ContentChecker.pm: Raise specific error for invalid
1730            root element.
1731    
1732    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1733    
1734            * ContentChecker.pm: Set level values for later uses.
1735    
1736    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1737    
1738            * ContentChecker.pm: Support for language tag validation.
1739    
1740    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1741    
1742            * LangTag.pm (check_rfc3066_language_tag): New method.
1743    
1744    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1745    
1746            * LangTag.pm: New module.
1747    
1748    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1749    
1750            * HTML.pm.src: Some error types were wrong.
1751    
1752    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1753    
1754            * CSS/: New directory.
1755    
1756    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1757    
1758            * ContentChecker.pm (_check_get_children): Support
1759            for |noscript| in |head|.
1760    
1761    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1762    
1763            * URI/: New directory.
1764    
1765    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1766    
1767            * HTML.pm.src: Tokenizer's states are now represented in
1768            number.
1769    
1770    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1771    
1772            * HTML.pm.src: |or|s for insertion modes are replaced
1773            by |&|s.
1774    
1775    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1776    
1777            * HTML.pm.src: Token types are now represented in number.
1778    
1779    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1780    
1781            * ContentType.pm (SEE ALSO): Updated.
1782    
1783            * HTML.pm.src: Insertion modes are now represented in number.
1784    
1785    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1786    
1787            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1788            999) is implemented.
1789    
1790    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1791    
1792            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1793            983) is implemented.
1794    
1795    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1796    
1797            * ContentChecker.pod: New documentation.
1798    
1799            * Makefile: A rule for |ContentChecker.html| is added.
1800    
1801            * ContentChecker.pm: A pod "LICENSE" section is added.
1802    
1803            * NanoDOM.pm ($VERSION): New variable.
1804    
1805    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1806    
1807            * H2H.pm: |b|, |i|, and |sub| are added to the
1808            list of allowed HTML elements.
1809    
1810    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1811    
1812            * H2H.pm: |samp| is added to the list of allowed
1813            HTML elements.
1814    
1815            * URIChecker.pm (check_iri): New.
1816            (check_iri_reference): Error type for IRI reference
1817            syntax error is changed.
1818    
1819    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1820    
1821            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1822            is added.
1823            (check_document): Load appropriate module before validation.
1824    
1825    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1826    
1827            * ContentChecker/: New directory.
1828    
1829    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1830    
1831            * ContentChecker.pm: HTML |time| element is implemented.
1832    
1833            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1834    
1835    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1836    
1837            * ContentChecker.pm (check_document): Return value
1838            even if no document element is found.
1839    
1840    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1841    
1842            * HTML.pm.src: |$in_body| is no longer a function.
1843    
1844    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1845    
1846            * HTML.pm.src: The |$in_body| code has been moved down.
1847    
1848    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1849    
1850            * HTML.pm.src: The "trailing end" insertion mode
1851            is split into "after html body" and "after html frameset"
1852            insertion modes.  Their codes are merged with "after body"
1853            and "after frameset" codes.  |$previous_insertion_mode|
1854            has been removed.  "after frameset" code is
1855            merged with "in frameset" code.
1856    
1857    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1858    
1859            * HTML.pm.src: The "before head" insertion mode is
1860            merged with the "in head" insertion mode.
1861    
1862    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1863    
1864            * HTML.pm.src: Cases in "in head" insertion mode are
1865            reorganized.
1866    
1867    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1868    
1869            * HTML.pm.src: Some cases in "in table" insertion mode
1870            are merged.
1871    
1872    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1873    
1874            * HTML.pm.src: The "in row" insertion mode is merged
1875            with "in table" insertion mode.
1876    
1877    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1878    
1879            * HTML.pm.src: The "in table" and "in table body" insertion
1880            modes are merged.
1881    
1882    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1883    
1884            * HTML.pm.src: There is no "in table head" or "in table foot"
1885            insertion mode!
1886    
1887    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1888    
1889            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1890            now directly invoke the handler.
1891    
1892    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1893    
1894            * HTML.pm.src: Codes for "in cell" insertion mode
1895            is merged to the "in body" insertion mode code.
1896    
1897    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1898    
1899            * HTML.pm.src: Codes for "in body" and "in caption"
1900            insertion modes are merged.
1901    
1902    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1903    
1904            * HTML.pm.src: Two |!!!next-token|s were missing.
1905    
1906    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1907    
1908            * HTML.pm.src: Use numeric constant for |{content_mode}|
1909            instead of string constant for |{content_model_flag}|.
1910    
1911    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1912    
1913            * HTML.pm.src: Add the name of the attribute
1914            to the "duplicate attribute" error.
1915    
1916    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1917    
1918            * ContentChecker.pm: Return the |class| node list.
1919    
1920    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1921    
1922            * ContentChecker.pm: Return the |id| node list.
1923    
1924            * HTML.pm.src: A typo is fixed.
1925    
1926    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1927    
1928            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1929            from "in HTML:xml:lang" error.  Character position
1930            is now the last part of the error type in the URI error
1931            description.  Report "unsupported" status for language
1932            tags, media queries, script codes, and style sheets.
1933    
1934    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1935    
1936            * ContentChecker.pm: Report error if |xml:lang|
1937            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1938            in XML.
1939    
1940            * NanoDOM.pm (Attr.owner_document): New attribute.
1941    
1942    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1943    
1944            * HTML.pm.src: The character immediately following
1945            a bare |hcro| was discarded.  Fix handling of
1946            entity references in attribute values.
1947    
1948    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1949    
1950            * HTML.pm.src (main and trailin end phases): Token
1951            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1952            factored out.  Error types |in html:#DOCTYPE|
1953            and |after html:#DOCTYPE| are merged
1954            into |DOCTYPE in the middle|.  |</frameset>|
1955            in fragment parsing mode changed the
1956            insertion mode.
1957    
1958    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1959    
1960            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1961            phase is now an insertion mode.  Treatments for white
1962            space character tokens were incorrect for some
1963            insertion modes.  An old |meta| case was not removed.
1964    
1965    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1966    
1967            * HTML.pm.src: |meta| charset declaration extraction
1968            implemented (but changing the encoding is not yet:-).
1969    
1970    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1971    
1972            * Charset/: New directory.
1973    
1974    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1975    
1976            * H2H.pm: New Perl module (created from
1977            manakai's H2H.dis).
1978    
1979    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1980    
1981            * XMLSerializer.pm: New Perl module (created from
1982            manakai's SimpleLS.dis).
1983    
1984    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
1985    
1986            * HTML.pm.src: |<!---x-->| was not processed correctly.
1987    
1988    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1989    
1990            * HTML.pm.src: Report correct error message
1991            for |<body></div></body>|.
1992    
1993    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1994    
1995            * HTMLTable.pm: An error description was incorrect.
1996    
1997    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1998    
1999            * ContentChecker.pm: Return |{term}| list.
2000    
2001    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
2002    
2003            * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
2004            nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
2005            and <title> outside of head).
2006    
2007    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
2008    
2009            * IMTChecker.pm: Report warning for unregistered
2010            and private types/subtypes.
2011    
2012            * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
2013            URIChecker.pm, HTMLTable.pm: Error messages are now
2014            consistent; they are all listed in
2015            <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
2016    
2017    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
2018    
2019            * ContentChecker.pm: |<img ismap>| not in |<a></a>|
2020            is now erred.  |<datalist>| is implemented.
2021            Attribute checker for |<command>| and |<menu>| are
2022            added.  Support for |contextmenu| global attribute
2023            is added.
2024    
2025    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
2026    
2027            * HTML.pm.src (_reset_insertion_mode): Interpretation
2028            of Step 3 has been changed.
2029    
2030    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
2031    
2032            * HTML.pm.src: Late |<html>| parse error is implemented.
2033    
2034    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2035    
2036            * URIChecker.pm (check_iri_reference): A |decode| method name was
2037            incorrect.
2038    
2039            * ContentChecker.pm: Support for the |footer| element.
2040            Check URI syntax for space-separated URI attributes.
2041            Support for the |tabindex| attribute.  Support
2042            for |datetime| attribute.
2043    
2044    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2045    
2046            * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
2047            character references).  HTML5 revision 1.146 (white space
2048            characters before root start tag).  HTML5 revision
2049            1.148 (named character references in attribute values).
2050            HTML5 revision 1.152 (<plaintext>.innerHTML get).
2051    
2052    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2053    
2054            * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
2055            in <head>).
2056    
2057  2007-06-24  Wakaba  <wakaba@suika.fam.cx>  2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2058    
2059          * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>          * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>

Legend:
Removed from v.1.48  
changed lines
  Added in v.1.400

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24