/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.48 by wakaba, Sat Jun 23 16:42:43 2007 UTC revision 1.402 by wakaba, Sun Sep 6 13:02:20 2009 UTC
# Line 1  Line 1 
1    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Fixed typos on |frameset-ok| implementation.
4    
5    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
6    
7            * HTML.pm.src: The first cut to implement the |frameset-ok| flag
8            concept (HTML5 revision 2703).
9    
10    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: |<hr/>| was incorrectly marked as an error.
13    
14    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
15    
16            * HTML.pm.src: Changed the handling of </html> end tag of "in
17            body" insertion mode according to the latest spec definition.
18    
19    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src: Rewrote end tag handling in |head| area (cf. HTML5
22            revision 2730, but it was entirely broken, maybe I missed some
23            spec changes before rev.2730).
24    
25    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: Added the |hgroup| element (HTML5 revision 3039 and
28            HTML5 revision 3040).
29    
30    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
33            revision 3689).  It should make no difference since the |xmp|
34            element has the PCDATA content and cannot be the non-bottommost
35            element in the stack of open elements.
36    
37    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
38    
39            * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
40            fragment parsing algorithm is invoked (HTML5 revision 2699).
41    
42    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
45    
46    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: No longer does foster parenting |style|, |script|,
49            and |input type=hidden| elements (HTML5 revision 2988).
50    
51    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
54            2870).
55    
56    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
57    
58            * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
59            element (HTML5 revision 2960).
60    
61    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
62    
63            * HTML.pm.src: Moved the "in table text" insertion mode code up
64            such that it does also work as specced in HTML5 in the remaining
65            cases (HTML5 revision 3382).
66    
67    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Implementing the "in table text" insertion mode,
70            first half (HTML5 revision 3382).
71    
72    2009-08-30  Wakaba  <wakaba@suika.fam.cx>
73    
74            * ContentChecker.pm: Allow |xml:lang| in HTML document if the
75            owner element is not an HTML element (HTML5 revision 3697).
76    
77    2009-08-23  Wakaba  <wakaba@suika.fam.cx>
78    
79            * ContentChecker.pm: Allow <input type=hidden> in context where
80            interactive content is disallowed (HTML5 revision 2392).
81    
82    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
83    
84            * HTML.pm.src: Dropped support for "XSLT-compat" and added
85            "about:legacy-compat" (HTML5 revision 2725).
86    
87    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
90            spec (HTML5 revision 2700).
91    
92    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
93    
94            * HTML.pm.src: Added support for <script> element in the "in
95            select" and "in select in table" insertion modes (HTML5 revision
96            2500).
97    
98    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
99    
100            * HTML.pm.src: An <hn> start tag in an <hn> element should close
101            the earlier element (HTML5 revision 2501).
102    
103    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
104    
105            * HTML.pm.src: The </br> end tags before the <body> start tag
106            should equally survive as the first br element child of the body
107            element (HTML5 revision 2586).
108    
109    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: <font> in SVG land is handled as an SVG element as
112            long as it did not have HTML attributes (HTML5 revision 2590).
113    
114    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src (_tree_construction_main): Improved error reporting
117            on "select" start tag in "in select" and "in select in table"
118            parsing modes (c.f. HTML5 revision 3604).
119    
120    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
121    
122            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
123            invalid (HTML5 revision 2701).
124    
125    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
128            optional before </body> (HTML5 revision 2806).
129    
130    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: Marked where SVG |script| execution should be
133            implemented (HTML5 revision 2904).
134    
135    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
136    
137            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
138            inherits the document quirksness flag of the main document (HTML5
139            revision 2935).
140    
141    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
142    
143            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
144            quirks mode (The Hixie Quirk, HTML5 revision 3132).
145    
146    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
147    
148            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
149            3163).
150    
151    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
152    
153            * HTML.pm.src: Don't change encoding to UTF-8 if the input
154            encoding is UTF-16 (HTML5 revision 3204).
155    
156    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src (parse_byte_stream): Don't change to another
159            encoding if the input is UTF-16 (HTML5 revision 3205).
160    
161    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: |isindex| should no longer generate |p| element
164            node (HTML5 revision 3236).
165    
166    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
167    
168            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
169            support for new |id_type| value of |any|, which means any kind of
170            element is allowed as a referenced element.
171    
172    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
173    
174            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
175            of defining codes for checking required attribute and as such.
176    
177    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
178    
179            * IMTChecker.pm: Added more definitions for subtypes.
180    
181    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
182    
183            * URIChecker.pm: Some of |pos_end| values were wrong.
184    
185    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
186    
187            * ContentChecker.pm (check_element): Added support for "no
188            referenced datalist" error.
189    
190    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
191    
192            * URIChecker.pm: Bug fix: It did not work unless
193            Message::DOM::DOMImplementation has been |require|d.
194    
195    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
196    
197            * NanoDOM.pm (document_uri): New attribute.
198    
199            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
200    
201    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
202    
203            * NanoDOM.pm (text_content): Don't create a Text node if the new
204            value is empty.
205    
206    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
207    
208            * SWML/: New directory.
209    
210    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
211    
212            * NanoDOM.pm (specified, all_declarations_processed,
213            manakai_attribute_type): New attributes.
214    
215    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
216    
217            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
218            array.
219    
220    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
221    
222            * NanoDOM.pm (notation_name): New attribute.
223    
224    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
225    
226            * NanoDOM.pm (public_id, system_id): New attributes.a
227    
228    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
229    
230            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
231            implemented.
232            (allowed_tokens, default_type, declared_type): Implemented.
233    
234    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
235    
236            * NanoDOM.pm (node_name): New attribute.
237            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
238            constants.
239            (create_element_type_definition_node, create_attribute_definition,
240            create_notation, create_general_entity,
241            get_element_type_definition_node,
242            set_element_type_definition_node, get_general_entity_node,
243            set_general_entity_node, get_notation_node, set_notation_node,
244            get_attribute_definition_node, set_attribute_definition_node): New
245            methods.
246            (element_types, entities, notations, attribute_definitions): New
247            attributes.
248            (DocumentType): Support for child nodes, entities, notations, and
249            element types.
250            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
251            New classes.
252    
253            * Dumper.pm: Support for general entities, notations, element type
254            definitions, and attribute definitions.
255    
256    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
257    
258            * NanoDOM.pm (create_processing_instruction): New method.
259            (xml_version, xml_encoding, xml_standalone): New attributes.
260            (ProcessingInstruction): New class.
261    
262    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
263    
264            * HTML.pm.src: Handling of end tags in the foreign content
265            insertion mode was partially wrong, because of wrong bit
266            operations.
267    
268    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
269    
270            * NanoDOM.pm (dom_config): New attribute (do nothing), for
271            Whatpm::XML::Parser support.
272    
273    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
274    
275            * Makefile: New rule to make HTML/Tokenizer.pm is added.
276    
277            * HTML.pm.src: Tokenizer part moved to another file.
278    
279    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
280    
281            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
282    
283    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Element category constants redefined.
286            
287    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
290            construction stage synced with the spec (HTML5 revisions 2139 and
291            2302).
292    
293    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
294    
295            * ContentChecker.pm: New error level "html5_fact" added, which
296            should be tentatively used until all of requirements are properly
297            specced as RFC 2119 "MUST" in HTML5.
298    
299    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
300    
301            * ContentChecker.pod: Note on internal flags is added.
302    
303    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
304    
305            * HTML.pm.src: An AAA bug fixed.
306    
307    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
308    
309            * HTML.pm.src: If another node is inserted by the parser, don't
310            reuse existing Text node to append a character (HTML5 revision
311            2124).
312    
313    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
314    
315            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
316            revisions 1731 and 2128).
317    
318    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
321            Support for end tags of camelCase SVG elements were broken.  A
322            wrong error type text fixed.
323    
324    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
325    
326            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
327    
328    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
329    
330            * HTML.pm.src: Support for new definition of |param| and |source|
331            start tag parsing (HTML5 revision 1731).
332    
333    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
334    
335            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
336    
337    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
338    
339            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
340            revisions 1731 and 1831).
341    
342    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
343    
344            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
345            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
346            revision 1731).
347    
348    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
349    
350            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
351            revision 1731).  End tags of |option| and |optgroup| elements are
352            now optional (HTML5 revision 1731).
353    
354    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
355    
356            * HTML.pm.src: New "special" elements added to the list (HTML5
357            revision 1778).  "strile" -> "strike".
358    
359    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
360    
361            * ContentType.pm (get_sniffed_type): Support for the "better"
362            content sniffing (HTML5 revision 1927).  In a case the official
363            type was not returned when the method is invoked in the list
364            context.
365    
366    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
367    
368            * HTML.pm.src: Character references for non-space C0 characters,
369            including U+000B VT, DEL character, noncharacter code points, are
370            now converted to the U+FFFD character (cf. HTML5 revision 2138).
371    
372    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
373    
374            * ContentChecker.pm: |form=""| check support added.
375    
376    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
377    
378            * ContentChecker.pm: |contextmenu| validness is now checked using
379            |id| and |id_type| properties, and |menu| property is removed.
380    
381    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
382    
383            * ContentChecker.pm: Prepare for |form| |name| attribute's
384            duplication checking.
385    
386    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
387    
388            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
389            unsupported charset="" parameter value (HTML5 revision 2131).
390    
391    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
392    
393            * HTML.pm.src: Reminding places where U+000B is allowed as a space
394            character is fixed (cf. HTML5 revision 1738).
395    
396            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
397            space characters (HTML5 revision 1738).
398    
399    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
400    
401            * HTML.pm.src: The "anything else" case for the "after after body"
402            insertion mode was not updated to swtich to the "in body"
403            insertion mode.  U+000B is no longer a space character for the
404            purpose of tree construction phase (HTML5 revision 1738).
405    
406    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
407    
408            * HTML.pm.src: U+000B is no longer a space character (HTML5
409            revision 1738).
410    
411    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
412    
413            * ContentType.pm: 0x0B is no longer a space character (HTML5
414            revision 1738).
415    
416            * HTML.pm.src: U+000B is no longer a space character for the
417            algorithm for extracting an encoding from a Content-Type (HTML5
418            revision 1738).
419    
420    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
421    
422            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
423    
424    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
425    
426            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
427    
428            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
429            matching.
430    
431    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
432    
433            * LangTag.pm: Warn for private use language subtags.  Error level
434            typos fixed.  Support for Suppress-Script field.
435    
436            * mklangreg.pl: Support for dumping of nested structure.
437    
438    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
439    
440            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
441            recommended case as per RFC 4646.
442    
443    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
444    
445            * LangTag.pm (check_rfc4646_langtag): New method.
446    
447    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
448    
449            * mklangreg.pl: New script.
450    
451            * Makefile: Updated for creation of the module for language subtag
452            registry.
453            
454    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
455    
456            * Makefile: WebIDL.html added.
457    
458            * WebIDL.pod: New documentation.
459    
460    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
461    
462            * WebIDL.pm: Checker's error types are redefined.
463    
464    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
465    
466            * WebIDL.pm: Parser's error types are redefined.  Some forward
467            compatible parsing bugs are fixed.  Some unreachable codes are
468            commented out.
469    
470    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
471    
472            * WebIDL.pm: Support for the reminding extended attributes are
473            added.  It does not satisfy the definition that a forward
474            interface declaration has an extended attribute.  It seems that
475            unless explicitly allowed multiple extended attributes with the
476            same name is not allowed, though it is not explicitly mentioned in
477            the spec.
478    
479    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
480    
481            * WebIDL.pm: Unescapes extended attribute names and extended
482            attribute identifiers.  Preserve whether an extended attribute has
483            an argument list of not.  Support for extended attributes:
484            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
485            NameGetter, NameSetter, and Null.
486            (has_argument_list): New attribute.
487            (idl_text): Stringifies argument lists, if any, even if it is
488            empty.
489    
490    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
491    
492            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
493            |{s_kwd}| in DATA_STATE as default.
494    
495    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
496    
497            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
498            by |{s_kwd}| in DATA_STATE.
499    
500    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
501    
502            * HTML.pm.src: Shorten keys.
503    
504    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
505    
506            * HTML.pm.src: Remove checking for control character, surrogate
507            pair, or noncharacter code points and non-Unicode code
508            points (they should be handled by Whatpm::Charset::UnicodeChecker).
509            (parse_char_stream): Support for the |$get_wrapper| argument and
510            character stream error handlers.
511    
512    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
513    
514            * ContentChecker.pm: Don't call |loda_ns_module|
515            for null-namespace elements/attributes.
516    
517            * HTML.pm.src: Fact out $disallowed_control_chars
518            as a hash.
519    
520    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
521    
522            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
523            and |{next_char}| initializations are moved to initialization
524            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
525            with |parse_char_stream|.
526    
527    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
528    
529            * HTML.pm.src (parse_char_stream): Make |set_next_char|
530            invoke |manakai_read_until|, not only |read|, where
531            possible, to decrease the number of |read| method calls.
532    
533            * mkhtmlparser.pl: Related changes to the aforementioned
534            modification.
535    
536    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
537    
538            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
539            would report character error from now.
540    
541    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
542    
543            * HTML.pm.src: White-space-leaded non-white-space character
544            tokens in "before head insertion mode" was not
545            correctly handled.
546            (set_inner_html): Reimplemented using CharString decodehandle
547            class.  Support for $get_wrapper argument.  Support
548            for |{read_until}| feature.
549    
550    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
551    
552            * HTML.pm.src: Make a "bare ero" error for unknown
553            entities point the "&" character.
554    
555    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
556    
557            * HTML.pm.src: It turns out that U+FFFD don't have to
558            be added to the list of excluded characters.
559    
560    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
561    
562            * HTML.pm.src ($char_onerror): Have character decoder's |line|
563            and |column| a higher priority than the one set by the
564            tokenizer's input handler.
565            ($self->{read_until}): Exclude U+FFFD (but this might
566            not be necessary, since now we do line/column fixup in
567            the character decode handle).
568    
569    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
570    
571            * HTML.pm.src: Use |{read_until}| where possible.
572    
573    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
574    
575            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
576            and |manakai_getc_until| to |manakai_read_until| to
577            reduce the number of string copies.
578    
579    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
580    
581            * HTML.pm.src (parse_char_string): Use newly created
582            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
583            standard feature to |open| a string as a filehandle,
584            since Perl's string filehandle seems not supporting |ungetc|
585            method correctly.
586            (parse_char_stream): Define |{getc_until}| method.
587            (DATA_STATE): Experimental support for |getc_until| feature.
588    
589    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
590    
591            * HTML.pm.src: Check points added to newly added branches.
592    
593    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
594    
595            * HTML.pm.src: Remove |{char}|, which is no longer used.
596            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
597            and replaced by |{prev_state}|.
598    
599            * mkhtmlparser.pl: Remove |{char}| feature.
600            Remove |!!!back-next-input-character;| macro.
601    
602    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
603    
604            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
605            entity related tokenizer states in favor of new states
606            implementing the consume character reference algorithm.
607    
608    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
609    
610            * HTML.pm.src: "Consume a character reference" algorithm is
611            now implemented as a tokenizer's state, rather than
612            a method, with minimum changes (more changes will
613            be made, in due course).  "Bogus comment state"'s inner
614            loop gets removed.
615    
616    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
617    
618            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
619            into their own tokenizer states.
620    
621    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
622    
623            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
624            is split into three states.
625    
626    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
627    
628            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
629            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
630            no longer does the tokenizer have to push back next input
631            characters in those states.
632    
633    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
634    
635            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
636            into four states so that no longer does the tokenizer have to push
637            back next input characters in that state.
638    
639    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
640    
641            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
642            which can be used to insert some wrapper between the character
643            stream handle and the tokenizer.  (It is currently not supported
644            for |set_inner_html| for |Element|s).
645    
646    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
647    
648            * HTML.pm.src: Ignore punctuations in charset names.
649    
650    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
651    
652            * ContentChecker.pm: Support for charset-layer error levels.
653    
654            * HTML.pm.src: Don't specify |text| argument for the
655            |chardecode:fallback| error, since it is not the encoding
656            being used alternatively.
657    
658    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
659    
660            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
661    
662    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
663    
664            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
665    
666    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
667    
668            * HTML.pm.src: Bug fix and sync with the spec with regard
669            to after after frameset insertion mode processing (HTML5
670            revision 1909).  Note that the implementation was wrong
671            per the old spec before the r1909 changes.
672    
673    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
674    
675            * HTMLTable.pm: scope=auto algorithm fix synced with the
676            spec (HTML5 revision 2093).
677            ($process_row): Algorithm step numbers synced with the
678            spec (HTML5 revision 2092).
679    
680    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
681    
682            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
683            revision 2094).
684    
685    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
686    
687            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
688    
689    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
690    
691            * HTML.pm.src: '"' and "'" at the end of attribute
692            name (after another attribute) now raise parse error (HTML5
693            revision 2123).  Empty unquoted attribute values are no
694            longer allowed (HTML5 revision 2122).
695    
696    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
697    
698            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
699            revision 2130).
700    
701    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
702    
703            * ContentChecker.pm: |xml:lang| attribute value must be same
704            as |lang| attribute value for HTML elements (HTML5 revision 2062
705            and so on).
706    
707    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
708    
709            * ContentChecker.pm: Error level definition for |xml_id_error|
710            was missing.
711    
712            * URIChecker.pm: The end of the URL should be marked as the
713            error location for an empty path error.  The position
714            between the userinfo and the port components should be
715            marked as the error location for an empty host error.
716    
717    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
718    
719            * URIChecker.pm: Set parameters representing where in the
720            value the error occurs for errors.  Report unknown
721            address format error in warning level, since address
722            formats are rarely added.  Path segments starting with "/.."
723            were misinterpreted as a dot-segment.
724    
725    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
726    
727            * URIChecker.pm (check_iri_reference): Requires
728            |Message::DOM::DOMImplementation|.
729    
730    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
731    
732            * IMTChecker.pm: Updated for the new error reporting architecture.
733    
734            * ContentChecker.pm: Error levels for IMTs are added.
735    
736    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
737    
738            * H2H.pm (_shift_token): Support for unquoted HTML attribute
739            values.
740    
741    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
742    
743            * CacheManifest.pm: Support for new style of error
744            reports.
745    
746            * HTML.pm.src: Set line=1, column=1 to the document node.
747    
748    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
749    
750            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
751            and URL checkers.  Support for more error levels for bogus
752            langauge tag and URL "standards".
753    
754            * LangTag.pm, URIChecker.pm: Support for new style error
755            level reporting.
756    
757    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
758    
759            * ContentChecker.pm: Support for RDF/XML error levels.
760    
761            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
762            specifying.  Error types are revised.
763    
764    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
765    
766            * ContentChecker.pm: All error reporting method calls are
767            renewed.
768    
769    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
770    
771            * HTML.pm.src: All error type names and "text" parameters
772            are revised.  Use new style for "level" specification.
773    
774            * mkhtmlparser.pl: Use new style for "level" specification.
775    
776    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
777    
778            * WebIDL.pm (parse_char_string): Simplified error
779            reporting process for broken ignored valuetype definition.
780            (Valuetype idl_text): Support for special "DOMString" name.
781    
782    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
783    
784            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
785            terminal of the ScopedName is "DOMString", such that whether
786            the last part of the scoped name is "DOMString" or "_DOMString"
787            later.  It is necessary to determine whether a |typedef|
788            definition should be ignored or not.
789            (parse_char_string): Unescape the identifier of
790            exception members.
791            ($resolve): Return undef for builtin types and sequence<T>
792            types (we might not have to do this, however...).
793            (check): Support checking for Exceptions, Valuetypes,
794            and Typedefs.
795            ($serialize_type): Support for "DOMString::::" syntax.
796            (Typedef idl_text): Output Type as "DOMString" if it
797            is really "DOMString" (i.e. its internal representation
798            is "::DOMString::").
799    
800    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
801    
802            * WebIDL.pm ($resolve): New code, based on resolve code
803            for constant types in the |check| method.
804            (check): Support for checking of attributes, operations, and
805            arguments.
806            (Attribute/Operation idl_text): Exception names in getraises,
807            setraises, and raises clauses is serizlied by |$serialize_type|
808            code.
809    
810    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
811    
812            * WebIDL.pm ($integer): Order of selections are changed to match
813            hexadecimal numbers (the original pattern, taken from the spec,
814            was not work for hexadecimal numbers, because the "0" prefix
815            matches to the [0-7]* part (as an empty string) and therefore
816            it does not match with remaining "x..." part of a "0x..." integer
817            literal.
818            ($get_type): It now returns a string, not an array reference,
819            for regular types and |sequence| types (i.e. it in any case
820            returns a string).
821            ($get_next_token): The second item in the array that represents
822            a integer or float token is now a Perl number value, not the
823            original string representation of the number.
824            (check): Support for const value consistency checking.
825            No extended attribute is defined for constants.
826            (Node subclasses): Use simple strings rather than array references
827            for default data type values.
828            ($serialize_type): Type values are now simple strings.
829            (value): If the new attribute value is a false value, then
830            a FALSE value is set to the attribute.
831    
832    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
833    
834            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
835            in its stringified format ("scoped name" as defined in the
836            spec).  Note that future version of this module should not use
837            array references for type values and the |type_text| attribute
838            should be made obsolete.
839            (parse_char_string): Unescape attribute names.
840            (check): Support for checking of whether inherited interfaces
841            are actually defined or not.  Support for checking of whether
842            interface member identifiers are duplicated or not.
843            ($serialize_type): Scoped names are returned as is.  A future
844            version of this code should escape identifiers other than "DOMString",
845            otherwise the idl_text would be non-conforming.
846    
847    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
848    
849            * WebIDL.pm (parse_char_string): Set line/column numbers
850            to generated nodes.  Unescape identifiers.  Extended attributes
851            for Definition's were ignored.
852            (append_child): Set |parent_node| attribute.
853            (parent_node): New attribute.
854            (check): Support interface/exception members.  Support
855            extended attributes.  Support definition identifier uniqueness
856            constraint.
857            (qualified_name): New attribute.
858            (Interface/Exception idl_text): Extended attributes were
859            not prepended to the returned text.
860    
861    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
862    
863            * WebIDL.pm (parse_char_string): Set line/column numbers
864            to interface object experimentally.  s/shift/pop/g, shift
865            would make things wrong.  Support for interface forward
866            declarations was missing.  Broken interface declarations
867            with no block were not ignored entirely.
868            (Whatpm::WebIDL::Node): New abstract class.  This class
869            makes things easier.
870            (child_nodes): New attribute.  Unlike DOM's attribute with
871            same name, this attribute returns a dead list of nodes for
872            simplicity.
873            (get_user_data, set_user_data): New methods.
874            (Module idl_text): A SPACE character should be inserted
875            before the |{| character.
876            (Interface idl_text): Support for interface forward declarations.
877            (is_forward_declaration): New attribute.
878    
879    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
880    
881            * WebIDL.pm (type_text): Better serializer.
882    
883    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
884    
885            * WebIDL.pm: Revise forward-compatible parsing so that
886            it now can handle broken extended attributes and as such.
887    
888    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
889    
890            * WebIDL.pm: Real support for extended attributes.
891            Support for extended attributes with arguments.
892    
893    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
894    
895            * WebIDL.pm: Support for |exception| syntax.
896            (Interface->idl_text): Tentative support for inheritances.
897    
898    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
899    
900            * WebIDL.pm: Hierarchical scoped name support was broken.
901            Support for raises, setraises, and getraises syntaxes.
902    
903    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
904    
905            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
906            proper support for types, extended attributes, and exceptions yet).
907            WebIDL parser, version 1 (no support for exceptions yet,
908            no proper support for extended attributes yet).
909    
910    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
911    
912            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
913    
914    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
915    
916            * WebIDL.pm: Support for valuetype and const.
917    
918    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
919            
920            * WebIDL.pm: New module.
921    
922    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
923    
924            * Makefile (Entities.html): URI changed.
925    
926    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
927    
928            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
929    
930    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
931    
932            * HTML.pm.src (_get_next_token): A parse error was missing.
933    
934    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
935    
936            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
937            spec (commented out). (HTML5 revision 1711).
938    
939    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
940    
941            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
942    
943            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
944            as UTF-16 (HTML5 revision 1701).
945    
946    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
947    
948            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
949            1692).
950    
951    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
952    
953            * HTML.pm.src: The secondary insertion mode used when switching
954            to foreign content is the "in body" insertion mode (HTML5 revision
955            1696).
956    
957    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
958    
959            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
960            revision 1697).
961    
962    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
963    
964            * HTML.pm.src: Support for end-of-file token in foreign content
965            insertion mode (HTML5 revision 1693).  Update SVG camelCase
966            attribute list (HTML5 revision 1700).  <textarea> closes
967            </select> (HTML5 revision 1699).  More start tags close in
968            foreign content insertion mode (HTML5 revision 1698).
969    
970    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
971    
972            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
973    
974    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
975    
976            * HTML.pm.src: More robust charset parameter detection (HTML5
977            revision 1674).
978    
979    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
980    
981            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
982            revision 1676).
983    
984    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
985    
986            * HTML.pm.src: Ignore language part of public identifiers for
987            quriks mode detection (HTML5 revision 1679).
988    
989    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
990    
991            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
992            revision 1685).
993    
994    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
995    
996            * HTML.pm.src: Support for EOF in new states for tags (HTML5
997            revision 1684).
998    
999    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
1000    
1001            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
1002            work (HTML5 revision 1690).
1003    
1004    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
1005    
1006            * HTML.pm.src (_tree_construction_main): Change handling of
1007            end tags in head insertion modes (HTML5 revision 1686).
1008            (parse_char_string): Bug fix for non-utf8 character string handlings.
1009            (parse_char_stream): |ungetc| does not work well for this context.
1010    
1011    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
1012    
1013            * HTML.pm.src (parse_byte_string): Redefined to invoke
1014            |parse_byte_stream|.
1015            (parse_byte_stream): New method.
1016    
1017    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
1018    
1019            * HTML.pm.src (parse_byte_string): Fix the column number reported
1020            by encoding layer error reporter.
1021    
1022    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * HTML.pm.src (parse_byte_string): Use streaming decoder
1025            rather than converting the whole byte string and then parsing.
1026            Propagate errors in character encoding layer.
1027            (get_next_token): Precise error reporting for |bare stago| error.
1028    
1029    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1030    
1031            * HTML.pm.src (parse_char_stream): New method.
1032            (parse_char_string): This method is now defined as an invocation
1033            of the |parse_char_stream| method.
1034    
1035    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1036    
1037            * HTML.pm.src (parse_byte_string): Report various status
1038            of the sniffing as info-level errors.  Support for new
1039            decoding framework in parser resestting.
1040            (new): Various default error levels were not set.
1041    
1042    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1043    
1044            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1045            algorithm, except for the actual sniffing, is implemented
1046            with new framework with Message::Charset::Info.
1047    
1048    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * CacheManifest.pm (_parse): Drop fragment identifiers from
1051            URIs in fallback section (HTML5 revision 1596).
1052    
1053    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1054    
1055            * Makefile (Entities.html): URI has changed.
1056    
1057    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1058    
1059            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1060            1553).
1061    
1062    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1063    
1064            * ContentChecker.pm: Noted that those returned in |table| are
1065            no longer table elements, but table objects returned
1066            by Whatpm::HTMLTable.
1067    
1068            * HTMLTable.pm (form_table): Return table element node
1069            as |$table->{element}|.
1070            (assign_header): Support for the |headers=""| attribute.
1071    
1072    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1073    
1074            * HTMLTable.pm (assign_header): New function; first version
1075            with no support for headers="".
1076            (form_table): Include table width and height to the returned
1077            table object for covenience.  Indexing in column assignement was wrong.
1078            Set whether a data cell is empty or not for convenience.
1079    
1080    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1081    
1082            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1083    
1084    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1085    
1086            * HTMLTable.pm: How table model errors are detected is
1087            changed (HTML5 revision 1387).
1088    
1089    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1090    
1091            * HTMLTable.pm: The algorithm now moves |tfoot| elements
1092            to the end of the table (HTML5 revision 1380).
1093    
1094    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1095    
1096            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1097            of 1-based (HTML5 revision 1376).
1098    
1099    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1100    
1101            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1102            and "Content-Encoding" no longer prevent sniffing (HTML5
1103            revision 1288).
1104    
1105    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1106    
1107            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1108            1282).
1109    
1110    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1111    
1112            * ContentChecker.pm: Support for global attributes.
1113            Status of XML specs are added.
1114    
1115    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1116    
1117            * ContentChecker.pm (check_element): Support for |template=""|
1118            and |ref=""| attribute (referent element type checking).
1119    
1120    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
1121    
1122            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1123            1500) is implemented (except for IDNA part and URI-scheme-specific
1124            knowledge).  Line number counting was wrong for LF-only
1125            documents.
1126    
1127    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1128    
1129            * HTML.pm.src: Raise an parse error for any disallowed
1130            character (HTML5 revision 1263).
1131    
1132    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1133    
1134            * mkentitylist.pl: Support for new HTML5 entity table format (the
1135            definition for |AElig;| was missing).
1136    
1137    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1138    
1139            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1140            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1141    
1142    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1143    
1144            * HTML.pm.src: List of element names that close foreign content
1145            insertion mode is added (HTML5 revisions 1412 and 1418).
1146    
1147    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1148    
1149            * HTML.pm.src: Support for |mglyph| and |malignmark|
1150            elements (HTML5 revision 1410).
1151    
1152    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1153    
1154            * HTML.pm.src: Support for new long MathML entities (HTML5
1155            revision 1406).
1156    
1157    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1158    
1159            * HTML.pm.src: CDATA section support for MathML and SVG
1160            elements (HTML5 revision 1404 and 1420).
1161    
1162    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1163    
1164            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1165            elements (HTML5 revision 1404).  Unused !!!macro definitions
1166            are removed.
1167    
1168    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1169    
1170            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1171            are raised is changed (HTML5 revision 1404).
1172    
1173    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1174    
1175            * HTML.pm.src: Category sets that are no longer used are removed.
1176    
1177    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1178    
1179            * HTML.pm.src: The ->[1] property of stack entries are now
1180            replaced by constants representing element category.
1181    
1182    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1183    
1184            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1185            for error reporting.  (This is a preparation for using constant
1186            value for ->[1].)
1187    
1188    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1189    
1190            * RDFXML.pm: Typo fixed.
1191    
1192    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1193    
1194            * ContentChecker.pm: |fact_level| is now treated
1195            as same as |must_level|, i.e. level = |m|.
1196            (check_element): Make list of URIs in the DOM.
1197    
1198    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1199    
1200            * RDFXML.pm: Language accessor implemented.
1201            Local (null-namespace) attribute support.
1202    
1203    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1204    
1205            * RDFXML.pm: Factored out ID checking code.
1206    
1207    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1208    
1209            * RDFXML.pm: TODO items noted.  Validation of ID
1210            and URI attributes is implemented.  Warn if unknown
1211            value is used in rdf:parseType="" attribute.
1212    
1213            * URIChecker.pm (check_rdf_uri_reference): New function.
1214    
1215    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1216    
1217            * RDFXML.pm: bnodeid implemented.  Relative references
1218            are now resolved.
1219    
1220    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1221    
1222            * ContentChecker.pm: RDF reification implemented.
1223    
1224            * RDFXML.pm: undef vs false bug fixed.
1225            Reification implemented.
1226    
1227    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1228    
1229            * RDFXML.pm: s/id/ID/ for attribute name.
1230            The |node| arguments are added for |ontriple| calls.
1231            Too many "attribute not allowed" errors were raised.
1232    
1233            * ContentChecker.pm: Initial experimental support for rdf:RDF
1234            element.
1235    
1236    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1237    
1238            * RDFXML.pm: New module.
1239    
1240    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1241    
1242            * HTML.pm.src (set_inner_html): Line/column number
1243            code was old one yet.
1244    
1245    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1246    
1247            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1248            errors.  Line/column markings for DOCTYPE, comment, and
1249            character tokens are reintroduced; otherwise,
1250            error location for "not HTML5" error and errors
1251            for implied elements are not attached.
1252    
1253    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1254    
1255            * HTML.pm.src: Set line/column numbers to attributes.
1256    
1257            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1258            (value): Setter implemented.
1259    
1260            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1261    
1262    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1263    
1264            * HTML.pm.src: Unused line/column markings are removed.
1265    
1266    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1267    
1268            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1269            flag, which is no longer used.
1270    
1271    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1272    
1273            * HTML.pm.src: Set line/column information to element nodes.
1274    
1275            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1276            and !!!insert-element-t): Set line/column information to
1277            element nodes.
1278    
1279    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1280    
1281            * HTML.pm.src (_get_next_token): The first "<" character
1282            in "<?", "<>", or "</>" should be the error point.
1283    
1284    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1285    
1286            * HTML.pm.src: Some more fixes on error position reporting.
1287    
1288    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1289    
1290            * HTML.pm.src: Token-level precious error reporting.
1291    
1292    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1293    
1294            * HTML.pm.src: Preparation for more precious error point
1295            reporting.
1296    
1297    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1298    
1299            * HTML.pm.src: Error type revised.
1300    
1301    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1302    
1303            * HTML.pm.src: Similar codes are merged together, again.
1304    
1305    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1306    
1307            * HTML.pm.src: Similar codes are merged together.
1308    
1309    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1310    
1311            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1312    
1313            * HTML.pm.src: Code refined.
1314    
1315    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1316    
1317            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1318            revision 1348).  Note that I really don't know this makes
1319            any difference in the black-box behavior of the parser.
1320    
1321    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1322    
1323            * HTML.pm.src: New end-of-file token implementation (HTML5
1324            revision 1348).
1325    
1326    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1327    
1328            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1329    
1330    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1331    
1332            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1333    
1334    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1335    
1336            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1337            mode and support for the "in select in table" insertion mode (HTML5
1338            revision 1342).
1339    
1340    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1341    
1342            * HTML.pm.src: No foster parenting for <script> and <script>
1343            in non-tainted <table>s (HTML5 revision 1336).
1344    
1345    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1346    
1347            * HTML.pm.src: Ignore white space characters between <html>
1348            and <head> (HTML5 revision 1332).
1349    
1350    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1351    
1352            * HTML.pm.src: Treat <input type=hidden> as if it were a
1353            white space (HTML5 revision 1331).
1354    
1355    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1356    
1357            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1358            element (HTML5 revision 1330).
1359    
1360    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1361    
1362            * HTML.pm.src: <title> is always appended to the current
1363            element (HTML5 revision 1328).
1364    
1365    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1366    
1367            * HTML.pm.src: White space in tainted tables are moved
1368            into foster parents (HTML5 revision 1326).
1369    
1370    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1371    
1372            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1373            revision 1321).
1374    
1375    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1376    
1377            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1378            the spec (HTML5 revision 1320).  Note that we finally complete
1379            all of HTML5 revision 1320 changes.
1380    
1381    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1382    
1383            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1384            tags (HTML5 revision 1320).
1385    
1386    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1387    
1388            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1389    
1390    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1391    
1392            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1393            could not be reached so that it is now removed (HTML5 revision
1394            1320).
1395    
1396    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1397    
1398            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1399            to align with new spec text (HTML5 revision 1320).
1400    
1401    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1402    
1403            * HTML.pm.src: Remove strange |if| condition; however, it should
1404            have had no harm in theory.
1405    
1406    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1407    
1408            * HTML.pm.src (_tree_construction_main): '</p>' in body
1409            case is split from other end tags for the preparation
1410            of implementing HTML revision 1320.
1411    
1412    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1413    
1414            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1415            revision 1320).
1416    
1417    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1418    
1419            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1420            and "div" (HTML5 revision 1318).  Add comments to where
1421            |form| pointer association codes should be inserted (HTML5 revision
1422            1319).
1423    
1424    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1425    
1426            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1427    
1428    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1429    
1430            * HTML.pm.src: Since the case t268 should never be reached (no
1431            other token type, there are), it is replaced by a |die| statement.
1432    
1433    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1434    
1435            * HTML.pm.src: Typo fixed.
1436    
1437    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1438    
1439            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1440            doctypes were not uppercased for comparison.
1441    
1442    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1443    
1444            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1445            are added.
1446    
1447    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1448    
1449            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1450            are set.  Cases that are unlikely reached are noted as so.
1451    
1452    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1453    
1454            * HTML.pm.src: Checkpoints for debugging are added.
1455    
1456            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1457    
1458    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1459    
1460            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1461            for simplicity.
1462    
1463    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1464    
1465            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1466            revision 1307).
1467    
1468    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1469    
1470            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1471            is set are changed (HTML5 revision 1305).
1472    
1473    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1474    
1475            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1476            1292).  Entities are not parsed in comment-like part in RCDATA
1477            elements (HTML5 revision 1294).  Allow bare & at the end
1478            of attribute value literals (HTML5 revision 1296).  More
1479            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1480            between attributes and ban attribute names or unquoted
1481            attribute values containing single or double quotes (HTML5
1482            revision 1303).
1483    
1484    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1485    
1486            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1487            and related errors unless it is an HTML document (though the spec
1488            is unclear on whether it is applied to XHTML document).
1489    
1490            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1491            are added.
1492    
1493    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1494    
1495            * HTML.pm.src: s/local_name/manakai_local_name/g.
1496    
1497    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1498    
1499            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1500    
1501            * HTML.pm.src: |charset| in |content| attribute is
1502            case-insensitive (HTML5 revision 1270).
1503    
1504    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1505    
1506            * ContentChecker.pm: New status constants are added.
1507            ($ElementDefault): |status| added.
1508            (check_element): Err for non-standard or deprecated elements.
1509            (_attr_status_info): For non-standard or deprecated attributes.
1510    
1511    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1512    
1513            * ContentChecker.pm (_attr_status_info): New internal method.
1514    
1515    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1516    
1517            * ContentChecker.pm (check_element): Element standardized
1518            status information is now dispatched.
1519    
1520    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1521    
1522            * ContentChecker.pm (check_element): Fix |del|-and-significant
1523            problem by adding some more arguments.
1524    
1525    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1526    
1527            * ContentChecker.pm (check_element): Use context of
1528            container-for-the-purpose-of-content-model element (not transparent
1529            element) for |check_child_element| calling and significant
1530            text flag marking.  This reintroduces |<del>|-and-significant
1531            problem again.
1532    
1533    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1534    
1535            * ContentChecker.pm (check_element): Make semi-transparent
1536            elements ignored for the purpose of phase changes in
1537            content model checking.
1538    
1539    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1540    
1541            * ContentChecker.pm (check_element): In-element state
1542            was not properly managed for transparent cases.
1543    
1544    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1545    
1546            * ContentChecker.pm (check_element): Support for |video|
1547            and |audio| as semi-transparent elements.
1548    
1549    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1550    
1551            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1552            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1553            Support for |html:object| as a semi-transparent element.
1554    
1555    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1556    
1557            * ContentChecker.pm (check_element): The way to traverse
1558            the tree is entirely revised to make it easier to track
1559            the state of ancestors/descendants.  As a result of this
1560            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1561            support for content model checking for HTML elements |figure|,
1562            |object|, |video|, and |audio| and checking for XML elements (and
1563            some XMLNS checkings) are dropped for now.  They will be
1564            reimplemented in due cource.
1565    
1566    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1567    
1568            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1569            token (HTML5 revision 1225).
1570    
1571    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1572    
1573            * ContentChecker.pm ({unsupported_level}): New value.
1574    
1575            * HTML.pm.src: Save whether |meta| |content| attribute
1576            contains character references or not.
1577    
1578    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1579    
1580            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1581            imlement significant content checking for contents
1582            with |del| elements.
1583    
1584    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1585    
1586            * ContenteChecker.pm ($HTMLTransparentElements): More
1587            elements are added.
1588            (_get_children): HTML |object| elements are now semi-transparent.
1589    
1590            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1591    
1592    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1593    
1594            * CacheManifest.pm: HTML5 revision 1211 implemented.
1595    
1596            * CacheManifest.pod: Updated.
1597    
1598    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1599    
1600            * ContentChecker.pm (check_document, check_element): Support
1601            for second argument ($onsubdoc).
1602            (_get_css_parser): Removed (now it is part of WDCC).
1603    
1604    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1605    
1606            * ContentChecker.pm (_get_css_parser): New.
1607    
1608    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1609    
1610            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1611            for checking had been used.
1612    
1613    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1614    
1615            * ContentChecker.pm (check_element): New todo item type |descendant|.
1616    
1617    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1618    
1619            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1620            specced in RFC 2046 and RFC 4288.
1621            (application/atom+xml): Definition added.
1622    
1623    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1624    
1625            * URIChecker.pm: Make RFC 3986 should-level errors
1626            warnings (rather than SHOULD-level errors).
1627    
1628    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1629    
1630            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1631    
1632            * HTML.pm.src: A flag for character references in attribute
1633            values are added.  Set |manakai_has_reference| user data
1634            to |charset| attribute.
1635    
1636    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1637    
1638            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1639            attributes.
1640    
1641            * ContentChecker.pm (check_document): Warn if charset requirements
1642            cannot be tested.
1643    
1644    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1645    
1646            * HTML.pm.src (parse_byte_string): Detect charset
1647            by universalchardet if charset parameter is not specified.
1648    
1649            * Makefile (Charset-all, Charset-clean): New rules.
1650    
1651    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1652    
1653            * ContentChecker.pm (check_document): Check the existence
1654            of character encoding declaration and selection of encoding
1655            for HTML document.
1656    
1657    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1658    
1659            * ContentType.pm (get_sniffed_type): Return also the official
1660            type in list context.
1661    
1662    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1663    
1664            * ContentType.pm: Sniffing with leading white space
1665            ignoring (HTML5 revisions 1013 and 1016).
1666    
1667    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1668    
1669            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1670            white spaces, are implemented.
1671    
1672    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1673    
1674            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1675            attribute if possible.
1676    
1677    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1678    
1679            * HTML.pm.src (parse_byte_string): New method.
1680            (parse_char_string): New alias for |parse_string|.
1681            (main phase): Invoking "change the encoding" algorithm if desired.
1682    
1683            * HTML.pod: Updated.
1684    
1685    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1686    
1687            * HTML.pod (get_inner_html): Removed.
1688    
1689            * Makefile (HTML-all, HTML-clean): New.
1690    
1691    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1692    
1693            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1694    
1695    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1696    
1697            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1698    
1699    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1700    
1701            * Makefile: |CacheManifest.html| is added.
1702    
1703            * CacheManifest.pod: New file.
1704    
1705    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1706    
1707            * CacheManifest.pm: New module.
1708    
1709    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1710    
1711            * HTML.pm.src: Support for application cache selection algorithm
1712            callback.
1713    
1714    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1715    
1716            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1717    
1718    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1719    
1720            * Makefile (clean): New rule.
1721    
1722            * NanoDOM.pm (public_id, system_id): New attributes.
1723    
1724    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1725    
1726            * Makefile (CSS-all, CSS-clean, clean): New rules.
1727    
1728    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1729    
1730            * ContentChecker.pm (check_document): Support for
1731            new |is_xml_root| flag.
1732            (check_element): Support for new |pluses| state.
1733            (_add_pluses): New method.
1734            (_remove_minuses): Support for new |minus| item.
1735    
1736    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1737    
1738            * ContentChecker.pm: Raise specific error for invalid
1739            root element.
1740    
1741    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1742    
1743            * ContentChecker.pm: Set level values for later uses.
1744    
1745    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1746    
1747            * ContentChecker.pm: Support for language tag validation.
1748    
1749    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1750    
1751            * LangTag.pm (check_rfc3066_language_tag): New method.
1752    
1753    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1754    
1755            * LangTag.pm: New module.
1756    
1757    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1758    
1759            * HTML.pm.src: Some error types were wrong.
1760    
1761    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1762    
1763            * CSS/: New directory.
1764    
1765    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1766    
1767            * ContentChecker.pm (_check_get_children): Support
1768            for |noscript| in |head|.
1769    
1770    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1771    
1772            * URI/: New directory.
1773    
1774    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1775    
1776            * HTML.pm.src: Tokenizer's states are now represented in
1777            number.
1778    
1779    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1780    
1781            * HTML.pm.src: |or|s for insertion modes are replaced
1782            by |&|s.
1783    
1784    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1785    
1786            * HTML.pm.src: Token types are now represented in number.
1787    
1788    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1789    
1790            * ContentType.pm (SEE ALSO): Updated.
1791    
1792            * HTML.pm.src: Insertion modes are now represented in number.
1793    
1794    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1795    
1796            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1797            999) is implemented.
1798    
1799    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1800    
1801            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1802            983) is implemented.
1803    
1804    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1805    
1806            * ContentChecker.pod: New documentation.
1807    
1808            * Makefile: A rule for |ContentChecker.html| is added.
1809    
1810            * ContentChecker.pm: A pod "LICENSE" section is added.
1811    
1812            * NanoDOM.pm ($VERSION): New variable.
1813    
1814    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1815    
1816            * H2H.pm: |b|, |i|, and |sub| are added to the
1817            list of allowed HTML elements.
1818    
1819    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1820    
1821            * H2H.pm: |samp| is added to the list of allowed
1822            HTML elements.
1823    
1824            * URIChecker.pm (check_iri): New.
1825            (check_iri_reference): Error type for IRI reference
1826            syntax error is changed.
1827    
1828    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1829    
1830            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1831            is added.
1832            (check_document): Load appropriate module before validation.
1833    
1834    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1835    
1836            * ContentChecker/: New directory.
1837    
1838    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1839    
1840            * ContentChecker.pm: HTML |time| element is implemented.
1841    
1842            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1843    
1844    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1845    
1846            * ContentChecker.pm (check_document): Return value
1847            even if no document element is found.
1848    
1849    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1850    
1851            * HTML.pm.src: |$in_body| is no longer a function.
1852    
1853    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1854    
1855            * HTML.pm.src: The |$in_body| code has been moved down.
1856    
1857    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1858    
1859            * HTML.pm.src: The "trailing end" insertion mode
1860            is split into "after html body" and "after html frameset"
1861            insertion modes.  Their codes are merged with "after body"
1862            and "after frameset" codes.  |$previous_insertion_mode|
1863            has been removed.  "after frameset" code is
1864            merged with "in frameset" code.
1865    
1866    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1867    
1868            * HTML.pm.src: The "before head" insertion mode is
1869            merged with the "in head" insertion mode.
1870    
1871    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1872    
1873            * HTML.pm.src: Cases in "in head" insertion mode are
1874            reorganized.
1875    
1876    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1877    
1878            * HTML.pm.src: Some cases in "in table" insertion mode
1879            are merged.
1880    
1881    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1882    
1883            * HTML.pm.src: The "in row" insertion mode is merged
1884            with "in table" insertion mode.
1885    
1886    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1887    
1888            * HTML.pm.src: The "in table" and "in table body" insertion
1889            modes are merged.
1890    
1891    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1892    
1893            * HTML.pm.src: There is no "in table head" or "in table foot"
1894            insertion mode!
1895    
1896    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1897    
1898            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1899            now directly invoke the handler.
1900    
1901    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1902    
1903            * HTML.pm.src: Codes for "in cell" insertion mode
1904            is merged to the "in body" insertion mode code.
1905    
1906    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1907    
1908            * HTML.pm.src: Codes for "in body" and "in caption"
1909            insertion modes are merged.
1910    
1911    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1912    
1913            * HTML.pm.src: Two |!!!next-token|s were missing.
1914    
1915    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1916    
1917            * HTML.pm.src: Use numeric constant for |{content_mode}|
1918            instead of string constant for |{content_model_flag}|.
1919    
1920    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1921    
1922            * HTML.pm.src: Add the name of the attribute
1923            to the "duplicate attribute" error.
1924    
1925    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1926    
1927            * ContentChecker.pm: Return the |class| node list.
1928    
1929    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1930    
1931            * ContentChecker.pm: Return the |id| node list.
1932    
1933            * HTML.pm.src: A typo is fixed.
1934    
1935    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1936    
1937            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1938            from "in HTML:xml:lang" error.  Character position
1939            is now the last part of the error type in the URI error
1940            description.  Report "unsupported" status for language
1941            tags, media queries, script codes, and style sheets.
1942    
1943    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1944    
1945            * ContentChecker.pm: Report error if |xml:lang|
1946            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1947            in XML.
1948    
1949            * NanoDOM.pm (Attr.owner_document): New attribute.
1950    
1951    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1952    
1953            * HTML.pm.src: The character immediately following
1954            a bare |hcro| was discarded.  Fix handling of
1955            entity references in attribute values.
1956    
1957    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1958    
1959            * HTML.pm.src (main and trailin end phases): Token
1960            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1961            factored out.  Error types |in html:#DOCTYPE|
1962            and |after html:#DOCTYPE| are merged
1963            into |DOCTYPE in the middle|.  |</frameset>|
1964            in fragment parsing mode changed the
1965            insertion mode.
1966    
1967    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1968    
1969            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1970            phase is now an insertion mode.  Treatments for white
1971            space character tokens were incorrect for some
1972            insertion modes.  An old |meta| case was not removed.
1973    
1974    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1975    
1976            * HTML.pm.src: |meta| charset declaration extraction
1977            implemented (but changing the encoding is not yet:-).
1978    
1979    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1980    
1981            * Charset/: New directory.
1982    
1983    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1984    
1985            * H2H.pm: New Perl module (created from
1986            manakai's H2H.dis).
1987    
1988    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1989    
1990            * XMLSerializer.pm: New Perl module (created from
1991            manakai's SimpleLS.dis).
1992    
1993    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
1994    
1995            * HTML.pm.src: |<!---x-->| was not processed correctly.
1996    
1997    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1998    
1999            * HTML.pm.src: Report correct error message
2000            for |<body></div></body>|.
2001    
2002    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
2003    
2004            * HTMLTable.pm: An error description was incorrect.
2005    
2006    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
2007    
2008            * ContentChecker.pm: Return |{term}| list.
2009    
2010    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
2011    
2012            * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
2013            nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
2014            and <title> outside of head).
2015    
2016    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
2017    
2018            * IMTChecker.pm: Report warning for unregistered
2019            and private types/subtypes.
2020    
2021            * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
2022            URIChecker.pm, HTMLTable.pm: Error messages are now
2023            consistent; they are all listed in
2024            <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
2025    
2026    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
2027    
2028            * ContentChecker.pm: |<img ismap>| not in |<a></a>|
2029            is now erred.  |<datalist>| is implemented.
2030            Attribute checker for |<command>| and |<menu>| are
2031            added.  Support for |contextmenu| global attribute
2032            is added.
2033    
2034    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
2035    
2036            * HTML.pm.src (_reset_insertion_mode): Interpretation
2037            of Step 3 has been changed.
2038    
2039    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
2040    
2041            * HTML.pm.src: Late |<html>| parse error is implemented.
2042    
2043    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2044    
2045            * URIChecker.pm (check_iri_reference): A |decode| method name was
2046            incorrect.
2047    
2048            * ContentChecker.pm: Support for the |footer| element.
2049            Check URI syntax for space-separated URI attributes.
2050            Support for the |tabindex| attribute.  Support
2051            for |datetime| attribute.
2052    
2053    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2054    
2055            * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
2056            character references).  HTML5 revision 1.146 (white space
2057            characters before root start tag).  HTML5 revision
2058            1.148 (named character references in attribute values).
2059            HTML5 revision 1.152 (<plaintext>.innerHTML get).
2060    
2061    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2062    
2063            * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
2064            in <head>).
2065    
2066  2007-06-24  Wakaba  <wakaba@suika.fam.cx>  2007-06-24  Wakaba  <wakaba@suika.fam.cx>
2067    
2068          * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>          * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>

Legend:
Removed from v.1.48  
changed lines
  Added in v.1.402

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24