/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.78 by wakaba, Sat Jul 21 08:52:59 2007 UTC revision 1.380 by wakaba, Sun Aug 16 03:41:52 2009 UTC
# Line 1  Line 1 
1    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (_tree_construction_main): Improved error reporting
4            on "select" start tag in "in select" and "in select in table"
5            parsing modes (c.f. HTML5 revision 3604).
6    
7    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
8    
9            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
10            invalid (HTML5 revision 2701).
11    
12    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
15            optional before </body> (HTML5 revision 2806).
16    
17    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: Marked where SVG |script| execution should be
20            implemented (HTML5 revision 2904).
21    
22    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
25            inherits the document quirksness flag of the main document (HTML5
26            revision 2935).
27    
28    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
31            quirks mode (The Hixie Quirk, HTML5 revision 3132).
32    
33    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
36            3163).
37    
38    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Don't change encoding to UTF-8 if the input
41            encoding is UTF-16 (HTML5 revision 3204).
42    
43    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTML.pm.src (parse_byte_stream): Don't change to another
46            encoding if the input is UTF-16 (HTML5 revision 3205).
47    
48    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTML.pm.src: |isindex| should no longer generate |p| element
51            node (HTML5 revision 3236).
52    
53    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
54    
55            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
56            support for new |id_type| value of |any|, which means any kind of
57            element is allowed as a referenced element.
58    
59    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
60    
61            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
62            of defining codes for checking required attribute and as such.
63    
64    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
65    
66            * IMTChecker.pm: Added more definitions for subtypes.
67    
68    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
69    
70            * URIChecker.pm: Some of |pos_end| values were wrong.
71    
72    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
73    
74            * ContentChecker.pm (check_element): Added support for "no
75            referenced datalist" error.
76    
77    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
78    
79            * URIChecker.pm: Bug fix: It did not work unless
80            Message::DOM::DOMImplementation has been |require|d.
81    
82    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
83    
84            * NanoDOM.pm (document_uri): New attribute.
85    
86            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
87    
88    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
89    
90            * NanoDOM.pm (text_content): Don't create a Text node if the new
91            value is empty.
92    
93    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
94    
95            * SWML/: New directory.
96    
97    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
98    
99            * NanoDOM.pm (specified, all_declarations_processed,
100            manakai_attribute_type): New attributes.
101    
102    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
103    
104            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
105            array.
106    
107    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
108    
109            * NanoDOM.pm (notation_name): New attribute.
110    
111    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
112    
113            * NanoDOM.pm (public_id, system_id): New attributes.a
114    
115    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
116    
117            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
118            implemented.
119            (allowed_tokens, default_type, declared_type): Implemented.
120    
121    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
122    
123            * NanoDOM.pm (node_name): New attribute.
124            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
125            constants.
126            (create_element_type_definition_node, create_attribute_definition,
127            create_notation, create_general_entity,
128            get_element_type_definition_node,
129            set_element_type_definition_node, get_general_entity_node,
130            set_general_entity_node, get_notation_node, set_notation_node,
131            get_attribute_definition_node, set_attribute_definition_node): New
132            methods.
133            (element_types, entities, notations, attribute_definitions): New
134            attributes.
135            (DocumentType): Support for child nodes, entities, notations, and
136            element types.
137            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
138            New classes.
139    
140            * Dumper.pm: Support for general entities, notations, element type
141            definitions, and attribute definitions.
142    
143    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
144    
145            * NanoDOM.pm (create_processing_instruction): New method.
146            (xml_version, xml_encoding, xml_standalone): New attributes.
147            (ProcessingInstruction): New class.
148    
149    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: Handling of end tags in the foreign content
152            insertion mode was partially wrong, because of wrong bit
153            operations.
154    
155    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
156    
157            * NanoDOM.pm (dom_config): New attribute (do nothing), for
158            Whatpm::XML::Parser support.
159    
160    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
161    
162            * Makefile: New rule to make HTML/Tokenizer.pm is added.
163    
164            * HTML.pm.src: Tokenizer part moved to another file.
165    
166    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
167    
168            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
169    
170    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Element category constants redefined.
173            
174    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
175    
176            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
177            construction stage synced with the spec (HTML5 revisions 2139 and
178            2302).
179    
180    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
181    
182            * ContentChecker.pm: New error level "html5_fact" added, which
183            should be tentatively used until all of requirements are properly
184            specced as RFC 2119 "MUST" in HTML5.
185    
186    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
187    
188            * ContentChecker.pod: Note on internal flags is added.
189    
190    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
191    
192            * HTML.pm.src: An AAA bug fixed.
193    
194    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
195    
196            * HTML.pm.src: If another node is inserted by the parser, don't
197            reuse existing Text node to append a character (HTML5 revision
198            2124).
199    
200    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
201    
202            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
203            revisions 1731 and 2128).
204    
205    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
206    
207            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
208            Support for end tags of camelCase SVG elements were broken.  A
209            wrong error type text fixed.
210    
211    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
214    
215    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
216    
217            * HTML.pm.src: Support for new definition of |param| and |source|
218            start tag parsing (HTML5 revision 1731).
219    
220    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
223    
224    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
225    
226            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
227            revisions 1731 and 1831).
228    
229    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
232            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
233            revision 1731).
234    
235    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
236    
237            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
238            revision 1731).  End tags of |option| and |optgroup| elements are
239            now optional (HTML5 revision 1731).
240    
241    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: New "special" elements added to the list (HTML5
244            revision 1778).  "strile" -> "strike".
245    
246    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
247    
248            * ContentType.pm (get_sniffed_type): Support for the "better"
249            content sniffing (HTML5 revision 1927).  In a case the official
250            type was not returned when the method is invoked in the list
251            context.
252    
253    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: Character references for non-space C0 characters,
256            including U+000B VT, DEL character, noncharacter code points, are
257            now converted to the U+FFFD character (cf. HTML5 revision 2138).
258    
259    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
260    
261            * ContentChecker.pm: |form=""| check support added.
262    
263    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
264    
265            * ContentChecker.pm: |contextmenu| validness is now checked using
266            |id| and |id_type| properties, and |menu| property is removed.
267    
268    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
269    
270            * ContentChecker.pm: Prepare for |form| |name| attribute's
271            duplication checking.
272    
273    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
274    
275            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
276            unsupported charset="" parameter value (HTML5 revision 2131).
277    
278    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: Reminding places where U+000B is allowed as a space
281            character is fixed (cf. HTML5 revision 1738).
282    
283            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
284            space characters (HTML5 revision 1738).
285    
286    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src: The "anything else" case for the "after after body"
289            insertion mode was not updated to swtich to the "in body"
290            insertion mode.  U+000B is no longer a space character for the
291            purpose of tree construction phase (HTML5 revision 1738).
292    
293    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src: U+000B is no longer a space character (HTML5
296            revision 1738).
297    
298    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
299    
300            * ContentType.pm: 0x0B is no longer a space character (HTML5
301            revision 1738).
302    
303            * HTML.pm.src: U+000B is no longer a space character for the
304            algorithm for extracting an encoding from a Content-Type (HTML5
305            revision 1738).
306    
307    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
308    
309            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
310    
311    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
312    
313            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
314    
315            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
316            matching.
317    
318    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
319    
320            * LangTag.pm: Warn for private use language subtags.  Error level
321            typos fixed.  Support for Suppress-Script field.
322    
323            * mklangreg.pl: Support for dumping of nested structure.
324    
325    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
326    
327            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
328            recommended case as per RFC 4646.
329    
330    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
331    
332            * LangTag.pm (check_rfc4646_langtag): New method.
333    
334    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
335    
336            * mklangreg.pl: New script.
337    
338            * Makefile: Updated for creation of the module for language subtag
339            registry.
340            
341    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
342    
343            * Makefile: WebIDL.html added.
344    
345            * WebIDL.pod: New documentation.
346    
347    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
348    
349            * WebIDL.pm: Checker's error types are redefined.
350    
351    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
352    
353            * WebIDL.pm: Parser's error types are redefined.  Some forward
354            compatible parsing bugs are fixed.  Some unreachable codes are
355            commented out.
356    
357    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
358    
359            * WebIDL.pm: Support for the reminding extended attributes are
360            added.  It does not satisfy the definition that a forward
361            interface declaration has an extended attribute.  It seems that
362            unless explicitly allowed multiple extended attributes with the
363            same name is not allowed, though it is not explicitly mentioned in
364            the spec.
365    
366    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
367    
368            * WebIDL.pm: Unescapes extended attribute names and extended
369            attribute identifiers.  Preserve whether an extended attribute has
370            an argument list of not.  Support for extended attributes:
371            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
372            NameGetter, NameSetter, and Null.
373            (has_argument_list): New attribute.
374            (idl_text): Stringifies argument lists, if any, even if it is
375            empty.
376    
377    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
378    
379            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
380            |{s_kwd}| in DATA_STATE as default.
381    
382    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
383    
384            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
385            by |{s_kwd}| in DATA_STATE.
386    
387    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
388    
389            * HTML.pm.src: Shorten keys.
390    
391    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
392    
393            * HTML.pm.src: Remove checking for control character, surrogate
394            pair, or noncharacter code points and non-Unicode code
395            points (they should be handled by Whatpm::Charset::UnicodeChecker).
396            (parse_char_stream): Support for the |$get_wrapper| argument and
397            character stream error handlers.
398    
399    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
400    
401            * ContentChecker.pm: Don't call |loda_ns_module|
402            for null-namespace elements/attributes.
403    
404            * HTML.pm.src: Fact out $disallowed_control_chars
405            as a hash.
406    
407    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
410            and |{next_char}| initializations are moved to initialization
411            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
412            with |parse_char_stream|.
413    
414    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
415    
416            * HTML.pm.src (parse_char_stream): Make |set_next_char|
417            invoke |manakai_read_until|, not only |read|, where
418            possible, to decrease the number of |read| method calls.
419    
420            * mkhtmlparser.pl: Related changes to the aforementioned
421            modification.
422    
423    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
424    
425            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
426            would report character error from now.
427    
428    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
429    
430            * HTML.pm.src: White-space-leaded non-white-space character
431            tokens in "before head insertion mode" was not
432            correctly handled.
433            (set_inner_html): Reimplemented using CharString decodehandle
434            class.  Support for $get_wrapper argument.  Support
435            for |{read_until}| feature.
436    
437    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
438    
439            * HTML.pm.src: Make a "bare ero" error for unknown
440            entities point the "&" character.
441    
442    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
443    
444            * HTML.pm.src: It turns out that U+FFFD don't have to
445            be added to the list of excluded characters.
446    
447    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
448    
449            * HTML.pm.src ($char_onerror): Have character decoder's |line|
450            and |column| a higher priority than the one set by the
451            tokenizer's input handler.
452            ($self->{read_until}): Exclude U+FFFD (but this might
453            not be necessary, since now we do line/column fixup in
454            the character decode handle).
455    
456    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
457    
458            * HTML.pm.src: Use |{read_until}| where possible.
459    
460    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
461    
462            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
463            and |manakai_getc_until| to |manakai_read_until| to
464            reduce the number of string copies.
465    
466    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
467    
468            * HTML.pm.src (parse_char_string): Use newly created
469            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
470            standard feature to |open| a string as a filehandle,
471            since Perl's string filehandle seems not supporting |ungetc|
472            method correctly.
473            (parse_char_stream): Define |{getc_until}| method.
474            (DATA_STATE): Experimental support for |getc_until| feature.
475    
476    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
477    
478            * HTML.pm.src: Check points added to newly added branches.
479    
480    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
481    
482            * HTML.pm.src: Remove |{char}|, which is no longer used.
483            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
484            and replaced by |{prev_state}|.
485    
486            * mkhtmlparser.pl: Remove |{char}| feature.
487            Remove |!!!back-next-input-character;| macro.
488    
489    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
490    
491            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
492            entity related tokenizer states in favor of new states
493            implementing the consume character reference algorithm.
494    
495    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
496    
497            * HTML.pm.src: "Consume a character reference" algorithm is
498            now implemented as a tokenizer's state, rather than
499            a method, with minimum changes (more changes will
500            be made, in due course).  "Bogus comment state"'s inner
501            loop gets removed.
502    
503    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
504    
505            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
506            into their own tokenizer states.
507    
508    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
509    
510            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
511            is split into three states.
512    
513    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
514    
515            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
516            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
517            no longer does the tokenizer have to push back next input
518            characters in those states.
519    
520    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
521    
522            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
523            into four states so that no longer does the tokenizer have to push
524            back next input characters in that state.
525    
526    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
527    
528            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
529            which can be used to insert some wrapper between the character
530            stream handle and the tokenizer.  (It is currently not supported
531            for |set_inner_html| for |Element|s).
532    
533    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
534    
535            * HTML.pm.src: Ignore punctuations in charset names.
536    
537    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
538    
539            * ContentChecker.pm: Support for charset-layer error levels.
540    
541            * HTML.pm.src: Don't specify |text| argument for the
542            |chardecode:fallback| error, since it is not the encoding
543            being used alternatively.
544    
545    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
546    
547            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
548    
549    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
550    
551            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
552    
553    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
554    
555            * HTML.pm.src: Bug fix and sync with the spec with regard
556            to after after frameset insertion mode processing (HTML5
557            revision 1909).  Note that the implementation was wrong
558            per the old spec before the r1909 changes.
559    
560    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
561    
562            * HTMLTable.pm: scope=auto algorithm fix synced with the
563            spec (HTML5 revision 2093).
564            ($process_row): Algorithm step numbers synced with the
565            spec (HTML5 revision 2092).
566    
567    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
568    
569            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
570            revision 2094).
571    
572    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
573    
574            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
575    
576    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
577    
578            * HTML.pm.src: '"' and "'" at the end of attribute
579            name (after another attribute) now raise parse error (HTML5
580            revision 2123).  Empty unquoted attribute values are no
581            longer allowed (HTML5 revision 2122).
582    
583    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
584    
585            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
586            revision 2130).
587    
588    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
589    
590            * ContentChecker.pm: |xml:lang| attribute value must be same
591            as |lang| attribute value for HTML elements (HTML5 revision 2062
592            and so on).
593    
594    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
595    
596            * ContentChecker.pm: Error level definition for |xml_id_error|
597            was missing.
598    
599            * URIChecker.pm: The end of the URL should be marked as the
600            error location for an empty path error.  The position
601            between the userinfo and the port components should be
602            marked as the error location for an empty host error.
603    
604    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
605    
606            * URIChecker.pm: Set parameters representing where in the
607            value the error occurs for errors.  Report unknown
608            address format error in warning level, since address
609            formats are rarely added.  Path segments starting with "/.."
610            were misinterpreted as a dot-segment.
611    
612    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
613    
614            * URIChecker.pm (check_iri_reference): Requires
615            |Message::DOM::DOMImplementation|.
616    
617    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
618    
619            * IMTChecker.pm: Updated for the new error reporting architecture.
620    
621            * ContentChecker.pm: Error levels for IMTs are added.
622    
623    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
624    
625            * H2H.pm (_shift_token): Support for unquoted HTML attribute
626            values.
627    
628    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
629    
630            * CacheManifest.pm: Support for new style of error
631            reports.
632    
633            * HTML.pm.src: Set line=1, column=1 to the document node.
634    
635    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
636    
637            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
638            and URL checkers.  Support for more error levels for bogus
639            langauge tag and URL "standards".
640    
641            * LangTag.pm, URIChecker.pm: Support for new style error
642            level reporting.
643    
644    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
645    
646            * ContentChecker.pm: Support for RDF/XML error levels.
647    
648            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
649            specifying.  Error types are revised.
650    
651    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
652    
653            * ContentChecker.pm: All error reporting method calls are
654            renewed.
655    
656    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
657    
658            * HTML.pm.src: All error type names and "text" parameters
659            are revised.  Use new style for "level" specification.
660    
661            * mkhtmlparser.pl: Use new style for "level" specification.
662    
663    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
664    
665            * WebIDL.pm (parse_char_string): Simplified error
666            reporting process for broken ignored valuetype definition.
667            (Valuetype idl_text): Support for special "DOMString" name.
668    
669    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
670    
671            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
672            terminal of the ScopedName is "DOMString", such that whether
673            the last part of the scoped name is "DOMString" or "_DOMString"
674            later.  It is necessary to determine whether a |typedef|
675            definition should be ignored or not.
676            (parse_char_string): Unescape the identifier of
677            exception members.
678            ($resolve): Return undef for builtin types and sequence<T>
679            types (we might not have to do this, however...).
680            (check): Support checking for Exceptions, Valuetypes,
681            and Typedefs.
682            ($serialize_type): Support for "DOMString::::" syntax.
683            (Typedef idl_text): Output Type as "DOMString" if it
684            is really "DOMString" (i.e. its internal representation
685            is "::DOMString::").
686    
687    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
688    
689            * WebIDL.pm ($resolve): New code, based on resolve code
690            for constant types in the |check| method.
691            (check): Support for checking of attributes, operations, and
692            arguments.
693            (Attribute/Operation idl_text): Exception names in getraises,
694            setraises, and raises clauses is serizlied by |$serialize_type|
695            code.
696    
697    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
698    
699            * WebIDL.pm ($integer): Order of selections are changed to match
700            hexadecimal numbers (the original pattern, taken from the spec,
701            was not work for hexadecimal numbers, because the "0" prefix
702            matches to the [0-7]* part (as an empty string) and therefore
703            it does not match with remaining "x..." part of a "0x..." integer
704            literal.
705            ($get_type): It now returns a string, not an array reference,
706            for regular types and |sequence| types (i.e. it in any case
707            returns a string).
708            ($get_next_token): The second item in the array that represents
709            a integer or float token is now a Perl number value, not the
710            original string representation of the number.
711            (check): Support for const value consistency checking.
712            No extended attribute is defined for constants.
713            (Node subclasses): Use simple strings rather than array references
714            for default data type values.
715            ($serialize_type): Type values are now simple strings.
716            (value): If the new attribute value is a false value, then
717            a FALSE value is set to the attribute.
718    
719    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
720    
721            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
722            in its stringified format ("scoped name" as defined in the
723            spec).  Note that future version of this module should not use
724            array references for type values and the |type_text| attribute
725            should be made obsolete.
726            (parse_char_string): Unescape attribute names.
727            (check): Support for checking of whether inherited interfaces
728            are actually defined or not.  Support for checking of whether
729            interface member identifiers are duplicated or not.
730            ($serialize_type): Scoped names are returned as is.  A future
731            version of this code should escape identifiers other than "DOMString",
732            otherwise the idl_text would be non-conforming.
733    
734    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
735    
736            * WebIDL.pm (parse_char_string): Set line/column numbers
737            to generated nodes.  Unescape identifiers.  Extended attributes
738            for Definition's were ignored.
739            (append_child): Set |parent_node| attribute.
740            (parent_node): New attribute.
741            (check): Support interface/exception members.  Support
742            extended attributes.  Support definition identifier uniqueness
743            constraint.
744            (qualified_name): New attribute.
745            (Interface/Exception idl_text): Extended attributes were
746            not prepended to the returned text.
747    
748    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
749    
750            * WebIDL.pm (parse_char_string): Set line/column numbers
751            to interface object experimentally.  s/shift/pop/g, shift
752            would make things wrong.  Support for interface forward
753            declarations was missing.  Broken interface declarations
754            with no block were not ignored entirely.
755            (Whatpm::WebIDL::Node): New abstract class.  This class
756            makes things easier.
757            (child_nodes): New attribute.  Unlike DOM's attribute with
758            same name, this attribute returns a dead list of nodes for
759            simplicity.
760            (get_user_data, set_user_data): New methods.
761            (Module idl_text): A SPACE character should be inserted
762            before the |{| character.
763            (Interface idl_text): Support for interface forward declarations.
764            (is_forward_declaration): New attribute.
765    
766    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
767    
768            * WebIDL.pm (type_text): Better serializer.
769    
770    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
771    
772            * WebIDL.pm: Revise forward-compatible parsing so that
773            it now can handle broken extended attributes and as such.
774    
775    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
776    
777            * WebIDL.pm: Real support for extended attributes.
778            Support for extended attributes with arguments.
779    
780    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
781    
782            * WebIDL.pm: Support for |exception| syntax.
783            (Interface->idl_text): Tentative support for inheritances.
784    
785    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
786    
787            * WebIDL.pm: Hierarchical scoped name support was broken.
788            Support for raises, setraises, and getraises syntaxes.
789    
790    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
791    
792            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
793            proper support for types, extended attributes, and exceptions yet).
794            WebIDL parser, version 1 (no support for exceptions yet,
795            no proper support for extended attributes yet).
796    
797    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
798    
799            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
800    
801    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
802    
803            * WebIDL.pm: Support for valuetype and const.
804    
805    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
806            
807            * WebIDL.pm: New module.
808    
809    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
810    
811            * Makefile (Entities.html): URI changed.
812    
813    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
814    
815            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
816    
817    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
818    
819            * HTML.pm.src (_get_next_token): A parse error was missing.
820    
821    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
822    
823            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
824            spec (commented out). (HTML5 revision 1711).
825    
826    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
827    
828            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
829    
830            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
831            as UTF-16 (HTML5 revision 1701).
832    
833    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
834    
835            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
836            1692).
837    
838    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
839    
840            * HTML.pm.src: The secondary insertion mode used when switching
841            to foreign content is the "in body" insertion mode (HTML5 revision
842            1696).
843    
844    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
845    
846            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
847            revision 1697).
848    
849    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
850    
851            * HTML.pm.src: Support for end-of-file token in foreign content
852            insertion mode (HTML5 revision 1693).  Update SVG camelCase
853            attribute list (HTML5 revision 1700).  <textarea> closes
854            </select> (HTML5 revision 1699).  More start tags close in
855            foreign content insertion mode (HTML5 revision 1698).
856    
857    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
858    
859            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
860    
861    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
862    
863            * HTML.pm.src: More robust charset parameter detection (HTML5
864            revision 1674).
865    
866    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
867    
868            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
869            revision 1676).
870    
871    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
872    
873            * HTML.pm.src: Ignore language part of public identifiers for
874            quriks mode detection (HTML5 revision 1679).
875    
876    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
877    
878            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
879            revision 1685).
880    
881    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
882    
883            * HTML.pm.src: Support for EOF in new states for tags (HTML5
884            revision 1684).
885    
886    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
887    
888            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
889            work (HTML5 revision 1690).
890    
891    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
892    
893            * HTML.pm.src (_tree_construction_main): Change handling of
894            end tags in head insertion modes (HTML5 revision 1686).
895            (parse_char_string): Bug fix for non-utf8 character string handlings.
896            (parse_char_stream): |ungetc| does not work well for this context.
897    
898    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
899    
900            * HTML.pm.src (parse_byte_string): Redefined to invoke
901            |parse_byte_stream|.
902            (parse_byte_stream): New method.
903    
904    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
905    
906            * HTML.pm.src (parse_byte_string): Fix the column number reported
907            by encoding layer error reporter.
908    
909    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
910    
911            * HTML.pm.src (parse_byte_string): Use streaming decoder
912            rather than converting the whole byte string and then parsing.
913            Propagate errors in character encoding layer.
914            (get_next_token): Precise error reporting for |bare stago| error.
915    
916    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
917    
918            * HTML.pm.src (parse_char_stream): New method.
919            (parse_char_string): This method is now defined as an invocation
920            of the |parse_char_stream| method.
921    
922    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
923    
924            * HTML.pm.src (parse_byte_string): Report various status
925            of the sniffing as info-level errors.  Support for new
926            decoding framework in parser resestting.
927            (new): Various default error levels were not set.
928    
929    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
930    
931            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
932            algorithm, except for the actual sniffing, is implemented
933            with new framework with Message::Charset::Info.
934    
935    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
936    
937            * CacheManifest.pm (_parse): Drop fragment identifiers from
938            URIs in fallback section (HTML5 revision 1596).
939    
940    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
941    
942            * Makefile (Entities.html): URI has changed.
943    
944    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
945    
946            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
947            1553).
948    
949    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
950    
951            * ContentChecker.pm: Noted that those returned in |table| are
952            no longer table elements, but table objects returned
953            by Whatpm::HTMLTable.
954    
955            * HTMLTable.pm (form_table): Return table element node
956            as |$table->{element}|.
957            (assign_header): Support for the |headers=""| attribute.
958    
959    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
960    
961            * HTMLTable.pm (assign_header): New function; first version
962            with no support for headers="".
963            (form_table): Include table width and height to the returned
964            table object for covenience.  Indexing in column assignement was wrong.
965            Set whether a data cell is empty or not for convenience.
966    
967    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
968    
969            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
970    
971    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
972    
973            * HTMLTable.pm: How table model errors are detected is
974            changed (HTML5 revision 1387).
975    
976    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
977    
978            * HTMLTable.pm: The algorithm now moves |tfoot| elements
979            to the end of the table (HTML5 revision 1380).
980    
981    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
982    
983            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
984            of 1-based (HTML5 revision 1376).
985    
986    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
987    
988            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
989            and "Content-Encoding" no longer prevent sniffing (HTML5
990            revision 1288).
991    
992    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
993    
994            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
995            1282).
996    
997    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
998    
999            * ContentChecker.pm: Support for global attributes.
1000            Status of XML specs are added.
1001    
1002    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1003    
1004            * ContentChecker.pm (check_element): Support for |template=""|
1005            and |ref=""| attribute (referent element type checking).
1006    
1007    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
1008    
1009            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1010            1500) is implemented (except for IDNA part and URI-scheme-specific
1011            knowledge).  Line number counting was wrong for LF-only
1012            documents.
1013    
1014    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1015    
1016            * HTML.pm.src: Raise an parse error for any disallowed
1017            character (HTML5 revision 1263).
1018    
1019    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1020    
1021            * mkentitylist.pl: Support for new HTML5 entity table format (the
1022            definition for |AElig;| was missing).
1023    
1024    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1025    
1026            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1027            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1028    
1029    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1030    
1031            * HTML.pm.src: List of element names that close foreign content
1032            insertion mode is added (HTML5 revisions 1412 and 1418).
1033    
1034    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1035    
1036            * HTML.pm.src: Support for |mglyph| and |malignmark|
1037            elements (HTML5 revision 1410).
1038    
1039    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * HTML.pm.src: Support for new long MathML entities (HTML5
1042            revision 1406).
1043    
1044    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1045    
1046            * HTML.pm.src: CDATA section support for MathML and SVG
1047            elements (HTML5 revision 1404 and 1420).
1048    
1049    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1050    
1051            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1052            elements (HTML5 revision 1404).  Unused !!!macro definitions
1053            are removed.
1054    
1055    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1056    
1057            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1058            are raised is changed (HTML5 revision 1404).
1059    
1060    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1061    
1062            * HTML.pm.src: Category sets that are no longer used are removed.
1063    
1064    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1065    
1066            * HTML.pm.src: The ->[1] property of stack entries are now
1067            replaced by constants representing element category.
1068    
1069    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1070    
1071            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1072            for error reporting.  (This is a preparation for using constant
1073            value for ->[1].)
1074    
1075    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1076    
1077            * RDFXML.pm: Typo fixed.
1078    
1079    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1080    
1081            * ContentChecker.pm: |fact_level| is now treated
1082            as same as |must_level|, i.e. level = |m|.
1083            (check_element): Make list of URIs in the DOM.
1084    
1085    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1086    
1087            * RDFXML.pm: Language accessor implemented.
1088            Local (null-namespace) attribute support.
1089    
1090    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1091    
1092            * RDFXML.pm: Factored out ID checking code.
1093    
1094    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1095    
1096            * RDFXML.pm: TODO items noted.  Validation of ID
1097            and URI attributes is implemented.  Warn if unknown
1098            value is used in rdf:parseType="" attribute.
1099    
1100            * URIChecker.pm (check_rdf_uri_reference): New function.
1101    
1102    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1103    
1104            * RDFXML.pm: bnodeid implemented.  Relative references
1105            are now resolved.
1106    
1107    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1108    
1109            * ContentChecker.pm: RDF reification implemented.
1110    
1111            * RDFXML.pm: undef vs false bug fixed.
1112            Reification implemented.
1113    
1114    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1115    
1116            * RDFXML.pm: s/id/ID/ for attribute name.
1117            The |node| arguments are added for |ontriple| calls.
1118            Too many "attribute not allowed" errors were raised.
1119    
1120            * ContentChecker.pm: Initial experimental support for rdf:RDF
1121            element.
1122    
1123    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1124    
1125            * RDFXML.pm: New module.
1126    
1127    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1128    
1129            * HTML.pm.src (set_inner_html): Line/column number
1130            code was old one yet.
1131    
1132    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1133    
1134            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1135            errors.  Line/column markings for DOCTYPE, comment, and
1136            character tokens are reintroduced; otherwise,
1137            error location for "not HTML5" error and errors
1138            for implied elements are not attached.
1139    
1140    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1141    
1142            * HTML.pm.src: Set line/column numbers to attributes.
1143    
1144            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1145            (value): Setter implemented.
1146    
1147            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1148    
1149    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1150    
1151            * HTML.pm.src: Unused line/column markings are removed.
1152    
1153    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1154    
1155            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1156            flag, which is no longer used.
1157    
1158    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1159    
1160            * HTML.pm.src: Set line/column information to element nodes.
1161    
1162            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1163            and !!!insert-element-t): Set line/column information to
1164            element nodes.
1165    
1166    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1167    
1168            * HTML.pm.src (_get_next_token): The first "<" character
1169            in "<?", "<>", or "</>" should be the error point.
1170    
1171    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1172    
1173            * HTML.pm.src: Some more fixes on error position reporting.
1174    
1175    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1176    
1177            * HTML.pm.src: Token-level precious error reporting.
1178    
1179    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1180    
1181            * HTML.pm.src: Preparation for more precious error point
1182            reporting.
1183    
1184    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1185    
1186            * HTML.pm.src: Error type revised.
1187    
1188    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1189    
1190            * HTML.pm.src: Similar codes are merged together, again.
1191    
1192    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1193    
1194            * HTML.pm.src: Similar codes are merged together.
1195    
1196    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1197    
1198            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1199    
1200            * HTML.pm.src: Code refined.
1201    
1202    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1203    
1204            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1205            revision 1348).  Note that I really don't know this makes
1206            any difference in the black-box behavior of the parser.
1207    
1208    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1209    
1210            * HTML.pm.src: New end-of-file token implementation (HTML5
1211            revision 1348).
1212    
1213    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1214    
1215            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1216    
1217    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1218    
1219            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1220    
1221    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1222    
1223            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1224            mode and support for the "in select in table" insertion mode (HTML5
1225            revision 1342).
1226    
1227    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1228    
1229            * HTML.pm.src: No foster parenting for <script> and <script>
1230            in non-tainted <table>s (HTML5 revision 1336).
1231    
1232    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1233    
1234            * HTML.pm.src: Ignore white space characters between <html>
1235            and <head> (HTML5 revision 1332).
1236    
1237    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1238    
1239            * HTML.pm.src: Treat <input type=hidden> as if it were a
1240            white space (HTML5 revision 1331).
1241    
1242    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1243    
1244            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1245            element (HTML5 revision 1330).
1246    
1247    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1248    
1249            * HTML.pm.src: <title> is always appended to the current
1250            element (HTML5 revision 1328).
1251    
1252    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1253    
1254            * HTML.pm.src: White space in tainted tables are moved
1255            into foster parents (HTML5 revision 1326).
1256    
1257    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1258    
1259            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1260            revision 1321).
1261    
1262    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1263    
1264            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1265            the spec (HTML5 revision 1320).  Note that we finally complete
1266            all of HTML5 revision 1320 changes.
1267    
1268    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1269    
1270            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1271            tags (HTML5 revision 1320).
1272    
1273    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1274    
1275            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1276    
1277    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1278    
1279            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1280            could not be reached so that it is now removed (HTML5 revision
1281            1320).
1282    
1283    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1284    
1285            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1286            to align with new spec text (HTML5 revision 1320).
1287    
1288    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1289    
1290            * HTML.pm.src: Remove strange |if| condition; however, it should
1291            have had no harm in theory.
1292    
1293    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1294    
1295            * HTML.pm.src (_tree_construction_main): '</p>' in body
1296            case is split from other end tags for the preparation
1297            of implementing HTML revision 1320.
1298    
1299    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1300    
1301            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1302            revision 1320).
1303    
1304    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1305    
1306            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1307            and "div" (HTML5 revision 1318).  Add comments to where
1308            |form| pointer association codes should be inserted (HTML5 revision
1309            1319).
1310    
1311    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1312    
1313            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1314    
1315    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1316    
1317            * HTML.pm.src: Since the case t268 should never be reached (no
1318            other token type, there are), it is replaced by a |die| statement.
1319    
1320    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1321    
1322            * HTML.pm.src: Typo fixed.
1323    
1324    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1325    
1326            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1327            doctypes were not uppercased for comparison.
1328    
1329    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1330    
1331            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1332            are added.
1333    
1334    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1335    
1336            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1337            are set.  Cases that are unlikely reached are noted as so.
1338    
1339    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1340    
1341            * HTML.pm.src: Checkpoints for debugging are added.
1342    
1343            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1344    
1345    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1346    
1347            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1348            for simplicity.
1349    
1350    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1351    
1352            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1353            revision 1307).
1354    
1355    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1356    
1357            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1358            is set are changed (HTML5 revision 1305).
1359    
1360    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1361    
1362            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1363            1292).  Entities are not parsed in comment-like part in RCDATA
1364            elements (HTML5 revision 1294).  Allow bare & at the end
1365            of attribute value literals (HTML5 revision 1296).  More
1366            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1367            between attributes and ban attribute names or unquoted
1368            attribute values containing single or double quotes (HTML5
1369            revision 1303).
1370    
1371    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1372    
1373            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1374            and related errors unless it is an HTML document (though the spec
1375            is unclear on whether it is applied to XHTML document).
1376    
1377            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1378            are added.
1379    
1380    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1381    
1382            * HTML.pm.src: s/local_name/manakai_local_name/g.
1383    
1384    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1385    
1386            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1387    
1388            * HTML.pm.src: |charset| in |content| attribute is
1389            case-insensitive (HTML5 revision 1270).
1390    
1391    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1392    
1393            * ContentChecker.pm: New status constants are added.
1394            ($ElementDefault): |status| added.
1395            (check_element): Err for non-standard or deprecated elements.
1396            (_attr_status_info): For non-standard or deprecated attributes.
1397    
1398    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1399    
1400            * ContentChecker.pm (_attr_status_info): New internal method.
1401    
1402    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1403    
1404            * ContentChecker.pm (check_element): Element standardized
1405            status information is now dispatched.
1406    
1407    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1408    
1409            * ContentChecker.pm (check_element): Fix |del|-and-significant
1410            problem by adding some more arguments.
1411    
1412    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1413    
1414            * ContentChecker.pm (check_element): Use context of
1415            container-for-the-purpose-of-content-model element (not transparent
1416            element) for |check_child_element| calling and significant
1417            text flag marking.  This reintroduces |<del>|-and-significant
1418            problem again.
1419    
1420    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1421    
1422            * ContentChecker.pm (check_element): Make semi-transparent
1423            elements ignored for the purpose of phase changes in
1424            content model checking.
1425    
1426    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1427    
1428            * ContentChecker.pm (check_element): In-element state
1429            was not properly managed for transparent cases.
1430    
1431    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1432    
1433            * ContentChecker.pm (check_element): Support for |video|
1434            and |audio| as semi-transparent elements.
1435    
1436    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1437    
1438            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1439            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1440            Support for |html:object| as a semi-transparent element.
1441    
1442    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1443    
1444            * ContentChecker.pm (check_element): The way to traverse
1445            the tree is entirely revised to make it easier to track
1446            the state of ancestors/descendants.  As a result of this
1447            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1448            support for content model checking for HTML elements |figure|,
1449            |object|, |video|, and |audio| and checking for XML elements (and
1450            some XMLNS checkings) are dropped for now.  They will be
1451            reimplemented in due cource.
1452    
1453    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1454    
1455            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1456            token (HTML5 revision 1225).
1457    
1458    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1459    
1460            * ContentChecker.pm ({unsupported_level}): New value.
1461    
1462            * HTML.pm.src: Save whether |meta| |content| attribute
1463            contains character references or not.
1464    
1465    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1466    
1467            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1468            imlement significant content checking for contents
1469            with |del| elements.
1470    
1471    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1472    
1473            * ContenteChecker.pm ($HTMLTransparentElements): More
1474            elements are added.
1475            (_get_children): HTML |object| elements are now semi-transparent.
1476    
1477            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1478    
1479    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1480    
1481            * CacheManifest.pm: HTML5 revision 1211 implemented.
1482    
1483            * CacheManifest.pod: Updated.
1484    
1485    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1486    
1487            * ContentChecker.pm (check_document, check_element): Support
1488            for second argument ($onsubdoc).
1489            (_get_css_parser): Removed (now it is part of WDCC).
1490    
1491    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1492    
1493            * ContentChecker.pm (_get_css_parser): New.
1494    
1495    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1496    
1497            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1498            for checking had been used.
1499    
1500    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1501    
1502            * ContentChecker.pm (check_element): New todo item type |descendant|.
1503    
1504    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1505    
1506            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1507            specced in RFC 2046 and RFC 4288.
1508            (application/atom+xml): Definition added.
1509    
1510    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1511    
1512            * URIChecker.pm: Make RFC 3986 should-level errors
1513            warnings (rather than SHOULD-level errors).
1514    
1515    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1516    
1517            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1518    
1519            * HTML.pm.src: A flag for character references in attribute
1520            values are added.  Set |manakai_has_reference| user data
1521            to |charset| attribute.
1522    
1523    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1524    
1525            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1526            attributes.
1527    
1528            * ContentChecker.pm (check_document): Warn if charset requirements
1529            cannot be tested.
1530    
1531    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1532    
1533            * HTML.pm.src (parse_byte_string): Detect charset
1534            by universalchardet if charset parameter is not specified.
1535    
1536            * Makefile (Charset-all, Charset-clean): New rules.
1537    
1538    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1539    
1540            * ContentChecker.pm (check_document): Check the existence
1541            of character encoding declaration and selection of encoding
1542            for HTML document.
1543    
1544    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1545    
1546            * ContentType.pm (get_sniffed_type): Return also the official
1547            type in list context.
1548    
1549    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1550    
1551            * ContentType.pm: Sniffing with leading white space
1552            ignoring (HTML5 revisions 1013 and 1016).
1553    
1554    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1555    
1556            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1557            white spaces, are implemented.
1558    
1559    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1560    
1561            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1562            attribute if possible.
1563    
1564    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1565    
1566            * HTML.pm.src (parse_byte_string): New method.
1567            (parse_char_string): New alias for |parse_string|.
1568            (main phase): Invoking "change the encoding" algorithm if desired.
1569    
1570            * HTML.pod: Updated.
1571    
1572    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1573    
1574            * HTML.pod (get_inner_html): Removed.
1575    
1576            * Makefile (HTML-all, HTML-clean): New.
1577    
1578    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1579    
1580            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1581    
1582    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1583    
1584            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1585    
1586    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1587    
1588            * Makefile: |CacheManifest.html| is added.
1589    
1590            * CacheManifest.pod: New file.
1591    
1592    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1593    
1594            * CacheManifest.pm: New module.
1595    
1596    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1597    
1598            * HTML.pm.src: Support for application cache selection algorithm
1599            callback.
1600    
1601    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1602    
1603            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1604    
1605    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1606    
1607            * Makefile (clean): New rule.
1608    
1609            * NanoDOM.pm (public_id, system_id): New attributes.
1610    
1611    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1612    
1613            * Makefile (CSS-all, CSS-clean, clean): New rules.
1614    
1615    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1616    
1617            * ContentChecker.pm (check_document): Support for
1618            new |is_xml_root| flag.
1619            (check_element): Support for new |pluses| state.
1620            (_add_pluses): New method.
1621            (_remove_minuses): Support for new |minus| item.
1622    
1623    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1624    
1625            * ContentChecker.pm: Raise specific error for invalid
1626            root element.
1627    
1628    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1629    
1630            * ContentChecker.pm: Set level values for later uses.
1631    
1632    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1633    
1634            * ContentChecker.pm: Support for language tag validation.
1635    
1636    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1637    
1638            * LangTag.pm (check_rfc3066_language_tag): New method.
1639    
1640    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1641    
1642            * LangTag.pm: New module.
1643    
1644    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1645    
1646            * HTML.pm.src: Some error types were wrong.
1647    
1648    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1649    
1650            * CSS/: New directory.
1651    
1652    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1653    
1654            * ContentChecker.pm (_check_get_children): Support
1655            for |noscript| in |head|.
1656    
1657    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1658    
1659            * URI/: New directory.
1660    
1661    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1662    
1663            * HTML.pm.src: Tokenizer's states are now represented in
1664            number.
1665    
1666    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1667    
1668            * HTML.pm.src: |or|s for insertion modes are replaced
1669            by |&|s.
1670    
1671    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1672    
1673            * HTML.pm.src: Token types are now represented in number.
1674    
1675    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1676    
1677            * ContentType.pm (SEE ALSO): Updated.
1678    
1679            * HTML.pm.src: Insertion modes are now represented in number.
1680    
1681    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1682    
1683            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1684            999) is implemented.
1685    
1686    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1687    
1688            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1689            983) is implemented.
1690    
1691    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1692    
1693            * ContentChecker.pod: New documentation.
1694    
1695            * Makefile: A rule for |ContentChecker.html| is added.
1696    
1697            * ContentChecker.pm: A pod "LICENSE" section is added.
1698    
1699            * NanoDOM.pm ($VERSION): New variable.
1700    
1701    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1702    
1703            * H2H.pm: |b|, |i|, and |sub| are added to the
1704            list of allowed HTML elements.
1705    
1706    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1707    
1708            * H2H.pm: |samp| is added to the list of allowed
1709            HTML elements.
1710    
1711            * URIChecker.pm (check_iri): New.
1712            (check_iri_reference): Error type for IRI reference
1713            syntax error is changed.
1714    
1715    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1716    
1717            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1718            is added.
1719            (check_document): Load appropriate module before validation.
1720    
1721    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1722    
1723            * ContentChecker/: New directory.
1724    
1725    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1726    
1727            * ContentChecker.pm: HTML |time| element is implemented.
1728    
1729            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1730    
1731    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1732    
1733            * ContentChecker.pm (check_document): Return value
1734            even if no document element is found.
1735    
1736    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1737    
1738            * HTML.pm.src: |$in_body| is no longer a function.
1739    
1740    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1741    
1742            * HTML.pm.src: The |$in_body| code has been moved down.
1743    
1744    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1745    
1746            * HTML.pm.src: The "trailing end" insertion mode
1747            is split into "after html body" and "after html frameset"
1748            insertion modes.  Their codes are merged with "after body"
1749            and "after frameset" codes.  |$previous_insertion_mode|
1750            has been removed.  "after frameset" code is
1751            merged with "in frameset" code.
1752    
1753    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1754    
1755            * HTML.pm.src: The "before head" insertion mode is
1756            merged with the "in head" insertion mode.
1757    
1758    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1759    
1760            * HTML.pm.src: Cases in "in head" insertion mode are
1761            reorganized.
1762    
1763    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1764    
1765            * HTML.pm.src: Some cases in "in table" insertion mode
1766            are merged.
1767    
1768    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1769    
1770            * HTML.pm.src: The "in row" insertion mode is merged
1771            with "in table" insertion mode.
1772    
1773  2007-07-21  Wakaba  <wakaba@suika.fam.cx>  2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1774    
1775          * HTML.pm.src: The "in table" and "in table body" insertion          * HTML.pm.src: The "in table" and "in table body" insertion

Legend:
Removed from v.1.78  
changed lines
  Added in v.1.380

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24