/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.78 by wakaba, Sat Jul 21 08:52:59 2007 UTC revision 1.360 by wakaba, Sun Oct 19 07:19:00 2008 UTC
# Line 1  Line 1 
1    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
2    
3            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
4            array.
5    
6    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
7    
8            * NanoDOM.pm (notation_name): New attribute.
9    
10    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
11    
12            * NanoDOM.pm (public_id, system_id): New attributes.a
13    
14    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
15    
16            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
17            implemented.
18            (allowed_tokens, default_type, declared_type): Implemented.
19    
20    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
21    
22            * NanoDOM.pm (node_name): New attribute.
23            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
24            constants.
25            (create_element_type_definition_node, create_attribute_definition,
26            create_notation, create_general_entity,
27            get_element_type_definition_node,
28            set_element_type_definition_node, get_general_entity_node,
29            set_general_entity_node, get_notation_node, set_notation_node,
30            get_attribute_definition_node, set_attribute_definition_node): New
31            methods.
32            (element_types, entities, notations, attribute_definitions): New
33            attributes.
34            (DocumentType): Support for child nodes, entities, notations, and
35            element types.
36            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
37            New classes.
38    
39            * Dumper.pm: Support for general entities, notations, element type
40            definitions, and attribute definitions.
41    
42    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
43    
44            * NanoDOM.pm (create_processing_instruction): New method.
45            (xml_version, xml_encoding, xml_standalone): New attributes.
46            (ProcessingInstruction): New class.
47    
48    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTML.pm.src: Handling of end tags in the foreign content
51            insertion mode was partially wrong, because of wrong bit
52            operations.
53    
54    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
55    
56            * NanoDOM.pm (dom_config): New attribute (do nothing), for
57            Whatpm::XML::Parser support.
58    
59    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
60    
61            * Makefile: New rule to make HTML/Tokenizer.pm is added.
62    
63            * HTML.pm.src: Tokenizer part moved to another file.
64    
65    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
66    
67            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
68    
69    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
70    
71            * HTML.pm.src: Element category constants redefined.
72            
73    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
74    
75            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
76            construction stage synced with the spec (HTML5 revisions 2139 and
77            2302).
78    
79    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
80    
81            * ContentChecker.pm: New error level "html5_fact" added, which
82            should be tentatively used until all of requirements are properly
83            specced as RFC 2119 "MUST" in HTML5.
84    
85    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
86    
87            * ContentChecker.pod: Note on internal flags is added.
88    
89    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
90    
91            * HTML.pm.src: An AAA bug fixed.
92    
93    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: If another node is inserted by the parser, don't
96            reuse existing Text node to append a character (HTML5 revision
97            2124).
98    
99    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
102            revisions 1731 and 2128).
103    
104    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
107            Support for end tags of camelCase SVG elements were broken.  A
108            wrong error type text fixed.
109    
110    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
113    
114    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: Support for new definition of |param| and |source|
117            start tag parsing (HTML5 revision 1731).
118    
119    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
122    
123    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
124    
125            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
126            revisions 1731 and 1831).
127    
128    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
129    
130            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
131            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
132            revision 1731).
133    
134    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
135    
136            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
137            revision 1731).  End tags of |option| and |optgroup| elements are
138            now optional (HTML5 revision 1731).
139    
140    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: New "special" elements added to the list (HTML5
143            revision 1778).  "strile" -> "strike".
144    
145    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
146    
147            * ContentType.pm (get_sniffed_type): Support for the "better"
148            content sniffing (HTML5 revision 1927).  In a case the official
149            type was not returned when the method is invoked in the list
150            context.
151    
152    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src: Character references for non-space C0 characters,
155            including U+000B VT, DEL character, noncharacter code points, are
156            now converted to the U+FFFD character (cf. HTML5 revision 2138).
157    
158    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
159    
160            * ContentChecker.pm: |form=""| check support added.
161    
162    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
163    
164            * ContentChecker.pm: |contextmenu| validness is now checked using
165            |id| and |id_type| properties, and |menu| property is removed.
166    
167    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
168    
169            * ContentChecker.pm: Prepare for |form| |name| attribute's
170            duplication checking.
171    
172    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
173    
174            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
175            unsupported charset="" parameter value (HTML5 revision 2131).
176    
177    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTML.pm.src: Reminding places where U+000B is allowed as a space
180            character is fixed (cf. HTML5 revision 1738).
181    
182            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
183            space characters (HTML5 revision 1738).
184    
185    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src: The "anything else" case for the "after after body"
188            insertion mode was not updated to swtich to the "in body"
189            insertion mode.  U+000B is no longer a space character for the
190            purpose of tree construction phase (HTML5 revision 1738).
191    
192    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
193    
194            * HTML.pm.src: U+000B is no longer a space character (HTML5
195            revision 1738).
196    
197    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
198    
199            * ContentType.pm: 0x0B is no longer a space character (HTML5
200            revision 1738).
201    
202            * HTML.pm.src: U+000B is no longer a space character for the
203            algorithm for extracting an encoding from a Content-Type (HTML5
204            revision 1738).
205    
206    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
207    
208            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
209    
210    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
211    
212            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
213    
214            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
215            matching.
216    
217    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
218    
219            * LangTag.pm: Warn for private use language subtags.  Error level
220            typos fixed.  Support for Suppress-Script field.
221    
222            * mklangreg.pl: Support for dumping of nested structure.
223    
224    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
225    
226            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
227            recommended case as per RFC 4646.
228    
229    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
230    
231            * LangTag.pm (check_rfc4646_langtag): New method.
232    
233    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
234    
235            * mklangreg.pl: New script.
236    
237            * Makefile: Updated for creation of the module for language subtag
238            registry.
239            
240    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
241    
242            * Makefile: WebIDL.html added.
243    
244            * WebIDL.pod: New documentation.
245    
246    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
247    
248            * WebIDL.pm: Checker's error types are redefined.
249    
250    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
251    
252            * WebIDL.pm: Parser's error types are redefined.  Some forward
253            compatible parsing bugs are fixed.  Some unreachable codes are
254            commented out.
255    
256    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
257    
258            * WebIDL.pm: Support for the reminding extended attributes are
259            added.  It does not satisfy the definition that a forward
260            interface declaration has an extended attribute.  It seems that
261            unless explicitly allowed multiple extended attributes with the
262            same name is not allowed, though it is not explicitly mentioned in
263            the spec.
264    
265    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
266    
267            * WebIDL.pm: Unescapes extended attribute names and extended
268            attribute identifiers.  Preserve whether an extended attribute has
269            an argument list of not.  Support for extended attributes:
270            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
271            NameGetter, NameSetter, and Null.
272            (has_argument_list): New attribute.
273            (idl_text): Stringifies argument lists, if any, even if it is
274            empty.
275    
276    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
277    
278            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
279            |{s_kwd}| in DATA_STATE as default.
280    
281    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
284            by |{s_kwd}| in DATA_STATE.
285    
286    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src: Shorten keys.
289    
290    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
291    
292            * HTML.pm.src: Remove checking for control character, surrogate
293            pair, or noncharacter code points and non-Unicode code
294            points (they should be handled by Whatpm::Charset::UnicodeChecker).
295            (parse_char_stream): Support for the |$get_wrapper| argument and
296            character stream error handlers.
297    
298    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
299    
300            * ContentChecker.pm: Don't call |loda_ns_module|
301            for null-namespace elements/attributes.
302    
303            * HTML.pm.src: Fact out $disallowed_control_chars
304            as a hash.
305    
306    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
307    
308            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
309            and |{next_char}| initializations are moved to initialization
310            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
311            with |parse_char_stream|.
312    
313    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
314    
315            * HTML.pm.src (parse_char_stream): Make |set_next_char|
316            invoke |manakai_read_until|, not only |read|, where
317            possible, to decrease the number of |read| method calls.
318    
319            * mkhtmlparser.pl: Related changes to the aforementioned
320            modification.
321    
322    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
325            would report character error from now.
326    
327    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
328    
329            * HTML.pm.src: White-space-leaded non-white-space character
330            tokens in "before head insertion mode" was not
331            correctly handled.
332            (set_inner_html): Reimplemented using CharString decodehandle
333            class.  Support for $get_wrapper argument.  Support
334            for |{read_until}| feature.
335    
336    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
337    
338            * HTML.pm.src: Make a "bare ero" error for unknown
339            entities point the "&" character.
340    
341    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: It turns out that U+FFFD don't have to
344            be added to the list of excluded characters.
345    
346    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
347    
348            * HTML.pm.src ($char_onerror): Have character decoder's |line|
349            and |column| a higher priority than the one set by the
350            tokenizer's input handler.
351            ($self->{read_until}): Exclude U+FFFD (but this might
352            not be necessary, since now we do line/column fixup in
353            the character decode handle).
354    
355    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
356    
357            * HTML.pm.src: Use |{read_until}| where possible.
358    
359    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
360    
361            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
362            and |manakai_getc_until| to |manakai_read_until| to
363            reduce the number of string copies.
364    
365    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
366    
367            * HTML.pm.src (parse_char_string): Use newly created
368            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
369            standard feature to |open| a string as a filehandle,
370            since Perl's string filehandle seems not supporting |ungetc|
371            method correctly.
372            (parse_char_stream): Define |{getc_until}| method.
373            (DATA_STATE): Experimental support for |getc_until| feature.
374    
375    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
376    
377            * HTML.pm.src: Check points added to newly added branches.
378    
379    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
380    
381            * HTML.pm.src: Remove |{char}|, which is no longer used.
382            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
383            and replaced by |{prev_state}|.
384    
385            * mkhtmlparser.pl: Remove |{char}| feature.
386            Remove |!!!back-next-input-character;| macro.
387    
388    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
389    
390            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
391            entity related tokenizer states in favor of new states
392            implementing the consume character reference algorithm.
393    
394    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
395    
396            * HTML.pm.src: "Consume a character reference" algorithm is
397            now implemented as a tokenizer's state, rather than
398            a method, with minimum changes (more changes will
399            be made, in due course).  "Bogus comment state"'s inner
400            loop gets removed.
401    
402    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
403    
404            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
405            into their own tokenizer states.
406    
407    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
410            is split into three states.
411    
412    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
413    
414            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
415            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
416            no longer does the tokenizer have to push back next input
417            characters in those states.
418    
419    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
420    
421            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
422            into four states so that no longer does the tokenizer have to push
423            back next input characters in that state.
424    
425    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
426    
427            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
428            which can be used to insert some wrapper between the character
429            stream handle and the tokenizer.  (It is currently not supported
430            for |set_inner_html| for |Element|s).
431    
432    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
433    
434            * HTML.pm.src: Ignore punctuations in charset names.
435    
436    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
437    
438            * ContentChecker.pm: Support for charset-layer error levels.
439    
440            * HTML.pm.src: Don't specify |text| argument for the
441            |chardecode:fallback| error, since it is not the encoding
442            being used alternatively.
443    
444    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
445    
446            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
447    
448    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
449    
450            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
451    
452    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
453    
454            * HTML.pm.src: Bug fix and sync with the spec with regard
455            to after after frameset insertion mode processing (HTML5
456            revision 1909).  Note that the implementation was wrong
457            per the old spec before the r1909 changes.
458    
459    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
460    
461            * HTMLTable.pm: scope=auto algorithm fix synced with the
462            spec (HTML5 revision 2093).
463            ($process_row): Algorithm step numbers synced with the
464            spec (HTML5 revision 2092).
465    
466    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
467    
468            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
469            revision 2094).
470    
471    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
472    
473            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
474    
475    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
476    
477            * HTML.pm.src: '"' and "'" at the end of attribute
478            name (after another attribute) now raise parse error (HTML5
479            revision 2123).  Empty unquoted attribute values are no
480            longer allowed (HTML5 revision 2122).
481    
482    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
483    
484            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
485            revision 2130).
486    
487    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
488    
489            * ContentChecker.pm: |xml:lang| attribute value must be same
490            as |lang| attribute value for HTML elements (HTML5 revision 2062
491            and so on).
492    
493    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
494    
495            * ContentChecker.pm: Error level definition for |xml_id_error|
496            was missing.
497    
498            * URIChecker.pm: The end of the URL should be marked as the
499            error location for an empty path error.  The position
500            between the userinfo and the port components should be
501            marked as the error location for an empty host error.
502    
503    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
504    
505            * URIChecker.pm: Set parameters representing where in the
506            value the error occurs for errors.  Report unknown
507            address format error in warning level, since address
508            formats are rarely added.  Path segments starting with "/.."
509            were misinterpreted as a dot-segment.
510    
511    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
512    
513            * URIChecker.pm (check_iri_reference): Requires
514            |Message::DOM::DOMImplementation|.
515    
516    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
517    
518            * IMTChecker.pm: Updated for the new error reporting architecture.
519    
520            * ContentChecker.pm: Error levels for IMTs are added.
521    
522    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
523    
524            * H2H.pm (_shift_token): Support for unquoted HTML attribute
525            values.
526    
527    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
528    
529            * CacheManifest.pm: Support for new style of error
530            reports.
531    
532            * HTML.pm.src: Set line=1, column=1 to the document node.
533    
534    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
535    
536            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
537            and URL checkers.  Support for more error levels for bogus
538            langauge tag and URL "standards".
539    
540            * LangTag.pm, URIChecker.pm: Support for new style error
541            level reporting.
542    
543    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
544    
545            * ContentChecker.pm: Support for RDF/XML error levels.
546    
547            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
548            specifying.  Error types are revised.
549    
550    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
551    
552            * ContentChecker.pm: All error reporting method calls are
553            renewed.
554    
555    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
556    
557            * HTML.pm.src: All error type names and "text" parameters
558            are revised.  Use new style for "level" specification.
559    
560            * mkhtmlparser.pl: Use new style for "level" specification.
561    
562    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
563    
564            * WebIDL.pm (parse_char_string): Simplified error
565            reporting process for broken ignored valuetype definition.
566            (Valuetype idl_text): Support for special "DOMString" name.
567    
568    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
569    
570            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
571            terminal of the ScopedName is "DOMString", such that whether
572            the last part of the scoped name is "DOMString" or "_DOMString"
573            later.  It is necessary to determine whether a |typedef|
574            definition should be ignored or not.
575            (parse_char_string): Unescape the identifier of
576            exception members.
577            ($resolve): Return undef for builtin types and sequence<T>
578            types (we might not have to do this, however...).
579            (check): Support checking for Exceptions, Valuetypes,
580            and Typedefs.
581            ($serialize_type): Support for "DOMString::::" syntax.
582            (Typedef idl_text): Output Type as "DOMString" if it
583            is really "DOMString" (i.e. its internal representation
584            is "::DOMString::").
585    
586    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
587    
588            * WebIDL.pm ($resolve): New code, based on resolve code
589            for constant types in the |check| method.
590            (check): Support for checking of attributes, operations, and
591            arguments.
592            (Attribute/Operation idl_text): Exception names in getraises,
593            setraises, and raises clauses is serizlied by |$serialize_type|
594            code.
595    
596    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
597    
598            * WebIDL.pm ($integer): Order of selections are changed to match
599            hexadecimal numbers (the original pattern, taken from the spec,
600            was not work for hexadecimal numbers, because the "0" prefix
601            matches to the [0-7]* part (as an empty string) and therefore
602            it does not match with remaining "x..." part of a "0x..." integer
603            literal.
604            ($get_type): It now returns a string, not an array reference,
605            for regular types and |sequence| types (i.e. it in any case
606            returns a string).
607            ($get_next_token): The second item in the array that represents
608            a integer or float token is now a Perl number value, not the
609            original string representation of the number.
610            (check): Support for const value consistency checking.
611            No extended attribute is defined for constants.
612            (Node subclasses): Use simple strings rather than array references
613            for default data type values.
614            ($serialize_type): Type values are now simple strings.
615            (value): If the new attribute value is a false value, then
616            a FALSE value is set to the attribute.
617    
618    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
619    
620            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
621            in its stringified format ("scoped name" as defined in the
622            spec).  Note that future version of this module should not use
623            array references for type values and the |type_text| attribute
624            should be made obsolete.
625            (parse_char_string): Unescape attribute names.
626            (check): Support for checking of whether inherited interfaces
627            are actually defined or not.  Support for checking of whether
628            interface member identifiers are duplicated or not.
629            ($serialize_type): Scoped names are returned as is.  A future
630            version of this code should escape identifiers other than "DOMString",
631            otherwise the idl_text would be non-conforming.
632    
633    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
634    
635            * WebIDL.pm (parse_char_string): Set line/column numbers
636            to generated nodes.  Unescape identifiers.  Extended attributes
637            for Definition's were ignored.
638            (append_child): Set |parent_node| attribute.
639            (parent_node): New attribute.
640            (check): Support interface/exception members.  Support
641            extended attributes.  Support definition identifier uniqueness
642            constraint.
643            (qualified_name): New attribute.
644            (Interface/Exception idl_text): Extended attributes were
645            not prepended to the returned text.
646    
647    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
648    
649            * WebIDL.pm (parse_char_string): Set line/column numbers
650            to interface object experimentally.  s/shift/pop/g, shift
651            would make things wrong.  Support for interface forward
652            declarations was missing.  Broken interface declarations
653            with no block were not ignored entirely.
654            (Whatpm::WebIDL::Node): New abstract class.  This class
655            makes things easier.
656            (child_nodes): New attribute.  Unlike DOM's attribute with
657            same name, this attribute returns a dead list of nodes for
658            simplicity.
659            (get_user_data, set_user_data): New methods.
660            (Module idl_text): A SPACE character should be inserted
661            before the |{| character.
662            (Interface idl_text): Support for interface forward declarations.
663            (is_forward_declaration): New attribute.
664    
665    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
666    
667            * WebIDL.pm (type_text): Better serializer.
668    
669    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
670    
671            * WebIDL.pm: Revise forward-compatible parsing so that
672            it now can handle broken extended attributes and as such.
673    
674    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
675    
676            * WebIDL.pm: Real support for extended attributes.
677            Support for extended attributes with arguments.
678    
679    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
680    
681            * WebIDL.pm: Support for |exception| syntax.
682            (Interface->idl_text): Tentative support for inheritances.
683    
684    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
685    
686            * WebIDL.pm: Hierarchical scoped name support was broken.
687            Support for raises, setraises, and getraises syntaxes.
688    
689    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
690    
691            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
692            proper support for types, extended attributes, and exceptions yet).
693            WebIDL parser, version 1 (no support for exceptions yet,
694            no proper support for extended attributes yet).
695    
696    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
697    
698            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
699    
700    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
701    
702            * WebIDL.pm: Support for valuetype and const.
703    
704    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
705            
706            * WebIDL.pm: New module.
707    
708    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
709    
710            * Makefile (Entities.html): URI changed.
711    
712    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
713    
714            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
715    
716    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
717    
718            * HTML.pm.src (_get_next_token): A parse error was missing.
719    
720    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
721    
722            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
723            spec (commented out). (HTML5 revision 1711).
724    
725    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
726    
727            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
728    
729            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
730            as UTF-16 (HTML5 revision 1701).
731    
732    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
733    
734            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
735            1692).
736    
737    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
738    
739            * HTML.pm.src: The secondary insertion mode used when switching
740            to foreign content is the "in body" insertion mode (HTML5 revision
741            1696).
742    
743    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
744    
745            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
746            revision 1697).
747    
748    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
749    
750            * HTML.pm.src: Support for end-of-file token in foreign content
751            insertion mode (HTML5 revision 1693).  Update SVG camelCase
752            attribute list (HTML5 revision 1700).  <textarea> closes
753            </select> (HTML5 revision 1699).  More start tags close in
754            foreign content insertion mode (HTML5 revision 1698).
755    
756    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
757    
758            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
759    
760    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
761    
762            * HTML.pm.src: More robust charset parameter detection (HTML5
763            revision 1674).
764    
765    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
766    
767            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
768            revision 1676).
769    
770    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
771    
772            * HTML.pm.src: Ignore language part of public identifiers for
773            quriks mode detection (HTML5 revision 1679).
774    
775    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
776    
777            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
778            revision 1685).
779    
780    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
781    
782            * HTML.pm.src: Support for EOF in new states for tags (HTML5
783            revision 1684).
784    
785    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
786    
787            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
788            work (HTML5 revision 1690).
789    
790    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
791    
792            * HTML.pm.src (_tree_construction_main): Change handling of
793            end tags in head insertion modes (HTML5 revision 1686).
794            (parse_char_string): Bug fix for non-utf8 character string handlings.
795            (parse_char_stream): |ungetc| does not work well for this context.
796    
797    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
798    
799            * HTML.pm.src (parse_byte_string): Redefined to invoke
800            |parse_byte_stream|.
801            (parse_byte_stream): New method.
802    
803    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
804    
805            * HTML.pm.src (parse_byte_string): Fix the column number reported
806            by encoding layer error reporter.
807    
808    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
809    
810            * HTML.pm.src (parse_byte_string): Use streaming decoder
811            rather than converting the whole byte string and then parsing.
812            Propagate errors in character encoding layer.
813            (get_next_token): Precise error reporting for |bare stago| error.
814    
815    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
816    
817            * HTML.pm.src (parse_char_stream): New method.
818            (parse_char_string): This method is now defined as an invocation
819            of the |parse_char_stream| method.
820    
821    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
822    
823            * HTML.pm.src (parse_byte_string): Report various status
824            of the sniffing as info-level errors.  Support for new
825            decoding framework in parser resestting.
826            (new): Various default error levels were not set.
827    
828    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
829    
830            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
831            algorithm, except for the actual sniffing, is implemented
832            with new framework with Message::Charset::Info.
833    
834    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
835    
836            * CacheManifest.pm (_parse): Drop fragment identifiers from
837            URIs in fallback section (HTML5 revision 1596).
838    
839    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
840    
841            * Makefile (Entities.html): URI has changed.
842    
843    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
844    
845            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
846            1553).
847    
848    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
849    
850            * ContentChecker.pm: Noted that those returned in |table| are
851            no longer table elements, but table objects returned
852            by Whatpm::HTMLTable.
853    
854            * HTMLTable.pm (form_table): Return table element node
855            as |$table->{element}|.
856            (assign_header): Support for the |headers=""| attribute.
857    
858    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
859    
860            * HTMLTable.pm (assign_header): New function; first version
861            with no support for headers="".
862            (form_table): Include table width and height to the returned
863            table object for covenience.  Indexing in column assignement was wrong.
864            Set whether a data cell is empty or not for convenience.
865    
866    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
867    
868            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
869    
870    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
871    
872            * HTMLTable.pm: How table model errors are detected is
873            changed (HTML5 revision 1387).
874    
875    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
876    
877            * HTMLTable.pm: The algorithm now moves |tfoot| elements
878            to the end of the table (HTML5 revision 1380).
879    
880    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
881    
882            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
883            of 1-based (HTML5 revision 1376).
884    
885    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
886    
887            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
888            and "Content-Encoding" no longer prevent sniffing (HTML5
889            revision 1288).
890    
891    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
892    
893            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
894            1282).
895    
896    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
897    
898            * ContentChecker.pm: Support for global attributes.
899            Status of XML specs are added.
900    
901    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
902    
903            * ContentChecker.pm (check_element): Support for |template=""|
904            and |ref=""| attribute (referent element type checking).
905    
906    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
907    
908            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
909            1500) is implemented (except for IDNA part and URI-scheme-specific
910            knowledge).  Line number counting was wrong for LF-only
911            documents.
912    
913    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
914    
915            * HTML.pm.src: Raise an parse error for any disallowed
916            character (HTML5 revision 1263).
917    
918    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
919    
920            * mkentitylist.pl: Support for new HTML5 entity table format (the
921            definition for |AElig;| was missing).
922    
923    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
924    
925            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
926            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
927    
928    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
929    
930            * HTML.pm.src: List of element names that close foreign content
931            insertion mode is added (HTML5 revisions 1412 and 1418).
932    
933    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
934    
935            * HTML.pm.src: Support for |mglyph| and |malignmark|
936            elements (HTML5 revision 1410).
937    
938    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
939    
940            * HTML.pm.src: Support for new long MathML entities (HTML5
941            revision 1406).
942    
943    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
944    
945            * HTML.pm.src: CDATA section support for MathML and SVG
946            elements (HTML5 revision 1404 and 1420).
947    
948    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
949    
950            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
951            elements (HTML5 revision 1404).  Unused !!!macro definitions
952            are removed.
953    
954    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
955    
956            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
957            are raised is changed (HTML5 revision 1404).
958    
959    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
960    
961            * HTML.pm.src: Category sets that are no longer used are removed.
962    
963    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
964    
965            * HTML.pm.src: The ->[1] property of stack entries are now
966            replaced by constants representing element category.
967    
968    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
969    
970            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
971            for error reporting.  (This is a preparation for using constant
972            value for ->[1].)
973    
974    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
975    
976            * RDFXML.pm: Typo fixed.
977    
978    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
979    
980            * ContentChecker.pm: |fact_level| is now treated
981            as same as |must_level|, i.e. level = |m|.
982            (check_element): Make list of URIs in the DOM.
983    
984    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
985    
986            * RDFXML.pm: Language accessor implemented.
987            Local (null-namespace) attribute support.
988    
989    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
990    
991            * RDFXML.pm: Factored out ID checking code.
992    
993    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
994    
995            * RDFXML.pm: TODO items noted.  Validation of ID
996            and URI attributes is implemented.  Warn if unknown
997            value is used in rdf:parseType="" attribute.
998    
999            * URIChecker.pm (check_rdf_uri_reference): New function.
1000    
1001    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1002    
1003            * RDFXML.pm: bnodeid implemented.  Relative references
1004            are now resolved.
1005    
1006    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1007    
1008            * ContentChecker.pm: RDF reification implemented.
1009    
1010            * RDFXML.pm: undef vs false bug fixed.
1011            Reification implemented.
1012    
1013    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1014    
1015            * RDFXML.pm: s/id/ID/ for attribute name.
1016            The |node| arguments are added for |ontriple| calls.
1017            Too many "attribute not allowed" errors were raised.
1018    
1019            * ContentChecker.pm: Initial experimental support for rdf:RDF
1020            element.
1021    
1022    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * RDFXML.pm: New module.
1025    
1026    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1027    
1028            * HTML.pm.src (set_inner_html): Line/column number
1029            code was old one yet.
1030    
1031    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1032    
1033            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1034            errors.  Line/column markings for DOCTYPE, comment, and
1035            character tokens are reintroduced; otherwise,
1036            error location for "not HTML5" error and errors
1037            for implied elements are not attached.
1038    
1039    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * HTML.pm.src: Set line/column numbers to attributes.
1042    
1043            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1044            (value): Setter implemented.
1045    
1046            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1047    
1048    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * HTML.pm.src: Unused line/column markings are removed.
1051    
1052    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1053    
1054            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1055            flag, which is no longer used.
1056    
1057    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1058    
1059            * HTML.pm.src: Set line/column information to element nodes.
1060    
1061            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1062            and !!!insert-element-t): Set line/column information to
1063            element nodes.
1064    
1065    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1066    
1067            * HTML.pm.src (_get_next_token): The first "<" character
1068            in "<?", "<>", or "</>" should be the error point.
1069    
1070    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1071    
1072            * HTML.pm.src: Some more fixes on error position reporting.
1073    
1074    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1075    
1076            * HTML.pm.src: Token-level precious error reporting.
1077    
1078    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1079    
1080            * HTML.pm.src: Preparation for more precious error point
1081            reporting.
1082    
1083    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1084    
1085            * HTML.pm.src: Error type revised.
1086    
1087    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1088    
1089            * HTML.pm.src: Similar codes are merged together, again.
1090    
1091    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1092    
1093            * HTML.pm.src: Similar codes are merged together.
1094    
1095    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1096    
1097            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1098    
1099            * HTML.pm.src: Code refined.
1100    
1101    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1102    
1103            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1104            revision 1348).  Note that I really don't know this makes
1105            any difference in the black-box behavior of the parser.
1106    
1107    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1108    
1109            * HTML.pm.src: New end-of-file token implementation (HTML5
1110            revision 1348).
1111    
1112    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1113    
1114            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1115    
1116    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1117    
1118            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1119    
1120    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1121    
1122            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1123            mode and support for the "in select in table" insertion mode (HTML5
1124            revision 1342).
1125    
1126    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1127    
1128            * HTML.pm.src: No foster parenting for <script> and <script>
1129            in non-tainted <table>s (HTML5 revision 1336).
1130    
1131    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1132    
1133            * HTML.pm.src: Ignore white space characters between <html>
1134            and <head> (HTML5 revision 1332).
1135    
1136    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1137    
1138            * HTML.pm.src: Treat <input type=hidden> as if it were a
1139            white space (HTML5 revision 1331).
1140    
1141    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1142    
1143            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1144            element (HTML5 revision 1330).
1145    
1146    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1147    
1148            * HTML.pm.src: <title> is always appended to the current
1149            element (HTML5 revision 1328).
1150    
1151    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1152    
1153            * HTML.pm.src: White space in tainted tables are moved
1154            into foster parents (HTML5 revision 1326).
1155    
1156    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1157    
1158            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1159            revision 1321).
1160    
1161    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1162    
1163            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1164            the spec (HTML5 revision 1320).  Note that we finally complete
1165            all of HTML5 revision 1320 changes.
1166    
1167    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1168    
1169            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1170            tags (HTML5 revision 1320).
1171    
1172    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1173    
1174            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1175    
1176    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1177    
1178            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1179            could not be reached so that it is now removed (HTML5 revision
1180            1320).
1181    
1182    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1183    
1184            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1185            to align with new spec text (HTML5 revision 1320).
1186    
1187    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1188    
1189            * HTML.pm.src: Remove strange |if| condition; however, it should
1190            have had no harm in theory.
1191    
1192    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1193    
1194            * HTML.pm.src (_tree_construction_main): '</p>' in body
1195            case is split from other end tags for the preparation
1196            of implementing HTML revision 1320.
1197    
1198    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1199    
1200            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1201            revision 1320).
1202    
1203    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1204    
1205            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1206            and "div" (HTML5 revision 1318).  Add comments to where
1207            |form| pointer association codes should be inserted (HTML5 revision
1208            1319).
1209    
1210    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1211    
1212            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1213    
1214    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1215    
1216            * HTML.pm.src: Since the case t268 should never be reached (no
1217            other token type, there are), it is replaced by a |die| statement.
1218    
1219    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1220    
1221            * HTML.pm.src: Typo fixed.
1222    
1223    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1224    
1225            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1226            doctypes were not uppercased for comparison.
1227    
1228    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1229    
1230            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1231            are added.
1232    
1233    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1234    
1235            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1236            are set.  Cases that are unlikely reached are noted as so.
1237    
1238    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1239    
1240            * HTML.pm.src: Checkpoints for debugging are added.
1241    
1242            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1243    
1244    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1245    
1246            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1247            for simplicity.
1248    
1249    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1250    
1251            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1252            revision 1307).
1253    
1254    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1255    
1256            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1257            is set are changed (HTML5 revision 1305).
1258    
1259    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1260    
1261            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1262            1292).  Entities are not parsed in comment-like part in RCDATA
1263            elements (HTML5 revision 1294).  Allow bare & at the end
1264            of attribute value literals (HTML5 revision 1296).  More
1265            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1266            between attributes and ban attribute names or unquoted
1267            attribute values containing single or double quotes (HTML5
1268            revision 1303).
1269    
1270    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1271    
1272            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1273            and related errors unless it is an HTML document (though the spec
1274            is unclear on whether it is applied to XHTML document).
1275    
1276            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1277            are added.
1278    
1279    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1280    
1281            * HTML.pm.src: s/local_name/manakai_local_name/g.
1282    
1283    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1284    
1285            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1286    
1287            * HTML.pm.src: |charset| in |content| attribute is
1288            case-insensitive (HTML5 revision 1270).
1289    
1290    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1291    
1292            * ContentChecker.pm: New status constants are added.
1293            ($ElementDefault): |status| added.
1294            (check_element): Err for non-standard or deprecated elements.
1295            (_attr_status_info): For non-standard or deprecated attributes.
1296    
1297    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1298    
1299            * ContentChecker.pm (_attr_status_info): New internal method.
1300    
1301    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1302    
1303            * ContentChecker.pm (check_element): Element standardized
1304            status information is now dispatched.
1305    
1306    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1307    
1308            * ContentChecker.pm (check_element): Fix |del|-and-significant
1309            problem by adding some more arguments.
1310    
1311    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1312    
1313            * ContentChecker.pm (check_element): Use context of
1314            container-for-the-purpose-of-content-model element (not transparent
1315            element) for |check_child_element| calling and significant
1316            text flag marking.  This reintroduces |<del>|-and-significant
1317            problem again.
1318    
1319    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1320    
1321            * ContentChecker.pm (check_element): Make semi-transparent
1322            elements ignored for the purpose of phase changes in
1323            content model checking.
1324    
1325    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1326    
1327            * ContentChecker.pm (check_element): In-element state
1328            was not properly managed for transparent cases.
1329    
1330    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1331    
1332            * ContentChecker.pm (check_element): Support for |video|
1333            and |audio| as semi-transparent elements.
1334    
1335    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1336    
1337            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1338            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1339            Support for |html:object| as a semi-transparent element.
1340    
1341    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1342    
1343            * ContentChecker.pm (check_element): The way to traverse
1344            the tree is entirely revised to make it easier to track
1345            the state of ancestors/descendants.  As a result of this
1346            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1347            support for content model checking for HTML elements |figure|,
1348            |object|, |video|, and |audio| and checking for XML elements (and
1349            some XMLNS checkings) are dropped for now.  They will be
1350            reimplemented in due cource.
1351    
1352    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1353    
1354            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1355            token (HTML5 revision 1225).
1356    
1357    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1358    
1359            * ContentChecker.pm ({unsupported_level}): New value.
1360    
1361            * HTML.pm.src: Save whether |meta| |content| attribute
1362            contains character references or not.
1363    
1364    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1365    
1366            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1367            imlement significant content checking for contents
1368            with |del| elements.
1369    
1370    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1371    
1372            * ContenteChecker.pm ($HTMLTransparentElements): More
1373            elements are added.
1374            (_get_children): HTML |object| elements are now semi-transparent.
1375    
1376            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1377    
1378    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1379    
1380            * CacheManifest.pm: HTML5 revision 1211 implemented.
1381    
1382            * CacheManifest.pod: Updated.
1383    
1384    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1385    
1386            * ContentChecker.pm (check_document, check_element): Support
1387            for second argument ($onsubdoc).
1388            (_get_css_parser): Removed (now it is part of WDCC).
1389    
1390    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1391    
1392            * ContentChecker.pm (_get_css_parser): New.
1393    
1394    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1395    
1396            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1397            for checking had been used.
1398    
1399    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1400    
1401            * ContentChecker.pm (check_element): New todo item type |descendant|.
1402    
1403    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1404    
1405            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1406            specced in RFC 2046 and RFC 4288.
1407            (application/atom+xml): Definition added.
1408    
1409    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1410    
1411            * URIChecker.pm: Make RFC 3986 should-level errors
1412            warnings (rather than SHOULD-level errors).
1413    
1414    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1415    
1416            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1417    
1418            * HTML.pm.src: A flag for character references in attribute
1419            values are added.  Set |manakai_has_reference| user data
1420            to |charset| attribute.
1421    
1422    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1423    
1424            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1425            attributes.
1426    
1427            * ContentChecker.pm (check_document): Warn if charset requirements
1428            cannot be tested.
1429    
1430    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1431    
1432            * HTML.pm.src (parse_byte_string): Detect charset
1433            by universalchardet if charset parameter is not specified.
1434    
1435            * Makefile (Charset-all, Charset-clean): New rules.
1436    
1437    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1438    
1439            * ContentChecker.pm (check_document): Check the existence
1440            of character encoding declaration and selection of encoding
1441            for HTML document.
1442    
1443    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1444    
1445            * ContentType.pm (get_sniffed_type): Return also the official
1446            type in list context.
1447    
1448    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1449    
1450            * ContentType.pm: Sniffing with leading white space
1451            ignoring (HTML5 revisions 1013 and 1016).
1452    
1453    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1454    
1455            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1456            white spaces, are implemented.
1457    
1458    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1459    
1460            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1461            attribute if possible.
1462    
1463    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1464    
1465            * HTML.pm.src (parse_byte_string): New method.
1466            (parse_char_string): New alias for |parse_string|.
1467            (main phase): Invoking "change the encoding" algorithm if desired.
1468    
1469            * HTML.pod: Updated.
1470    
1471    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1472    
1473            * HTML.pod (get_inner_html): Removed.
1474    
1475            * Makefile (HTML-all, HTML-clean): New.
1476    
1477    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1478    
1479            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1480    
1481    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1482    
1483            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1484    
1485    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1486    
1487            * Makefile: |CacheManifest.html| is added.
1488    
1489            * CacheManifest.pod: New file.
1490    
1491    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1492    
1493            * CacheManifest.pm: New module.
1494    
1495    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1496    
1497            * HTML.pm.src: Support for application cache selection algorithm
1498            callback.
1499    
1500    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1501    
1502            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1503    
1504    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1505    
1506            * Makefile (clean): New rule.
1507    
1508            * NanoDOM.pm (public_id, system_id): New attributes.
1509    
1510    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1511    
1512            * Makefile (CSS-all, CSS-clean, clean): New rules.
1513    
1514    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1515    
1516            * ContentChecker.pm (check_document): Support for
1517            new |is_xml_root| flag.
1518            (check_element): Support for new |pluses| state.
1519            (_add_pluses): New method.
1520            (_remove_minuses): Support for new |minus| item.
1521    
1522    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1523    
1524            * ContentChecker.pm: Raise specific error for invalid
1525            root element.
1526    
1527    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1528    
1529            * ContentChecker.pm: Set level values for later uses.
1530    
1531    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1532    
1533            * ContentChecker.pm: Support for language tag validation.
1534    
1535    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1536    
1537            * LangTag.pm (check_rfc3066_language_tag): New method.
1538    
1539    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1540    
1541            * LangTag.pm: New module.
1542    
1543    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1544    
1545            * HTML.pm.src: Some error types were wrong.
1546    
1547    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1548    
1549            * CSS/: New directory.
1550    
1551    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1552    
1553            * ContentChecker.pm (_check_get_children): Support
1554            for |noscript| in |head|.
1555    
1556    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1557    
1558            * URI/: New directory.
1559    
1560    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1561    
1562            * HTML.pm.src: Tokenizer's states are now represented in
1563            number.
1564    
1565    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1566    
1567            * HTML.pm.src: |or|s for insertion modes are replaced
1568            by |&|s.
1569    
1570    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1571    
1572            * HTML.pm.src: Token types are now represented in number.
1573    
1574    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1575    
1576            * ContentType.pm (SEE ALSO): Updated.
1577    
1578            * HTML.pm.src: Insertion modes are now represented in number.
1579    
1580    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1581    
1582            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1583            999) is implemented.
1584    
1585    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1586    
1587            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1588            983) is implemented.
1589    
1590    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1591    
1592            * ContentChecker.pod: New documentation.
1593    
1594            * Makefile: A rule for |ContentChecker.html| is added.
1595    
1596            * ContentChecker.pm: A pod "LICENSE" section is added.
1597    
1598            * NanoDOM.pm ($VERSION): New variable.
1599    
1600    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1601    
1602            * H2H.pm: |b|, |i|, and |sub| are added to the
1603            list of allowed HTML elements.
1604    
1605    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1606    
1607            * H2H.pm: |samp| is added to the list of allowed
1608            HTML elements.
1609    
1610            * URIChecker.pm (check_iri): New.
1611            (check_iri_reference): Error type for IRI reference
1612            syntax error is changed.
1613    
1614    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1615    
1616            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1617            is added.
1618            (check_document): Load appropriate module before validation.
1619    
1620    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1621    
1622            * ContentChecker/: New directory.
1623    
1624    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1625    
1626            * ContentChecker.pm: HTML |time| element is implemented.
1627    
1628            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1629    
1630    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1631    
1632            * ContentChecker.pm (check_document): Return value
1633            even if no document element is found.
1634    
1635    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1636    
1637            * HTML.pm.src: |$in_body| is no longer a function.
1638    
1639    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1640    
1641            * HTML.pm.src: The |$in_body| code has been moved down.
1642    
1643    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1644    
1645            * HTML.pm.src: The "trailing end" insertion mode
1646            is split into "after html body" and "after html frameset"
1647            insertion modes.  Their codes are merged with "after body"
1648            and "after frameset" codes.  |$previous_insertion_mode|
1649            has been removed.  "after frameset" code is
1650            merged with "in frameset" code.
1651    
1652    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1653    
1654            * HTML.pm.src: The "before head" insertion mode is
1655            merged with the "in head" insertion mode.
1656    
1657    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1658    
1659            * HTML.pm.src: Cases in "in head" insertion mode are
1660            reorganized.
1661    
1662    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1663    
1664            * HTML.pm.src: Some cases in "in table" insertion mode
1665            are merged.
1666    
1667    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1668    
1669            * HTML.pm.src: The "in row" insertion mode is merged
1670            with "in table" insertion mode.
1671    
1672  2007-07-21  Wakaba  <wakaba@suika.fam.cx>  2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1673    
1674          * HTML.pm.src: The "in table" and "in table body" insertion          * HTML.pm.src: The "in table" and "in table body" insertion

Legend:
Removed from v.1.78  
changed lines
  Added in v.1.360

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24