/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.118 by wakaba, Sun Nov 11 06:54:36 2007 UTC revision 1.354 by wakaba, Tue Oct 14 13:24:52 2008 UTC
# Line 1  Line 1 
1    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Handling of end tags in the foreign content
4            insertion mode was partially wrong, because of wrong bit
5            operations.
6    
7    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
8    
9            * NanoDOM.pm (dom_config): New attribute (do nothing), for
10            Whatpm::XML::Parser support.
11    
12    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
13    
14            * Makefile: New rule to make HTML/Tokenizer.pm is added.
15    
16            * HTML.pm.src: Tokenizer part moved to another file.
17    
18    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
19    
20            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
21    
22    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: Element category constants redefined.
25            
26    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
29            construction stage synced with the spec (HTML5 revisions 2139 and
30            2302).
31    
32    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
33    
34            * ContentChecker.pm: New error level "html5_fact" added, which
35            should be tentatively used until all of requirements are properly
36            specced as RFC 2119 "MUST" in HTML5.
37    
38    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
39    
40            * ContentChecker.pod: Note on internal flags is added.
41    
42    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: An AAA bug fixed.
45    
46    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: If another node is inserted by the parser, don't
49            reuse existing Text node to append a character (HTML5 revision
50            2124).
51    
52    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
55            revisions 1731 and 2128).
56    
57    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
60            Support for end tags of camelCase SVG elements were broken.  A
61            wrong error type text fixed.
62    
63    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
66    
67    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Support for new definition of |param| and |source|
70            start tag parsing (HTML5 revision 1731).
71    
72    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
75    
76    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
79            revisions 1731 and 1831).
80    
81    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
84            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
85            revision 1731).
86    
87    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
90            revision 1731).  End tags of |option| and |optgroup| elements are
91            now optional (HTML5 revision 1731).
92    
93    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: New "special" elements added to the list (HTML5
96            revision 1778).  "strile" -> "strike".
97    
98    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
99    
100            * ContentType.pm (get_sniffed_type): Support for the "better"
101            content sniffing (HTML5 revision 1927).  In a case the official
102            type was not returned when the method is invoked in the list
103            context.
104    
105    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: Character references for non-space C0 characters,
108            including U+000B VT, DEL character, noncharacter code points, are
109            now converted to the U+FFFD character (cf. HTML5 revision 2138).
110    
111    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
112    
113            * ContentChecker.pm: |form=""| check support added.
114    
115    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
116    
117            * ContentChecker.pm: |contextmenu| validness is now checked using
118            |id| and |id_type| properties, and |menu| property is removed.
119    
120    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
121    
122            * ContentChecker.pm: Prepare for |form| |name| attribute's
123            duplication checking.
124    
125    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
128            unsupported charset="" parameter value (HTML5 revision 2131).
129    
130    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: Reminding places where U+000B is allowed as a space
133            character is fixed (cf. HTML5 revision 1738).
134    
135            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
136            space characters (HTML5 revision 1738).
137    
138    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
139    
140            * HTML.pm.src: The "anything else" case for the "after after body"
141            insertion mode was not updated to swtich to the "in body"
142            insertion mode.  U+000B is no longer a space character for the
143            purpose of tree construction phase (HTML5 revision 1738).
144    
145    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: U+000B is no longer a space character (HTML5
148            revision 1738).
149    
150    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
151    
152            * ContentType.pm: 0x0B is no longer a space character (HTML5
153            revision 1738).
154    
155            * HTML.pm.src: U+000B is no longer a space character for the
156            algorithm for extracting an encoding from a Content-Type (HTML5
157            revision 1738).
158    
159    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
160    
161            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
162    
163    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
164    
165            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
166    
167            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
168            matching.
169    
170    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
171    
172            * LangTag.pm: Warn for private use language subtags.  Error level
173            typos fixed.  Support for Suppress-Script field.
174    
175            * mklangreg.pl: Support for dumping of nested structure.
176    
177    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
178    
179            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
180            recommended case as per RFC 4646.
181    
182    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
183    
184            * LangTag.pm (check_rfc4646_langtag): New method.
185    
186    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
187    
188            * mklangreg.pl: New script.
189    
190            * Makefile: Updated for creation of the module for language subtag
191            registry.
192            
193    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
194    
195            * Makefile: WebIDL.html added.
196    
197            * WebIDL.pod: New documentation.
198    
199    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
200    
201            * WebIDL.pm: Checker's error types are redefined.
202    
203    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
204    
205            * WebIDL.pm: Parser's error types are redefined.  Some forward
206            compatible parsing bugs are fixed.  Some unreachable codes are
207            commented out.
208    
209    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
210    
211            * WebIDL.pm: Support for the reminding extended attributes are
212            added.  It does not satisfy the definition that a forward
213            interface declaration has an extended attribute.  It seems that
214            unless explicitly allowed multiple extended attributes with the
215            same name is not allowed, though it is not explicitly mentioned in
216            the spec.
217    
218    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
219    
220            * WebIDL.pm: Unescapes extended attribute names and extended
221            attribute identifiers.  Preserve whether an extended attribute has
222            an argument list of not.  Support for extended attributes:
223            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
224            NameGetter, NameSetter, and Null.
225            (has_argument_list): New attribute.
226            (idl_text): Stringifies argument lists, if any, even if it is
227            empty.
228    
229    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
232            |{s_kwd}| in DATA_STATE as default.
233    
234    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
235    
236            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
237            by |{s_kwd}| in DATA_STATE.
238    
239    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
240    
241            * HTML.pm.src: Shorten keys.
242    
243    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: Remove checking for control character, surrogate
246            pair, or noncharacter code points and non-Unicode code
247            points (they should be handled by Whatpm::Charset::UnicodeChecker).
248            (parse_char_stream): Support for the |$get_wrapper| argument and
249            character stream error handlers.
250    
251    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
252    
253            * ContentChecker.pm: Don't call |loda_ns_module|
254            for null-namespace elements/attributes.
255    
256            * HTML.pm.src: Fact out $disallowed_control_chars
257            as a hash.
258    
259    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
260    
261            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
262            and |{next_char}| initializations are moved to initialization
263            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
264            with |parse_char_stream|.
265    
266    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src (parse_char_stream): Make |set_next_char|
269            invoke |manakai_read_until|, not only |read|, where
270            possible, to decrease the number of |read| method calls.
271    
272            * mkhtmlparser.pl: Related changes to the aforementioned
273            modification.
274    
275    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
278            would report character error from now.
279    
280    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
281    
282            * HTML.pm.src: White-space-leaded non-white-space character
283            tokens in "before head insertion mode" was not
284            correctly handled.
285            (set_inner_html): Reimplemented using CharString decodehandle
286            class.  Support for $get_wrapper argument.  Support
287            for |{read_until}| feature.
288    
289    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
290    
291            * HTML.pm.src: Make a "bare ero" error for unknown
292            entities point the "&" character.
293    
294    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
295    
296            * HTML.pm.src: It turns out that U+FFFD don't have to
297            be added to the list of excluded characters.
298    
299    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src ($char_onerror): Have character decoder's |line|
302            and |column| a higher priority than the one set by the
303            tokenizer's input handler.
304            ($self->{read_until}): Exclude U+FFFD (but this might
305            not be necessary, since now we do line/column fixup in
306            the character decode handle).
307    
308    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
309    
310            * HTML.pm.src: Use |{read_until}| where possible.
311    
312    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
313    
314            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
315            and |manakai_getc_until| to |manakai_read_until| to
316            reduce the number of string copies.
317    
318    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src (parse_char_string): Use newly created
321            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
322            standard feature to |open| a string as a filehandle,
323            since Perl's string filehandle seems not supporting |ungetc|
324            method correctly.
325            (parse_char_stream): Define |{getc_until}| method.
326            (DATA_STATE): Experimental support for |getc_until| feature.
327    
328    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
329    
330            * HTML.pm.src: Check points added to newly added branches.
331    
332    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
333    
334            * HTML.pm.src: Remove |{char}|, which is no longer used.
335            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
336            and replaced by |{prev_state}|.
337    
338            * mkhtmlparser.pl: Remove |{char}| feature.
339            Remove |!!!back-next-input-character;| macro.
340    
341    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
344            entity related tokenizer states in favor of new states
345            implementing the consume character reference algorithm.
346    
347    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
348    
349            * HTML.pm.src: "Consume a character reference" algorithm is
350            now implemented as a tokenizer's state, rather than
351            a method, with minimum changes (more changes will
352            be made, in due course).  "Bogus comment state"'s inner
353            loop gets removed.
354    
355    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
356    
357            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
358            into their own tokenizer states.
359    
360    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
361    
362            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
363            is split into three states.
364    
365    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
366    
367            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
368            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
369            no longer does the tokenizer have to push back next input
370            characters in those states.
371    
372    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
373    
374            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
375            into four states so that no longer does the tokenizer have to push
376            back next input characters in that state.
377    
378    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
379    
380            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
381            which can be used to insert some wrapper between the character
382            stream handle and the tokenizer.  (It is currently not supported
383            for |set_inner_html| for |Element|s).
384    
385    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
386    
387            * HTML.pm.src: Ignore punctuations in charset names.
388    
389    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
390    
391            * ContentChecker.pm: Support for charset-layer error levels.
392    
393            * HTML.pm.src: Don't specify |text| argument for the
394            |chardecode:fallback| error, since it is not the encoding
395            being used alternatively.
396    
397    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
398    
399            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
400    
401    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
402    
403            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
404    
405    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
406    
407            * HTML.pm.src: Bug fix and sync with the spec with regard
408            to after after frameset insertion mode processing (HTML5
409            revision 1909).  Note that the implementation was wrong
410            per the old spec before the r1909 changes.
411    
412    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
413    
414            * HTMLTable.pm: scope=auto algorithm fix synced with the
415            spec (HTML5 revision 2093).
416            ($process_row): Algorithm step numbers synced with the
417            spec (HTML5 revision 2092).
418    
419    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
420    
421            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
422            revision 2094).
423    
424    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
425    
426            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
427    
428    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
429    
430            * HTML.pm.src: '"' and "'" at the end of attribute
431            name (after another attribute) now raise parse error (HTML5
432            revision 2123).  Empty unquoted attribute values are no
433            longer allowed (HTML5 revision 2122).
434    
435    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
436    
437            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
438            revision 2130).
439    
440    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
441    
442            * ContentChecker.pm: |xml:lang| attribute value must be same
443            as |lang| attribute value for HTML elements (HTML5 revision 2062
444            and so on).
445    
446    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
447    
448            * ContentChecker.pm: Error level definition for |xml_id_error|
449            was missing.
450    
451            * URIChecker.pm: The end of the URL should be marked as the
452            error location for an empty path error.  The position
453            between the userinfo and the port components should be
454            marked as the error location for an empty host error.
455    
456    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
457    
458            * URIChecker.pm: Set parameters representing where in the
459            value the error occurs for errors.  Report unknown
460            address format error in warning level, since address
461            formats are rarely added.  Path segments starting with "/.."
462            were misinterpreted as a dot-segment.
463    
464    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
465    
466            * URIChecker.pm (check_iri_reference): Requires
467            |Message::DOM::DOMImplementation|.
468    
469    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
470    
471            * IMTChecker.pm: Updated for the new error reporting architecture.
472    
473            * ContentChecker.pm: Error levels for IMTs are added.
474    
475    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
476    
477            * H2H.pm (_shift_token): Support for unquoted HTML attribute
478            values.
479    
480    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
481    
482            * CacheManifest.pm: Support for new style of error
483            reports.
484    
485            * HTML.pm.src: Set line=1, column=1 to the document node.
486    
487    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
488    
489            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
490            and URL checkers.  Support for more error levels for bogus
491            langauge tag and URL "standards".
492    
493            * LangTag.pm, URIChecker.pm: Support for new style error
494            level reporting.
495    
496    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
497    
498            * ContentChecker.pm: Support for RDF/XML error levels.
499    
500            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
501            specifying.  Error types are revised.
502    
503    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
504    
505            * ContentChecker.pm: All error reporting method calls are
506            renewed.
507    
508    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
509    
510            * HTML.pm.src: All error type names and "text" parameters
511            are revised.  Use new style for "level" specification.
512    
513            * mkhtmlparser.pl: Use new style for "level" specification.
514    
515    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
516    
517            * WebIDL.pm (parse_char_string): Simplified error
518            reporting process for broken ignored valuetype definition.
519            (Valuetype idl_text): Support for special "DOMString" name.
520    
521    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
522    
523            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
524            terminal of the ScopedName is "DOMString", such that whether
525            the last part of the scoped name is "DOMString" or "_DOMString"
526            later.  It is necessary to determine whether a |typedef|
527            definition should be ignored or not.
528            (parse_char_string): Unescape the identifier of
529            exception members.
530            ($resolve): Return undef for builtin types and sequence<T>
531            types (we might not have to do this, however...).
532            (check): Support checking for Exceptions, Valuetypes,
533            and Typedefs.
534            ($serialize_type): Support for "DOMString::::" syntax.
535            (Typedef idl_text): Output Type as "DOMString" if it
536            is really "DOMString" (i.e. its internal representation
537            is "::DOMString::").
538    
539    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
540    
541            * WebIDL.pm ($resolve): New code, based on resolve code
542            for constant types in the |check| method.
543            (check): Support for checking of attributes, operations, and
544            arguments.
545            (Attribute/Operation idl_text): Exception names in getraises,
546            setraises, and raises clauses is serizlied by |$serialize_type|
547            code.
548    
549    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
550    
551            * WebIDL.pm ($integer): Order of selections are changed to match
552            hexadecimal numbers (the original pattern, taken from the spec,
553            was not work for hexadecimal numbers, because the "0" prefix
554            matches to the [0-7]* part (as an empty string) and therefore
555            it does not match with remaining "x..." part of a "0x..." integer
556            literal.
557            ($get_type): It now returns a string, not an array reference,
558            for regular types and |sequence| types (i.e. it in any case
559            returns a string).
560            ($get_next_token): The second item in the array that represents
561            a integer or float token is now a Perl number value, not the
562            original string representation of the number.
563            (check): Support for const value consistency checking.
564            No extended attribute is defined for constants.
565            (Node subclasses): Use simple strings rather than array references
566            for default data type values.
567            ($serialize_type): Type values are now simple strings.
568            (value): If the new attribute value is a false value, then
569            a FALSE value is set to the attribute.
570    
571    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
572    
573            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
574            in its stringified format ("scoped name" as defined in the
575            spec).  Note that future version of this module should not use
576            array references for type values and the |type_text| attribute
577            should be made obsolete.
578            (parse_char_string): Unescape attribute names.
579            (check): Support for checking of whether inherited interfaces
580            are actually defined or not.  Support for checking of whether
581            interface member identifiers are duplicated or not.
582            ($serialize_type): Scoped names are returned as is.  A future
583            version of this code should escape identifiers other than "DOMString",
584            otherwise the idl_text would be non-conforming.
585    
586    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
587    
588            * WebIDL.pm (parse_char_string): Set line/column numbers
589            to generated nodes.  Unescape identifiers.  Extended attributes
590            for Definition's were ignored.
591            (append_child): Set |parent_node| attribute.
592            (parent_node): New attribute.
593            (check): Support interface/exception members.  Support
594            extended attributes.  Support definition identifier uniqueness
595            constraint.
596            (qualified_name): New attribute.
597            (Interface/Exception idl_text): Extended attributes were
598            not prepended to the returned text.
599    
600    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
601    
602            * WebIDL.pm (parse_char_string): Set line/column numbers
603            to interface object experimentally.  s/shift/pop/g, shift
604            would make things wrong.  Support for interface forward
605            declarations was missing.  Broken interface declarations
606            with no block were not ignored entirely.
607            (Whatpm::WebIDL::Node): New abstract class.  This class
608            makes things easier.
609            (child_nodes): New attribute.  Unlike DOM's attribute with
610            same name, this attribute returns a dead list of nodes for
611            simplicity.
612            (get_user_data, set_user_data): New methods.
613            (Module idl_text): A SPACE character should be inserted
614            before the |{| character.
615            (Interface idl_text): Support for interface forward declarations.
616            (is_forward_declaration): New attribute.
617    
618    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
619    
620            * WebIDL.pm (type_text): Better serializer.
621    
622    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
623    
624            * WebIDL.pm: Revise forward-compatible parsing so that
625            it now can handle broken extended attributes and as such.
626    
627    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
628    
629            * WebIDL.pm: Real support for extended attributes.
630            Support for extended attributes with arguments.
631    
632    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
633    
634            * WebIDL.pm: Support for |exception| syntax.
635            (Interface->idl_text): Tentative support for inheritances.
636    
637    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
638    
639            * WebIDL.pm: Hierarchical scoped name support was broken.
640            Support for raises, setraises, and getraises syntaxes.
641    
642    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
643    
644            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
645            proper support for types, extended attributes, and exceptions yet).
646            WebIDL parser, version 1 (no support for exceptions yet,
647            no proper support for extended attributes yet).
648    
649    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
650    
651            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
652    
653    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
654    
655            * WebIDL.pm: Support for valuetype and const.
656    
657    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
658            
659            * WebIDL.pm: New module.
660    
661    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
662    
663            * Makefile (Entities.html): URI changed.
664    
665    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
666    
667            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
668    
669    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
670    
671            * HTML.pm.src (_get_next_token): A parse error was missing.
672    
673    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
674    
675            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
676            spec (commented out). (HTML5 revision 1711).
677    
678    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
679    
680            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
681    
682            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
683            as UTF-16 (HTML5 revision 1701).
684    
685    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
686    
687            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
688            1692).
689    
690    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
691    
692            * HTML.pm.src: The secondary insertion mode used when switching
693            to foreign content is the "in body" insertion mode (HTML5 revision
694            1696).
695    
696    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
697    
698            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
699            revision 1697).
700    
701    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
702    
703            * HTML.pm.src: Support for end-of-file token in foreign content
704            insertion mode (HTML5 revision 1693).  Update SVG camelCase
705            attribute list (HTML5 revision 1700).  <textarea> closes
706            </select> (HTML5 revision 1699).  More start tags close in
707            foreign content insertion mode (HTML5 revision 1698).
708    
709    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
710    
711            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
712    
713    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
714    
715            * HTML.pm.src: More robust charset parameter detection (HTML5
716            revision 1674).
717    
718    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
719    
720            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
721            revision 1676).
722    
723    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
724    
725            * HTML.pm.src: Ignore language part of public identifiers for
726            quriks mode detection (HTML5 revision 1679).
727    
728    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
729    
730            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
731            revision 1685).
732    
733    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
734    
735            * HTML.pm.src: Support for EOF in new states for tags (HTML5
736            revision 1684).
737    
738    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
739    
740            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
741            work (HTML5 revision 1690).
742    
743    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
744    
745            * HTML.pm.src (_tree_construction_main): Change handling of
746            end tags in head insertion modes (HTML5 revision 1686).
747            (parse_char_string): Bug fix for non-utf8 character string handlings.
748            (parse_char_stream): |ungetc| does not work well for this context.
749    
750    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
751    
752            * HTML.pm.src (parse_byte_string): Redefined to invoke
753            |parse_byte_stream|.
754            (parse_byte_stream): New method.
755    
756    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
757    
758            * HTML.pm.src (parse_byte_string): Fix the column number reported
759            by encoding layer error reporter.
760    
761    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
762    
763            * HTML.pm.src (parse_byte_string): Use streaming decoder
764            rather than converting the whole byte string and then parsing.
765            Propagate errors in character encoding layer.
766            (get_next_token): Precise error reporting for |bare stago| error.
767    
768    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
769    
770            * HTML.pm.src (parse_char_stream): New method.
771            (parse_char_string): This method is now defined as an invocation
772            of the |parse_char_stream| method.
773    
774    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
775    
776            * HTML.pm.src (parse_byte_string): Report various status
777            of the sniffing as info-level errors.  Support for new
778            decoding framework in parser resestting.
779            (new): Various default error levels were not set.
780    
781    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
782    
783            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
784            algorithm, except for the actual sniffing, is implemented
785            with new framework with Message::Charset::Info.
786    
787    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
788    
789            * CacheManifest.pm (_parse): Drop fragment identifiers from
790            URIs in fallback section (HTML5 revision 1596).
791    
792    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
793    
794            * Makefile (Entities.html): URI has changed.
795    
796    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
797    
798            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
799            1553).
800    
801    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
802    
803            * ContentChecker.pm: Noted that those returned in |table| are
804            no longer table elements, but table objects returned
805            by Whatpm::HTMLTable.
806    
807            * HTMLTable.pm (form_table): Return table element node
808            as |$table->{element}|.
809            (assign_header): Support for the |headers=""| attribute.
810    
811    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
812    
813            * HTMLTable.pm (assign_header): New function; first version
814            with no support for headers="".
815            (form_table): Include table width and height to the returned
816            table object for covenience.  Indexing in column assignement was wrong.
817            Set whether a data cell is empty or not for convenience.
818    
819    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
820    
821            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
822    
823    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
824    
825            * HTMLTable.pm: How table model errors are detected is
826            changed (HTML5 revision 1387).
827    
828    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
829    
830            * HTMLTable.pm: The algorithm now moves |tfoot| elements
831            to the end of the table (HTML5 revision 1380).
832    
833    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
834    
835            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
836            of 1-based (HTML5 revision 1376).
837    
838    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
839    
840            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
841            and "Content-Encoding" no longer prevent sniffing (HTML5
842            revision 1288).
843    
844    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
845    
846            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
847            1282).
848    
849    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
850    
851            * ContentChecker.pm: Support for global attributes.
852            Status of XML specs are added.
853    
854    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
855    
856            * ContentChecker.pm (check_element): Support for |template=""|
857            and |ref=""| attribute (referent element type checking).
858    
859    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
860    
861            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
862            1500) is implemented (except for IDNA part and URI-scheme-specific
863            knowledge).  Line number counting was wrong for LF-only
864            documents.
865    
866    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
867    
868            * HTML.pm.src: Raise an parse error for any disallowed
869            character (HTML5 revision 1263).
870    
871    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
872    
873            * mkentitylist.pl: Support for new HTML5 entity table format (the
874            definition for |AElig;| was missing).
875    
876    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
877    
878            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
879            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
880    
881    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
882    
883            * HTML.pm.src: List of element names that close foreign content
884            insertion mode is added (HTML5 revisions 1412 and 1418).
885    
886    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
887    
888            * HTML.pm.src: Support for |mglyph| and |malignmark|
889            elements (HTML5 revision 1410).
890    
891    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
892    
893            * HTML.pm.src: Support for new long MathML entities (HTML5
894            revision 1406).
895    
896    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
897    
898            * HTML.pm.src: CDATA section support for MathML and SVG
899            elements (HTML5 revision 1404 and 1420).
900    
901    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
902    
903            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
904            elements (HTML5 revision 1404).  Unused !!!macro definitions
905            are removed.
906    
907    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
908    
909            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
910            are raised is changed (HTML5 revision 1404).
911    
912    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
913    
914            * HTML.pm.src: Category sets that are no longer used are removed.
915    
916    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
917    
918            * HTML.pm.src: The ->[1] property of stack entries are now
919            replaced by constants representing element category.
920    
921    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
922    
923            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
924            for error reporting.  (This is a preparation for using constant
925            value for ->[1].)
926    
927    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
928    
929            * RDFXML.pm: Typo fixed.
930    
931    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
932    
933            * ContentChecker.pm: |fact_level| is now treated
934            as same as |must_level|, i.e. level = |m|.
935            (check_element): Make list of URIs in the DOM.
936    
937    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
938    
939            * RDFXML.pm: Language accessor implemented.
940            Local (null-namespace) attribute support.
941    
942    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
943    
944            * RDFXML.pm: Factored out ID checking code.
945    
946    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
947    
948            * RDFXML.pm: TODO items noted.  Validation of ID
949            and URI attributes is implemented.  Warn if unknown
950            value is used in rdf:parseType="" attribute.
951    
952            * URIChecker.pm (check_rdf_uri_reference): New function.
953    
954    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
955    
956            * RDFXML.pm: bnodeid implemented.  Relative references
957            are now resolved.
958    
959    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
960    
961            * ContentChecker.pm: RDF reification implemented.
962    
963            * RDFXML.pm: undef vs false bug fixed.
964            Reification implemented.
965    
966    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
967    
968            * RDFXML.pm: s/id/ID/ for attribute name.
969            The |node| arguments are added for |ontriple| calls.
970            Too many "attribute not allowed" errors were raised.
971    
972            * ContentChecker.pm: Initial experimental support for rdf:RDF
973            element.
974    
975    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
976    
977            * RDFXML.pm: New module.
978    
979    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
980    
981            * HTML.pm.src (set_inner_html): Line/column number
982            code was old one yet.
983    
984    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
985    
986            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
987            errors.  Line/column markings for DOCTYPE, comment, and
988            character tokens are reintroduced; otherwise,
989            error location for "not HTML5" error and errors
990            for implied elements are not attached.
991    
992    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
993    
994            * HTML.pm.src: Set line/column numbers to attributes.
995    
996            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
997            (value): Setter implemented.
998    
999            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1000    
1001    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1002    
1003            * HTML.pm.src: Unused line/column markings are removed.
1004    
1005    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1006    
1007            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1008            flag, which is no longer used.
1009    
1010    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1011    
1012            * HTML.pm.src: Set line/column information to element nodes.
1013    
1014            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1015            and !!!insert-element-t): Set line/column information to
1016            element nodes.
1017    
1018    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1019    
1020            * HTML.pm.src (_get_next_token): The first "<" character
1021            in "<?", "<>", or "</>" should be the error point.
1022    
1023    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1024    
1025            * HTML.pm.src: Some more fixes on error position reporting.
1026    
1027    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1028    
1029            * HTML.pm.src: Token-level precious error reporting.
1030    
1031    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1032    
1033            * HTML.pm.src: Preparation for more precious error point
1034            reporting.
1035    
1036    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1037    
1038            * HTML.pm.src: Error type revised.
1039    
1040    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1041    
1042            * HTML.pm.src: Similar codes are merged together, again.
1043    
1044    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1045    
1046            * HTML.pm.src: Similar codes are merged together.
1047    
1048    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1051    
1052            * HTML.pm.src: Code refined.
1053    
1054    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1055    
1056            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1057            revision 1348).  Note that I really don't know this makes
1058            any difference in the black-box behavior of the parser.
1059    
1060    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1061    
1062            * HTML.pm.src: New end-of-file token implementation (HTML5
1063            revision 1348).
1064    
1065    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1066    
1067            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1068    
1069    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1070    
1071            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1072    
1073    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1074    
1075            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1076            mode and support for the "in select in table" insertion mode (HTML5
1077            revision 1342).
1078    
1079    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1080    
1081            * HTML.pm.src: No foster parenting for <script> and <script>
1082            in non-tainted <table>s (HTML5 revision 1336).
1083    
1084    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1085    
1086            * HTML.pm.src: Ignore white space characters between <html>
1087            and <head> (HTML5 revision 1332).
1088    
1089    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1090    
1091            * HTML.pm.src: Treat <input type=hidden> as if it were a
1092            white space (HTML5 revision 1331).
1093    
1094    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1095    
1096            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1097            element (HTML5 revision 1330).
1098    
1099    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1100    
1101            * HTML.pm.src: <title> is always appended to the current
1102            element (HTML5 revision 1328).
1103    
1104    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1105    
1106            * HTML.pm.src: White space in tainted tables are moved
1107            into foster parents (HTML5 revision 1326).
1108    
1109    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1110    
1111            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1112            revision 1321).
1113    
1114    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1115    
1116            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1117            the spec (HTML5 revision 1320).  Note that we finally complete
1118            all of HTML5 revision 1320 changes.
1119    
1120    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1121    
1122            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1123            tags (HTML5 revision 1320).
1124    
1125    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1126    
1127            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1128    
1129    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1130    
1131            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1132            could not be reached so that it is now removed (HTML5 revision
1133            1320).
1134    
1135    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1136    
1137            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1138            to align with new spec text (HTML5 revision 1320).
1139    
1140    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1141    
1142            * HTML.pm.src: Remove strange |if| condition; however, it should
1143            have had no harm in theory.
1144    
1145    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1146    
1147            * HTML.pm.src (_tree_construction_main): '</p>' in body
1148            case is split from other end tags for the preparation
1149            of implementing HTML revision 1320.
1150    
1151    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1152    
1153            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1154            revision 1320).
1155    
1156    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1157    
1158            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1159            and "div" (HTML5 revision 1318).  Add comments to where
1160            |form| pointer association codes should be inserted (HTML5 revision
1161            1319).
1162    
1163    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1164    
1165            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1166    
1167    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1168    
1169            * HTML.pm.src: Since the case t268 should never be reached (no
1170            other token type, there are), it is replaced by a |die| statement.
1171    
1172    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1173    
1174            * HTML.pm.src: Typo fixed.
1175    
1176    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1177    
1178            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1179            doctypes were not uppercased for comparison.
1180    
1181    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1182    
1183            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1184            are added.
1185    
1186    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1187    
1188            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1189            are set.  Cases that are unlikely reached are noted as so.
1190    
1191    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1192    
1193            * HTML.pm.src: Checkpoints for debugging are added.
1194    
1195            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1196    
1197    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1198    
1199            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1200            for simplicity.
1201    
1202    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1203    
1204            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1205            revision 1307).
1206    
1207    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1208    
1209            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1210            is set are changed (HTML5 revision 1305).
1211    
1212    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1213    
1214            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1215            1292).  Entities are not parsed in comment-like part in RCDATA
1216            elements (HTML5 revision 1294).  Allow bare & at the end
1217            of attribute value literals (HTML5 revision 1296).  More
1218            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1219            between attributes and ban attribute names or unquoted
1220            attribute values containing single or double quotes (HTML5
1221            revision 1303).
1222    
1223    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1224    
1225            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1226            and related errors unless it is an HTML document (though the spec
1227            is unclear on whether it is applied to XHTML document).
1228    
1229            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1230            are added.
1231    
1232    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1233    
1234            * HTML.pm.src: s/local_name/manakai_local_name/g.
1235    
1236    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1237    
1238            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1239    
1240            * HTML.pm.src: |charset| in |content| attribute is
1241            case-insensitive (HTML5 revision 1270).
1242    
1243    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1244    
1245            * ContentChecker.pm: New status constants are added.
1246            ($ElementDefault): |status| added.
1247            (check_element): Err for non-standard or deprecated elements.
1248            (_attr_status_info): For non-standard or deprecated attributes.
1249    
1250    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1251    
1252            * ContentChecker.pm (_attr_status_info): New internal method.
1253    
1254    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1255    
1256            * ContentChecker.pm (check_element): Element standardized
1257            status information is now dispatched.
1258    
1259    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1260    
1261            * ContentChecker.pm (check_element): Fix |del|-and-significant
1262            problem by adding some more arguments.
1263    
1264    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1265    
1266            * ContentChecker.pm (check_element): Use context of
1267            container-for-the-purpose-of-content-model element (not transparent
1268            element) for |check_child_element| calling and significant
1269            text flag marking.  This reintroduces |<del>|-and-significant
1270            problem again.
1271    
1272    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1273    
1274            * ContentChecker.pm (check_element): Make semi-transparent
1275            elements ignored for the purpose of phase changes in
1276            content model checking.
1277    
1278    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1279    
1280            * ContentChecker.pm (check_element): In-element state
1281            was not properly managed for transparent cases.
1282    
1283    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1284    
1285            * ContentChecker.pm (check_element): Support for |video|
1286            and |audio| as semi-transparent elements.
1287    
1288    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1289    
1290            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1291            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1292            Support for |html:object| as a semi-transparent element.
1293    
1294    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1295    
1296            * ContentChecker.pm (check_element): The way to traverse
1297            the tree is entirely revised to make it easier to track
1298            the state of ancestors/descendants.  As a result of this
1299            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1300            support for content model checking for HTML elements |figure|,
1301            |object|, |video|, and |audio| and checking for XML elements (and
1302            some XMLNS checkings) are dropped for now.  They will be
1303            reimplemented in due cource.
1304    
1305    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1306    
1307            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1308            token (HTML5 revision 1225).
1309    
1310    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1311    
1312            * ContentChecker.pm ({unsupported_level}): New value.
1313    
1314            * HTML.pm.src: Save whether |meta| |content| attribute
1315            contains character references or not.
1316    
1317    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1318    
1319            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1320            imlement significant content checking for contents
1321            with |del| elements.
1322    
1323    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1324    
1325            * ContenteChecker.pm ($HTMLTransparentElements): More
1326            elements are added.
1327            (_get_children): HTML |object| elements are now semi-transparent.
1328    
1329            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1330    
1331    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1332    
1333            * CacheManifest.pm: HTML5 revision 1211 implemented.
1334    
1335            * CacheManifest.pod: Updated.
1336    
1337    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1338    
1339            * ContentChecker.pm (check_document, check_element): Support
1340            for second argument ($onsubdoc).
1341            (_get_css_parser): Removed (now it is part of WDCC).
1342    
1343    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1344    
1345            * ContentChecker.pm (_get_css_parser): New.
1346    
1347    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1348    
1349            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1350            for checking had been used.
1351    
1352    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1353    
1354            * ContentChecker.pm (check_element): New todo item type |descendant|.
1355    
1356    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1357    
1358            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1359            specced in RFC 2046 and RFC 4288.
1360            (application/atom+xml): Definition added.
1361    
1362    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1363    
1364            * URIChecker.pm: Make RFC 3986 should-level errors
1365            warnings (rather than SHOULD-level errors).
1366    
1367    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1368    
1369            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1370    
1371            * HTML.pm.src: A flag for character references in attribute
1372            values are added.  Set |manakai_has_reference| user data
1373            to |charset| attribute.
1374    
1375    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1376    
1377            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1378            attributes.
1379    
1380            * ContentChecker.pm (check_document): Warn if charset requirements
1381            cannot be tested.
1382    
1383    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1384    
1385            * HTML.pm.src (parse_byte_string): Detect charset
1386            by universalchardet if charset parameter is not specified.
1387    
1388            * Makefile (Charset-all, Charset-clean): New rules.
1389    
1390    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1391    
1392            * ContentChecker.pm (check_document): Check the existence
1393            of character encoding declaration and selection of encoding
1394            for HTML document.
1395    
1396    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1397    
1398            * ContentType.pm (get_sniffed_type): Return also the official
1399            type in list context.
1400    
1401    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1402    
1403            * ContentType.pm: Sniffing with leading white space
1404            ignoring (HTML5 revisions 1013 and 1016).
1405    
1406    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1407    
1408            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1409            white spaces, are implemented.
1410    
1411    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1412    
1413            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1414            attribute if possible.
1415    
1416  2007-11-11  Wakaba  <wakaba@suika.fam.cx>  2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1417    
1418          * HTML.pm.src (parse_byte_string): New method.          * HTML.pm.src (parse_byte_string): New method.

Legend:
Removed from v.1.118  
changed lines
  Added in v.1.354

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24