/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.45 by wakaba, Sat Jun 23 14:34:39 2007 UTC revision 1.316 by wakaba, Tue Sep 16 10:43:18 2008 UTC
# Line 1  Line 1 
1    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
2    
3            * WebIDL.pm: Support for the reminding extended attributes are
4            added.  It does not satisfy the definition that a forward
5            interface declaration has an extended attribute.  It seems that
6            unless explicitly allowed multiple extended attributes with the
7            same name is not allowed, though it is not explicitly mentioned in
8            the spec.
9    
10    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
11    
12            * WebIDL.pm: Unescapes extended attribute names and extended
13            attribute identifiers.  Preserve whether an extended attribute has
14            an argument list of not.  Support for extended attributes:
15            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
16            NameGetter, NameSetter, and Null.
17            (has_argument_list): New attribute.
18            (idl_text): Stringifies argument lists, if any, even if it is
19            empty.
20    
21    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
24            |{s_kwd}| in DATA_STATE as default.
25    
26    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
29            by |{s_kwd}| in DATA_STATE.
30    
31    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: Shorten keys.
34    
35    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
36    
37            * HTML.pm.src: Remove checking for control character, surrogate
38            pair, or noncharacter code points and non-Unicode code
39            points (they should be handled by Whatpm::Charset::UnicodeChecker).
40            (parse_char_stream): Support for the |$get_wrapper| argument and
41            character stream error handlers.
42    
43    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
44    
45            * ContentChecker.pm: Don't call |loda_ns_module|
46            for null-namespace elements/attributes.
47    
48            * HTML.pm.src: Fact out $disallowed_control_chars
49            as a hash.
50    
51    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
54            and |{next_char}| initializations are moved to initialization
55            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
56            with |parse_char_stream|.
57    
58    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src (parse_char_stream): Make |set_next_char|
61            invoke |manakai_read_until|, not only |read|, where
62            possible, to decrease the number of |read| method calls.
63    
64            * mkhtmlparser.pl: Related changes to the aforementioned
65            modification.
66    
67    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
70            would report character error from now.
71    
72    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: White-space-leaded non-white-space character
75            tokens in "before head insertion mode" was not
76            correctly handled.
77            (set_inner_html): Reimplemented using CharString decodehandle
78            class.  Support for $get_wrapper argument.  Support
79            for |{read_until}| feature.
80    
81    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Make a "bare ero" error for unknown
84            entities point the "&" character.
85    
86    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
87    
88            * HTML.pm.src: It turns out that U+FFFD don't have to
89            be added to the list of excluded characters.
90    
91    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
92    
93            * HTML.pm.src ($char_onerror): Have character decoder's |line|
94            and |column| a higher priority than the one set by the
95            tokenizer's input handler.
96            ($self->{read_until}): Exclude U+FFFD (but this might
97            not be necessary, since now we do line/column fixup in
98            the character decode handle).
99    
100    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: Use |{read_until}| where possible.
103    
104    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
107            and |manakai_getc_until| to |manakai_read_until| to
108            reduce the number of string copies.
109    
110    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src (parse_char_string): Use newly created
113            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
114            standard feature to |open| a string as a filehandle,
115            since Perl's string filehandle seems not supporting |ungetc|
116            method correctly.
117            (parse_char_stream): Define |{getc_until}| method.
118            (DATA_STATE): Experimental support for |getc_until| feature.
119    
120    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: Check points added to newly added branches.
123    
124    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Remove |{char}|, which is no longer used.
127            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
128            and replaced by |{prev_state}|.
129    
130            * mkhtmlparser.pl: Remove |{char}| feature.
131            Remove |!!!back-next-input-character;| macro.
132    
133    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
136            entity related tokenizer states in favor of new states
137            implementing the consume character reference algorithm.
138    
139    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: "Consume a character reference" algorithm is
142            now implemented as a tokenizer's state, rather than
143            a method, with minimum changes (more changes will
144            be made, in due course).  "Bogus comment state"'s inner
145            loop gets removed.
146    
147    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
150            into their own tokenizer states.
151    
152    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
155            is split into three states.
156    
157    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
158    
159            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
160            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
161            no longer does the tokenizer have to push back next input
162            characters in those states.
163    
164    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
165    
166            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
167            into four states so that no longer does the tokenizer have to push
168            back next input characters in that state.
169    
170    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
173            which can be used to insert some wrapper between the character
174            stream handle and the tokenizer.  (It is currently not supported
175            for |set_inner_html| for |Element|s).
176    
177    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTML.pm.src: Ignore punctuations in charset names.
180    
181    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
182    
183            * ContentChecker.pm: Support for charset-layer error levels.
184    
185            * HTML.pm.src: Don't specify |text| argument for the
186            |chardecode:fallback| error, since it is not the encoding
187            being used alternatively.
188    
189    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
192    
193    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
194    
195            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
196    
197    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Bug fix and sync with the spec with regard
200            to after after frameset insertion mode processing (HTML5
201            revision 1909).  Note that the implementation was wrong
202            per the old spec before the r1909 changes.
203    
204    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTMLTable.pm: scope=auto algorithm fix synced with the
207            spec (HTML5 revision 2093).
208            ($process_row): Algorithm step numbers synced with the
209            spec (HTML5 revision 2092).
210    
211    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
214            revision 2094).
215    
216    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
217    
218            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
219    
220    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: '"' and "'" at the end of attribute
223            name (after another attribute) now raise parse error (HTML5
224            revision 2123).  Empty unquoted attribute values are no
225            longer allowed (HTML5 revision 2122).
226    
227    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
228    
229            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
230            revision 2130).
231    
232    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
233    
234            * ContentChecker.pm: |xml:lang| attribute value must be same
235            as |lang| attribute value for HTML elements (HTML5 revision 2062
236            and so on).
237    
238    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
239    
240            * ContentChecker.pm: Error level definition for |xml_id_error|
241            was missing.
242    
243            * URIChecker.pm: The end of the URL should be marked as the
244            error location for an empty path error.  The position
245            between the userinfo and the port components should be
246            marked as the error location for an empty host error.
247    
248    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
249    
250            * URIChecker.pm: Set parameters representing where in the
251            value the error occurs for errors.  Report unknown
252            address format error in warning level, since address
253            formats are rarely added.  Path segments starting with "/.."
254            were misinterpreted as a dot-segment.
255    
256    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
257    
258            * URIChecker.pm (check_iri_reference): Requires
259            |Message::DOM::DOMImplementation|.
260    
261    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
262    
263            * IMTChecker.pm: Updated for the new error reporting architecture.
264    
265            * ContentChecker.pm: Error levels for IMTs are added.
266    
267    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
268    
269            * H2H.pm (_shift_token): Support for unquoted HTML attribute
270            values.
271    
272    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
273    
274            * CacheManifest.pm: Support for new style of error
275            reports.
276    
277            * HTML.pm.src: Set line=1, column=1 to the document node.
278    
279    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
280    
281            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
282            and URL checkers.  Support for more error levels for bogus
283            langauge tag and URL "standards".
284    
285            * LangTag.pm, URIChecker.pm: Support for new style error
286            level reporting.
287    
288    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
289    
290            * ContentChecker.pm: Support for RDF/XML error levels.
291    
292            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
293            specifying.  Error types are revised.
294    
295    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
296    
297            * ContentChecker.pm: All error reporting method calls are
298            renewed.
299    
300    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
301    
302            * HTML.pm.src: All error type names and "text" parameters
303            are revised.  Use new style for "level" specification.
304    
305            * mkhtmlparser.pl: Use new style for "level" specification.
306    
307    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
308    
309            * WebIDL.pm (parse_char_string): Simplified error
310            reporting process for broken ignored valuetype definition.
311            (Valuetype idl_text): Support for special "DOMString" name.
312    
313    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
314    
315            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
316            terminal of the ScopedName is "DOMString", such that whether
317            the last part of the scoped name is "DOMString" or "_DOMString"
318            later.  It is necessary to determine whether a |typedef|
319            definition should be ignored or not.
320            (parse_char_string): Unescape the identifier of
321            exception members.
322            ($resolve): Return undef for builtin types and sequence<T>
323            types (we might not have to do this, however...).
324            (check): Support checking for Exceptions, Valuetypes,
325            and Typedefs.
326            ($serialize_type): Support for "DOMString::::" syntax.
327            (Typedef idl_text): Output Type as "DOMString" if it
328            is really "DOMString" (i.e. its internal representation
329            is "::DOMString::").
330    
331    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
332    
333            * WebIDL.pm ($resolve): New code, based on resolve code
334            for constant types in the |check| method.
335            (check): Support for checking of attributes, operations, and
336            arguments.
337            (Attribute/Operation idl_text): Exception names in getraises,
338            setraises, and raises clauses is serizlied by |$serialize_type|
339            code.
340    
341    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
342    
343            * WebIDL.pm ($integer): Order of selections are changed to match
344            hexadecimal numbers (the original pattern, taken from the spec,
345            was not work for hexadecimal numbers, because the "0" prefix
346            matches to the [0-7]* part (as an empty string) and therefore
347            it does not match with remaining "x..." part of a "0x..." integer
348            literal.
349            ($get_type): It now returns a string, not an array reference,
350            for regular types and |sequence| types (i.e. it in any case
351            returns a string).
352            ($get_next_token): The second item in the array that represents
353            a integer or float token is now a Perl number value, not the
354            original string representation of the number.
355            (check): Support for const value consistency checking.
356            No extended attribute is defined for constants.
357            (Node subclasses): Use simple strings rather than array references
358            for default data type values.
359            ($serialize_type): Type values are now simple strings.
360            (value): If the new attribute value is a false value, then
361            a FALSE value is set to the attribute.
362    
363    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
364    
365            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
366            in its stringified format ("scoped name" as defined in the
367            spec).  Note that future version of this module should not use
368            array references for type values and the |type_text| attribute
369            should be made obsolete.
370            (parse_char_string): Unescape attribute names.
371            (check): Support for checking of whether inherited interfaces
372            are actually defined or not.  Support for checking of whether
373            interface member identifiers are duplicated or not.
374            ($serialize_type): Scoped names are returned as is.  A future
375            version of this code should escape identifiers other than "DOMString",
376            otherwise the idl_text would be non-conforming.
377    
378    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
379    
380            * WebIDL.pm (parse_char_string): Set line/column numbers
381            to generated nodes.  Unescape identifiers.  Extended attributes
382            for Definition's were ignored.
383            (append_child): Set |parent_node| attribute.
384            (parent_node): New attribute.
385            (check): Support interface/exception members.  Support
386            extended attributes.  Support definition identifier uniqueness
387            constraint.
388            (qualified_name): New attribute.
389            (Interface/Exception idl_text): Extended attributes were
390            not prepended to the returned text.
391    
392    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
393    
394            * WebIDL.pm (parse_char_string): Set line/column numbers
395            to interface object experimentally.  s/shift/pop/g, shift
396            would make things wrong.  Support for interface forward
397            declarations was missing.  Broken interface declarations
398            with no block were not ignored entirely.
399            (Whatpm::WebIDL::Node): New abstract class.  This class
400            makes things easier.
401            (child_nodes): New attribute.  Unlike DOM's attribute with
402            same name, this attribute returns a dead list of nodes for
403            simplicity.
404            (get_user_data, set_user_data): New methods.
405            (Module idl_text): A SPACE character should be inserted
406            before the |{| character.
407            (Interface idl_text): Support for interface forward declarations.
408            (is_forward_declaration): New attribute.
409    
410    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
411    
412            * WebIDL.pm (type_text): Better serializer.
413    
414    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
415    
416            * WebIDL.pm: Revise forward-compatible parsing so that
417            it now can handle broken extended attributes and as such.
418    
419    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
420    
421            * WebIDL.pm: Real support for extended attributes.
422            Support for extended attributes with arguments.
423    
424    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
425    
426            * WebIDL.pm: Support for |exception| syntax.
427            (Interface->idl_text): Tentative support for inheritances.
428    
429    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
430    
431            * WebIDL.pm: Hierarchical scoped name support was broken.
432            Support for raises, setraises, and getraises syntaxes.
433    
434    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
435    
436            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
437            proper support for types, extended attributes, and exceptions yet).
438            WebIDL parser, version 1 (no support for exceptions yet,
439            no proper support for extended attributes yet).
440    
441    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
442    
443            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
444    
445    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
446    
447            * WebIDL.pm: Support for valuetype and const.
448    
449    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
450            
451            * WebIDL.pm: New module.
452    
453    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
454    
455            * Makefile (Entities.html): URI changed.
456    
457    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
458    
459            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
460    
461    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
462    
463            * HTML.pm.src (_get_next_token): A parse error was missing.
464    
465    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
466    
467            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
468            spec (commented out). (HTML5 revision 1711).
469    
470    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
471    
472            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
473    
474            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
475            as UTF-16 (HTML5 revision 1701).
476    
477    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
478    
479            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
480            1692).
481    
482    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
483    
484            * HTML.pm.src: The secondary insertion mode used when switching
485            to foreign content is the "in body" insertion mode (HTML5 revision
486            1696).
487    
488    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
489    
490            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
491            revision 1697).
492    
493    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
494    
495            * HTML.pm.src: Support for end-of-file token in foreign content
496            insertion mode (HTML5 revision 1693).  Update SVG camelCase
497            attribute list (HTML5 revision 1700).  <textarea> closes
498            </select> (HTML5 revision 1699).  More start tags close in
499            foreign content insertion mode (HTML5 revision 1698).
500    
501    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
502    
503            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
504    
505    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
506    
507            * HTML.pm.src: More robust charset parameter detection (HTML5
508            revision 1674).
509    
510    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
511    
512            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
513            revision 1676).
514    
515    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
516    
517            * HTML.pm.src: Ignore language part of public identifiers for
518            quriks mode detection (HTML5 revision 1679).
519    
520    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
521    
522            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
523            revision 1685).
524    
525    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
526    
527            * HTML.pm.src: Support for EOF in new states for tags (HTML5
528            revision 1684).
529    
530    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
531    
532            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
533            work (HTML5 revision 1690).
534    
535    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
536    
537            * HTML.pm.src (_tree_construction_main): Change handling of
538            end tags in head insertion modes (HTML5 revision 1686).
539            (parse_char_string): Bug fix for non-utf8 character string handlings.
540            (parse_char_stream): |ungetc| does not work well for this context.
541    
542    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
543    
544            * HTML.pm.src (parse_byte_string): Redefined to invoke
545            |parse_byte_stream|.
546            (parse_byte_stream): New method.
547    
548    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
549    
550            * HTML.pm.src (parse_byte_string): Fix the column number reported
551            by encoding layer error reporter.
552    
553    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
554    
555            * HTML.pm.src (parse_byte_string): Use streaming decoder
556            rather than converting the whole byte string and then parsing.
557            Propagate errors in character encoding layer.
558            (get_next_token): Precise error reporting for |bare stago| error.
559    
560    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
561    
562            * HTML.pm.src (parse_char_stream): New method.
563            (parse_char_string): This method is now defined as an invocation
564            of the |parse_char_stream| method.
565    
566    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
567    
568            * HTML.pm.src (parse_byte_string): Report various status
569            of the sniffing as info-level errors.  Support for new
570            decoding framework in parser resestting.
571            (new): Various default error levels were not set.
572    
573    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
574    
575            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
576            algorithm, except for the actual sniffing, is implemented
577            with new framework with Message::Charset::Info.
578    
579    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
580    
581            * CacheManifest.pm (_parse): Drop fragment identifiers from
582            URIs in fallback section (HTML5 revision 1596).
583    
584    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
585    
586            * Makefile (Entities.html): URI has changed.
587    
588    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
589    
590            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
591            1553).
592    
593    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
594    
595            * ContentChecker.pm: Noted that those returned in |table| are
596            no longer table elements, but table objects returned
597            by Whatpm::HTMLTable.
598    
599            * HTMLTable.pm (form_table): Return table element node
600            as |$table->{element}|.
601            (assign_header): Support for the |headers=""| attribute.
602    
603    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
604    
605            * HTMLTable.pm (assign_header): New function; first version
606            with no support for headers="".
607            (form_table): Include table width and height to the returned
608            table object for covenience.  Indexing in column assignement was wrong.
609            Set whether a data cell is empty or not for convenience.
610    
611    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
612    
613            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
614    
615    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
616    
617            * HTMLTable.pm: How table model errors are detected is
618            changed (HTML5 revision 1387).
619    
620    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
621    
622            * HTMLTable.pm: The algorithm now moves |tfoot| elements
623            to the end of the table (HTML5 revision 1380).
624    
625    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
626    
627            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
628            of 1-based (HTML5 revision 1376).
629    
630    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
631    
632            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
633            and "Content-Encoding" no longer prevent sniffing (HTML5
634            revision 1288).
635    
636    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
637    
638            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
639            1282).
640    
641    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
642    
643            * ContentChecker.pm: Support for global attributes.
644            Status of XML specs are added.
645    
646    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
647    
648            * ContentChecker.pm (check_element): Support for |template=""|
649            and |ref=""| attribute (referent element type checking).
650    
651    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
652    
653            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
654            1500) is implemented (except for IDNA part and URI-scheme-specific
655            knowledge).  Line number counting was wrong for LF-only
656            documents.
657    
658    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
659    
660            * HTML.pm.src: Raise an parse error for any disallowed
661            character (HTML5 revision 1263).
662    
663    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
664    
665            * mkentitylist.pl: Support for new HTML5 entity table format (the
666            definition for |AElig;| was missing).
667    
668    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
669    
670            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
671            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
672    
673    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
674    
675            * HTML.pm.src: List of element names that close foreign content
676            insertion mode is added (HTML5 revisions 1412 and 1418).
677    
678    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
679    
680            * HTML.pm.src: Support for |mglyph| and |malignmark|
681            elements (HTML5 revision 1410).
682    
683    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
684    
685            * HTML.pm.src: Support for new long MathML entities (HTML5
686            revision 1406).
687    
688    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
689    
690            * HTML.pm.src: CDATA section support for MathML and SVG
691            elements (HTML5 revision 1404 and 1420).
692    
693    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
694    
695            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
696            elements (HTML5 revision 1404).  Unused !!!macro definitions
697            are removed.
698    
699    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
700    
701            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
702            are raised is changed (HTML5 revision 1404).
703    
704    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
705    
706            * HTML.pm.src: Category sets that are no longer used are removed.
707    
708    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
709    
710            * HTML.pm.src: The ->[1] property of stack entries are now
711            replaced by constants representing element category.
712    
713    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
714    
715            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
716            for error reporting.  (This is a preparation for using constant
717            value for ->[1].)
718    
719    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
720    
721            * RDFXML.pm: Typo fixed.
722    
723    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
724    
725            * ContentChecker.pm: |fact_level| is now treated
726            as same as |must_level|, i.e. level = |m|.
727            (check_element): Make list of URIs in the DOM.
728    
729    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
730    
731            * RDFXML.pm: Language accessor implemented.
732            Local (null-namespace) attribute support.
733    
734    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
735    
736            * RDFXML.pm: Factored out ID checking code.
737    
738    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
739    
740            * RDFXML.pm: TODO items noted.  Validation of ID
741            and URI attributes is implemented.  Warn if unknown
742            value is used in rdf:parseType="" attribute.
743    
744            * URIChecker.pm (check_rdf_uri_reference): New function.
745    
746    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
747    
748            * RDFXML.pm: bnodeid implemented.  Relative references
749            are now resolved.
750    
751    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
752    
753            * ContentChecker.pm: RDF reification implemented.
754    
755            * RDFXML.pm: undef vs false bug fixed.
756            Reification implemented.
757    
758    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
759    
760            * RDFXML.pm: s/id/ID/ for attribute name.
761            The |node| arguments are added for |ontriple| calls.
762            Too many "attribute not allowed" errors were raised.
763    
764            * ContentChecker.pm: Initial experimental support for rdf:RDF
765            element.
766    
767    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
768    
769            * RDFXML.pm: New module.
770    
771    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
772    
773            * HTML.pm.src (set_inner_html): Line/column number
774            code was old one yet.
775    
776    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
777    
778            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
779            errors.  Line/column markings for DOCTYPE, comment, and
780            character tokens are reintroduced; otherwise,
781            error location for "not HTML5" error and errors
782            for implied elements are not attached.
783    
784    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
785    
786            * HTML.pm.src: Set line/column numbers to attributes.
787    
788            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
789            (value): Setter implemented.
790    
791            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
792    
793    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
794    
795            * HTML.pm.src: Unused line/column markings are removed.
796    
797    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
798    
799            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
800            flag, which is no longer used.
801    
802    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
803    
804            * HTML.pm.src: Set line/column information to element nodes.
805    
806            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
807            and !!!insert-element-t): Set line/column information to
808            element nodes.
809    
810    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
811    
812            * HTML.pm.src (_get_next_token): The first "<" character
813            in "<?", "<>", or "</>" should be the error point.
814    
815    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
816    
817            * HTML.pm.src: Some more fixes on error position reporting.
818    
819    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
820    
821            * HTML.pm.src: Token-level precious error reporting.
822    
823    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
824    
825            * HTML.pm.src: Preparation for more precious error point
826            reporting.
827    
828    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
829    
830            * HTML.pm.src: Error type revised.
831    
832    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
833    
834            * HTML.pm.src: Similar codes are merged together, again.
835    
836    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
837    
838            * HTML.pm.src: Similar codes are merged together.
839    
840    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
841    
842            * mkhtmlparser.pl: Set "level" parameter to parse errors.
843    
844            * HTML.pm.src: Code refined.
845    
846    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
847    
848            * HTML.pm.src: |</body>| treatement has been changed (HTML5
849            revision 1348).  Note that I really don't know this makes
850            any difference in the black-box behavior of the parser.
851    
852    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
853    
854            * HTML.pm.src: New end-of-file token implementation (HTML5
855            revision 1348).
856    
857    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
858    
859            * HTML.pm.src: |applet| support (HTML5 revision 1347).
860    
861    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
862    
863            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
864    
865    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
866    
867            * HTML.pm.src: Support for |<input>| in the "in select" insertion
868            mode and support for the "in select in table" insertion mode (HTML5
869            revision 1342).
870    
871    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
872    
873            * HTML.pm.src: No foster parenting for <script> and <script>
874            in non-tainted <table>s (HTML5 revision 1336).
875    
876    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
877    
878            * HTML.pm.src: Ignore white space characters between <html>
879            and <head> (HTML5 revision 1332).
880    
881    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
882    
883            * HTML.pm.src: Treat <input type=hidden> as if it were a
884            white space (HTML5 revision 1331).
885    
886    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
887    
888            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
889            element (HTML5 revision 1330).
890    
891    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
892    
893            * HTML.pm.src: <title> is always appended to the current
894            element (HTML5 revision 1328).
895    
896    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
897    
898            * HTML.pm.src: White space in tainted tables are moved
899            into foster parents (HTML5 revision 1326).
900    
901    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
902    
903            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
904            revision 1321).
905    
906    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
907    
908            * HTML.pm.src: |</h/n/>| case code rearranged to align with
909            the spec (HTML5 revision 1320).  Note that we finally complete
910            all of HTML5 revision 1320 changes.
911    
912    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
913    
914            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
915            tags (HTML5 revision 1320).
916    
917    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
920    
921    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
922    
923            * HTML.pm.src: A "generate implied end tags" code (t409.1)
924            could not be reached so that it is now removed (HTML5 revision
925            1320).
926    
927    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
928    
929            * HTML.pm.src: Code for the case of |</div>| and so on are revised
930            to align with new spec text (HTML5 revision 1320).
931    
932    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
933    
934            * HTML.pm.src: Remove strange |if| condition; however, it should
935            have had no harm in theory.
936    
937    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
938    
939            * HTML.pm.src (_tree_construction_main): '</p>' in body
940            case is split from other end tags for the preparation
941            of implementing HTML revision 1320.
942    
943    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
944    
945            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
946            revision 1320).
947    
948    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
949    
950            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
951            and "div" (HTML5 revision 1318).  Add comments to where
952            |form| pointer association codes should be inserted (HTML5 revision
953            1319).
954    
955    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
956    
957            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
958    
959    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
960    
961            * HTML.pm.src: Since the case t268 should never be reached (no
962            other token type, there are), it is replaced by a |die| statement.
963    
964    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
965    
966            * HTML.pm.src: Typo fixed.
967    
968    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
969    
970            * HTML.pm.src (_tree_construction_initial): Some limited quirks
971            doctypes were not uppercased for comparison.
972    
973    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
974    
975            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
976            are added.
977    
978    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
979    
980            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
981            are set.  Cases that are unlikely reached are noted as so.
982    
983    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
984    
985            * HTML.pm.src: Checkpoints for debugging are added.
986    
987            * mkhtmlparser.pl: Support for |!!!cp| syntax.
988    
989    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
990    
991            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
992            for simplicity.
993    
994    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
995    
996            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
997            revision 1307).
998    
999    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1000    
1001            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1002            is set are changed (HTML5 revision 1305).
1003    
1004    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1005    
1006            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1007            1292).  Entities are not parsed in comment-like part in RCDATA
1008            elements (HTML5 revision 1294).  Allow bare & at the end
1009            of attribute value literals (HTML5 revision 1296).  More
1010            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1011            between attributes and ban attribute names or unquoted
1012            attribute values containing single or double quotes (HTML5
1013            revision 1303).
1014    
1015    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1016    
1017            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1018            and related errors unless it is an HTML document (though the spec
1019            is unclear on whether it is applied to XHTML document).
1020    
1021            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1022            are added.
1023    
1024    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1025    
1026            * HTML.pm.src: s/local_name/manakai_local_name/g.
1027    
1028    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1029    
1030            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1031    
1032            * HTML.pm.src: |charset| in |content| attribute is
1033            case-insensitive (HTML5 revision 1270).
1034    
1035    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1036    
1037            * ContentChecker.pm: New status constants are added.
1038            ($ElementDefault): |status| added.
1039            (check_element): Err for non-standard or deprecated elements.
1040            (_attr_status_info): For non-standard or deprecated attributes.
1041    
1042    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1043    
1044            * ContentChecker.pm (_attr_status_info): New internal method.
1045    
1046    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1047    
1048            * ContentChecker.pm (check_element): Element standardized
1049            status information is now dispatched.
1050    
1051    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1052    
1053            * ContentChecker.pm (check_element): Fix |del|-and-significant
1054            problem by adding some more arguments.
1055    
1056    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1057    
1058            * ContentChecker.pm (check_element): Use context of
1059            container-for-the-purpose-of-content-model element (not transparent
1060            element) for |check_child_element| calling and significant
1061            text flag marking.  This reintroduces |<del>|-and-significant
1062            problem again.
1063    
1064    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1065    
1066            * ContentChecker.pm (check_element): Make semi-transparent
1067            elements ignored for the purpose of phase changes in
1068            content model checking.
1069    
1070    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1071    
1072            * ContentChecker.pm (check_element): In-element state
1073            was not properly managed for transparent cases.
1074    
1075    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1076    
1077            * ContentChecker.pm (check_element): Support for |video|
1078            and |audio| as semi-transparent elements.
1079    
1080    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1081    
1082            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1083            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1084            Support for |html:object| as a semi-transparent element.
1085    
1086    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1087    
1088            * ContentChecker.pm (check_element): The way to traverse
1089            the tree is entirely revised to make it easier to track
1090            the state of ancestors/descendants.  As a result of this
1091            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1092            support for content model checking for HTML elements |figure|,
1093            |object|, |video|, and |audio| and checking for XML elements (and
1094            some XMLNS checkings) are dropped for now.  They will be
1095            reimplemented in due cource.
1096    
1097    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1098    
1099            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1100            token (HTML5 revision 1225).
1101    
1102    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1103    
1104            * ContentChecker.pm ({unsupported_level}): New value.
1105    
1106            * HTML.pm.src: Save whether |meta| |content| attribute
1107            contains character references or not.
1108    
1109    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1110    
1111            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1112            imlement significant content checking for contents
1113            with |del| elements.
1114    
1115    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1116    
1117            * ContenteChecker.pm ($HTMLTransparentElements): More
1118            elements are added.
1119            (_get_children): HTML |object| elements are now semi-transparent.
1120    
1121            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1122    
1123    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1124    
1125            * CacheManifest.pm: HTML5 revision 1211 implemented.
1126    
1127            * CacheManifest.pod: Updated.
1128    
1129    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1130    
1131            * ContentChecker.pm (check_document, check_element): Support
1132            for second argument ($onsubdoc).
1133            (_get_css_parser): Removed (now it is part of WDCC).
1134    
1135    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1136    
1137            * ContentChecker.pm (_get_css_parser): New.
1138    
1139    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1140    
1141            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1142            for checking had been used.
1143    
1144    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1145    
1146            * ContentChecker.pm (check_element): New todo item type |descendant|.
1147    
1148    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1149    
1150            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1151            specced in RFC 2046 and RFC 4288.
1152            (application/atom+xml): Definition added.
1153    
1154    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1155    
1156            * URIChecker.pm: Make RFC 3986 should-level errors
1157            warnings (rather than SHOULD-level errors).
1158    
1159    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1160    
1161            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1162    
1163            * HTML.pm.src: A flag for character references in attribute
1164            values are added.  Set |manakai_has_reference| user data
1165            to |charset| attribute.
1166    
1167    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1168    
1169            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1170            attributes.
1171    
1172            * ContentChecker.pm (check_document): Warn if charset requirements
1173            cannot be tested.
1174    
1175    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1176    
1177            * HTML.pm.src (parse_byte_string): Detect charset
1178            by universalchardet if charset parameter is not specified.
1179    
1180            * Makefile (Charset-all, Charset-clean): New rules.
1181    
1182    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1183    
1184            * ContentChecker.pm (check_document): Check the existence
1185            of character encoding declaration and selection of encoding
1186            for HTML document.
1187    
1188    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1189    
1190            * ContentType.pm (get_sniffed_type): Return also the official
1191            type in list context.
1192    
1193    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1194    
1195            * ContentType.pm: Sniffing with leading white space
1196            ignoring (HTML5 revisions 1013 and 1016).
1197    
1198    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1199    
1200            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1201            white spaces, are implemented.
1202    
1203    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1204    
1205            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1206            attribute if possible.
1207    
1208    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1209    
1210            * HTML.pm.src (parse_byte_string): New method.
1211            (parse_char_string): New alias for |parse_string|.
1212            (main phase): Invoking "change the encoding" algorithm if desired.
1213    
1214            * HTML.pod: Updated.
1215    
1216    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1217    
1218            * HTML.pod (get_inner_html): Removed.
1219    
1220            * Makefile (HTML-all, HTML-clean): New.
1221    
1222    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1223    
1224            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1225    
1226    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1227    
1228            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1229    
1230    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1231    
1232            * Makefile: |CacheManifest.html| is added.
1233    
1234            * CacheManifest.pod: New file.
1235    
1236    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1237    
1238            * CacheManifest.pm: New module.
1239    
1240    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1241    
1242            * HTML.pm.src: Support for application cache selection algorithm
1243            callback.
1244    
1245    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1246    
1247            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1248    
1249    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1250    
1251            * Makefile (clean): New rule.
1252    
1253            * NanoDOM.pm (public_id, system_id): New attributes.
1254    
1255    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1256    
1257            * Makefile (CSS-all, CSS-clean, clean): New rules.
1258    
1259    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1260    
1261            * ContentChecker.pm (check_document): Support for
1262            new |is_xml_root| flag.
1263            (check_element): Support for new |pluses| state.
1264            (_add_pluses): New method.
1265            (_remove_minuses): Support for new |minus| item.
1266    
1267    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1268    
1269            * ContentChecker.pm: Raise specific error for invalid
1270            root element.
1271    
1272    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1273    
1274            * ContentChecker.pm: Set level values for later uses.
1275    
1276    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1277    
1278            * ContentChecker.pm: Support for language tag validation.
1279    
1280    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1281    
1282            * LangTag.pm (check_rfc3066_language_tag): New method.
1283    
1284    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1285    
1286            * LangTag.pm: New module.
1287    
1288    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1289    
1290            * HTML.pm.src: Some error types were wrong.
1291    
1292    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1293    
1294            * CSS/: New directory.
1295    
1296    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1297    
1298            * ContentChecker.pm (_check_get_children): Support
1299            for |noscript| in |head|.
1300    
1301    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1302    
1303            * URI/: New directory.
1304    
1305    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1306    
1307            * HTML.pm.src: Tokenizer's states are now represented in
1308            number.
1309    
1310    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1311    
1312            * HTML.pm.src: |or|s for insertion modes are replaced
1313            by |&|s.
1314    
1315    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1316    
1317            * HTML.pm.src: Token types are now represented in number.
1318    
1319    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1320    
1321            * ContentType.pm (SEE ALSO): Updated.
1322    
1323            * HTML.pm.src: Insertion modes are now represented in number.
1324    
1325    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1326    
1327            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1328            999) is implemented.
1329    
1330    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1331    
1332            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1333            983) is implemented.
1334    
1335    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1336    
1337            * ContentChecker.pod: New documentation.
1338    
1339            * Makefile: A rule for |ContentChecker.html| is added.
1340    
1341            * ContentChecker.pm: A pod "LICENSE" section is added.
1342    
1343            * NanoDOM.pm ($VERSION): New variable.
1344    
1345    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1346    
1347            * H2H.pm: |b|, |i|, and |sub| are added to the
1348            list of allowed HTML elements.
1349    
1350    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1351    
1352            * H2H.pm: |samp| is added to the list of allowed
1353            HTML elements.
1354    
1355            * URIChecker.pm (check_iri): New.
1356            (check_iri_reference): Error type for IRI reference
1357            syntax error is changed.
1358    
1359    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1360    
1361            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1362            is added.
1363            (check_document): Load appropriate module before validation.
1364    
1365    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1366    
1367            * ContentChecker/: New directory.
1368    
1369    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1370    
1371            * ContentChecker.pm: HTML |time| element is implemented.
1372    
1373            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1374    
1375    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1376    
1377            * ContentChecker.pm (check_document): Return value
1378            even if no document element is found.
1379    
1380    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1381    
1382            * HTML.pm.src: |$in_body| is no longer a function.
1383    
1384    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1385    
1386            * HTML.pm.src: The |$in_body| code has been moved down.
1387    
1388    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1389    
1390            * HTML.pm.src: The "trailing end" insertion mode
1391            is split into "after html body" and "after html frameset"
1392            insertion modes.  Their codes are merged with "after body"
1393            and "after frameset" codes.  |$previous_insertion_mode|
1394            has been removed.  "after frameset" code is
1395            merged with "in frameset" code.
1396    
1397    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1398    
1399            * HTML.pm.src: The "before head" insertion mode is
1400            merged with the "in head" insertion mode.
1401    
1402    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1403    
1404            * HTML.pm.src: Cases in "in head" insertion mode are
1405            reorganized.
1406    
1407    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1408    
1409            * HTML.pm.src: Some cases in "in table" insertion mode
1410            are merged.
1411    
1412    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1413    
1414            * HTML.pm.src: The "in row" insertion mode is merged
1415            with "in table" insertion mode.
1416    
1417    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1418    
1419            * HTML.pm.src: The "in table" and "in table body" insertion
1420            modes are merged.
1421    
1422    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1423    
1424            * HTML.pm.src: There is no "in table head" or "in table foot"
1425            insertion mode!
1426    
1427    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1428    
1429            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1430            now directly invoke the handler.
1431    
1432    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1433    
1434            * HTML.pm.src: Codes for "in cell" insertion mode
1435            is merged to the "in body" insertion mode code.
1436    
1437    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1438    
1439            * HTML.pm.src: Codes for "in body" and "in caption"
1440            insertion modes are merged.
1441    
1442    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1443    
1444            * HTML.pm.src: Two |!!!next-token|s were missing.
1445    
1446    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1447    
1448            * HTML.pm.src: Use numeric constant for |{content_mode}|
1449            instead of string constant for |{content_model_flag}|.
1450    
1451    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1452    
1453            * HTML.pm.src: Add the name of the attribute
1454            to the "duplicate attribute" error.
1455    
1456    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1457    
1458            * ContentChecker.pm: Return the |class| node list.
1459    
1460    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1461    
1462            * ContentChecker.pm: Return the |id| node list.
1463    
1464            * HTML.pm.src: A typo is fixed.
1465    
1466    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1467    
1468            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1469            from "in HTML:xml:lang" error.  Character position
1470            is now the last part of the error type in the URI error
1471            description.  Report "unsupported" status for language
1472            tags, media queries, script codes, and style sheets.
1473    
1474    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1475    
1476            * ContentChecker.pm: Report error if |xml:lang|
1477            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1478            in XML.
1479    
1480            * NanoDOM.pm (Attr.owner_document): New attribute.
1481    
1482    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1483    
1484            * HTML.pm.src: The character immediately following
1485            a bare |hcro| was discarded.  Fix handling of
1486            entity references in attribute values.
1487    
1488    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1489    
1490            * HTML.pm.src (main and trailin end phases): Token
1491            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1492            factored out.  Error types |in html:#DOCTYPE|
1493            and |after html:#DOCTYPE| are merged
1494            into |DOCTYPE in the middle|.  |</frameset>|
1495            in fragment parsing mode changed the
1496            insertion mode.
1497    
1498    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1499    
1500            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1501            phase is now an insertion mode.  Treatments for white
1502            space character tokens were incorrect for some
1503            insertion modes.  An old |meta| case was not removed.
1504    
1505    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1506    
1507            * HTML.pm.src: |meta| charset declaration extraction
1508            implemented (but changing the encoding is not yet:-).
1509    
1510    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1511    
1512            * Charset/: New directory.
1513    
1514    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1515    
1516            * H2H.pm: New Perl module (created from
1517            manakai's H2H.dis).
1518    
1519    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1520    
1521            * XMLSerializer.pm: New Perl module (created from
1522            manakai's SimpleLS.dis).
1523    
1524    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
1525    
1526            * HTML.pm.src: |<!---x-->| was not processed correctly.
1527    
1528    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1529    
1530            * HTML.pm.src: Report correct error message
1531            for |<body></div></body>|.
1532    
1533    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1534    
1535            * HTMLTable.pm: An error description was incorrect.
1536    
1537    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1538    
1539            * ContentChecker.pm: Return |{term}| list.
1540    
1541    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1542    
1543            * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1544            nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1545            and <title> outside of head).
1546    
1547    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1548    
1549            * IMTChecker.pm: Report warning for unregistered
1550            and private types/subtypes.
1551    
1552            * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1553            URIChecker.pm, HTMLTable.pm: Error messages are now
1554            consistent; they are all listed in
1555            <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1556    
1557    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1558    
1559            * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1560            is now erred.  |<datalist>| is implemented.
1561            Attribute checker for |<command>| and |<menu>| are
1562            added.  Support for |contextmenu| global attribute
1563            is added.
1564    
1565    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1566    
1567            * HTML.pm.src (_reset_insertion_mode): Interpretation
1568            of Step 3 has been changed.
1569    
1570    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1571    
1572            * HTML.pm.src: Late |<html>| parse error is implemented.
1573    
1574    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1575    
1576            * URIChecker.pm (check_iri_reference): A |decode| method name was
1577            incorrect.
1578    
1579            * ContentChecker.pm: Support for the |footer| element.
1580            Check URI syntax for space-separated URI attributes.
1581            Support for the |tabindex| attribute.  Support
1582            for |datetime| attribute.
1583    
1584    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1585    
1586            * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1587            character references).  HTML5 revision 1.146 (white space
1588            characters before root start tag).  HTML5 revision
1589            1.148 (named character references in attribute values).
1590            HTML5 revision 1.152 (<plaintext>.innerHTML get).
1591    
1592    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1593    
1594            * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1595            in <head>).
1596    
1597    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1598    
1599            * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1600            in body).
1601    
1602            * ContentChecker.pm: HTML5 revision 938 (scoped="").
1603    
1604    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1605    
1606            * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1607            or RCDATA in fragment parsing mode).  HTML5 revision
1608            924 (<!--> and <!--->).  HTML5 revision 926 (hn in hn).
1609    
1610    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1611    
1612            * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1613            for <pre> and <textarea>).
1614    
1615    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1616    
1617            * HTML.pm.src: HTML5 revision 920 (<isindex>).
1618    
1619  2007-06-23  Wakaba  <wakaba@suika.fam.cx>  2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1620    
1621          * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).          * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).

Legend:
Removed from v.1.45  
changed lines
  Added in v.1.316

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24