/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.45 by wakaba, Sat Jun 23 14:34:39 2007 UTC revision 1.324 by wakaba, Thu Sep 18 14:32:48 2008 UTC
# Line 1  Line 1 
1    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
2    
3            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
4    
5            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
6            matching.
7    
8    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
9    
10            * LangTag.pm: Warn for private use language subtags.  Error level
11            typos fixed.  Support for Suppress-Script field.
12    
13            * mklangreg.pl: Support for dumping of nested structure.
14    
15    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
16    
17            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
18            recommended case as per RFC 4646.
19    
20    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
21    
22            * LangTag.pm (check_rfc4646_langtag): New method.
23    
24    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
25    
26            * mklangreg.pl: New script.
27    
28            * Makefile: Updated for creation of the module for language subtag
29            registry.
30            
31    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
32    
33            * Makefile: WebIDL.html added.
34    
35            * WebIDL.pod: New documentation.
36    
37    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
38    
39            * WebIDL.pm: Checker's error types are redefined.
40    
41    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
42    
43            * WebIDL.pm: Parser's error types are redefined.  Some forward
44            compatible parsing bugs are fixed.  Some unreachable codes are
45            commented out.
46    
47    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
48    
49            * WebIDL.pm: Support for the reminding extended attributes are
50            added.  It does not satisfy the definition that a forward
51            interface declaration has an extended attribute.  It seems that
52            unless explicitly allowed multiple extended attributes with the
53            same name is not allowed, though it is not explicitly mentioned in
54            the spec.
55    
56    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
57    
58            * WebIDL.pm: Unescapes extended attribute names and extended
59            attribute identifiers.  Preserve whether an extended attribute has
60            an argument list of not.  Support for extended attributes:
61            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
62            NameGetter, NameSetter, and Null.
63            (has_argument_list): New attribute.
64            (idl_text): Stringifies argument lists, if any, even if it is
65            empty.
66    
67    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
70            |{s_kwd}| in DATA_STATE as default.
71    
72    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
75            by |{s_kwd}| in DATA_STATE.
76    
77    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
78    
79            * HTML.pm.src: Shorten keys.
80    
81    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Remove checking for control character, surrogate
84            pair, or noncharacter code points and non-Unicode code
85            points (they should be handled by Whatpm::Charset::UnicodeChecker).
86            (parse_char_stream): Support for the |$get_wrapper| argument and
87            character stream error handlers.
88    
89    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
90    
91            * ContentChecker.pm: Don't call |loda_ns_module|
92            for null-namespace elements/attributes.
93    
94            * HTML.pm.src: Fact out $disallowed_control_chars
95            as a hash.
96    
97    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
100            and |{next_char}| initializations are moved to initialization
101            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
102            with |parse_char_stream|.
103    
104    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src (parse_char_stream): Make |set_next_char|
107            invoke |manakai_read_until|, not only |read|, where
108            possible, to decrease the number of |read| method calls.
109    
110            * mkhtmlparser.pl: Related changes to the aforementioned
111            modification.
112    
113    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
114    
115            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
116            would report character error from now.
117    
118    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: White-space-leaded non-white-space character
121            tokens in "before head insertion mode" was not
122            correctly handled.
123            (set_inner_html): Reimplemented using CharString decodehandle
124            class.  Support for $get_wrapper argument.  Support
125            for |{read_until}| feature.
126    
127    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
128    
129            * HTML.pm.src: Make a "bare ero" error for unknown
130            entities point the "&" character.
131    
132    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: It turns out that U+FFFD don't have to
135            be added to the list of excluded characters.
136    
137    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src ($char_onerror): Have character decoder's |line|
140            and |column| a higher priority than the one set by the
141            tokenizer's input handler.
142            ($self->{read_until}): Exclude U+FFFD (but this might
143            not be necessary, since now we do line/column fixup in
144            the character decode handle).
145    
146    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
147    
148            * HTML.pm.src: Use |{read_until}| where possible.
149    
150    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
151    
152            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
153            and |manakai_getc_until| to |manakai_read_until| to
154            reduce the number of string copies.
155    
156    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src (parse_char_string): Use newly created
159            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
160            standard feature to |open| a string as a filehandle,
161            since Perl's string filehandle seems not supporting |ungetc|
162            method correctly.
163            (parse_char_stream): Define |{getc_until}| method.
164            (DATA_STATE): Experimental support for |getc_until| feature.
165    
166    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
167    
168            * HTML.pm.src: Check points added to newly added branches.
169    
170    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Remove |{char}|, which is no longer used.
173            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
174            and replaced by |{prev_state}|.
175    
176            * mkhtmlparser.pl: Remove |{char}| feature.
177            Remove |!!!back-next-input-character;| macro.
178    
179    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
182            entity related tokenizer states in favor of new states
183            implementing the consume character reference algorithm.
184    
185    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src: "Consume a character reference" algorithm is
188            now implemented as a tokenizer's state, rather than
189            a method, with minimum changes (more changes will
190            be made, in due course).  "Bogus comment state"'s inner
191            loop gets removed.
192    
193    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
194    
195            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
196            into their own tokenizer states.
197    
198    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
201            is split into three states.
202    
203    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
206            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
207            no longer does the tokenizer have to push back next input
208            characters in those states.
209    
210    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
211    
212            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
213            into four states so that no longer does the tokenizer have to push
214            back next input characters in that state.
215    
216    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
217    
218            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
219            which can be used to insert some wrapper between the character
220            stream handle and the tokenizer.  (It is currently not supported
221            for |set_inner_html| for |Element|s).
222    
223    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
224    
225            * HTML.pm.src: Ignore punctuations in charset names.
226    
227    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
228    
229            * ContentChecker.pm: Support for charset-layer error levels.
230    
231            * HTML.pm.src: Don't specify |text| argument for the
232            |chardecode:fallback| error, since it is not the encoding
233            being used alternatively.
234    
235    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
236    
237            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
238    
239    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
240    
241            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
242    
243    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: Bug fix and sync with the spec with regard
246            to after after frameset insertion mode processing (HTML5
247            revision 1909).  Note that the implementation was wrong
248            per the old spec before the r1909 changes.
249    
250    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTMLTable.pm: scope=auto algorithm fix synced with the
253            spec (HTML5 revision 2093).
254            ($process_row): Algorithm step numbers synced with the
255            spec (HTML5 revision 2092).
256    
257    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
258    
259            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
260            revision 2094).
261    
262    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
263    
264            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
265    
266    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: '"' and "'" at the end of attribute
269            name (after another attribute) now raise parse error (HTML5
270            revision 2123).  Empty unquoted attribute values are no
271            longer allowed (HTML5 revision 2122).
272    
273    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
274    
275            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
276            revision 2130).
277    
278    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
279    
280            * ContentChecker.pm: |xml:lang| attribute value must be same
281            as |lang| attribute value for HTML elements (HTML5 revision 2062
282            and so on).
283    
284    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
285    
286            * ContentChecker.pm: Error level definition for |xml_id_error|
287            was missing.
288    
289            * URIChecker.pm: The end of the URL should be marked as the
290            error location for an empty path error.  The position
291            between the userinfo and the port components should be
292            marked as the error location for an empty host error.
293    
294    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
295    
296            * URIChecker.pm: Set parameters representing where in the
297            value the error occurs for errors.  Report unknown
298            address format error in warning level, since address
299            formats are rarely added.  Path segments starting with "/.."
300            were misinterpreted as a dot-segment.
301    
302    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
303    
304            * URIChecker.pm (check_iri_reference): Requires
305            |Message::DOM::DOMImplementation|.
306    
307    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
308    
309            * IMTChecker.pm: Updated for the new error reporting architecture.
310    
311            * ContentChecker.pm: Error levels for IMTs are added.
312    
313    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
314    
315            * H2H.pm (_shift_token): Support for unquoted HTML attribute
316            values.
317    
318    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
319    
320            * CacheManifest.pm: Support for new style of error
321            reports.
322    
323            * HTML.pm.src: Set line=1, column=1 to the document node.
324    
325    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
326    
327            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
328            and URL checkers.  Support for more error levels for bogus
329            langauge tag and URL "standards".
330    
331            * LangTag.pm, URIChecker.pm: Support for new style error
332            level reporting.
333    
334    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
335    
336            * ContentChecker.pm: Support for RDF/XML error levels.
337    
338            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
339            specifying.  Error types are revised.
340    
341    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
342    
343            * ContentChecker.pm: All error reporting method calls are
344            renewed.
345    
346    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
347    
348            * HTML.pm.src: All error type names and "text" parameters
349            are revised.  Use new style for "level" specification.
350    
351            * mkhtmlparser.pl: Use new style for "level" specification.
352    
353    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
354    
355            * WebIDL.pm (parse_char_string): Simplified error
356            reporting process for broken ignored valuetype definition.
357            (Valuetype idl_text): Support for special "DOMString" name.
358    
359    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
360    
361            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
362            terminal of the ScopedName is "DOMString", such that whether
363            the last part of the scoped name is "DOMString" or "_DOMString"
364            later.  It is necessary to determine whether a |typedef|
365            definition should be ignored or not.
366            (parse_char_string): Unescape the identifier of
367            exception members.
368            ($resolve): Return undef for builtin types and sequence<T>
369            types (we might not have to do this, however...).
370            (check): Support checking for Exceptions, Valuetypes,
371            and Typedefs.
372            ($serialize_type): Support for "DOMString::::" syntax.
373            (Typedef idl_text): Output Type as "DOMString" if it
374            is really "DOMString" (i.e. its internal representation
375            is "::DOMString::").
376    
377    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
378    
379            * WebIDL.pm ($resolve): New code, based on resolve code
380            for constant types in the |check| method.
381            (check): Support for checking of attributes, operations, and
382            arguments.
383            (Attribute/Operation idl_text): Exception names in getraises,
384            setraises, and raises clauses is serizlied by |$serialize_type|
385            code.
386    
387    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
388    
389            * WebIDL.pm ($integer): Order of selections are changed to match
390            hexadecimal numbers (the original pattern, taken from the spec,
391            was not work for hexadecimal numbers, because the "0" prefix
392            matches to the [0-7]* part (as an empty string) and therefore
393            it does not match with remaining "x..." part of a "0x..." integer
394            literal.
395            ($get_type): It now returns a string, not an array reference,
396            for regular types and |sequence| types (i.e. it in any case
397            returns a string).
398            ($get_next_token): The second item in the array that represents
399            a integer or float token is now a Perl number value, not the
400            original string representation of the number.
401            (check): Support for const value consistency checking.
402            No extended attribute is defined for constants.
403            (Node subclasses): Use simple strings rather than array references
404            for default data type values.
405            ($serialize_type): Type values are now simple strings.
406            (value): If the new attribute value is a false value, then
407            a FALSE value is set to the attribute.
408    
409    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
410    
411            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
412            in its stringified format ("scoped name" as defined in the
413            spec).  Note that future version of this module should not use
414            array references for type values and the |type_text| attribute
415            should be made obsolete.
416            (parse_char_string): Unescape attribute names.
417            (check): Support for checking of whether inherited interfaces
418            are actually defined or not.  Support for checking of whether
419            interface member identifiers are duplicated or not.
420            ($serialize_type): Scoped names are returned as is.  A future
421            version of this code should escape identifiers other than "DOMString",
422            otherwise the idl_text would be non-conforming.
423    
424    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
425    
426            * WebIDL.pm (parse_char_string): Set line/column numbers
427            to generated nodes.  Unescape identifiers.  Extended attributes
428            for Definition's were ignored.
429            (append_child): Set |parent_node| attribute.
430            (parent_node): New attribute.
431            (check): Support interface/exception members.  Support
432            extended attributes.  Support definition identifier uniqueness
433            constraint.
434            (qualified_name): New attribute.
435            (Interface/Exception idl_text): Extended attributes were
436            not prepended to the returned text.
437    
438    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
439    
440            * WebIDL.pm (parse_char_string): Set line/column numbers
441            to interface object experimentally.  s/shift/pop/g, shift
442            would make things wrong.  Support for interface forward
443            declarations was missing.  Broken interface declarations
444            with no block were not ignored entirely.
445            (Whatpm::WebIDL::Node): New abstract class.  This class
446            makes things easier.
447            (child_nodes): New attribute.  Unlike DOM's attribute with
448            same name, this attribute returns a dead list of nodes for
449            simplicity.
450            (get_user_data, set_user_data): New methods.
451            (Module idl_text): A SPACE character should be inserted
452            before the |{| character.
453            (Interface idl_text): Support for interface forward declarations.
454            (is_forward_declaration): New attribute.
455    
456    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
457    
458            * WebIDL.pm (type_text): Better serializer.
459    
460    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
461    
462            * WebIDL.pm: Revise forward-compatible parsing so that
463            it now can handle broken extended attributes and as such.
464    
465    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
466    
467            * WebIDL.pm: Real support for extended attributes.
468            Support for extended attributes with arguments.
469    
470    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
471    
472            * WebIDL.pm: Support for |exception| syntax.
473            (Interface->idl_text): Tentative support for inheritances.
474    
475    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
476    
477            * WebIDL.pm: Hierarchical scoped name support was broken.
478            Support for raises, setraises, and getraises syntaxes.
479    
480    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
481    
482            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
483            proper support for types, extended attributes, and exceptions yet).
484            WebIDL parser, version 1 (no support for exceptions yet,
485            no proper support for extended attributes yet).
486    
487    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
488    
489            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
490    
491    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
492    
493            * WebIDL.pm: Support for valuetype and const.
494    
495    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
496            
497            * WebIDL.pm: New module.
498    
499    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
500    
501            * Makefile (Entities.html): URI changed.
502    
503    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
504    
505            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
506    
507    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
508    
509            * HTML.pm.src (_get_next_token): A parse error was missing.
510    
511    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
512    
513            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
514            spec (commented out). (HTML5 revision 1711).
515    
516    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
517    
518            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
519    
520            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
521            as UTF-16 (HTML5 revision 1701).
522    
523    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
524    
525            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
526            1692).
527    
528    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
529    
530            * HTML.pm.src: The secondary insertion mode used when switching
531            to foreign content is the "in body" insertion mode (HTML5 revision
532            1696).
533    
534    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
535    
536            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
537            revision 1697).
538    
539    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
540    
541            * HTML.pm.src: Support for end-of-file token in foreign content
542            insertion mode (HTML5 revision 1693).  Update SVG camelCase
543            attribute list (HTML5 revision 1700).  <textarea> closes
544            </select> (HTML5 revision 1699).  More start tags close in
545            foreign content insertion mode (HTML5 revision 1698).
546    
547    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
548    
549            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
550    
551    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
552    
553            * HTML.pm.src: More robust charset parameter detection (HTML5
554            revision 1674).
555    
556    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
557    
558            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
559            revision 1676).
560    
561    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
562    
563            * HTML.pm.src: Ignore language part of public identifiers for
564            quriks mode detection (HTML5 revision 1679).
565    
566    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
567    
568            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
569            revision 1685).
570    
571    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
572    
573            * HTML.pm.src: Support for EOF in new states for tags (HTML5
574            revision 1684).
575    
576    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
577    
578            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
579            work (HTML5 revision 1690).
580    
581    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
582    
583            * HTML.pm.src (_tree_construction_main): Change handling of
584            end tags in head insertion modes (HTML5 revision 1686).
585            (parse_char_string): Bug fix for non-utf8 character string handlings.
586            (parse_char_stream): |ungetc| does not work well for this context.
587    
588    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
589    
590            * HTML.pm.src (parse_byte_string): Redefined to invoke
591            |parse_byte_stream|.
592            (parse_byte_stream): New method.
593    
594    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
595    
596            * HTML.pm.src (parse_byte_string): Fix the column number reported
597            by encoding layer error reporter.
598    
599    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
600    
601            * HTML.pm.src (parse_byte_string): Use streaming decoder
602            rather than converting the whole byte string and then parsing.
603            Propagate errors in character encoding layer.
604            (get_next_token): Precise error reporting for |bare stago| error.
605    
606    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
607    
608            * HTML.pm.src (parse_char_stream): New method.
609            (parse_char_string): This method is now defined as an invocation
610            of the |parse_char_stream| method.
611    
612    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
613    
614            * HTML.pm.src (parse_byte_string): Report various status
615            of the sniffing as info-level errors.  Support for new
616            decoding framework in parser resestting.
617            (new): Various default error levels were not set.
618    
619    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
620    
621            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
622            algorithm, except for the actual sniffing, is implemented
623            with new framework with Message::Charset::Info.
624    
625    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
626    
627            * CacheManifest.pm (_parse): Drop fragment identifiers from
628            URIs in fallback section (HTML5 revision 1596).
629    
630    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
631    
632            * Makefile (Entities.html): URI has changed.
633    
634    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
635    
636            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
637            1553).
638    
639    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
640    
641            * ContentChecker.pm: Noted that those returned in |table| are
642            no longer table elements, but table objects returned
643            by Whatpm::HTMLTable.
644    
645            * HTMLTable.pm (form_table): Return table element node
646            as |$table->{element}|.
647            (assign_header): Support for the |headers=""| attribute.
648    
649    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
650    
651            * HTMLTable.pm (assign_header): New function; first version
652            with no support for headers="".
653            (form_table): Include table width and height to the returned
654            table object for covenience.  Indexing in column assignement was wrong.
655            Set whether a data cell is empty or not for convenience.
656    
657    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
658    
659            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
660    
661    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
662    
663            * HTMLTable.pm: How table model errors are detected is
664            changed (HTML5 revision 1387).
665    
666    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
667    
668            * HTMLTable.pm: The algorithm now moves |tfoot| elements
669            to the end of the table (HTML5 revision 1380).
670    
671    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
672    
673            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
674            of 1-based (HTML5 revision 1376).
675    
676    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
677    
678            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
679            and "Content-Encoding" no longer prevent sniffing (HTML5
680            revision 1288).
681    
682    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
683    
684            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
685            1282).
686    
687    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
688    
689            * ContentChecker.pm: Support for global attributes.
690            Status of XML specs are added.
691    
692    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
693    
694            * ContentChecker.pm (check_element): Support for |template=""|
695            and |ref=""| attribute (referent element type checking).
696    
697    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
698    
699            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
700            1500) is implemented (except for IDNA part and URI-scheme-specific
701            knowledge).  Line number counting was wrong for LF-only
702            documents.
703    
704    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
705    
706            * HTML.pm.src: Raise an parse error for any disallowed
707            character (HTML5 revision 1263).
708    
709    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
710    
711            * mkentitylist.pl: Support for new HTML5 entity table format (the
712            definition for |AElig;| was missing).
713    
714    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
715    
716            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
717            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
718    
719    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
720    
721            * HTML.pm.src: List of element names that close foreign content
722            insertion mode is added (HTML5 revisions 1412 and 1418).
723    
724    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
725    
726            * HTML.pm.src: Support for |mglyph| and |malignmark|
727            elements (HTML5 revision 1410).
728    
729    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
730    
731            * HTML.pm.src: Support for new long MathML entities (HTML5
732            revision 1406).
733    
734    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
735    
736            * HTML.pm.src: CDATA section support for MathML and SVG
737            elements (HTML5 revision 1404 and 1420).
738    
739    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
740    
741            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
742            elements (HTML5 revision 1404).  Unused !!!macro definitions
743            are removed.
744    
745    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
746    
747            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
748            are raised is changed (HTML5 revision 1404).
749    
750    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
751    
752            * HTML.pm.src: Category sets that are no longer used are removed.
753    
754    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
755    
756            * HTML.pm.src: The ->[1] property of stack entries are now
757            replaced by constants representing element category.
758    
759    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
760    
761            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
762            for error reporting.  (This is a preparation for using constant
763            value for ->[1].)
764    
765    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
766    
767            * RDFXML.pm: Typo fixed.
768    
769    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
770    
771            * ContentChecker.pm: |fact_level| is now treated
772            as same as |must_level|, i.e. level = |m|.
773            (check_element): Make list of URIs in the DOM.
774    
775    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
776    
777            * RDFXML.pm: Language accessor implemented.
778            Local (null-namespace) attribute support.
779    
780    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
781    
782            * RDFXML.pm: Factored out ID checking code.
783    
784    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
785    
786            * RDFXML.pm: TODO items noted.  Validation of ID
787            and URI attributes is implemented.  Warn if unknown
788            value is used in rdf:parseType="" attribute.
789    
790            * URIChecker.pm (check_rdf_uri_reference): New function.
791    
792    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
793    
794            * RDFXML.pm: bnodeid implemented.  Relative references
795            are now resolved.
796    
797    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
798    
799            * ContentChecker.pm: RDF reification implemented.
800    
801            * RDFXML.pm: undef vs false bug fixed.
802            Reification implemented.
803    
804    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
805    
806            * RDFXML.pm: s/id/ID/ for attribute name.
807            The |node| arguments are added for |ontriple| calls.
808            Too many "attribute not allowed" errors were raised.
809    
810            * ContentChecker.pm: Initial experimental support for rdf:RDF
811            element.
812    
813    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
814    
815            * RDFXML.pm: New module.
816    
817    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
818    
819            * HTML.pm.src (set_inner_html): Line/column number
820            code was old one yet.
821    
822    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
823    
824            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
825            errors.  Line/column markings for DOCTYPE, comment, and
826            character tokens are reintroduced; otherwise,
827            error location for "not HTML5" error and errors
828            for implied elements are not attached.
829    
830    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
831    
832            * HTML.pm.src: Set line/column numbers to attributes.
833    
834            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
835            (value): Setter implemented.
836    
837            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
838    
839    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
840    
841            * HTML.pm.src: Unused line/column markings are removed.
842    
843    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
844    
845            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
846            flag, which is no longer used.
847    
848    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
849    
850            * HTML.pm.src: Set line/column information to element nodes.
851    
852            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
853            and !!!insert-element-t): Set line/column information to
854            element nodes.
855    
856    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
857    
858            * HTML.pm.src (_get_next_token): The first "<" character
859            in "<?", "<>", or "</>" should be the error point.
860    
861    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
862    
863            * HTML.pm.src: Some more fixes on error position reporting.
864    
865    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
866    
867            * HTML.pm.src: Token-level precious error reporting.
868    
869    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
870    
871            * HTML.pm.src: Preparation for more precious error point
872            reporting.
873    
874    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
875    
876            * HTML.pm.src: Error type revised.
877    
878    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
879    
880            * HTML.pm.src: Similar codes are merged together, again.
881    
882    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
883    
884            * HTML.pm.src: Similar codes are merged together.
885    
886    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
887    
888            * mkhtmlparser.pl: Set "level" parameter to parse errors.
889    
890            * HTML.pm.src: Code refined.
891    
892    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
893    
894            * HTML.pm.src: |</body>| treatement has been changed (HTML5
895            revision 1348).  Note that I really don't know this makes
896            any difference in the black-box behavior of the parser.
897    
898    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
899    
900            * HTML.pm.src: New end-of-file token implementation (HTML5
901            revision 1348).
902    
903    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
904    
905            * HTML.pm.src: |applet| support (HTML5 revision 1347).
906    
907    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
908    
909            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
910    
911    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
912    
913            * HTML.pm.src: Support for |<input>| in the "in select" insertion
914            mode and support for the "in select in table" insertion mode (HTML5
915            revision 1342).
916    
917    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTML.pm.src: No foster parenting for <script> and <script>
920            in non-tainted <table>s (HTML5 revision 1336).
921    
922    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
923    
924            * HTML.pm.src: Ignore white space characters between <html>
925            and <head> (HTML5 revision 1332).
926    
927    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
928    
929            * HTML.pm.src: Treat <input type=hidden> as if it were a
930            white space (HTML5 revision 1331).
931    
932    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
933    
934            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
935            element (HTML5 revision 1330).
936    
937    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
938    
939            * HTML.pm.src: <title> is always appended to the current
940            element (HTML5 revision 1328).
941    
942    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
943    
944            * HTML.pm.src: White space in tainted tables are moved
945            into foster parents (HTML5 revision 1326).
946    
947    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
948    
949            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
950            revision 1321).
951    
952    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
953    
954            * HTML.pm.src: |</h/n/>| case code rearranged to align with
955            the spec (HTML5 revision 1320).  Note that we finally complete
956            all of HTML5 revision 1320 changes.
957    
958    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
959    
960            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
961            tags (HTML5 revision 1320).
962    
963    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
964    
965            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
966    
967    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
968    
969            * HTML.pm.src: A "generate implied end tags" code (t409.1)
970            could not be reached so that it is now removed (HTML5 revision
971            1320).
972    
973    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
974    
975            * HTML.pm.src: Code for the case of |</div>| and so on are revised
976            to align with new spec text (HTML5 revision 1320).
977    
978    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
979    
980            * HTML.pm.src: Remove strange |if| condition; however, it should
981            have had no harm in theory.
982    
983    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
984    
985            * HTML.pm.src (_tree_construction_main): '</p>' in body
986            case is split from other end tags for the preparation
987            of implementing HTML revision 1320.
988    
989    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
990    
991            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
992            revision 1320).
993    
994    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
995    
996            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
997            and "div" (HTML5 revision 1318).  Add comments to where
998            |form| pointer association codes should be inserted (HTML5 revision
999            1319).
1000    
1001    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1002    
1003            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1004    
1005    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1006    
1007            * HTML.pm.src: Since the case t268 should never be reached (no
1008            other token type, there are), it is replaced by a |die| statement.
1009    
1010    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1011    
1012            * HTML.pm.src: Typo fixed.
1013    
1014    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1015    
1016            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1017            doctypes were not uppercased for comparison.
1018    
1019    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1020    
1021            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1022            are added.
1023    
1024    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1025    
1026            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1027            are set.  Cases that are unlikely reached are noted as so.
1028    
1029    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1030    
1031            * HTML.pm.src: Checkpoints for debugging are added.
1032    
1033            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1034    
1035    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1036    
1037            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1038            for simplicity.
1039    
1040    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1041    
1042            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1043            revision 1307).
1044    
1045    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1046    
1047            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1048            is set are changed (HTML5 revision 1305).
1049    
1050    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1051    
1052            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1053            1292).  Entities are not parsed in comment-like part in RCDATA
1054            elements (HTML5 revision 1294).  Allow bare & at the end
1055            of attribute value literals (HTML5 revision 1296).  More
1056            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1057            between attributes and ban attribute names or unquoted
1058            attribute values containing single or double quotes (HTML5
1059            revision 1303).
1060    
1061    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1062    
1063            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1064            and related errors unless it is an HTML document (though the spec
1065            is unclear on whether it is applied to XHTML document).
1066    
1067            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1068            are added.
1069    
1070    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1071    
1072            * HTML.pm.src: s/local_name/manakai_local_name/g.
1073    
1074    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1075    
1076            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1077    
1078            * HTML.pm.src: |charset| in |content| attribute is
1079            case-insensitive (HTML5 revision 1270).
1080    
1081    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1082    
1083            * ContentChecker.pm: New status constants are added.
1084            ($ElementDefault): |status| added.
1085            (check_element): Err for non-standard or deprecated elements.
1086            (_attr_status_info): For non-standard or deprecated attributes.
1087    
1088    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1089    
1090            * ContentChecker.pm (_attr_status_info): New internal method.
1091    
1092    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1093    
1094            * ContentChecker.pm (check_element): Element standardized
1095            status information is now dispatched.
1096    
1097    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1098    
1099            * ContentChecker.pm (check_element): Fix |del|-and-significant
1100            problem by adding some more arguments.
1101    
1102    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1103    
1104            * ContentChecker.pm (check_element): Use context of
1105            container-for-the-purpose-of-content-model element (not transparent
1106            element) for |check_child_element| calling and significant
1107            text flag marking.  This reintroduces |<del>|-and-significant
1108            problem again.
1109    
1110    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1111    
1112            * ContentChecker.pm (check_element): Make semi-transparent
1113            elements ignored for the purpose of phase changes in
1114            content model checking.
1115    
1116    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1117    
1118            * ContentChecker.pm (check_element): In-element state
1119            was not properly managed for transparent cases.
1120    
1121    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1122    
1123            * ContentChecker.pm (check_element): Support for |video|
1124            and |audio| as semi-transparent elements.
1125    
1126    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1127    
1128            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1129            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1130            Support for |html:object| as a semi-transparent element.
1131    
1132    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1133    
1134            * ContentChecker.pm (check_element): The way to traverse
1135            the tree is entirely revised to make it easier to track
1136            the state of ancestors/descendants.  As a result of this
1137            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1138            support for content model checking for HTML elements |figure|,
1139            |object|, |video|, and |audio| and checking for XML elements (and
1140            some XMLNS checkings) are dropped for now.  They will be
1141            reimplemented in due cource.
1142    
1143    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1144    
1145            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1146            token (HTML5 revision 1225).
1147    
1148    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1149    
1150            * ContentChecker.pm ({unsupported_level}): New value.
1151    
1152            * HTML.pm.src: Save whether |meta| |content| attribute
1153            contains character references or not.
1154    
1155    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1156    
1157            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1158            imlement significant content checking for contents
1159            with |del| elements.
1160    
1161    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1162    
1163            * ContenteChecker.pm ($HTMLTransparentElements): More
1164            elements are added.
1165            (_get_children): HTML |object| elements are now semi-transparent.
1166    
1167            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1168    
1169    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1170    
1171            * CacheManifest.pm: HTML5 revision 1211 implemented.
1172    
1173            * CacheManifest.pod: Updated.
1174    
1175    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1176    
1177            * ContentChecker.pm (check_document, check_element): Support
1178            for second argument ($onsubdoc).
1179            (_get_css_parser): Removed (now it is part of WDCC).
1180    
1181    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1182    
1183            * ContentChecker.pm (_get_css_parser): New.
1184    
1185    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1186    
1187            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1188            for checking had been used.
1189    
1190    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1191    
1192            * ContentChecker.pm (check_element): New todo item type |descendant|.
1193    
1194    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1195    
1196            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1197            specced in RFC 2046 and RFC 4288.
1198            (application/atom+xml): Definition added.
1199    
1200    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1201    
1202            * URIChecker.pm: Make RFC 3986 should-level errors
1203            warnings (rather than SHOULD-level errors).
1204    
1205    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1206    
1207            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1208    
1209            * HTML.pm.src: A flag for character references in attribute
1210            values are added.  Set |manakai_has_reference| user data
1211            to |charset| attribute.
1212    
1213    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1214    
1215            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1216            attributes.
1217    
1218            * ContentChecker.pm (check_document): Warn if charset requirements
1219            cannot be tested.
1220    
1221    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1222    
1223            * HTML.pm.src (parse_byte_string): Detect charset
1224            by universalchardet if charset parameter is not specified.
1225    
1226            * Makefile (Charset-all, Charset-clean): New rules.
1227    
1228    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1229    
1230            * ContentChecker.pm (check_document): Check the existence
1231            of character encoding declaration and selection of encoding
1232            for HTML document.
1233    
1234    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1235    
1236            * ContentType.pm (get_sniffed_type): Return also the official
1237            type in list context.
1238    
1239    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1240    
1241            * ContentType.pm: Sniffing with leading white space
1242            ignoring (HTML5 revisions 1013 and 1016).
1243    
1244    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1245    
1246            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1247            white spaces, are implemented.
1248    
1249    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1250    
1251            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1252            attribute if possible.
1253    
1254    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1255    
1256            * HTML.pm.src (parse_byte_string): New method.
1257            (parse_char_string): New alias for |parse_string|.
1258            (main phase): Invoking "change the encoding" algorithm if desired.
1259    
1260            * HTML.pod: Updated.
1261    
1262    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1263    
1264            * HTML.pod (get_inner_html): Removed.
1265    
1266            * Makefile (HTML-all, HTML-clean): New.
1267    
1268    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1269    
1270            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1271    
1272    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1273    
1274            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1275    
1276    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1277    
1278            * Makefile: |CacheManifest.html| is added.
1279    
1280            * CacheManifest.pod: New file.
1281    
1282    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1283    
1284            * CacheManifest.pm: New module.
1285    
1286    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1287    
1288            * HTML.pm.src: Support for application cache selection algorithm
1289            callback.
1290    
1291    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1292    
1293            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1294    
1295    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1296    
1297            * Makefile (clean): New rule.
1298    
1299            * NanoDOM.pm (public_id, system_id): New attributes.
1300    
1301    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1302    
1303            * Makefile (CSS-all, CSS-clean, clean): New rules.
1304    
1305    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1306    
1307            * ContentChecker.pm (check_document): Support for
1308            new |is_xml_root| flag.
1309            (check_element): Support for new |pluses| state.
1310            (_add_pluses): New method.
1311            (_remove_minuses): Support for new |minus| item.
1312    
1313    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1314    
1315            * ContentChecker.pm: Raise specific error for invalid
1316            root element.
1317    
1318    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1319    
1320            * ContentChecker.pm: Set level values for later uses.
1321    
1322    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1323    
1324            * ContentChecker.pm: Support for language tag validation.
1325    
1326    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1327    
1328            * LangTag.pm (check_rfc3066_language_tag): New method.
1329    
1330    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1331    
1332            * LangTag.pm: New module.
1333    
1334    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1335    
1336            * HTML.pm.src: Some error types were wrong.
1337    
1338    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1339    
1340            * CSS/: New directory.
1341    
1342    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1343    
1344            * ContentChecker.pm (_check_get_children): Support
1345            for |noscript| in |head|.
1346    
1347    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1348    
1349            * URI/: New directory.
1350    
1351    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1352    
1353            * HTML.pm.src: Tokenizer's states are now represented in
1354            number.
1355    
1356    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1357    
1358            * HTML.pm.src: |or|s for insertion modes are replaced
1359            by |&|s.
1360    
1361    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1362    
1363            * HTML.pm.src: Token types are now represented in number.
1364    
1365    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1366    
1367            * ContentType.pm (SEE ALSO): Updated.
1368    
1369            * HTML.pm.src: Insertion modes are now represented in number.
1370    
1371    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1372    
1373            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1374            999) is implemented.
1375    
1376    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1377    
1378            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1379            983) is implemented.
1380    
1381    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1382    
1383            * ContentChecker.pod: New documentation.
1384    
1385            * Makefile: A rule for |ContentChecker.html| is added.
1386    
1387            * ContentChecker.pm: A pod "LICENSE" section is added.
1388    
1389            * NanoDOM.pm ($VERSION): New variable.
1390    
1391    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1392    
1393            * H2H.pm: |b|, |i|, and |sub| are added to the
1394            list of allowed HTML elements.
1395    
1396    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1397    
1398            * H2H.pm: |samp| is added to the list of allowed
1399            HTML elements.
1400    
1401            * URIChecker.pm (check_iri): New.
1402            (check_iri_reference): Error type for IRI reference
1403            syntax error is changed.
1404    
1405    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1406    
1407            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1408            is added.
1409            (check_document): Load appropriate module before validation.
1410    
1411    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1412    
1413            * ContentChecker/: New directory.
1414    
1415    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1416    
1417            * ContentChecker.pm: HTML |time| element is implemented.
1418    
1419            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1420    
1421    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1422    
1423            * ContentChecker.pm (check_document): Return value
1424            even if no document element is found.
1425    
1426    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1427    
1428            * HTML.pm.src: |$in_body| is no longer a function.
1429    
1430    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1431    
1432            * HTML.pm.src: The |$in_body| code has been moved down.
1433    
1434    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1435    
1436            * HTML.pm.src: The "trailing end" insertion mode
1437            is split into "after html body" and "after html frameset"
1438            insertion modes.  Their codes are merged with "after body"
1439            and "after frameset" codes.  |$previous_insertion_mode|
1440            has been removed.  "after frameset" code is
1441            merged with "in frameset" code.
1442    
1443    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1444    
1445            * HTML.pm.src: The "before head" insertion mode is
1446            merged with the "in head" insertion mode.
1447    
1448    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1449    
1450            * HTML.pm.src: Cases in "in head" insertion mode are
1451            reorganized.
1452    
1453    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1454    
1455            * HTML.pm.src: Some cases in "in table" insertion mode
1456            are merged.
1457    
1458    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1459    
1460            * HTML.pm.src: The "in row" insertion mode is merged
1461            with "in table" insertion mode.
1462    
1463    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1464    
1465            * HTML.pm.src: The "in table" and "in table body" insertion
1466            modes are merged.
1467    
1468    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1469    
1470            * HTML.pm.src: There is no "in table head" or "in table foot"
1471            insertion mode!
1472    
1473    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1474    
1475            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1476            now directly invoke the handler.
1477    
1478    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1479    
1480            * HTML.pm.src: Codes for "in cell" insertion mode
1481            is merged to the "in body" insertion mode code.
1482    
1483    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1484    
1485            * HTML.pm.src: Codes for "in body" and "in caption"
1486            insertion modes are merged.
1487    
1488    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1489    
1490            * HTML.pm.src: Two |!!!next-token|s were missing.
1491    
1492    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1493    
1494            * HTML.pm.src: Use numeric constant for |{content_mode}|
1495            instead of string constant for |{content_model_flag}|.
1496    
1497    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1498    
1499            * HTML.pm.src: Add the name of the attribute
1500            to the "duplicate attribute" error.
1501    
1502    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1503    
1504            * ContentChecker.pm: Return the |class| node list.
1505    
1506    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1507    
1508            * ContentChecker.pm: Return the |id| node list.
1509    
1510            * HTML.pm.src: A typo is fixed.
1511    
1512    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1513    
1514            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1515            from "in HTML:xml:lang" error.  Character position
1516            is now the last part of the error type in the URI error
1517            description.  Report "unsupported" status for language
1518            tags, media queries, script codes, and style sheets.
1519    
1520    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1521    
1522            * ContentChecker.pm: Report error if |xml:lang|
1523            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1524            in XML.
1525    
1526            * NanoDOM.pm (Attr.owner_document): New attribute.
1527    
1528    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1529    
1530            * HTML.pm.src: The character immediately following
1531            a bare |hcro| was discarded.  Fix handling of
1532            entity references in attribute values.
1533    
1534    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1535    
1536            * HTML.pm.src (main and trailin end phases): Token
1537            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1538            factored out.  Error types |in html:#DOCTYPE|
1539            and |after html:#DOCTYPE| are merged
1540            into |DOCTYPE in the middle|.  |</frameset>|
1541            in fragment parsing mode changed the
1542            insertion mode.
1543    
1544    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1545    
1546            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1547            phase is now an insertion mode.  Treatments for white
1548            space character tokens were incorrect for some
1549            insertion modes.  An old |meta| case was not removed.
1550    
1551    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1552    
1553            * HTML.pm.src: |meta| charset declaration extraction
1554            implemented (but changing the encoding is not yet:-).
1555    
1556    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1557    
1558            * Charset/: New directory.
1559    
1560    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1561    
1562            * H2H.pm: New Perl module (created from
1563            manakai's H2H.dis).
1564    
1565    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1566    
1567            * XMLSerializer.pm: New Perl module (created from
1568            manakai's SimpleLS.dis).
1569    
1570    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
1571    
1572            * HTML.pm.src: |<!---x-->| was not processed correctly.
1573    
1574    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1575    
1576            * HTML.pm.src: Report correct error message
1577            for |<body></div></body>|.
1578    
1579    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1580    
1581            * HTMLTable.pm: An error description was incorrect.
1582    
1583    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1584    
1585            * ContentChecker.pm: Return |{term}| list.
1586    
1587    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1588    
1589            * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1590            nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1591            and <title> outside of head).
1592    
1593    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1594    
1595            * IMTChecker.pm: Report warning for unregistered
1596            and private types/subtypes.
1597    
1598            * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1599            URIChecker.pm, HTMLTable.pm: Error messages are now
1600            consistent; they are all listed in
1601            <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1602    
1603    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1604    
1605            * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1606            is now erred.  |<datalist>| is implemented.
1607            Attribute checker for |<command>| and |<menu>| are
1608            added.  Support for |contextmenu| global attribute
1609            is added.
1610    
1611    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1612    
1613            * HTML.pm.src (_reset_insertion_mode): Interpretation
1614            of Step 3 has been changed.
1615    
1616    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1617    
1618            * HTML.pm.src: Late |<html>| parse error is implemented.
1619    
1620    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1621    
1622            * URIChecker.pm (check_iri_reference): A |decode| method name was
1623            incorrect.
1624    
1625            * ContentChecker.pm: Support for the |footer| element.
1626            Check URI syntax for space-separated URI attributes.
1627            Support for the |tabindex| attribute.  Support
1628            for |datetime| attribute.
1629    
1630    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1631    
1632            * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1633            character references).  HTML5 revision 1.146 (white space
1634            characters before root start tag).  HTML5 revision
1635            1.148 (named character references in attribute values).
1636            HTML5 revision 1.152 (<plaintext>.innerHTML get).
1637    
1638    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1639    
1640            * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1641            in <head>).
1642    
1643    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1644    
1645            * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1646            in body).
1647    
1648            * ContentChecker.pm: HTML5 revision 938 (scoped="").
1649    
1650    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1651    
1652            * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1653            or RCDATA in fragment parsing mode).  HTML5 revision
1654            924 (<!--> and <!--->).  HTML5 revision 926 (hn in hn).
1655    
1656    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1657    
1658            * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1659            for <pre> and <textarea>).
1660    
1661    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1662    
1663            * HTML.pm.src: HTML5 revision 920 (<isindex>).
1664    
1665  2007-06-23  Wakaba  <wakaba@suika.fam.cx>  2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1666    
1667          * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).          * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).

Legend:
Removed from v.1.45  
changed lines
  Added in v.1.324

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24