/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.45 by wakaba, Sat Jun 23 14:34:39 2007 UTC revision 1.322 by wakaba, Thu Sep 18 07:42:57 2008 UTC
# Line 1  Line 1 
1    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
2    
3            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
4            recommended case as per RFC 4646.
5    
6    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
7    
8            * LangTag.pm (check_rfc4646_langtag): New method.
9    
10    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
11    
12            * mklangreg.pl: New script.
13    
14            * Makefile: Updated for creation of the module for language subtag
15            registry.
16            
17    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
18    
19            * Makefile: WebIDL.html added.
20    
21            * WebIDL.pod: New documentation.
22    
23    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
24    
25            * WebIDL.pm: Checker's error types are redefined.
26    
27    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
28    
29            * WebIDL.pm: Parser's error types are redefined.  Some forward
30            compatible parsing bugs are fixed.  Some unreachable codes are
31            commented out.
32    
33    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
34    
35            * WebIDL.pm: Support for the reminding extended attributes are
36            added.  It does not satisfy the definition that a forward
37            interface declaration has an extended attribute.  It seems that
38            unless explicitly allowed multiple extended attributes with the
39            same name is not allowed, though it is not explicitly mentioned in
40            the spec.
41    
42    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
43    
44            * WebIDL.pm: Unescapes extended attribute names and extended
45            attribute identifiers.  Preserve whether an extended attribute has
46            an argument list of not.  Support for extended attributes:
47            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
48            NameGetter, NameSetter, and Null.
49            (has_argument_list): New attribute.
50            (idl_text): Stringifies argument lists, if any, even if it is
51            empty.
52    
53    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
56            |{s_kwd}| in DATA_STATE as default.
57    
58    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
61            by |{s_kwd}| in DATA_STATE.
62    
63    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src: Shorten keys.
66    
67    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Remove checking for control character, surrogate
70            pair, or noncharacter code points and non-Unicode code
71            points (they should be handled by Whatpm::Charset::UnicodeChecker).
72            (parse_char_stream): Support for the |$get_wrapper| argument and
73            character stream error handlers.
74    
75    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
76    
77            * ContentChecker.pm: Don't call |loda_ns_module|
78            for null-namespace elements/attributes.
79    
80            * HTML.pm.src: Fact out $disallowed_control_chars
81            as a hash.
82    
83    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
84    
85            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
86            and |{next_char}| initializations are moved to initialization
87            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
88            with |parse_char_stream|.
89    
90    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src (parse_char_stream): Make |set_next_char|
93            invoke |manakai_read_until|, not only |read|, where
94            possible, to decrease the number of |read| method calls.
95    
96            * mkhtmlparser.pl: Related changes to the aforementioned
97            modification.
98    
99    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
102            would report character error from now.
103    
104    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src: White-space-leaded non-white-space character
107            tokens in "before head insertion mode" was not
108            correctly handled.
109            (set_inner_html): Reimplemented using CharString decodehandle
110            class.  Support for $get_wrapper argument.  Support
111            for |{read_until}| feature.
112    
113    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
114    
115            * HTML.pm.src: Make a "bare ero" error for unknown
116            entities point the "&" character.
117    
118    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: It turns out that U+FFFD don't have to
121            be added to the list of excluded characters.
122    
123    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
124    
125            * HTML.pm.src ($char_onerror): Have character decoder's |line|
126            and |column| a higher priority than the one set by the
127            tokenizer's input handler.
128            ($self->{read_until}): Exclude U+FFFD (but this might
129            not be necessary, since now we do line/column fixup in
130            the character decode handle).
131    
132    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: Use |{read_until}| where possible.
135    
136    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
137    
138            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
139            and |manakai_getc_until| to |manakai_read_until| to
140            reduce the number of string copies.
141    
142    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
143    
144            * HTML.pm.src (parse_char_string): Use newly created
145            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
146            standard feature to |open| a string as a filehandle,
147            since Perl's string filehandle seems not supporting |ungetc|
148            method correctly.
149            (parse_char_stream): Define |{getc_until}| method.
150            (DATA_STATE): Experimental support for |getc_until| feature.
151    
152    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src: Check points added to newly added branches.
155    
156    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src: Remove |{char}|, which is no longer used.
159            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
160            and replaced by |{prev_state}|.
161    
162            * mkhtmlparser.pl: Remove |{char}| feature.
163            Remove |!!!back-next-input-character;| macro.
164    
165    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
168            entity related tokenizer states in favor of new states
169            implementing the consume character reference algorithm.
170    
171    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
172    
173            * HTML.pm.src: "Consume a character reference" algorithm is
174            now implemented as a tokenizer's state, rather than
175            a method, with minimum changes (more changes will
176            be made, in due course).  "Bogus comment state"'s inner
177            loop gets removed.
178    
179    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
182            into their own tokenizer states.
183    
184    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
185    
186            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
187            is split into three states.
188    
189    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
192            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
193            no longer does the tokenizer have to push back next input
194            characters in those states.
195    
196    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
197    
198            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
199            into four states so that no longer does the tokenizer have to push
200            back next input characters in that state.
201    
202    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
205            which can be used to insert some wrapper between the character
206            stream handle and the tokenizer.  (It is currently not supported
207            for |set_inner_html| for |Element|s).
208    
209    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
210    
211            * HTML.pm.src: Ignore punctuations in charset names.
212    
213    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
214    
215            * ContentChecker.pm: Support for charset-layer error levels.
216    
217            * HTML.pm.src: Don't specify |text| argument for the
218            |chardecode:fallback| error, since it is not the encoding
219            being used alternatively.
220    
221    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
222    
223            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
224    
225    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
226    
227            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
228    
229    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: Bug fix and sync with the spec with regard
232            to after after frameset insertion mode processing (HTML5
233            revision 1909).  Note that the implementation was wrong
234            per the old spec before the r1909 changes.
235    
236    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTMLTable.pm: scope=auto algorithm fix synced with the
239            spec (HTML5 revision 2093).
240            ($process_row): Algorithm step numbers synced with the
241            spec (HTML5 revision 2092).
242    
243    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
246            revision 2094).
247    
248    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
249    
250            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
251    
252    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
253    
254            * HTML.pm.src: '"' and "'" at the end of attribute
255            name (after another attribute) now raise parse error (HTML5
256            revision 2123).  Empty unquoted attribute values are no
257            longer allowed (HTML5 revision 2122).
258    
259    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
260    
261            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
262            revision 2130).
263    
264    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
265    
266            * ContentChecker.pm: |xml:lang| attribute value must be same
267            as |lang| attribute value for HTML elements (HTML5 revision 2062
268            and so on).
269    
270    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
271    
272            * ContentChecker.pm: Error level definition for |xml_id_error|
273            was missing.
274    
275            * URIChecker.pm: The end of the URL should be marked as the
276            error location for an empty path error.  The position
277            between the userinfo and the port components should be
278            marked as the error location for an empty host error.
279    
280    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
281    
282            * URIChecker.pm: Set parameters representing where in the
283            value the error occurs for errors.  Report unknown
284            address format error in warning level, since address
285            formats are rarely added.  Path segments starting with "/.."
286            were misinterpreted as a dot-segment.
287    
288    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
289    
290            * URIChecker.pm (check_iri_reference): Requires
291            |Message::DOM::DOMImplementation|.
292    
293    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
294    
295            * IMTChecker.pm: Updated for the new error reporting architecture.
296    
297            * ContentChecker.pm: Error levels for IMTs are added.
298    
299    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
300    
301            * H2H.pm (_shift_token): Support for unquoted HTML attribute
302            values.
303    
304    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
305    
306            * CacheManifest.pm: Support for new style of error
307            reports.
308    
309            * HTML.pm.src: Set line=1, column=1 to the document node.
310    
311    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
312    
313            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
314            and URL checkers.  Support for more error levels for bogus
315            langauge tag and URL "standards".
316    
317            * LangTag.pm, URIChecker.pm: Support for new style error
318            level reporting.
319    
320    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
321    
322            * ContentChecker.pm: Support for RDF/XML error levels.
323    
324            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
325            specifying.  Error types are revised.
326    
327    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
328    
329            * ContentChecker.pm: All error reporting method calls are
330            renewed.
331    
332    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
333    
334            * HTML.pm.src: All error type names and "text" parameters
335            are revised.  Use new style for "level" specification.
336    
337            * mkhtmlparser.pl: Use new style for "level" specification.
338    
339    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
340    
341            * WebIDL.pm (parse_char_string): Simplified error
342            reporting process for broken ignored valuetype definition.
343            (Valuetype idl_text): Support for special "DOMString" name.
344    
345    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
346    
347            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
348            terminal of the ScopedName is "DOMString", such that whether
349            the last part of the scoped name is "DOMString" or "_DOMString"
350            later.  It is necessary to determine whether a |typedef|
351            definition should be ignored or not.
352            (parse_char_string): Unescape the identifier of
353            exception members.
354            ($resolve): Return undef for builtin types and sequence<T>
355            types (we might not have to do this, however...).
356            (check): Support checking for Exceptions, Valuetypes,
357            and Typedefs.
358            ($serialize_type): Support for "DOMString::::" syntax.
359            (Typedef idl_text): Output Type as "DOMString" if it
360            is really "DOMString" (i.e. its internal representation
361            is "::DOMString::").
362    
363    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
364    
365            * WebIDL.pm ($resolve): New code, based on resolve code
366            for constant types in the |check| method.
367            (check): Support for checking of attributes, operations, and
368            arguments.
369            (Attribute/Operation idl_text): Exception names in getraises,
370            setraises, and raises clauses is serizlied by |$serialize_type|
371            code.
372    
373    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
374    
375            * WebIDL.pm ($integer): Order of selections are changed to match
376            hexadecimal numbers (the original pattern, taken from the spec,
377            was not work for hexadecimal numbers, because the "0" prefix
378            matches to the [0-7]* part (as an empty string) and therefore
379            it does not match with remaining "x..." part of a "0x..." integer
380            literal.
381            ($get_type): It now returns a string, not an array reference,
382            for regular types and |sequence| types (i.e. it in any case
383            returns a string).
384            ($get_next_token): The second item in the array that represents
385            a integer or float token is now a Perl number value, not the
386            original string representation of the number.
387            (check): Support for const value consistency checking.
388            No extended attribute is defined for constants.
389            (Node subclasses): Use simple strings rather than array references
390            for default data type values.
391            ($serialize_type): Type values are now simple strings.
392            (value): If the new attribute value is a false value, then
393            a FALSE value is set to the attribute.
394    
395    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
396    
397            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
398            in its stringified format ("scoped name" as defined in the
399            spec).  Note that future version of this module should not use
400            array references for type values and the |type_text| attribute
401            should be made obsolete.
402            (parse_char_string): Unescape attribute names.
403            (check): Support for checking of whether inherited interfaces
404            are actually defined or not.  Support for checking of whether
405            interface member identifiers are duplicated or not.
406            ($serialize_type): Scoped names are returned as is.  A future
407            version of this code should escape identifiers other than "DOMString",
408            otherwise the idl_text would be non-conforming.
409    
410    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
411    
412            * WebIDL.pm (parse_char_string): Set line/column numbers
413            to generated nodes.  Unescape identifiers.  Extended attributes
414            for Definition's were ignored.
415            (append_child): Set |parent_node| attribute.
416            (parent_node): New attribute.
417            (check): Support interface/exception members.  Support
418            extended attributes.  Support definition identifier uniqueness
419            constraint.
420            (qualified_name): New attribute.
421            (Interface/Exception idl_text): Extended attributes were
422            not prepended to the returned text.
423    
424    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
425    
426            * WebIDL.pm (parse_char_string): Set line/column numbers
427            to interface object experimentally.  s/shift/pop/g, shift
428            would make things wrong.  Support for interface forward
429            declarations was missing.  Broken interface declarations
430            with no block were not ignored entirely.
431            (Whatpm::WebIDL::Node): New abstract class.  This class
432            makes things easier.
433            (child_nodes): New attribute.  Unlike DOM's attribute with
434            same name, this attribute returns a dead list of nodes for
435            simplicity.
436            (get_user_data, set_user_data): New methods.
437            (Module idl_text): A SPACE character should be inserted
438            before the |{| character.
439            (Interface idl_text): Support for interface forward declarations.
440            (is_forward_declaration): New attribute.
441    
442    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
443    
444            * WebIDL.pm (type_text): Better serializer.
445    
446    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
447    
448            * WebIDL.pm: Revise forward-compatible parsing so that
449            it now can handle broken extended attributes and as such.
450    
451    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
452    
453            * WebIDL.pm: Real support for extended attributes.
454            Support for extended attributes with arguments.
455    
456    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
457    
458            * WebIDL.pm: Support for |exception| syntax.
459            (Interface->idl_text): Tentative support for inheritances.
460    
461    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
462    
463            * WebIDL.pm: Hierarchical scoped name support was broken.
464            Support for raises, setraises, and getraises syntaxes.
465    
466    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
467    
468            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
469            proper support for types, extended attributes, and exceptions yet).
470            WebIDL parser, version 1 (no support for exceptions yet,
471            no proper support for extended attributes yet).
472    
473    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
474    
475            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
476    
477    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
478    
479            * WebIDL.pm: Support for valuetype and const.
480    
481    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
482            
483            * WebIDL.pm: New module.
484    
485    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
486    
487            * Makefile (Entities.html): URI changed.
488    
489    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
490    
491            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
492    
493    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
494    
495            * HTML.pm.src (_get_next_token): A parse error was missing.
496    
497    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
498    
499            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
500            spec (commented out). (HTML5 revision 1711).
501    
502    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
503    
504            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
505    
506            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
507            as UTF-16 (HTML5 revision 1701).
508    
509    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
510    
511            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
512            1692).
513    
514    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
515    
516            * HTML.pm.src: The secondary insertion mode used when switching
517            to foreign content is the "in body" insertion mode (HTML5 revision
518            1696).
519    
520    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
521    
522            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
523            revision 1697).
524    
525    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
526    
527            * HTML.pm.src: Support for end-of-file token in foreign content
528            insertion mode (HTML5 revision 1693).  Update SVG camelCase
529            attribute list (HTML5 revision 1700).  <textarea> closes
530            </select> (HTML5 revision 1699).  More start tags close in
531            foreign content insertion mode (HTML5 revision 1698).
532    
533    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
534    
535            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
536    
537    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
538    
539            * HTML.pm.src: More robust charset parameter detection (HTML5
540            revision 1674).
541    
542    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
543    
544            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
545            revision 1676).
546    
547    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
548    
549            * HTML.pm.src: Ignore language part of public identifiers for
550            quriks mode detection (HTML5 revision 1679).
551    
552    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
553    
554            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
555            revision 1685).
556    
557    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
558    
559            * HTML.pm.src: Support for EOF in new states for tags (HTML5
560            revision 1684).
561    
562    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
563    
564            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
565            work (HTML5 revision 1690).
566    
567    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
568    
569            * HTML.pm.src (_tree_construction_main): Change handling of
570            end tags in head insertion modes (HTML5 revision 1686).
571            (parse_char_string): Bug fix for non-utf8 character string handlings.
572            (parse_char_stream): |ungetc| does not work well for this context.
573    
574    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
575    
576            * HTML.pm.src (parse_byte_string): Redefined to invoke
577            |parse_byte_stream|.
578            (parse_byte_stream): New method.
579    
580    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
581    
582            * HTML.pm.src (parse_byte_string): Fix the column number reported
583            by encoding layer error reporter.
584    
585    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
586    
587            * HTML.pm.src (parse_byte_string): Use streaming decoder
588            rather than converting the whole byte string and then parsing.
589            Propagate errors in character encoding layer.
590            (get_next_token): Precise error reporting for |bare stago| error.
591    
592    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
593    
594            * HTML.pm.src (parse_char_stream): New method.
595            (parse_char_string): This method is now defined as an invocation
596            of the |parse_char_stream| method.
597    
598    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
599    
600            * HTML.pm.src (parse_byte_string): Report various status
601            of the sniffing as info-level errors.  Support for new
602            decoding framework in parser resestting.
603            (new): Various default error levels were not set.
604    
605    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
606    
607            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
608            algorithm, except for the actual sniffing, is implemented
609            with new framework with Message::Charset::Info.
610    
611    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
612    
613            * CacheManifest.pm (_parse): Drop fragment identifiers from
614            URIs in fallback section (HTML5 revision 1596).
615    
616    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
617    
618            * Makefile (Entities.html): URI has changed.
619    
620    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
621    
622            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
623            1553).
624    
625    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
626    
627            * ContentChecker.pm: Noted that those returned in |table| are
628            no longer table elements, but table objects returned
629            by Whatpm::HTMLTable.
630    
631            * HTMLTable.pm (form_table): Return table element node
632            as |$table->{element}|.
633            (assign_header): Support for the |headers=""| attribute.
634    
635    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
636    
637            * HTMLTable.pm (assign_header): New function; first version
638            with no support for headers="".
639            (form_table): Include table width and height to the returned
640            table object for covenience.  Indexing in column assignement was wrong.
641            Set whether a data cell is empty or not for convenience.
642    
643    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
644    
645            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
646    
647    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
648    
649            * HTMLTable.pm: How table model errors are detected is
650            changed (HTML5 revision 1387).
651    
652    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
653    
654            * HTMLTable.pm: The algorithm now moves |tfoot| elements
655            to the end of the table (HTML5 revision 1380).
656    
657    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
658    
659            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
660            of 1-based (HTML5 revision 1376).
661    
662    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
663    
664            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
665            and "Content-Encoding" no longer prevent sniffing (HTML5
666            revision 1288).
667    
668    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
669    
670            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
671            1282).
672    
673    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
674    
675            * ContentChecker.pm: Support for global attributes.
676            Status of XML specs are added.
677    
678    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
679    
680            * ContentChecker.pm (check_element): Support for |template=""|
681            and |ref=""| attribute (referent element type checking).
682    
683    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
684    
685            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
686            1500) is implemented (except for IDNA part and URI-scheme-specific
687            knowledge).  Line number counting was wrong for LF-only
688            documents.
689    
690    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
691    
692            * HTML.pm.src: Raise an parse error for any disallowed
693            character (HTML5 revision 1263).
694    
695    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
696    
697            * mkentitylist.pl: Support for new HTML5 entity table format (the
698            definition for |AElig;| was missing).
699    
700    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
701    
702            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
703            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
704    
705    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
706    
707            * HTML.pm.src: List of element names that close foreign content
708            insertion mode is added (HTML5 revisions 1412 and 1418).
709    
710    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
711    
712            * HTML.pm.src: Support for |mglyph| and |malignmark|
713            elements (HTML5 revision 1410).
714    
715    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
716    
717            * HTML.pm.src: Support for new long MathML entities (HTML5
718            revision 1406).
719    
720    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
721    
722            * HTML.pm.src: CDATA section support for MathML and SVG
723            elements (HTML5 revision 1404 and 1420).
724    
725    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
726    
727            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
728            elements (HTML5 revision 1404).  Unused !!!macro definitions
729            are removed.
730    
731    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
732    
733            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
734            are raised is changed (HTML5 revision 1404).
735    
736    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
737    
738            * HTML.pm.src: Category sets that are no longer used are removed.
739    
740    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
741    
742            * HTML.pm.src: The ->[1] property of stack entries are now
743            replaced by constants representing element category.
744    
745    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
746    
747            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
748            for error reporting.  (This is a preparation for using constant
749            value for ->[1].)
750    
751    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
752    
753            * RDFXML.pm: Typo fixed.
754    
755    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
756    
757            * ContentChecker.pm: |fact_level| is now treated
758            as same as |must_level|, i.e. level = |m|.
759            (check_element): Make list of URIs in the DOM.
760    
761    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
762    
763            * RDFXML.pm: Language accessor implemented.
764            Local (null-namespace) attribute support.
765    
766    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
767    
768            * RDFXML.pm: Factored out ID checking code.
769    
770    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
771    
772            * RDFXML.pm: TODO items noted.  Validation of ID
773            and URI attributes is implemented.  Warn if unknown
774            value is used in rdf:parseType="" attribute.
775    
776            * URIChecker.pm (check_rdf_uri_reference): New function.
777    
778    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
779    
780            * RDFXML.pm: bnodeid implemented.  Relative references
781            are now resolved.
782    
783    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
784    
785            * ContentChecker.pm: RDF reification implemented.
786    
787            * RDFXML.pm: undef vs false bug fixed.
788            Reification implemented.
789    
790    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
791    
792            * RDFXML.pm: s/id/ID/ for attribute name.
793            The |node| arguments are added for |ontriple| calls.
794            Too many "attribute not allowed" errors were raised.
795    
796            * ContentChecker.pm: Initial experimental support for rdf:RDF
797            element.
798    
799    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
800    
801            * RDFXML.pm: New module.
802    
803    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
804    
805            * HTML.pm.src (set_inner_html): Line/column number
806            code was old one yet.
807    
808    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
809    
810            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
811            errors.  Line/column markings for DOCTYPE, comment, and
812            character tokens are reintroduced; otherwise,
813            error location for "not HTML5" error and errors
814            for implied elements are not attached.
815    
816    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
817    
818            * HTML.pm.src: Set line/column numbers to attributes.
819    
820            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
821            (value): Setter implemented.
822    
823            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
824    
825    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
826    
827            * HTML.pm.src: Unused line/column markings are removed.
828    
829    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
830    
831            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
832            flag, which is no longer used.
833    
834    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
835    
836            * HTML.pm.src: Set line/column information to element nodes.
837    
838            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
839            and !!!insert-element-t): Set line/column information to
840            element nodes.
841    
842    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
843    
844            * HTML.pm.src (_get_next_token): The first "<" character
845            in "<?", "<>", or "</>" should be the error point.
846    
847    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
848    
849            * HTML.pm.src: Some more fixes on error position reporting.
850    
851    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
852    
853            * HTML.pm.src: Token-level precious error reporting.
854    
855    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
856    
857            * HTML.pm.src: Preparation for more precious error point
858            reporting.
859    
860    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
861    
862            * HTML.pm.src: Error type revised.
863    
864    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
865    
866            * HTML.pm.src: Similar codes are merged together, again.
867    
868    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
869    
870            * HTML.pm.src: Similar codes are merged together.
871    
872    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
873    
874            * mkhtmlparser.pl: Set "level" parameter to parse errors.
875    
876            * HTML.pm.src: Code refined.
877    
878    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
879    
880            * HTML.pm.src: |</body>| treatement has been changed (HTML5
881            revision 1348).  Note that I really don't know this makes
882            any difference in the black-box behavior of the parser.
883    
884    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
885    
886            * HTML.pm.src: New end-of-file token implementation (HTML5
887            revision 1348).
888    
889    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
890    
891            * HTML.pm.src: |applet| support (HTML5 revision 1347).
892    
893    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
894    
895            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
896    
897    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
898    
899            * HTML.pm.src: Support for |<input>| in the "in select" insertion
900            mode and support for the "in select in table" insertion mode (HTML5
901            revision 1342).
902    
903    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
904    
905            * HTML.pm.src: No foster parenting for <script> and <script>
906            in non-tainted <table>s (HTML5 revision 1336).
907    
908    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
909    
910            * HTML.pm.src: Ignore white space characters between <html>
911            and <head> (HTML5 revision 1332).
912    
913    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
914    
915            * HTML.pm.src: Treat <input type=hidden> as if it were a
916            white space (HTML5 revision 1331).
917    
918    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
919    
920            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
921            element (HTML5 revision 1330).
922    
923    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
924    
925            * HTML.pm.src: <title> is always appended to the current
926            element (HTML5 revision 1328).
927    
928    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
929    
930            * HTML.pm.src: White space in tainted tables are moved
931            into foster parents (HTML5 revision 1326).
932    
933    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
934    
935            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
936            revision 1321).
937    
938    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
939    
940            * HTML.pm.src: |</h/n/>| case code rearranged to align with
941            the spec (HTML5 revision 1320).  Note that we finally complete
942            all of HTML5 revision 1320 changes.
943    
944    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
945    
946            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
947            tags (HTML5 revision 1320).
948    
949    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
950    
951            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
952    
953    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
954    
955            * HTML.pm.src: A "generate implied end tags" code (t409.1)
956            could not be reached so that it is now removed (HTML5 revision
957            1320).
958    
959    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
960    
961            * HTML.pm.src: Code for the case of |</div>| and so on are revised
962            to align with new spec text (HTML5 revision 1320).
963    
964    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
965    
966            * HTML.pm.src: Remove strange |if| condition; however, it should
967            have had no harm in theory.
968    
969    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
970    
971            * HTML.pm.src (_tree_construction_main): '</p>' in body
972            case is split from other end tags for the preparation
973            of implementing HTML revision 1320.
974    
975    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
976    
977            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
978            revision 1320).
979    
980    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
981    
982            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
983            and "div" (HTML5 revision 1318).  Add comments to where
984            |form| pointer association codes should be inserted (HTML5 revision
985            1319).
986    
987    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
988    
989            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
990    
991    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
992    
993            * HTML.pm.src: Since the case t268 should never be reached (no
994            other token type, there are), it is replaced by a |die| statement.
995    
996    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
997    
998            * HTML.pm.src: Typo fixed.
999    
1000    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1001    
1002            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1003            doctypes were not uppercased for comparison.
1004    
1005    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1006    
1007            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1008            are added.
1009    
1010    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1011    
1012            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1013            are set.  Cases that are unlikely reached are noted as so.
1014    
1015    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1016    
1017            * HTML.pm.src: Checkpoints for debugging are added.
1018    
1019            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1020    
1021    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1022    
1023            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1024            for simplicity.
1025    
1026    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1027    
1028            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1029            revision 1307).
1030    
1031    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1032    
1033            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1034            is set are changed (HTML5 revision 1305).
1035    
1036    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1037    
1038            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1039            1292).  Entities are not parsed in comment-like part in RCDATA
1040            elements (HTML5 revision 1294).  Allow bare & at the end
1041            of attribute value literals (HTML5 revision 1296).  More
1042            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1043            between attributes and ban attribute names or unquoted
1044            attribute values containing single or double quotes (HTML5
1045            revision 1303).
1046    
1047    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1048    
1049            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1050            and related errors unless it is an HTML document (though the spec
1051            is unclear on whether it is applied to XHTML document).
1052    
1053            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1054            are added.
1055    
1056    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1057    
1058            * HTML.pm.src: s/local_name/manakai_local_name/g.
1059    
1060    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1061    
1062            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1063    
1064            * HTML.pm.src: |charset| in |content| attribute is
1065            case-insensitive (HTML5 revision 1270).
1066    
1067    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1068    
1069            * ContentChecker.pm: New status constants are added.
1070            ($ElementDefault): |status| added.
1071            (check_element): Err for non-standard or deprecated elements.
1072            (_attr_status_info): For non-standard or deprecated attributes.
1073    
1074    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1075    
1076            * ContentChecker.pm (_attr_status_info): New internal method.
1077    
1078    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1079    
1080            * ContentChecker.pm (check_element): Element standardized
1081            status information is now dispatched.
1082    
1083    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1084    
1085            * ContentChecker.pm (check_element): Fix |del|-and-significant
1086            problem by adding some more arguments.
1087    
1088    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1089    
1090            * ContentChecker.pm (check_element): Use context of
1091            container-for-the-purpose-of-content-model element (not transparent
1092            element) for |check_child_element| calling and significant
1093            text flag marking.  This reintroduces |<del>|-and-significant
1094            problem again.
1095    
1096    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1097    
1098            * ContentChecker.pm (check_element): Make semi-transparent
1099            elements ignored for the purpose of phase changes in
1100            content model checking.
1101    
1102    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1103    
1104            * ContentChecker.pm (check_element): In-element state
1105            was not properly managed for transparent cases.
1106    
1107    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1108    
1109            * ContentChecker.pm (check_element): Support for |video|
1110            and |audio| as semi-transparent elements.
1111    
1112    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1113    
1114            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1115            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1116            Support for |html:object| as a semi-transparent element.
1117    
1118    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1119    
1120            * ContentChecker.pm (check_element): The way to traverse
1121            the tree is entirely revised to make it easier to track
1122            the state of ancestors/descendants.  As a result of this
1123            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1124            support for content model checking for HTML elements |figure|,
1125            |object|, |video|, and |audio| and checking for XML elements (and
1126            some XMLNS checkings) are dropped for now.  They will be
1127            reimplemented in due cource.
1128    
1129    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1130    
1131            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1132            token (HTML5 revision 1225).
1133    
1134    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1135    
1136            * ContentChecker.pm ({unsupported_level}): New value.
1137    
1138            * HTML.pm.src: Save whether |meta| |content| attribute
1139            contains character references or not.
1140    
1141    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1142    
1143            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1144            imlement significant content checking for contents
1145            with |del| elements.
1146    
1147    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1148    
1149            * ContenteChecker.pm ($HTMLTransparentElements): More
1150            elements are added.
1151            (_get_children): HTML |object| elements are now semi-transparent.
1152    
1153            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1154    
1155    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1156    
1157            * CacheManifest.pm: HTML5 revision 1211 implemented.
1158    
1159            * CacheManifest.pod: Updated.
1160    
1161    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1162    
1163            * ContentChecker.pm (check_document, check_element): Support
1164            for second argument ($onsubdoc).
1165            (_get_css_parser): Removed (now it is part of WDCC).
1166    
1167    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1168    
1169            * ContentChecker.pm (_get_css_parser): New.
1170    
1171    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1172    
1173            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1174            for checking had been used.
1175    
1176    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1177    
1178            * ContentChecker.pm (check_element): New todo item type |descendant|.
1179    
1180    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1181    
1182            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1183            specced in RFC 2046 and RFC 4288.
1184            (application/atom+xml): Definition added.
1185    
1186    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1187    
1188            * URIChecker.pm: Make RFC 3986 should-level errors
1189            warnings (rather than SHOULD-level errors).
1190    
1191    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1192    
1193            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1194    
1195            * HTML.pm.src: A flag for character references in attribute
1196            values are added.  Set |manakai_has_reference| user data
1197            to |charset| attribute.
1198    
1199    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1200    
1201            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1202            attributes.
1203    
1204            * ContentChecker.pm (check_document): Warn if charset requirements
1205            cannot be tested.
1206    
1207    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1208    
1209            * HTML.pm.src (parse_byte_string): Detect charset
1210            by universalchardet if charset parameter is not specified.
1211    
1212            * Makefile (Charset-all, Charset-clean): New rules.
1213    
1214    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1215    
1216            * ContentChecker.pm (check_document): Check the existence
1217            of character encoding declaration and selection of encoding
1218            for HTML document.
1219    
1220    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1221    
1222            * ContentType.pm (get_sniffed_type): Return also the official
1223            type in list context.
1224    
1225    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1226    
1227            * ContentType.pm: Sniffing with leading white space
1228            ignoring (HTML5 revisions 1013 and 1016).
1229    
1230    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1231    
1232            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1233            white spaces, are implemented.
1234    
1235    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1236    
1237            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1238            attribute if possible.
1239    
1240    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1241    
1242            * HTML.pm.src (parse_byte_string): New method.
1243            (parse_char_string): New alias for |parse_string|.
1244            (main phase): Invoking "change the encoding" algorithm if desired.
1245    
1246            * HTML.pod: Updated.
1247    
1248    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1249    
1250            * HTML.pod (get_inner_html): Removed.
1251    
1252            * Makefile (HTML-all, HTML-clean): New.
1253    
1254    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1255    
1256            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1257    
1258    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1259    
1260            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1261    
1262    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1263    
1264            * Makefile: |CacheManifest.html| is added.
1265    
1266            * CacheManifest.pod: New file.
1267    
1268    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1269    
1270            * CacheManifest.pm: New module.
1271    
1272    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1273    
1274            * HTML.pm.src: Support for application cache selection algorithm
1275            callback.
1276    
1277    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1278    
1279            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1280    
1281    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1282    
1283            * Makefile (clean): New rule.
1284    
1285            * NanoDOM.pm (public_id, system_id): New attributes.
1286    
1287    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1288    
1289            * Makefile (CSS-all, CSS-clean, clean): New rules.
1290    
1291    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1292    
1293            * ContentChecker.pm (check_document): Support for
1294            new |is_xml_root| flag.
1295            (check_element): Support for new |pluses| state.
1296            (_add_pluses): New method.
1297            (_remove_minuses): Support for new |minus| item.
1298    
1299    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1300    
1301            * ContentChecker.pm: Raise specific error for invalid
1302            root element.
1303    
1304    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1305    
1306            * ContentChecker.pm: Set level values for later uses.
1307    
1308    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1309    
1310            * ContentChecker.pm: Support for language tag validation.
1311    
1312    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1313    
1314            * LangTag.pm (check_rfc3066_language_tag): New method.
1315    
1316    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1317    
1318            * LangTag.pm: New module.
1319    
1320    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1321    
1322            * HTML.pm.src: Some error types were wrong.
1323    
1324    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1325    
1326            * CSS/: New directory.
1327    
1328    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1329    
1330            * ContentChecker.pm (_check_get_children): Support
1331            for |noscript| in |head|.
1332    
1333    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1334    
1335            * URI/: New directory.
1336    
1337    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1338    
1339            * HTML.pm.src: Tokenizer's states are now represented in
1340            number.
1341    
1342    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1343    
1344            * HTML.pm.src: |or|s for insertion modes are replaced
1345            by |&|s.
1346    
1347    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1348    
1349            * HTML.pm.src: Token types are now represented in number.
1350    
1351    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1352    
1353            * ContentType.pm (SEE ALSO): Updated.
1354    
1355            * HTML.pm.src: Insertion modes are now represented in number.
1356    
1357    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1358    
1359            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1360            999) is implemented.
1361    
1362    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1363    
1364            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1365            983) is implemented.
1366    
1367    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1368    
1369            * ContentChecker.pod: New documentation.
1370    
1371            * Makefile: A rule for |ContentChecker.html| is added.
1372    
1373            * ContentChecker.pm: A pod "LICENSE" section is added.
1374    
1375            * NanoDOM.pm ($VERSION): New variable.
1376    
1377    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1378    
1379            * H2H.pm: |b|, |i|, and |sub| are added to the
1380            list of allowed HTML elements.
1381    
1382    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1383    
1384            * H2H.pm: |samp| is added to the list of allowed
1385            HTML elements.
1386    
1387            * URIChecker.pm (check_iri): New.
1388            (check_iri_reference): Error type for IRI reference
1389            syntax error is changed.
1390    
1391    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1392    
1393            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1394            is added.
1395            (check_document): Load appropriate module before validation.
1396    
1397    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1398    
1399            * ContentChecker/: New directory.
1400    
1401    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1402    
1403            * ContentChecker.pm: HTML |time| element is implemented.
1404    
1405            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1406    
1407    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1408    
1409            * ContentChecker.pm (check_document): Return value
1410            even if no document element is found.
1411    
1412    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1413    
1414            * HTML.pm.src: |$in_body| is no longer a function.
1415    
1416    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1417    
1418            * HTML.pm.src: The |$in_body| code has been moved down.
1419    
1420    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1421    
1422            * HTML.pm.src: The "trailing end" insertion mode
1423            is split into "after html body" and "after html frameset"
1424            insertion modes.  Their codes are merged with "after body"
1425            and "after frameset" codes.  |$previous_insertion_mode|
1426            has been removed.  "after frameset" code is
1427            merged with "in frameset" code.
1428    
1429    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1430    
1431            * HTML.pm.src: The "before head" insertion mode is
1432            merged with the "in head" insertion mode.
1433    
1434    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1435    
1436            * HTML.pm.src: Cases in "in head" insertion mode are
1437            reorganized.
1438    
1439    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1440    
1441            * HTML.pm.src: Some cases in "in table" insertion mode
1442            are merged.
1443    
1444    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1445    
1446            * HTML.pm.src: The "in row" insertion mode is merged
1447            with "in table" insertion mode.
1448    
1449    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1450    
1451            * HTML.pm.src: The "in table" and "in table body" insertion
1452            modes are merged.
1453    
1454    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1455    
1456            * HTML.pm.src: There is no "in table head" or "in table foot"
1457            insertion mode!
1458    
1459    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1460    
1461            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1462            now directly invoke the handler.
1463    
1464    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1465    
1466            * HTML.pm.src: Codes for "in cell" insertion mode
1467            is merged to the "in body" insertion mode code.
1468    
1469    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1470    
1471            * HTML.pm.src: Codes for "in body" and "in caption"
1472            insertion modes are merged.
1473    
1474    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1475    
1476            * HTML.pm.src: Two |!!!next-token|s were missing.
1477    
1478    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1479    
1480            * HTML.pm.src: Use numeric constant for |{content_mode}|
1481            instead of string constant for |{content_model_flag}|.
1482    
1483    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1484    
1485            * HTML.pm.src: Add the name of the attribute
1486            to the "duplicate attribute" error.
1487    
1488    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1489    
1490            * ContentChecker.pm: Return the |class| node list.
1491    
1492    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1493    
1494            * ContentChecker.pm: Return the |id| node list.
1495    
1496            * HTML.pm.src: A typo is fixed.
1497    
1498    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1499    
1500            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1501            from "in HTML:xml:lang" error.  Character position
1502            is now the last part of the error type in the URI error
1503            description.  Report "unsupported" status for language
1504            tags, media queries, script codes, and style sheets.
1505    
1506    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1507    
1508            * ContentChecker.pm: Report error if |xml:lang|
1509            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1510            in XML.
1511    
1512            * NanoDOM.pm (Attr.owner_document): New attribute.
1513    
1514    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1515    
1516            * HTML.pm.src: The character immediately following
1517            a bare |hcro| was discarded.  Fix handling of
1518            entity references in attribute values.
1519    
1520    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1521    
1522            * HTML.pm.src (main and trailin end phases): Token
1523            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1524            factored out.  Error types |in html:#DOCTYPE|
1525            and |after html:#DOCTYPE| are merged
1526            into |DOCTYPE in the middle|.  |</frameset>|
1527            in fragment parsing mode changed the
1528            insertion mode.
1529    
1530    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1531    
1532            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1533            phase is now an insertion mode.  Treatments for white
1534            space character tokens were incorrect for some
1535            insertion modes.  An old |meta| case was not removed.
1536    
1537    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1538    
1539            * HTML.pm.src: |meta| charset declaration extraction
1540            implemented (but changing the encoding is not yet:-).
1541    
1542    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1543    
1544            * Charset/: New directory.
1545    
1546    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1547    
1548            * H2H.pm: New Perl module (created from
1549            manakai's H2H.dis).
1550    
1551    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1552    
1553            * XMLSerializer.pm: New Perl module (created from
1554            manakai's SimpleLS.dis).
1555    
1556    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
1557    
1558            * HTML.pm.src: |<!---x-->| was not processed correctly.
1559    
1560    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1561    
1562            * HTML.pm.src: Report correct error message
1563            for |<body></div></body>|.
1564    
1565    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1566    
1567            * HTMLTable.pm: An error description was incorrect.
1568    
1569    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1570    
1571            * ContentChecker.pm: Return |{term}| list.
1572    
1573    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1574    
1575            * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1576            nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1577            and <title> outside of head).
1578    
1579    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1580    
1581            * IMTChecker.pm: Report warning for unregistered
1582            and private types/subtypes.
1583    
1584            * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1585            URIChecker.pm, HTMLTable.pm: Error messages are now
1586            consistent; they are all listed in
1587            <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1588    
1589    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1590    
1591            * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1592            is now erred.  |<datalist>| is implemented.
1593            Attribute checker for |<command>| and |<menu>| are
1594            added.  Support for |contextmenu| global attribute
1595            is added.
1596    
1597    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1598    
1599            * HTML.pm.src (_reset_insertion_mode): Interpretation
1600            of Step 3 has been changed.
1601    
1602    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1603    
1604            * HTML.pm.src: Late |<html>| parse error is implemented.
1605    
1606    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1607    
1608            * URIChecker.pm (check_iri_reference): A |decode| method name was
1609            incorrect.
1610    
1611            * ContentChecker.pm: Support for the |footer| element.
1612            Check URI syntax for space-separated URI attributes.
1613            Support for the |tabindex| attribute.  Support
1614            for |datetime| attribute.
1615    
1616    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1617    
1618            * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1619            character references).  HTML5 revision 1.146 (white space
1620            characters before root start tag).  HTML5 revision
1621            1.148 (named character references in attribute values).
1622            HTML5 revision 1.152 (<plaintext>.innerHTML get).
1623    
1624    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1625    
1626            * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1627            in <head>).
1628    
1629    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1630    
1631            * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1632            in body).
1633    
1634            * ContentChecker.pm: HTML5 revision 938 (scoped="").
1635    
1636    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1637    
1638            * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1639            or RCDATA in fragment parsing mode).  HTML5 revision
1640            924 (<!--> and <!--->).  HTML5 revision 926 (hn in hn).
1641    
1642    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1643    
1644            * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1645            for <pre> and <textarea>).
1646    
1647    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1648    
1649            * HTML.pm.src: HTML5 revision 920 (<isindex>).
1650    
1651  2007-06-23  Wakaba  <wakaba@suika.fam.cx>  2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1652    
1653          * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).          * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).

Legend:
Removed from v.1.45  
changed lines
  Added in v.1.322

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24