/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.57 by wakaba, Sun Jul 1 04:46:48 2007 UTC revision 1.335 by wakaba, Thu Oct 2 10:59:04 2008 UTC
# Line 1  Line 1 
1    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentType.pm (get_sniffed_type): Support for the "better"
4            content sniffing (HTML5 revision 1927).  In a case the official
5            type was not returned when the method is invoked in the list
6            context.
7    
8    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
9    
10            * HTML.pm.src: Character references for non-space C0 characters,
11            including U+000B VT, DEL character, noncharacter code points, are
12            now converted to the U+FFFD character (cf. HTML5 revision 2138).
13    
14    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
15    
16            * ContentChecker.pm: |form=""| check support added.
17    
18    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
19    
20            * ContentChecker.pm: |contextmenu| validness is now checked using
21            |id| and |id_type| properties, and |menu| property is removed.
22    
23    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
24    
25            * ContentChecker.pm: Prepare for |form| |name| attribute's
26            duplication checking.
27    
28    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
31            unsupported charset="" parameter value (HTML5 revision 2131).
32    
33    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Reminding places where U+000B is allowed as a space
36            character is fixed (cf. HTML5 revision 1738).
37    
38            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
39            space characters (HTML5 revision 1738).
40    
41    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
42    
43            * HTML.pm.src: The "anything else" case for the "after after body"
44            insertion mode was not updated to swtich to the "in body"
45            insertion mode.  U+000B is no longer a space character for the
46            purpose of tree construction phase (HTML5 revision 1738).
47    
48    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTML.pm.src: U+000B is no longer a space character (HTML5
51            revision 1738).
52    
53    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
54    
55            * ContentType.pm: 0x0B is no longer a space character (HTML5
56            revision 1738).
57    
58            * HTML.pm.src: U+000B is no longer a space character for the
59            algorithm for extracting an encoding from a Content-Type (HTML5
60            revision 1738).
61    
62    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
63    
64            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
65    
66    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
67    
68            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
69    
70            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
71            matching.
72    
73    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
74    
75            * LangTag.pm: Warn for private use language subtags.  Error level
76            typos fixed.  Support for Suppress-Script field.
77    
78            * mklangreg.pl: Support for dumping of nested structure.
79    
80    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
81    
82            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
83            recommended case as per RFC 4646.
84    
85    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
86    
87            * LangTag.pm (check_rfc4646_langtag): New method.
88    
89    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
90    
91            * mklangreg.pl: New script.
92    
93            * Makefile: Updated for creation of the module for language subtag
94            registry.
95            
96    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
97    
98            * Makefile: WebIDL.html added.
99    
100            * WebIDL.pod: New documentation.
101    
102    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
103    
104            * WebIDL.pm: Checker's error types are redefined.
105    
106    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
107    
108            * WebIDL.pm: Parser's error types are redefined.  Some forward
109            compatible parsing bugs are fixed.  Some unreachable codes are
110            commented out.
111    
112    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
113    
114            * WebIDL.pm: Support for the reminding extended attributes are
115            added.  It does not satisfy the definition that a forward
116            interface declaration has an extended attribute.  It seems that
117            unless explicitly allowed multiple extended attributes with the
118            same name is not allowed, though it is not explicitly mentioned in
119            the spec.
120    
121    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
122    
123            * WebIDL.pm: Unescapes extended attribute names and extended
124            attribute identifiers.  Preserve whether an extended attribute has
125            an argument list of not.  Support for extended attributes:
126            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
127            NameGetter, NameSetter, and Null.
128            (has_argument_list): New attribute.
129            (idl_text): Stringifies argument lists, if any, even if it is
130            empty.
131    
132    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
135            |{s_kwd}| in DATA_STATE as default.
136    
137    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
140            by |{s_kwd}| in DATA_STATE.
141    
142    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
143    
144            * HTML.pm.src: Shorten keys.
145    
146    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
147    
148            * HTML.pm.src: Remove checking for control character, surrogate
149            pair, or noncharacter code points and non-Unicode code
150            points (they should be handled by Whatpm::Charset::UnicodeChecker).
151            (parse_char_stream): Support for the |$get_wrapper| argument and
152            character stream error handlers.
153    
154    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
155    
156            * ContentChecker.pm: Don't call |loda_ns_module|
157            for null-namespace elements/attributes.
158    
159            * HTML.pm.src: Fact out $disallowed_control_chars
160            as a hash.
161    
162    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
163    
164            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
165            and |{next_char}| initializations are moved to initialization
166            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
167            with |parse_char_stream|.
168    
169    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
170    
171            * HTML.pm.src (parse_char_stream): Make |set_next_char|
172            invoke |manakai_read_until|, not only |read|, where
173            possible, to decrease the number of |read| method calls.
174    
175            * mkhtmlparser.pl: Related changes to the aforementioned
176            modification.
177    
178    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
179    
180            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
181            would report character error from now.
182    
183    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: White-space-leaded non-white-space character
186            tokens in "before head insertion mode" was not
187            correctly handled.
188            (set_inner_html): Reimplemented using CharString decodehandle
189            class.  Support for $get_wrapper argument.  Support
190            for |{read_until}| feature.
191    
192    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
193    
194            * HTML.pm.src: Make a "bare ero" error for unknown
195            entities point the "&" character.
196    
197    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: It turns out that U+FFFD don't have to
200            be added to the list of excluded characters.
201    
202    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src ($char_onerror): Have character decoder's |line|
205            and |column| a higher priority than the one set by the
206            tokenizer's input handler.
207            ($self->{read_until}): Exclude U+FFFD (but this might
208            not be necessary, since now we do line/column fixup in
209            the character decode handle).
210    
211    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTML.pm.src: Use |{read_until}| where possible.
214    
215    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
216    
217            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
218            and |manakai_getc_until| to |manakai_read_until| to
219            reduce the number of string copies.
220    
221    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
222    
223            * HTML.pm.src (parse_char_string): Use newly created
224            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
225            standard feature to |open| a string as a filehandle,
226            since Perl's string filehandle seems not supporting |ungetc|
227            method correctly.
228            (parse_char_stream): Define |{getc_until}| method.
229            (DATA_STATE): Experimental support for |getc_until| feature.
230    
231    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src: Check points added to newly added branches.
234    
235    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
236    
237            * HTML.pm.src: Remove |{char}|, which is no longer used.
238            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
239            and replaced by |{prev_state}|.
240    
241            * mkhtmlparser.pl: Remove |{char}| feature.
242            Remove |!!!back-next-input-character;| macro.
243    
244    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
245    
246            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
247            entity related tokenizer states in favor of new states
248            implementing the consume character reference algorithm.
249    
250    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: "Consume a character reference" algorithm is
253            now implemented as a tokenizer's state, rather than
254            a method, with minimum changes (more changes will
255            be made, in due course).  "Bogus comment state"'s inner
256            loop gets removed.
257    
258    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
259    
260            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
261            into their own tokenizer states.
262    
263    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
264    
265            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
266            is split into three states.
267    
268    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
271            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
272            no longer does the tokenizer have to push back next input
273            characters in those states.
274    
275    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
278            into four states so that no longer does the tokenizer have to push
279            back next input characters in that state.
280    
281    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
284            which can be used to insert some wrapper between the character
285            stream handle and the tokenizer.  (It is currently not supported
286            for |set_inner_html| for |Element|s).
287    
288    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
289    
290            * HTML.pm.src: Ignore punctuations in charset names.
291    
292    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
293    
294            * ContentChecker.pm: Support for charset-layer error levels.
295    
296            * HTML.pm.src: Don't specify |text| argument for the
297            |chardecode:fallback| error, since it is not the encoding
298            being used alternatively.
299    
300    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
301    
302            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
303    
304    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
305    
306            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
307    
308    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
309    
310            * HTML.pm.src: Bug fix and sync with the spec with regard
311            to after after frameset insertion mode processing (HTML5
312            revision 1909).  Note that the implementation was wrong
313            per the old spec before the r1909 changes.
314    
315    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
316    
317            * HTMLTable.pm: scope=auto algorithm fix synced with the
318            spec (HTML5 revision 2093).
319            ($process_row): Algorithm step numbers synced with the
320            spec (HTML5 revision 2092).
321    
322    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
325            revision 2094).
326    
327    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
328    
329            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
330    
331    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
332    
333            * HTML.pm.src: '"' and "'" at the end of attribute
334            name (after another attribute) now raise parse error (HTML5
335            revision 2123).  Empty unquoted attribute values are no
336            longer allowed (HTML5 revision 2122).
337    
338    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
339    
340            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
341            revision 2130).
342    
343    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
344    
345            * ContentChecker.pm: |xml:lang| attribute value must be same
346            as |lang| attribute value for HTML elements (HTML5 revision 2062
347            and so on).
348    
349    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
350    
351            * ContentChecker.pm: Error level definition for |xml_id_error|
352            was missing.
353    
354            * URIChecker.pm: The end of the URL should be marked as the
355            error location for an empty path error.  The position
356            between the userinfo and the port components should be
357            marked as the error location for an empty host error.
358    
359    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
360    
361            * URIChecker.pm: Set parameters representing where in the
362            value the error occurs for errors.  Report unknown
363            address format error in warning level, since address
364            formats are rarely added.  Path segments starting with "/.."
365            were misinterpreted as a dot-segment.
366    
367    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
368    
369            * URIChecker.pm (check_iri_reference): Requires
370            |Message::DOM::DOMImplementation|.
371    
372    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
373    
374            * IMTChecker.pm: Updated for the new error reporting architecture.
375    
376            * ContentChecker.pm: Error levels for IMTs are added.
377    
378    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
379    
380            * H2H.pm (_shift_token): Support for unquoted HTML attribute
381            values.
382    
383    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
384    
385            * CacheManifest.pm: Support for new style of error
386            reports.
387    
388            * HTML.pm.src: Set line=1, column=1 to the document node.
389    
390    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
391    
392            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
393            and URL checkers.  Support for more error levels for bogus
394            langauge tag and URL "standards".
395    
396            * LangTag.pm, URIChecker.pm: Support for new style error
397            level reporting.
398    
399    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
400    
401            * ContentChecker.pm: Support for RDF/XML error levels.
402    
403            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
404            specifying.  Error types are revised.
405    
406    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
407    
408            * ContentChecker.pm: All error reporting method calls are
409            renewed.
410    
411    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
412    
413            * HTML.pm.src: All error type names and "text" parameters
414            are revised.  Use new style for "level" specification.
415    
416            * mkhtmlparser.pl: Use new style for "level" specification.
417    
418    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
419    
420            * WebIDL.pm (parse_char_string): Simplified error
421            reporting process for broken ignored valuetype definition.
422            (Valuetype idl_text): Support for special "DOMString" name.
423    
424    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
425    
426            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
427            terminal of the ScopedName is "DOMString", such that whether
428            the last part of the scoped name is "DOMString" or "_DOMString"
429            later.  It is necessary to determine whether a |typedef|
430            definition should be ignored or not.
431            (parse_char_string): Unescape the identifier of
432            exception members.
433            ($resolve): Return undef for builtin types and sequence<T>
434            types (we might not have to do this, however...).
435            (check): Support checking for Exceptions, Valuetypes,
436            and Typedefs.
437            ($serialize_type): Support for "DOMString::::" syntax.
438            (Typedef idl_text): Output Type as "DOMString" if it
439            is really "DOMString" (i.e. its internal representation
440            is "::DOMString::").
441    
442    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
443    
444            * WebIDL.pm ($resolve): New code, based on resolve code
445            for constant types in the |check| method.
446            (check): Support for checking of attributes, operations, and
447            arguments.
448            (Attribute/Operation idl_text): Exception names in getraises,
449            setraises, and raises clauses is serizlied by |$serialize_type|
450            code.
451    
452    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
453    
454            * WebIDL.pm ($integer): Order of selections are changed to match
455            hexadecimal numbers (the original pattern, taken from the spec,
456            was not work for hexadecimal numbers, because the "0" prefix
457            matches to the [0-7]* part (as an empty string) and therefore
458            it does not match with remaining "x..." part of a "0x..." integer
459            literal.
460            ($get_type): It now returns a string, not an array reference,
461            for regular types and |sequence| types (i.e. it in any case
462            returns a string).
463            ($get_next_token): The second item in the array that represents
464            a integer or float token is now a Perl number value, not the
465            original string representation of the number.
466            (check): Support for const value consistency checking.
467            No extended attribute is defined for constants.
468            (Node subclasses): Use simple strings rather than array references
469            for default data type values.
470            ($serialize_type): Type values are now simple strings.
471            (value): If the new attribute value is a false value, then
472            a FALSE value is set to the attribute.
473    
474    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
475    
476            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
477            in its stringified format ("scoped name" as defined in the
478            spec).  Note that future version of this module should not use
479            array references for type values and the |type_text| attribute
480            should be made obsolete.
481            (parse_char_string): Unescape attribute names.
482            (check): Support for checking of whether inherited interfaces
483            are actually defined or not.  Support for checking of whether
484            interface member identifiers are duplicated or not.
485            ($serialize_type): Scoped names are returned as is.  A future
486            version of this code should escape identifiers other than "DOMString",
487            otherwise the idl_text would be non-conforming.
488    
489    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
490    
491            * WebIDL.pm (parse_char_string): Set line/column numbers
492            to generated nodes.  Unescape identifiers.  Extended attributes
493            for Definition's were ignored.
494            (append_child): Set |parent_node| attribute.
495            (parent_node): New attribute.
496            (check): Support interface/exception members.  Support
497            extended attributes.  Support definition identifier uniqueness
498            constraint.
499            (qualified_name): New attribute.
500            (Interface/Exception idl_text): Extended attributes were
501            not prepended to the returned text.
502    
503    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
504    
505            * WebIDL.pm (parse_char_string): Set line/column numbers
506            to interface object experimentally.  s/shift/pop/g, shift
507            would make things wrong.  Support for interface forward
508            declarations was missing.  Broken interface declarations
509            with no block were not ignored entirely.
510            (Whatpm::WebIDL::Node): New abstract class.  This class
511            makes things easier.
512            (child_nodes): New attribute.  Unlike DOM's attribute with
513            same name, this attribute returns a dead list of nodes for
514            simplicity.
515            (get_user_data, set_user_data): New methods.
516            (Module idl_text): A SPACE character should be inserted
517            before the |{| character.
518            (Interface idl_text): Support for interface forward declarations.
519            (is_forward_declaration): New attribute.
520    
521    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
522    
523            * WebIDL.pm (type_text): Better serializer.
524    
525    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
526    
527            * WebIDL.pm: Revise forward-compatible parsing so that
528            it now can handle broken extended attributes and as such.
529    
530    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
531    
532            * WebIDL.pm: Real support for extended attributes.
533            Support for extended attributes with arguments.
534    
535    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
536    
537            * WebIDL.pm: Support for |exception| syntax.
538            (Interface->idl_text): Tentative support for inheritances.
539    
540    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
541    
542            * WebIDL.pm: Hierarchical scoped name support was broken.
543            Support for raises, setraises, and getraises syntaxes.
544    
545    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
546    
547            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
548            proper support for types, extended attributes, and exceptions yet).
549            WebIDL parser, version 1 (no support for exceptions yet,
550            no proper support for extended attributes yet).
551    
552    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
553    
554            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
555    
556    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
557    
558            * WebIDL.pm: Support for valuetype and const.
559    
560    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
561            
562            * WebIDL.pm: New module.
563    
564    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
565    
566            * Makefile (Entities.html): URI changed.
567    
568    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
569    
570            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
571    
572    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
573    
574            * HTML.pm.src (_get_next_token): A parse error was missing.
575    
576    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
577    
578            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
579            spec (commented out). (HTML5 revision 1711).
580    
581    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
582    
583            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
584    
585            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
586            as UTF-16 (HTML5 revision 1701).
587    
588    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
589    
590            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
591            1692).
592    
593    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
594    
595            * HTML.pm.src: The secondary insertion mode used when switching
596            to foreign content is the "in body" insertion mode (HTML5 revision
597            1696).
598    
599    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
600    
601            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
602            revision 1697).
603    
604    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
605    
606            * HTML.pm.src: Support for end-of-file token in foreign content
607            insertion mode (HTML5 revision 1693).  Update SVG camelCase
608            attribute list (HTML5 revision 1700).  <textarea> closes
609            </select> (HTML5 revision 1699).  More start tags close in
610            foreign content insertion mode (HTML5 revision 1698).
611    
612    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
613    
614            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
615    
616    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
617    
618            * HTML.pm.src: More robust charset parameter detection (HTML5
619            revision 1674).
620    
621    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
622    
623            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
624            revision 1676).
625    
626    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
627    
628            * HTML.pm.src: Ignore language part of public identifiers for
629            quriks mode detection (HTML5 revision 1679).
630    
631    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
632    
633            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
634            revision 1685).
635    
636    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
637    
638            * HTML.pm.src: Support for EOF in new states for tags (HTML5
639            revision 1684).
640    
641    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
642    
643            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
644            work (HTML5 revision 1690).
645    
646    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
647    
648            * HTML.pm.src (_tree_construction_main): Change handling of
649            end tags in head insertion modes (HTML5 revision 1686).
650            (parse_char_string): Bug fix for non-utf8 character string handlings.
651            (parse_char_stream): |ungetc| does not work well for this context.
652    
653    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
654    
655            * HTML.pm.src (parse_byte_string): Redefined to invoke
656            |parse_byte_stream|.
657            (parse_byte_stream): New method.
658    
659    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
660    
661            * HTML.pm.src (parse_byte_string): Fix the column number reported
662            by encoding layer error reporter.
663    
664    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
665    
666            * HTML.pm.src (parse_byte_string): Use streaming decoder
667            rather than converting the whole byte string and then parsing.
668            Propagate errors in character encoding layer.
669            (get_next_token): Precise error reporting for |bare stago| error.
670    
671    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
672    
673            * HTML.pm.src (parse_char_stream): New method.
674            (parse_char_string): This method is now defined as an invocation
675            of the |parse_char_stream| method.
676    
677    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
678    
679            * HTML.pm.src (parse_byte_string): Report various status
680            of the sniffing as info-level errors.  Support for new
681            decoding framework in parser resestting.
682            (new): Various default error levels were not set.
683    
684    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
685    
686            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
687            algorithm, except for the actual sniffing, is implemented
688            with new framework with Message::Charset::Info.
689    
690    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
691    
692            * CacheManifest.pm (_parse): Drop fragment identifiers from
693            URIs in fallback section (HTML5 revision 1596).
694    
695    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
696    
697            * Makefile (Entities.html): URI has changed.
698    
699    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
700    
701            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
702            1553).
703    
704    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
705    
706            * ContentChecker.pm: Noted that those returned in |table| are
707            no longer table elements, but table objects returned
708            by Whatpm::HTMLTable.
709    
710            * HTMLTable.pm (form_table): Return table element node
711            as |$table->{element}|.
712            (assign_header): Support for the |headers=""| attribute.
713    
714    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
715    
716            * HTMLTable.pm (assign_header): New function; first version
717            with no support for headers="".
718            (form_table): Include table width and height to the returned
719            table object for covenience.  Indexing in column assignement was wrong.
720            Set whether a data cell is empty or not for convenience.
721    
722    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
723    
724            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
725    
726    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
727    
728            * HTMLTable.pm: How table model errors are detected is
729            changed (HTML5 revision 1387).
730    
731    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
732    
733            * HTMLTable.pm: The algorithm now moves |tfoot| elements
734            to the end of the table (HTML5 revision 1380).
735    
736    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
737    
738            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
739            of 1-based (HTML5 revision 1376).
740    
741    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
742    
743            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
744            and "Content-Encoding" no longer prevent sniffing (HTML5
745            revision 1288).
746    
747    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
748    
749            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
750            1282).
751    
752    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
753    
754            * ContentChecker.pm: Support for global attributes.
755            Status of XML specs are added.
756    
757    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
758    
759            * ContentChecker.pm (check_element): Support for |template=""|
760            and |ref=""| attribute (referent element type checking).
761    
762    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
763    
764            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
765            1500) is implemented (except for IDNA part and URI-scheme-specific
766            knowledge).  Line number counting was wrong for LF-only
767            documents.
768    
769    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
770    
771            * HTML.pm.src: Raise an parse error for any disallowed
772            character (HTML5 revision 1263).
773    
774    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
775    
776            * mkentitylist.pl: Support for new HTML5 entity table format (the
777            definition for |AElig;| was missing).
778    
779    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
780    
781            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
782            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
783    
784    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
785    
786            * HTML.pm.src: List of element names that close foreign content
787            insertion mode is added (HTML5 revisions 1412 and 1418).
788    
789    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
790    
791            * HTML.pm.src: Support for |mglyph| and |malignmark|
792            elements (HTML5 revision 1410).
793    
794    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
795    
796            * HTML.pm.src: Support for new long MathML entities (HTML5
797            revision 1406).
798    
799    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
800    
801            * HTML.pm.src: CDATA section support for MathML and SVG
802            elements (HTML5 revision 1404 and 1420).
803    
804    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
805    
806            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
807            elements (HTML5 revision 1404).  Unused !!!macro definitions
808            are removed.
809    
810    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
811    
812            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
813            are raised is changed (HTML5 revision 1404).
814    
815    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
816    
817            * HTML.pm.src: Category sets that are no longer used are removed.
818    
819    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
820    
821            * HTML.pm.src: The ->[1] property of stack entries are now
822            replaced by constants representing element category.
823    
824    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
825    
826            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
827            for error reporting.  (This is a preparation for using constant
828            value for ->[1].)
829    
830    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
831    
832            * RDFXML.pm: Typo fixed.
833    
834    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
835    
836            * ContentChecker.pm: |fact_level| is now treated
837            as same as |must_level|, i.e. level = |m|.
838            (check_element): Make list of URIs in the DOM.
839    
840    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
841    
842            * RDFXML.pm: Language accessor implemented.
843            Local (null-namespace) attribute support.
844    
845    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
846    
847            * RDFXML.pm: Factored out ID checking code.
848    
849    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
850    
851            * RDFXML.pm: TODO items noted.  Validation of ID
852            and URI attributes is implemented.  Warn if unknown
853            value is used in rdf:parseType="" attribute.
854    
855            * URIChecker.pm (check_rdf_uri_reference): New function.
856    
857    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
858    
859            * RDFXML.pm: bnodeid implemented.  Relative references
860            are now resolved.
861    
862    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
863    
864            * ContentChecker.pm: RDF reification implemented.
865    
866            * RDFXML.pm: undef vs false bug fixed.
867            Reification implemented.
868    
869    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
870    
871            * RDFXML.pm: s/id/ID/ for attribute name.
872            The |node| arguments are added for |ontriple| calls.
873            Too many "attribute not allowed" errors were raised.
874    
875            * ContentChecker.pm: Initial experimental support for rdf:RDF
876            element.
877    
878    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
879    
880            * RDFXML.pm: New module.
881    
882    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
883    
884            * HTML.pm.src (set_inner_html): Line/column number
885            code was old one yet.
886    
887    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
888    
889            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
890            errors.  Line/column markings for DOCTYPE, comment, and
891            character tokens are reintroduced; otherwise,
892            error location for "not HTML5" error and errors
893            for implied elements are not attached.
894    
895    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
896    
897            * HTML.pm.src: Set line/column numbers to attributes.
898    
899            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
900            (value): Setter implemented.
901    
902            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
903    
904    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
905    
906            * HTML.pm.src: Unused line/column markings are removed.
907    
908    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
909    
910            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
911            flag, which is no longer used.
912    
913    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
914    
915            * HTML.pm.src: Set line/column information to element nodes.
916    
917            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
918            and !!!insert-element-t): Set line/column information to
919            element nodes.
920    
921    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
922    
923            * HTML.pm.src (_get_next_token): The first "<" character
924            in "<?", "<>", or "</>" should be the error point.
925    
926    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
927    
928            * HTML.pm.src: Some more fixes on error position reporting.
929    
930    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
931    
932            * HTML.pm.src: Token-level precious error reporting.
933    
934    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
935    
936            * HTML.pm.src: Preparation for more precious error point
937            reporting.
938    
939    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
940    
941            * HTML.pm.src: Error type revised.
942    
943    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
944    
945            * HTML.pm.src: Similar codes are merged together, again.
946    
947    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
948    
949            * HTML.pm.src: Similar codes are merged together.
950    
951    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
952    
953            * mkhtmlparser.pl: Set "level" parameter to parse errors.
954    
955            * HTML.pm.src: Code refined.
956    
957    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
958    
959            * HTML.pm.src: |</body>| treatement has been changed (HTML5
960            revision 1348).  Note that I really don't know this makes
961            any difference in the black-box behavior of the parser.
962    
963    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
964    
965            * HTML.pm.src: New end-of-file token implementation (HTML5
966            revision 1348).
967    
968    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
969    
970            * HTML.pm.src: |applet| support (HTML5 revision 1347).
971    
972    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
973    
974            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
975    
976    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
977    
978            * HTML.pm.src: Support for |<input>| in the "in select" insertion
979            mode and support for the "in select in table" insertion mode (HTML5
980            revision 1342).
981    
982    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
983    
984            * HTML.pm.src: No foster parenting for <script> and <script>
985            in non-tainted <table>s (HTML5 revision 1336).
986    
987    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
988    
989            * HTML.pm.src: Ignore white space characters between <html>
990            and <head> (HTML5 revision 1332).
991    
992    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
993    
994            * HTML.pm.src: Treat <input type=hidden> as if it were a
995            white space (HTML5 revision 1331).
996    
997    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
998    
999            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1000            element (HTML5 revision 1330).
1001    
1002    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1003    
1004            * HTML.pm.src: <title> is always appended to the current
1005            element (HTML5 revision 1328).
1006    
1007    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1008    
1009            * HTML.pm.src: White space in tainted tables are moved
1010            into foster parents (HTML5 revision 1326).
1011    
1012    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1013    
1014            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1015            revision 1321).
1016    
1017    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1018    
1019            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1020            the spec (HTML5 revision 1320).  Note that we finally complete
1021            all of HTML5 revision 1320 changes.
1022    
1023    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1024    
1025            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1026            tags (HTML5 revision 1320).
1027    
1028    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1029    
1030            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1031    
1032    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1033    
1034            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1035            could not be reached so that it is now removed (HTML5 revision
1036            1320).
1037    
1038    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1039    
1040            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1041            to align with new spec text (HTML5 revision 1320).
1042    
1043    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1044    
1045            * HTML.pm.src: Remove strange |if| condition; however, it should
1046            have had no harm in theory.
1047    
1048    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * HTML.pm.src (_tree_construction_main): '</p>' in body
1051            case is split from other end tags for the preparation
1052            of implementing HTML revision 1320.
1053    
1054    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1055    
1056            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1057            revision 1320).
1058    
1059    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1060    
1061            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1062            and "div" (HTML5 revision 1318).  Add comments to where
1063            |form| pointer association codes should be inserted (HTML5 revision
1064            1319).
1065    
1066    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1067    
1068            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1069    
1070    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1071    
1072            * HTML.pm.src: Since the case t268 should never be reached (no
1073            other token type, there are), it is replaced by a |die| statement.
1074    
1075    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1076    
1077            * HTML.pm.src: Typo fixed.
1078    
1079    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1080    
1081            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1082            doctypes were not uppercased for comparison.
1083    
1084    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1085    
1086            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1087            are added.
1088    
1089    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1090    
1091            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1092            are set.  Cases that are unlikely reached are noted as so.
1093    
1094    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1095    
1096            * HTML.pm.src: Checkpoints for debugging are added.
1097    
1098            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1099    
1100    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1101    
1102            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1103            for simplicity.
1104    
1105    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1106    
1107            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1108            revision 1307).
1109    
1110    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1111    
1112            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1113            is set are changed (HTML5 revision 1305).
1114    
1115    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1116    
1117            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1118            1292).  Entities are not parsed in comment-like part in RCDATA
1119            elements (HTML5 revision 1294).  Allow bare & at the end
1120            of attribute value literals (HTML5 revision 1296).  More
1121            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1122            between attributes and ban attribute names or unquoted
1123            attribute values containing single or double quotes (HTML5
1124            revision 1303).
1125    
1126    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1127    
1128            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1129            and related errors unless it is an HTML document (though the spec
1130            is unclear on whether it is applied to XHTML document).
1131    
1132            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1133            are added.
1134    
1135    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1136    
1137            * HTML.pm.src: s/local_name/manakai_local_name/g.
1138    
1139    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1140    
1141            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1142    
1143            * HTML.pm.src: |charset| in |content| attribute is
1144            case-insensitive (HTML5 revision 1270).
1145    
1146    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1147    
1148            * ContentChecker.pm: New status constants are added.
1149            ($ElementDefault): |status| added.
1150            (check_element): Err for non-standard or deprecated elements.
1151            (_attr_status_info): For non-standard or deprecated attributes.
1152    
1153    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1154    
1155            * ContentChecker.pm (_attr_status_info): New internal method.
1156    
1157    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1158    
1159            * ContentChecker.pm (check_element): Element standardized
1160            status information is now dispatched.
1161    
1162    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1163    
1164            * ContentChecker.pm (check_element): Fix |del|-and-significant
1165            problem by adding some more arguments.
1166    
1167    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1168    
1169            * ContentChecker.pm (check_element): Use context of
1170            container-for-the-purpose-of-content-model element (not transparent
1171            element) for |check_child_element| calling and significant
1172            text flag marking.  This reintroduces |<del>|-and-significant
1173            problem again.
1174    
1175    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1176    
1177            * ContentChecker.pm (check_element): Make semi-transparent
1178            elements ignored for the purpose of phase changes in
1179            content model checking.
1180    
1181    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1182    
1183            * ContentChecker.pm (check_element): In-element state
1184            was not properly managed for transparent cases.
1185    
1186    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1187    
1188            * ContentChecker.pm (check_element): Support for |video|
1189            and |audio| as semi-transparent elements.
1190    
1191    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1192    
1193            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1194            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1195            Support for |html:object| as a semi-transparent element.
1196    
1197    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1198    
1199            * ContentChecker.pm (check_element): The way to traverse
1200            the tree is entirely revised to make it easier to track
1201            the state of ancestors/descendants.  As a result of this
1202            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1203            support for content model checking for HTML elements |figure|,
1204            |object|, |video|, and |audio| and checking for XML elements (and
1205            some XMLNS checkings) are dropped for now.  They will be
1206            reimplemented in due cource.
1207    
1208    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1209    
1210            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1211            token (HTML5 revision 1225).
1212    
1213    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1214    
1215            * ContentChecker.pm ({unsupported_level}): New value.
1216    
1217            * HTML.pm.src: Save whether |meta| |content| attribute
1218            contains character references or not.
1219    
1220    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1221    
1222            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1223            imlement significant content checking for contents
1224            with |del| elements.
1225    
1226    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1227    
1228            * ContenteChecker.pm ($HTMLTransparentElements): More
1229            elements are added.
1230            (_get_children): HTML |object| elements are now semi-transparent.
1231    
1232            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1233    
1234    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1235    
1236            * CacheManifest.pm: HTML5 revision 1211 implemented.
1237    
1238            * CacheManifest.pod: Updated.
1239    
1240    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1241    
1242            * ContentChecker.pm (check_document, check_element): Support
1243            for second argument ($onsubdoc).
1244            (_get_css_parser): Removed (now it is part of WDCC).
1245    
1246    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1247    
1248            * ContentChecker.pm (_get_css_parser): New.
1249    
1250    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1251    
1252            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1253            for checking had been used.
1254    
1255    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1256    
1257            * ContentChecker.pm (check_element): New todo item type |descendant|.
1258    
1259    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1260    
1261            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1262            specced in RFC 2046 and RFC 4288.
1263            (application/atom+xml): Definition added.
1264    
1265    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1266    
1267            * URIChecker.pm: Make RFC 3986 should-level errors
1268            warnings (rather than SHOULD-level errors).
1269    
1270    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1271    
1272            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1273    
1274            * HTML.pm.src: A flag for character references in attribute
1275            values are added.  Set |manakai_has_reference| user data
1276            to |charset| attribute.
1277    
1278    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1279    
1280            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1281            attributes.
1282    
1283            * ContentChecker.pm (check_document): Warn if charset requirements
1284            cannot be tested.
1285    
1286    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1287    
1288            * HTML.pm.src (parse_byte_string): Detect charset
1289            by universalchardet if charset parameter is not specified.
1290    
1291            * Makefile (Charset-all, Charset-clean): New rules.
1292    
1293    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1294    
1295            * ContentChecker.pm (check_document): Check the existence
1296            of character encoding declaration and selection of encoding
1297            for HTML document.
1298    
1299    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1300    
1301            * ContentType.pm (get_sniffed_type): Return also the official
1302            type in list context.
1303    
1304    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1305    
1306            * ContentType.pm: Sniffing with leading white space
1307            ignoring (HTML5 revisions 1013 and 1016).
1308    
1309    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1310    
1311            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1312            white spaces, are implemented.
1313    
1314    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1315    
1316            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1317            attribute if possible.
1318    
1319    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1320    
1321            * HTML.pm.src (parse_byte_string): New method.
1322            (parse_char_string): New alias for |parse_string|.
1323            (main phase): Invoking "change the encoding" algorithm if desired.
1324    
1325            * HTML.pod: Updated.
1326    
1327    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1328    
1329            * HTML.pod (get_inner_html): Removed.
1330    
1331            * Makefile (HTML-all, HTML-clean): New.
1332    
1333    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1334    
1335            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1336    
1337    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1338    
1339            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1340    
1341    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1342    
1343            * Makefile: |CacheManifest.html| is added.
1344    
1345            * CacheManifest.pod: New file.
1346    
1347    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1348    
1349            * CacheManifest.pm: New module.
1350    
1351    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1352    
1353            * HTML.pm.src: Support for application cache selection algorithm
1354            callback.
1355    
1356    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1357    
1358            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1359    
1360    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1361    
1362            * Makefile (clean): New rule.
1363    
1364            * NanoDOM.pm (public_id, system_id): New attributes.
1365    
1366    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1367    
1368            * Makefile (CSS-all, CSS-clean, clean): New rules.
1369    
1370    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1371    
1372            * ContentChecker.pm (check_document): Support for
1373            new |is_xml_root| flag.
1374            (check_element): Support for new |pluses| state.
1375            (_add_pluses): New method.
1376            (_remove_minuses): Support for new |minus| item.
1377    
1378    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1379    
1380            * ContentChecker.pm: Raise specific error for invalid
1381            root element.
1382    
1383    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1384    
1385            * ContentChecker.pm: Set level values for later uses.
1386    
1387    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1388    
1389            * ContentChecker.pm: Support for language tag validation.
1390    
1391    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1392    
1393            * LangTag.pm (check_rfc3066_language_tag): New method.
1394    
1395    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1396    
1397            * LangTag.pm: New module.
1398    
1399    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1400    
1401            * HTML.pm.src: Some error types were wrong.
1402    
1403    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1404    
1405            * CSS/: New directory.
1406    
1407    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1408    
1409            * ContentChecker.pm (_check_get_children): Support
1410            for |noscript| in |head|.
1411    
1412    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1413    
1414            * URI/: New directory.
1415    
1416    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1417    
1418            * HTML.pm.src: Tokenizer's states are now represented in
1419            number.
1420    
1421    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1422    
1423            * HTML.pm.src: |or|s for insertion modes are replaced
1424            by |&|s.
1425    
1426    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1427    
1428            * HTML.pm.src: Token types are now represented in number.
1429    
1430    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1431    
1432            * ContentType.pm (SEE ALSO): Updated.
1433    
1434            * HTML.pm.src: Insertion modes are now represented in number.
1435    
1436    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1437    
1438            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1439            999) is implemented.
1440    
1441    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1442    
1443            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1444            983) is implemented.
1445    
1446    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1447    
1448            * ContentChecker.pod: New documentation.
1449    
1450            * Makefile: A rule for |ContentChecker.html| is added.
1451    
1452            * ContentChecker.pm: A pod "LICENSE" section is added.
1453    
1454            * NanoDOM.pm ($VERSION): New variable.
1455    
1456    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1457    
1458            * H2H.pm: |b|, |i|, and |sub| are added to the
1459            list of allowed HTML elements.
1460    
1461    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1462    
1463            * H2H.pm: |samp| is added to the list of allowed
1464            HTML elements.
1465    
1466            * URIChecker.pm (check_iri): New.
1467            (check_iri_reference): Error type for IRI reference
1468            syntax error is changed.
1469    
1470    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1471    
1472            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1473            is added.
1474            (check_document): Load appropriate module before validation.
1475    
1476    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1477    
1478            * ContentChecker/: New directory.
1479    
1480    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1481    
1482            * ContentChecker.pm: HTML |time| element is implemented.
1483    
1484            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1485    
1486    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1487    
1488            * ContentChecker.pm (check_document): Return value
1489            even if no document element is found.
1490    
1491    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1492    
1493            * HTML.pm.src: |$in_body| is no longer a function.
1494    
1495    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1496    
1497            * HTML.pm.src: The |$in_body| code has been moved down.
1498    
1499    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1500    
1501            * HTML.pm.src: The "trailing end" insertion mode
1502            is split into "after html body" and "after html frameset"
1503            insertion modes.  Their codes are merged with "after body"
1504            and "after frameset" codes.  |$previous_insertion_mode|
1505            has been removed.  "after frameset" code is
1506            merged with "in frameset" code.
1507    
1508    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1509    
1510            * HTML.pm.src: The "before head" insertion mode is
1511            merged with the "in head" insertion mode.
1512    
1513    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1514    
1515            * HTML.pm.src: Cases in "in head" insertion mode are
1516            reorganized.
1517    
1518    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1519    
1520            * HTML.pm.src: Some cases in "in table" insertion mode
1521            are merged.
1522    
1523    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1524    
1525            * HTML.pm.src: The "in row" insertion mode is merged
1526            with "in table" insertion mode.
1527    
1528    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1529    
1530            * HTML.pm.src: The "in table" and "in table body" insertion
1531            modes are merged.
1532    
1533    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1534    
1535            * HTML.pm.src: There is no "in table head" or "in table foot"
1536            insertion mode!
1537    
1538    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1539    
1540            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1541            now directly invoke the handler.
1542    
1543    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1544    
1545            * HTML.pm.src: Codes for "in cell" insertion mode
1546            is merged to the "in body" insertion mode code.
1547    
1548    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1549    
1550            * HTML.pm.src: Codes for "in body" and "in caption"
1551            insertion modes are merged.
1552    
1553    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1554    
1555            * HTML.pm.src: Two |!!!next-token|s were missing.
1556    
1557    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1558    
1559            * HTML.pm.src: Use numeric constant for |{content_mode}|
1560            instead of string constant for |{content_model_flag}|.
1561    
1562    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1563    
1564            * HTML.pm.src: Add the name of the attribute
1565            to the "duplicate attribute" error.
1566    
1567    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1568    
1569            * ContentChecker.pm: Return the |class| node list.
1570    
1571    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1572    
1573            * ContentChecker.pm: Return the |id| node list.
1574    
1575            * HTML.pm.src: A typo is fixed.
1576    
1577    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1578    
1579            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1580            from "in HTML:xml:lang" error.  Character position
1581            is now the last part of the error type in the URI error
1582            description.  Report "unsupported" status for language
1583            tags, media queries, script codes, and style sheets.
1584    
1585    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1586    
1587            * ContentChecker.pm: Report error if |xml:lang|
1588            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1589            in XML.
1590    
1591            * NanoDOM.pm (Attr.owner_document): New attribute.
1592    
1593    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1594    
1595            * HTML.pm.src: The character immediately following
1596            a bare |hcro| was discarded.  Fix handling of
1597            entity references in attribute values.
1598    
1599    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1600    
1601            * HTML.pm.src (main and trailin end phases): Token
1602            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1603            factored out.  Error types |in html:#DOCTYPE|
1604            and |after html:#DOCTYPE| are merged
1605            into |DOCTYPE in the middle|.  |</frameset>|
1606            in fragment parsing mode changed the
1607            insertion mode.
1608    
1609    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1610    
1611            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1612            phase is now an insertion mode.  Treatments for white
1613            space character tokens were incorrect for some
1614            insertion modes.  An old |meta| case was not removed.
1615    
1616    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1617    
1618            * HTML.pm.src: |meta| charset declaration extraction
1619            implemented (but changing the encoding is not yet:-).
1620    
1621    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1622    
1623            * Charset/: New directory.
1624    
1625    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1626    
1627            * H2H.pm: New Perl module (created from
1628            manakai's H2H.dis).
1629    
1630    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1631    
1632            * XMLSerializer.pm: New Perl module (created from
1633            manakai's SimpleLS.dis).
1634    
1635    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
1636    
1637            * HTML.pm.src: |<!---x-->| was not processed correctly.
1638    
1639    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1640    
1641            * HTML.pm.src: Report correct error message
1642            for |<body></div></body>|.
1643    
1644  2007-07-01  Wakaba  <wakaba@suika.fam.cx>  2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1645    
1646          * HTMLTable.pm: An error description was incorrect.          * HTMLTable.pm: An error description was incorrect.

Legend:
Removed from v.1.57  
changed lines
  Added in v.1.335

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24