/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.78 by wakaba, Sat Jul 21 08:52:59 2007 UTC revision 1.342 by wakaba, Sat Oct 4 09:17:54 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
4    
5    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
6    
7            * HTML.pm.src: Support for new definition of |param| and |source|
8            start tag parsing (HTML5 revision 1731).
9    
10    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
13    
14    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
15    
16            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
17            revisions 1731 and 1831).
18    
19    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
22            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
23            revision 1731).
24    
25    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
28            revision 1731).  End tags of |option| and |optgroup| elements are
29            now optional (HTML5 revision 1731).
30    
31    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: New "special" elements added to the list (HTML5
34            revision 1778).  "strile" -> "strike".
35    
36    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
37    
38            * ContentType.pm (get_sniffed_type): Support for the "better"
39            content sniffing (HTML5 revision 1927).  In a case the official
40            type was not returned when the method is invoked in the list
41            context.
42    
43    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTML.pm.src: Character references for non-space C0 characters,
46            including U+000B VT, DEL character, noncharacter code points, are
47            now converted to the U+FFFD character (cf. HTML5 revision 2138).
48    
49    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
50    
51            * ContentChecker.pm: |form=""| check support added.
52    
53    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
54    
55            * ContentChecker.pm: |contextmenu| validness is now checked using
56            |id| and |id_type| properties, and |menu| property is removed.
57    
58    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
59    
60            * ContentChecker.pm: Prepare for |form| |name| attribute's
61            duplication checking.
62    
63    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
66            unsupported charset="" parameter value (HTML5 revision 2131).
67    
68    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: Reminding places where U+000B is allowed as a space
71            character is fixed (cf. HTML5 revision 1738).
72    
73            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
74            space characters (HTML5 revision 1738).
75    
76    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: The "anything else" case for the "after after body"
79            insertion mode was not updated to swtich to the "in body"
80            insertion mode.  U+000B is no longer a space character for the
81            purpose of tree construction phase (HTML5 revision 1738).
82    
83    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
84    
85            * HTML.pm.src: U+000B is no longer a space character (HTML5
86            revision 1738).
87    
88    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
89    
90            * ContentType.pm: 0x0B is no longer a space character (HTML5
91            revision 1738).
92    
93            * HTML.pm.src: U+000B is no longer a space character for the
94            algorithm for extracting an encoding from a Content-Type (HTML5
95            revision 1738).
96    
97    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
98    
99            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
100    
101    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
102    
103            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
104    
105            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
106            matching.
107    
108    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
109    
110            * LangTag.pm: Warn for private use language subtags.  Error level
111            typos fixed.  Support for Suppress-Script field.
112    
113            * mklangreg.pl: Support for dumping of nested structure.
114    
115    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
116    
117            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
118            recommended case as per RFC 4646.
119    
120    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
121    
122            * LangTag.pm (check_rfc4646_langtag): New method.
123    
124    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
125    
126            * mklangreg.pl: New script.
127    
128            * Makefile: Updated for creation of the module for language subtag
129            registry.
130            
131    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
132    
133            * Makefile: WebIDL.html added.
134    
135            * WebIDL.pod: New documentation.
136    
137    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
138    
139            * WebIDL.pm: Checker's error types are redefined.
140    
141    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
142    
143            * WebIDL.pm: Parser's error types are redefined.  Some forward
144            compatible parsing bugs are fixed.  Some unreachable codes are
145            commented out.
146    
147    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
148    
149            * WebIDL.pm: Support for the reminding extended attributes are
150            added.  It does not satisfy the definition that a forward
151            interface declaration has an extended attribute.  It seems that
152            unless explicitly allowed multiple extended attributes with the
153            same name is not allowed, though it is not explicitly mentioned in
154            the spec.
155    
156    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
157    
158            * WebIDL.pm: Unescapes extended attribute names and extended
159            attribute identifiers.  Preserve whether an extended attribute has
160            an argument list of not.  Support for extended attributes:
161            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
162            NameGetter, NameSetter, and Null.
163            (has_argument_list): New attribute.
164            (idl_text): Stringifies argument lists, if any, even if it is
165            empty.
166    
167    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
168    
169            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
170            |{s_kwd}| in DATA_STATE as default.
171    
172    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
173    
174            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
175            by |{s_kwd}| in DATA_STATE.
176    
177    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTML.pm.src: Shorten keys.
180    
181    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
182    
183            * HTML.pm.src: Remove checking for control character, surrogate
184            pair, or noncharacter code points and non-Unicode code
185            points (they should be handled by Whatpm::Charset::UnicodeChecker).
186            (parse_char_stream): Support for the |$get_wrapper| argument and
187            character stream error handlers.
188    
189    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
190    
191            * ContentChecker.pm: Don't call |loda_ns_module|
192            for null-namespace elements/attributes.
193    
194            * HTML.pm.src: Fact out $disallowed_control_chars
195            as a hash.
196    
197    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
200            and |{next_char}| initializations are moved to initialization
201            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
202            with |parse_char_stream|.
203    
204    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src (parse_char_stream): Make |set_next_char|
207            invoke |manakai_read_until|, not only |read|, where
208            possible, to decrease the number of |read| method calls.
209    
210            * mkhtmlparser.pl: Related changes to the aforementioned
211            modification.
212    
213    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
214    
215            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
216            would report character error from now.
217    
218    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: White-space-leaded non-white-space character
221            tokens in "before head insertion mode" was not
222            correctly handled.
223            (set_inner_html): Reimplemented using CharString decodehandle
224            class.  Support for $get_wrapper argument.  Support
225            for |{read_until}| feature.
226    
227    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
228    
229            * HTML.pm.src: Make a "bare ero" error for unknown
230            entities point the "&" character.
231    
232    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
233    
234            * HTML.pm.src: It turns out that U+FFFD don't have to
235            be added to the list of excluded characters.
236    
237    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src ($char_onerror): Have character decoder's |line|
240            and |column| a higher priority than the one set by the
241            tokenizer's input handler.
242            ($self->{read_until}): Exclude U+FFFD (but this might
243            not be necessary, since now we do line/column fixup in
244            the character decode handle).
245    
246    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: Use |{read_until}| where possible.
249    
250    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
253            and |manakai_getc_until| to |manakai_read_until| to
254            reduce the number of string copies.
255    
256    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
257    
258            * HTML.pm.src (parse_char_string): Use newly created
259            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
260            standard feature to |open| a string as a filehandle,
261            since Perl's string filehandle seems not supporting |ungetc|
262            method correctly.
263            (parse_char_stream): Define |{getc_until}| method.
264            (DATA_STATE): Experimental support for |getc_until| feature.
265    
266    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: Check points added to newly added branches.
269    
270    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
271    
272            * HTML.pm.src: Remove |{char}|, which is no longer used.
273            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
274            and replaced by |{prev_state}|.
275    
276            * mkhtmlparser.pl: Remove |{char}| feature.
277            Remove |!!!back-next-input-character;| macro.
278    
279    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
280    
281            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
282            entity related tokenizer states in favor of new states
283            implementing the consume character reference algorithm.
284    
285    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
286    
287            * HTML.pm.src: "Consume a character reference" algorithm is
288            now implemented as a tokenizer's state, rather than
289            a method, with minimum changes (more changes will
290            be made, in due course).  "Bogus comment state"'s inner
291            loop gets removed.
292    
293    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
296            into their own tokenizer states.
297    
298    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
299    
300            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
301            is split into three states.
302    
303    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
304    
305            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
306            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
307            no longer does the tokenizer have to push back next input
308            characters in those states.
309    
310    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
311    
312            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
313            into four states so that no longer does the tokenizer have to push
314            back next input characters in that state.
315    
316    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
317    
318            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
319            which can be used to insert some wrapper between the character
320            stream handle and the tokenizer.  (It is currently not supported
321            for |set_inner_html| for |Element|s).
322    
323    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
324    
325            * HTML.pm.src: Ignore punctuations in charset names.
326    
327    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
328    
329            * ContentChecker.pm: Support for charset-layer error levels.
330    
331            * HTML.pm.src: Don't specify |text| argument for the
332            |chardecode:fallback| error, since it is not the encoding
333            being used alternatively.
334    
335    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
336    
337            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
338    
339    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
340    
341            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
342    
343    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
344    
345            * HTML.pm.src: Bug fix and sync with the spec with regard
346            to after after frameset insertion mode processing (HTML5
347            revision 1909).  Note that the implementation was wrong
348            per the old spec before the r1909 changes.
349    
350    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
351    
352            * HTMLTable.pm: scope=auto algorithm fix synced with the
353            spec (HTML5 revision 2093).
354            ($process_row): Algorithm step numbers synced with the
355            spec (HTML5 revision 2092).
356    
357    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
358    
359            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
360            revision 2094).
361    
362    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
363    
364            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
365    
366    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
367    
368            * HTML.pm.src: '"' and "'" at the end of attribute
369            name (after another attribute) now raise parse error (HTML5
370            revision 2123).  Empty unquoted attribute values are no
371            longer allowed (HTML5 revision 2122).
372    
373    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
374    
375            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
376            revision 2130).
377    
378    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
379    
380            * ContentChecker.pm: |xml:lang| attribute value must be same
381            as |lang| attribute value for HTML elements (HTML5 revision 2062
382            and so on).
383    
384    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
385    
386            * ContentChecker.pm: Error level definition for |xml_id_error|
387            was missing.
388    
389            * URIChecker.pm: The end of the URL should be marked as the
390            error location for an empty path error.  The position
391            between the userinfo and the port components should be
392            marked as the error location for an empty host error.
393    
394    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
395    
396            * URIChecker.pm: Set parameters representing where in the
397            value the error occurs for errors.  Report unknown
398            address format error in warning level, since address
399            formats are rarely added.  Path segments starting with "/.."
400            were misinterpreted as a dot-segment.
401    
402    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
403    
404            * URIChecker.pm (check_iri_reference): Requires
405            |Message::DOM::DOMImplementation|.
406    
407    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
408    
409            * IMTChecker.pm: Updated for the new error reporting architecture.
410    
411            * ContentChecker.pm: Error levels for IMTs are added.
412    
413    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
414    
415            * H2H.pm (_shift_token): Support for unquoted HTML attribute
416            values.
417    
418    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
419    
420            * CacheManifest.pm: Support for new style of error
421            reports.
422    
423            * HTML.pm.src: Set line=1, column=1 to the document node.
424    
425    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
426    
427            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
428            and URL checkers.  Support for more error levels for bogus
429            langauge tag and URL "standards".
430    
431            * LangTag.pm, URIChecker.pm: Support for new style error
432            level reporting.
433    
434    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
435    
436            * ContentChecker.pm: Support for RDF/XML error levels.
437    
438            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
439            specifying.  Error types are revised.
440    
441    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
442    
443            * ContentChecker.pm: All error reporting method calls are
444            renewed.
445    
446    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
447    
448            * HTML.pm.src: All error type names and "text" parameters
449            are revised.  Use new style for "level" specification.
450    
451            * mkhtmlparser.pl: Use new style for "level" specification.
452    
453    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
454    
455            * WebIDL.pm (parse_char_string): Simplified error
456            reporting process for broken ignored valuetype definition.
457            (Valuetype idl_text): Support for special "DOMString" name.
458    
459    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
460    
461            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
462            terminal of the ScopedName is "DOMString", such that whether
463            the last part of the scoped name is "DOMString" or "_DOMString"
464            later.  It is necessary to determine whether a |typedef|
465            definition should be ignored or not.
466            (parse_char_string): Unescape the identifier of
467            exception members.
468            ($resolve): Return undef for builtin types and sequence<T>
469            types (we might not have to do this, however...).
470            (check): Support checking for Exceptions, Valuetypes,
471            and Typedefs.
472            ($serialize_type): Support for "DOMString::::" syntax.
473            (Typedef idl_text): Output Type as "DOMString" if it
474            is really "DOMString" (i.e. its internal representation
475            is "::DOMString::").
476    
477    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
478    
479            * WebIDL.pm ($resolve): New code, based on resolve code
480            for constant types in the |check| method.
481            (check): Support for checking of attributes, operations, and
482            arguments.
483            (Attribute/Operation idl_text): Exception names in getraises,
484            setraises, and raises clauses is serizlied by |$serialize_type|
485            code.
486    
487    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
488    
489            * WebIDL.pm ($integer): Order of selections are changed to match
490            hexadecimal numbers (the original pattern, taken from the spec,
491            was not work for hexadecimal numbers, because the "0" prefix
492            matches to the [0-7]* part (as an empty string) and therefore
493            it does not match with remaining "x..." part of a "0x..." integer
494            literal.
495            ($get_type): It now returns a string, not an array reference,
496            for regular types and |sequence| types (i.e. it in any case
497            returns a string).
498            ($get_next_token): The second item in the array that represents
499            a integer or float token is now a Perl number value, not the
500            original string representation of the number.
501            (check): Support for const value consistency checking.
502            No extended attribute is defined for constants.
503            (Node subclasses): Use simple strings rather than array references
504            for default data type values.
505            ($serialize_type): Type values are now simple strings.
506            (value): If the new attribute value is a false value, then
507            a FALSE value is set to the attribute.
508    
509    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
510    
511            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
512            in its stringified format ("scoped name" as defined in the
513            spec).  Note that future version of this module should not use
514            array references for type values and the |type_text| attribute
515            should be made obsolete.
516            (parse_char_string): Unescape attribute names.
517            (check): Support for checking of whether inherited interfaces
518            are actually defined or not.  Support for checking of whether
519            interface member identifiers are duplicated or not.
520            ($serialize_type): Scoped names are returned as is.  A future
521            version of this code should escape identifiers other than "DOMString",
522            otherwise the idl_text would be non-conforming.
523    
524    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
525    
526            * WebIDL.pm (parse_char_string): Set line/column numbers
527            to generated nodes.  Unescape identifiers.  Extended attributes
528            for Definition's were ignored.
529            (append_child): Set |parent_node| attribute.
530            (parent_node): New attribute.
531            (check): Support interface/exception members.  Support
532            extended attributes.  Support definition identifier uniqueness
533            constraint.
534            (qualified_name): New attribute.
535            (Interface/Exception idl_text): Extended attributes were
536            not prepended to the returned text.
537    
538    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
539    
540            * WebIDL.pm (parse_char_string): Set line/column numbers
541            to interface object experimentally.  s/shift/pop/g, shift
542            would make things wrong.  Support for interface forward
543            declarations was missing.  Broken interface declarations
544            with no block were not ignored entirely.
545            (Whatpm::WebIDL::Node): New abstract class.  This class
546            makes things easier.
547            (child_nodes): New attribute.  Unlike DOM's attribute with
548            same name, this attribute returns a dead list of nodes for
549            simplicity.
550            (get_user_data, set_user_data): New methods.
551            (Module idl_text): A SPACE character should be inserted
552            before the |{| character.
553            (Interface idl_text): Support for interface forward declarations.
554            (is_forward_declaration): New attribute.
555    
556    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
557    
558            * WebIDL.pm (type_text): Better serializer.
559    
560    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
561    
562            * WebIDL.pm: Revise forward-compatible parsing so that
563            it now can handle broken extended attributes and as such.
564    
565    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
566    
567            * WebIDL.pm: Real support for extended attributes.
568            Support for extended attributes with arguments.
569    
570    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
571    
572            * WebIDL.pm: Support for |exception| syntax.
573            (Interface->idl_text): Tentative support for inheritances.
574    
575    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
576    
577            * WebIDL.pm: Hierarchical scoped name support was broken.
578            Support for raises, setraises, and getraises syntaxes.
579    
580    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
581    
582            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
583            proper support for types, extended attributes, and exceptions yet).
584            WebIDL parser, version 1 (no support for exceptions yet,
585            no proper support for extended attributes yet).
586    
587    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
588    
589            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
590    
591    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
592    
593            * WebIDL.pm: Support for valuetype and const.
594    
595    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
596            
597            * WebIDL.pm: New module.
598    
599    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
600    
601            * Makefile (Entities.html): URI changed.
602    
603    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
604    
605            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
606    
607    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
608    
609            * HTML.pm.src (_get_next_token): A parse error was missing.
610    
611    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
612    
613            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
614            spec (commented out). (HTML5 revision 1711).
615    
616    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
617    
618            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
619    
620            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
621            as UTF-16 (HTML5 revision 1701).
622    
623    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
624    
625            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
626            1692).
627    
628    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
629    
630            * HTML.pm.src: The secondary insertion mode used when switching
631            to foreign content is the "in body" insertion mode (HTML5 revision
632            1696).
633    
634    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
635    
636            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
637            revision 1697).
638    
639    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
640    
641            * HTML.pm.src: Support for end-of-file token in foreign content
642            insertion mode (HTML5 revision 1693).  Update SVG camelCase
643            attribute list (HTML5 revision 1700).  <textarea> closes
644            </select> (HTML5 revision 1699).  More start tags close in
645            foreign content insertion mode (HTML5 revision 1698).
646    
647    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
648    
649            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
650    
651    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
652    
653            * HTML.pm.src: More robust charset parameter detection (HTML5
654            revision 1674).
655    
656    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
657    
658            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
659            revision 1676).
660    
661    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
662    
663            * HTML.pm.src: Ignore language part of public identifiers for
664            quriks mode detection (HTML5 revision 1679).
665    
666    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
667    
668            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
669            revision 1685).
670    
671    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
672    
673            * HTML.pm.src: Support for EOF in new states for tags (HTML5
674            revision 1684).
675    
676    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
677    
678            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
679            work (HTML5 revision 1690).
680    
681    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
682    
683            * HTML.pm.src (_tree_construction_main): Change handling of
684            end tags in head insertion modes (HTML5 revision 1686).
685            (parse_char_string): Bug fix for non-utf8 character string handlings.
686            (parse_char_stream): |ungetc| does not work well for this context.
687    
688    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
689    
690            * HTML.pm.src (parse_byte_string): Redefined to invoke
691            |parse_byte_stream|.
692            (parse_byte_stream): New method.
693    
694    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
695    
696            * HTML.pm.src (parse_byte_string): Fix the column number reported
697            by encoding layer error reporter.
698    
699    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
700    
701            * HTML.pm.src (parse_byte_string): Use streaming decoder
702            rather than converting the whole byte string and then parsing.
703            Propagate errors in character encoding layer.
704            (get_next_token): Precise error reporting for |bare stago| error.
705    
706    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
707    
708            * HTML.pm.src (parse_char_stream): New method.
709            (parse_char_string): This method is now defined as an invocation
710            of the |parse_char_stream| method.
711    
712    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
713    
714            * HTML.pm.src (parse_byte_string): Report various status
715            of the sniffing as info-level errors.  Support for new
716            decoding framework in parser resestting.
717            (new): Various default error levels were not set.
718    
719    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
720    
721            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
722            algorithm, except for the actual sniffing, is implemented
723            with new framework with Message::Charset::Info.
724    
725    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
726    
727            * CacheManifest.pm (_parse): Drop fragment identifiers from
728            URIs in fallback section (HTML5 revision 1596).
729    
730    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
731    
732            * Makefile (Entities.html): URI has changed.
733    
734    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
735    
736            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
737            1553).
738    
739    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
740    
741            * ContentChecker.pm: Noted that those returned in |table| are
742            no longer table elements, but table objects returned
743            by Whatpm::HTMLTable.
744    
745            * HTMLTable.pm (form_table): Return table element node
746            as |$table->{element}|.
747            (assign_header): Support for the |headers=""| attribute.
748    
749    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
750    
751            * HTMLTable.pm (assign_header): New function; first version
752            with no support for headers="".
753            (form_table): Include table width and height to the returned
754            table object for covenience.  Indexing in column assignement was wrong.
755            Set whether a data cell is empty or not for convenience.
756    
757    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
758    
759            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
760    
761    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
762    
763            * HTMLTable.pm: How table model errors are detected is
764            changed (HTML5 revision 1387).
765    
766    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
767    
768            * HTMLTable.pm: The algorithm now moves |tfoot| elements
769            to the end of the table (HTML5 revision 1380).
770    
771    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
772    
773            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
774            of 1-based (HTML5 revision 1376).
775    
776    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
777    
778            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
779            and "Content-Encoding" no longer prevent sniffing (HTML5
780            revision 1288).
781    
782    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
783    
784            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
785            1282).
786    
787    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
788    
789            * ContentChecker.pm: Support for global attributes.
790            Status of XML specs are added.
791    
792    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
793    
794            * ContentChecker.pm (check_element): Support for |template=""|
795            and |ref=""| attribute (referent element type checking).
796    
797    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
798    
799            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
800            1500) is implemented (except for IDNA part and URI-scheme-specific
801            knowledge).  Line number counting was wrong for LF-only
802            documents.
803    
804    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
805    
806            * HTML.pm.src: Raise an parse error for any disallowed
807            character (HTML5 revision 1263).
808    
809    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
810    
811            * mkentitylist.pl: Support for new HTML5 entity table format (the
812            definition for |AElig;| was missing).
813    
814    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
815    
816            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
817            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
818    
819    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
820    
821            * HTML.pm.src: List of element names that close foreign content
822            insertion mode is added (HTML5 revisions 1412 and 1418).
823    
824    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
825    
826            * HTML.pm.src: Support for |mglyph| and |malignmark|
827            elements (HTML5 revision 1410).
828    
829    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
830    
831            * HTML.pm.src: Support for new long MathML entities (HTML5
832            revision 1406).
833    
834    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
835    
836            * HTML.pm.src: CDATA section support for MathML and SVG
837            elements (HTML5 revision 1404 and 1420).
838    
839    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
840    
841            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
842            elements (HTML5 revision 1404).  Unused !!!macro definitions
843            are removed.
844    
845    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
846    
847            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
848            are raised is changed (HTML5 revision 1404).
849    
850    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
851    
852            * HTML.pm.src: Category sets that are no longer used are removed.
853    
854    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
855    
856            * HTML.pm.src: The ->[1] property of stack entries are now
857            replaced by constants representing element category.
858    
859    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
860    
861            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
862            for error reporting.  (This is a preparation for using constant
863            value for ->[1].)
864    
865    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
866    
867            * RDFXML.pm: Typo fixed.
868    
869    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
870    
871            * ContentChecker.pm: |fact_level| is now treated
872            as same as |must_level|, i.e. level = |m|.
873            (check_element): Make list of URIs in the DOM.
874    
875    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
876    
877            * RDFXML.pm: Language accessor implemented.
878            Local (null-namespace) attribute support.
879    
880    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
881    
882            * RDFXML.pm: Factored out ID checking code.
883    
884    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
885    
886            * RDFXML.pm: TODO items noted.  Validation of ID
887            and URI attributes is implemented.  Warn if unknown
888            value is used in rdf:parseType="" attribute.
889    
890            * URIChecker.pm (check_rdf_uri_reference): New function.
891    
892    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
893    
894            * RDFXML.pm: bnodeid implemented.  Relative references
895            are now resolved.
896    
897    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
898    
899            * ContentChecker.pm: RDF reification implemented.
900    
901            * RDFXML.pm: undef vs false bug fixed.
902            Reification implemented.
903    
904    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
905    
906            * RDFXML.pm: s/id/ID/ for attribute name.
907            The |node| arguments are added for |ontriple| calls.
908            Too many "attribute not allowed" errors were raised.
909    
910            * ContentChecker.pm: Initial experimental support for rdf:RDF
911            element.
912    
913    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
914    
915            * RDFXML.pm: New module.
916    
917    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTML.pm.src (set_inner_html): Line/column number
920            code was old one yet.
921    
922    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
923    
924            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
925            errors.  Line/column markings for DOCTYPE, comment, and
926            character tokens are reintroduced; otherwise,
927            error location for "not HTML5" error and errors
928            for implied elements are not attached.
929    
930    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
931    
932            * HTML.pm.src: Set line/column numbers to attributes.
933    
934            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
935            (value): Setter implemented.
936    
937            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
938    
939    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
940    
941            * HTML.pm.src: Unused line/column markings are removed.
942    
943    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
944    
945            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
946            flag, which is no longer used.
947    
948    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
949    
950            * HTML.pm.src: Set line/column information to element nodes.
951    
952            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
953            and !!!insert-element-t): Set line/column information to
954            element nodes.
955    
956    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
957    
958            * HTML.pm.src (_get_next_token): The first "<" character
959            in "<?", "<>", or "</>" should be the error point.
960    
961    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
962    
963            * HTML.pm.src: Some more fixes on error position reporting.
964    
965    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
966    
967            * HTML.pm.src: Token-level precious error reporting.
968    
969    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
970    
971            * HTML.pm.src: Preparation for more precious error point
972            reporting.
973    
974    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
975    
976            * HTML.pm.src: Error type revised.
977    
978    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
979    
980            * HTML.pm.src: Similar codes are merged together, again.
981    
982    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
983    
984            * HTML.pm.src: Similar codes are merged together.
985    
986    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
987    
988            * mkhtmlparser.pl: Set "level" parameter to parse errors.
989    
990            * HTML.pm.src: Code refined.
991    
992    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
993    
994            * HTML.pm.src: |</body>| treatement has been changed (HTML5
995            revision 1348).  Note that I really don't know this makes
996            any difference in the black-box behavior of the parser.
997    
998    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
999    
1000            * HTML.pm.src: New end-of-file token implementation (HTML5
1001            revision 1348).
1002    
1003    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1004    
1005            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1006    
1007    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1008    
1009            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1010    
1011    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1012    
1013            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1014            mode and support for the "in select in table" insertion mode (HTML5
1015            revision 1342).
1016    
1017    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1018    
1019            * HTML.pm.src: No foster parenting for <script> and <script>
1020            in non-tainted <table>s (HTML5 revision 1336).
1021    
1022    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * HTML.pm.src: Ignore white space characters between <html>
1025            and <head> (HTML5 revision 1332).
1026    
1027    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1028    
1029            * HTML.pm.src: Treat <input type=hidden> as if it were a
1030            white space (HTML5 revision 1331).
1031    
1032    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1033    
1034            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1035            element (HTML5 revision 1330).
1036    
1037    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1038    
1039            * HTML.pm.src: <title> is always appended to the current
1040            element (HTML5 revision 1328).
1041    
1042    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1043    
1044            * HTML.pm.src: White space in tainted tables are moved
1045            into foster parents (HTML5 revision 1326).
1046    
1047    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1048    
1049            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1050            revision 1321).
1051    
1052    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1053    
1054            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1055            the spec (HTML5 revision 1320).  Note that we finally complete
1056            all of HTML5 revision 1320 changes.
1057    
1058    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1059    
1060            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1061            tags (HTML5 revision 1320).
1062    
1063    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1064    
1065            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1066    
1067    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1068    
1069            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1070            could not be reached so that it is now removed (HTML5 revision
1071            1320).
1072    
1073    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1074    
1075            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1076            to align with new spec text (HTML5 revision 1320).
1077    
1078    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1079    
1080            * HTML.pm.src: Remove strange |if| condition; however, it should
1081            have had no harm in theory.
1082    
1083    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1084    
1085            * HTML.pm.src (_tree_construction_main): '</p>' in body
1086            case is split from other end tags for the preparation
1087            of implementing HTML revision 1320.
1088    
1089    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1090    
1091            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1092            revision 1320).
1093    
1094    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1095    
1096            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1097            and "div" (HTML5 revision 1318).  Add comments to where
1098            |form| pointer association codes should be inserted (HTML5 revision
1099            1319).
1100    
1101    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1102    
1103            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1104    
1105    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1106    
1107            * HTML.pm.src: Since the case t268 should never be reached (no
1108            other token type, there are), it is replaced by a |die| statement.
1109    
1110    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1111    
1112            * HTML.pm.src: Typo fixed.
1113    
1114    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1115    
1116            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1117            doctypes were not uppercased for comparison.
1118    
1119    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1120    
1121            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1122            are added.
1123    
1124    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1125    
1126            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1127            are set.  Cases that are unlikely reached are noted as so.
1128    
1129    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1130    
1131            * HTML.pm.src: Checkpoints for debugging are added.
1132    
1133            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1134    
1135    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1136    
1137            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1138            for simplicity.
1139    
1140    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1141    
1142            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1143            revision 1307).
1144    
1145    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1146    
1147            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1148            is set are changed (HTML5 revision 1305).
1149    
1150    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1151    
1152            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1153            1292).  Entities are not parsed in comment-like part in RCDATA
1154            elements (HTML5 revision 1294).  Allow bare & at the end
1155            of attribute value literals (HTML5 revision 1296).  More
1156            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1157            between attributes and ban attribute names or unquoted
1158            attribute values containing single or double quotes (HTML5
1159            revision 1303).
1160    
1161    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1162    
1163            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1164            and related errors unless it is an HTML document (though the spec
1165            is unclear on whether it is applied to XHTML document).
1166    
1167            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1168            are added.
1169    
1170    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1171    
1172            * HTML.pm.src: s/local_name/manakai_local_name/g.
1173    
1174    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1175    
1176            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1177    
1178            * HTML.pm.src: |charset| in |content| attribute is
1179            case-insensitive (HTML5 revision 1270).
1180    
1181    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1182    
1183            * ContentChecker.pm: New status constants are added.
1184            ($ElementDefault): |status| added.
1185            (check_element): Err for non-standard or deprecated elements.
1186            (_attr_status_info): For non-standard or deprecated attributes.
1187    
1188    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1189    
1190            * ContentChecker.pm (_attr_status_info): New internal method.
1191    
1192    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1193    
1194            * ContentChecker.pm (check_element): Element standardized
1195            status information is now dispatched.
1196    
1197    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1198    
1199            * ContentChecker.pm (check_element): Fix |del|-and-significant
1200            problem by adding some more arguments.
1201    
1202    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1203    
1204            * ContentChecker.pm (check_element): Use context of
1205            container-for-the-purpose-of-content-model element (not transparent
1206            element) for |check_child_element| calling and significant
1207            text flag marking.  This reintroduces |<del>|-and-significant
1208            problem again.
1209    
1210    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1211    
1212            * ContentChecker.pm (check_element): Make semi-transparent
1213            elements ignored for the purpose of phase changes in
1214            content model checking.
1215    
1216    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1217    
1218            * ContentChecker.pm (check_element): In-element state
1219            was not properly managed for transparent cases.
1220    
1221    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1222    
1223            * ContentChecker.pm (check_element): Support for |video|
1224            and |audio| as semi-transparent elements.
1225    
1226    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1227    
1228            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1229            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1230            Support for |html:object| as a semi-transparent element.
1231    
1232    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1233    
1234            * ContentChecker.pm (check_element): The way to traverse
1235            the tree is entirely revised to make it easier to track
1236            the state of ancestors/descendants.  As a result of this
1237            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1238            support for content model checking for HTML elements |figure|,
1239            |object|, |video|, and |audio| and checking for XML elements (and
1240            some XMLNS checkings) are dropped for now.  They will be
1241            reimplemented in due cource.
1242    
1243    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1244    
1245            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1246            token (HTML5 revision 1225).
1247    
1248    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1249    
1250            * ContentChecker.pm ({unsupported_level}): New value.
1251    
1252            * HTML.pm.src: Save whether |meta| |content| attribute
1253            contains character references or not.
1254    
1255    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1256    
1257            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1258            imlement significant content checking for contents
1259            with |del| elements.
1260    
1261    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1262    
1263            * ContenteChecker.pm ($HTMLTransparentElements): More
1264            elements are added.
1265            (_get_children): HTML |object| elements are now semi-transparent.
1266    
1267            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1268    
1269    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1270    
1271            * CacheManifest.pm: HTML5 revision 1211 implemented.
1272    
1273            * CacheManifest.pod: Updated.
1274    
1275    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1276    
1277            * ContentChecker.pm (check_document, check_element): Support
1278            for second argument ($onsubdoc).
1279            (_get_css_parser): Removed (now it is part of WDCC).
1280    
1281    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1282    
1283            * ContentChecker.pm (_get_css_parser): New.
1284    
1285    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1286    
1287            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1288            for checking had been used.
1289    
1290    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1291    
1292            * ContentChecker.pm (check_element): New todo item type |descendant|.
1293    
1294    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1295    
1296            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1297            specced in RFC 2046 and RFC 4288.
1298            (application/atom+xml): Definition added.
1299    
1300    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1301    
1302            * URIChecker.pm: Make RFC 3986 should-level errors
1303            warnings (rather than SHOULD-level errors).
1304    
1305    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1306    
1307            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1308    
1309            * HTML.pm.src: A flag for character references in attribute
1310            values are added.  Set |manakai_has_reference| user data
1311            to |charset| attribute.
1312    
1313    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1314    
1315            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1316            attributes.
1317    
1318            * ContentChecker.pm (check_document): Warn if charset requirements
1319            cannot be tested.
1320    
1321    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1322    
1323            * HTML.pm.src (parse_byte_string): Detect charset
1324            by universalchardet if charset parameter is not specified.
1325    
1326            * Makefile (Charset-all, Charset-clean): New rules.
1327    
1328    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1329    
1330            * ContentChecker.pm (check_document): Check the existence
1331            of character encoding declaration and selection of encoding
1332            for HTML document.
1333    
1334    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1335    
1336            * ContentType.pm (get_sniffed_type): Return also the official
1337            type in list context.
1338    
1339    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1340    
1341            * ContentType.pm: Sniffing with leading white space
1342            ignoring (HTML5 revisions 1013 and 1016).
1343    
1344    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1345    
1346            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1347            white spaces, are implemented.
1348    
1349    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1350    
1351            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1352            attribute if possible.
1353    
1354    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1355    
1356            * HTML.pm.src (parse_byte_string): New method.
1357            (parse_char_string): New alias for |parse_string|.
1358            (main phase): Invoking "change the encoding" algorithm if desired.
1359    
1360            * HTML.pod: Updated.
1361    
1362    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1363    
1364            * HTML.pod (get_inner_html): Removed.
1365    
1366            * Makefile (HTML-all, HTML-clean): New.
1367    
1368    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1369    
1370            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1371    
1372    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1373    
1374            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1375    
1376    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1377    
1378            * Makefile: |CacheManifest.html| is added.
1379    
1380            * CacheManifest.pod: New file.
1381    
1382    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1383    
1384            * CacheManifest.pm: New module.
1385    
1386    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1387    
1388            * HTML.pm.src: Support for application cache selection algorithm
1389            callback.
1390    
1391    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1392    
1393            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1394    
1395    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1396    
1397            * Makefile (clean): New rule.
1398    
1399            * NanoDOM.pm (public_id, system_id): New attributes.
1400    
1401    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1402    
1403            * Makefile (CSS-all, CSS-clean, clean): New rules.
1404    
1405    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1406    
1407            * ContentChecker.pm (check_document): Support for
1408            new |is_xml_root| flag.
1409            (check_element): Support for new |pluses| state.
1410            (_add_pluses): New method.
1411            (_remove_minuses): Support for new |minus| item.
1412    
1413    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1414    
1415            * ContentChecker.pm: Raise specific error for invalid
1416            root element.
1417    
1418    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1419    
1420            * ContentChecker.pm: Set level values for later uses.
1421    
1422    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1423    
1424            * ContentChecker.pm: Support for language tag validation.
1425    
1426    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1427    
1428            * LangTag.pm (check_rfc3066_language_tag): New method.
1429    
1430    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1431    
1432            * LangTag.pm: New module.
1433    
1434    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1435    
1436            * HTML.pm.src: Some error types were wrong.
1437    
1438    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1439    
1440            * CSS/: New directory.
1441    
1442    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1443    
1444            * ContentChecker.pm (_check_get_children): Support
1445            for |noscript| in |head|.
1446    
1447    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1448    
1449            * URI/: New directory.
1450    
1451    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1452    
1453            * HTML.pm.src: Tokenizer's states are now represented in
1454            number.
1455    
1456    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1457    
1458            * HTML.pm.src: |or|s for insertion modes are replaced
1459            by |&|s.
1460    
1461    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1462    
1463            * HTML.pm.src: Token types are now represented in number.
1464    
1465    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1466    
1467            * ContentType.pm (SEE ALSO): Updated.
1468    
1469            * HTML.pm.src: Insertion modes are now represented in number.
1470    
1471    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1472    
1473            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1474            999) is implemented.
1475    
1476    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1477    
1478            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1479            983) is implemented.
1480    
1481    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1482    
1483            * ContentChecker.pod: New documentation.
1484    
1485            * Makefile: A rule for |ContentChecker.html| is added.
1486    
1487            * ContentChecker.pm: A pod "LICENSE" section is added.
1488    
1489            * NanoDOM.pm ($VERSION): New variable.
1490    
1491    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1492    
1493            * H2H.pm: |b|, |i|, and |sub| are added to the
1494            list of allowed HTML elements.
1495    
1496    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1497    
1498            * H2H.pm: |samp| is added to the list of allowed
1499            HTML elements.
1500    
1501            * URIChecker.pm (check_iri): New.
1502            (check_iri_reference): Error type for IRI reference
1503            syntax error is changed.
1504    
1505    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1506    
1507            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1508            is added.
1509            (check_document): Load appropriate module before validation.
1510    
1511    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1512    
1513            * ContentChecker/: New directory.
1514    
1515    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1516    
1517            * ContentChecker.pm: HTML |time| element is implemented.
1518    
1519            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1520    
1521    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1522    
1523            * ContentChecker.pm (check_document): Return value
1524            even if no document element is found.
1525    
1526    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1527    
1528            * HTML.pm.src: |$in_body| is no longer a function.
1529    
1530    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1531    
1532            * HTML.pm.src: The |$in_body| code has been moved down.
1533    
1534    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1535    
1536            * HTML.pm.src: The "trailing end" insertion mode
1537            is split into "after html body" and "after html frameset"
1538            insertion modes.  Their codes are merged with "after body"
1539            and "after frameset" codes.  |$previous_insertion_mode|
1540            has been removed.  "after frameset" code is
1541            merged with "in frameset" code.
1542    
1543    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1544    
1545            * HTML.pm.src: The "before head" insertion mode is
1546            merged with the "in head" insertion mode.
1547    
1548    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1549    
1550            * HTML.pm.src: Cases in "in head" insertion mode are
1551            reorganized.
1552    
1553    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1554    
1555            * HTML.pm.src: Some cases in "in table" insertion mode
1556            are merged.
1557    
1558    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1559    
1560            * HTML.pm.src: The "in row" insertion mode is merged
1561            with "in table" insertion mode.
1562    
1563  2007-07-21  Wakaba  <wakaba@suika.fam.cx>  2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1564    
1565          * HTML.pm.src: The "in table" and "in table body" insertion          * HTML.pm.src: The "in table" and "in table body" insertion

Legend:
Removed from v.1.78  
changed lines
  Added in v.1.342

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24