/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.64 by wakaba, Mon Jul 16 03:21:04 2007 UTC revision 1.309 by wakaba, Sun Sep 14 14:35:43 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
4            and |{next_char}| initializations are moved to initialization
5            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
6            with |parse_char_stream|.
7    
8    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
9    
10            * HTML.pm.src (parse_char_stream): Make |set_next_char|
11            invoke |manakai_read_until|, not only |read|, where
12            possible, to decrease the number of |read| method calls.
13    
14            * mkhtmlparser.pl: Related changes to the aforementioned
15            modification.
16    
17    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
20            would report character error from now.
21    
22    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: White-space-leaded non-white-space character
25            tokens in "before head insertion mode" was not
26            correctly handled.
27            (set_inner_html): Reimplemented using CharString decodehandle
28            class.  Support for $get_wrapper argument.  Support
29            for |{read_until}| feature.
30    
31    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: Make a "bare ero" error for unknown
34            entities point the "&" character.
35    
36    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: It turns out that U+FFFD don't have to
39            be added to the list of excluded characters.
40    
41    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
42    
43            * HTML.pm.src ($char_onerror): Have character decoder's |line|
44            and |column| a higher priority than the one set by the
45            tokenizer's input handler.
46            ($self->{read_until}): Exclude U+FFFD (but this might
47            not be necessary, since now we do line/column fixup in
48            the character decode handle).
49    
50    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
51    
52            * HTML.pm.src: Use |{read_until}| where possible.
53    
54    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
55    
56            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
57            and |manakai_getc_until| to |manakai_read_until| to
58            reduce the number of string copies.
59    
60    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
61    
62            * HTML.pm.src (parse_char_string): Use newly created
63            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
64            standard feature to |open| a string as a filehandle,
65            since Perl's string filehandle seems not supporting |ungetc|
66            method correctly.
67            (parse_char_stream): Define |{getc_until}| method.
68            (DATA_STATE): Experimental support for |getc_until| feature.
69    
70    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: Check points added to newly added branches.
73    
74    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src: Remove |{char}|, which is no longer used.
77            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
78            and replaced by |{prev_state}|.
79    
80            * mkhtmlparser.pl: Remove |{char}| feature.
81            Remove |!!!back-next-input-character;| macro.
82    
83    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
84    
85            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
86            entity related tokenizer states in favor of new states
87            implementing the consume character reference algorithm.
88    
89    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
90    
91            * HTML.pm.src: "Consume a character reference" algorithm is
92            now implemented as a tokenizer's state, rather than
93            a method, with minimum changes (more changes will
94            be made, in due course).  "Bogus comment state"'s inner
95            loop gets removed.
96    
97    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
100            into their own tokenizer states.
101    
102    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
103    
104            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
105            is split into three states.
106    
107    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
108    
109            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
110            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
111            no longer does the tokenizer have to push back next input
112            characters in those states.
113    
114    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
117            into four states so that no longer does the tokenizer have to push
118            back next input characters in that state.
119    
120    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
123            which can be used to insert some wrapper between the character
124            stream handle and the tokenizer.  (It is currently not supported
125            for |set_inner_html| for |Element|s).
126    
127    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
128    
129            * HTML.pm.src: Ignore punctuations in charset names.
130    
131    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
132    
133            * ContentChecker.pm: Support for charset-layer error levels.
134    
135            * HTML.pm.src: Don't specify |text| argument for the
136            |chardecode:fallback| error, since it is not the encoding
137            being used alternatively.
138    
139    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
142    
143    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
144    
145            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
146    
147    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src: Bug fix and sync with the spec with regard
150            to after after frameset insertion mode processing (HTML5
151            revision 1909).  Note that the implementation was wrong
152            per the old spec before the r1909 changes.
153    
154    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
155    
156            * HTMLTable.pm: scope=auto algorithm fix synced with the
157            spec (HTML5 revision 2093).
158            ($process_row): Algorithm step numbers synced with the
159            spec (HTML5 revision 2092).
160    
161    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
164            revision 2094).
165    
166    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
167    
168            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
169    
170    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: '"' and "'" at the end of attribute
173            name (after another attribute) now raise parse error (HTML5
174            revision 2123).  Empty unquoted attribute values are no
175            longer allowed (HTML5 revision 2122).
176    
177    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
178    
179            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
180            revision 2130).
181    
182    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
183    
184            * ContentChecker.pm: |xml:lang| attribute value must be same
185            as |lang| attribute value for HTML elements (HTML5 revision 2062
186            and so on).
187    
188    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
189    
190            * ContentChecker.pm: Error level definition for |xml_id_error|
191            was missing.
192    
193            * URIChecker.pm: The end of the URL should be marked as the
194            error location for an empty path error.  The position
195            between the userinfo and the port components should be
196            marked as the error location for an empty host error.
197    
198    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
199    
200            * URIChecker.pm: Set parameters representing where in the
201            value the error occurs for errors.  Report unknown
202            address format error in warning level, since address
203            formats are rarely added.  Path segments starting with "/.."
204            were misinterpreted as a dot-segment.
205    
206    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
207    
208            * URIChecker.pm (check_iri_reference): Requires
209            |Message::DOM::DOMImplementation|.
210    
211    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
212    
213            * IMTChecker.pm: Updated for the new error reporting architecture.
214    
215            * ContentChecker.pm: Error levels for IMTs are added.
216    
217    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
218    
219            * H2H.pm (_shift_token): Support for unquoted HTML attribute
220            values.
221    
222    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
223    
224            * CacheManifest.pm: Support for new style of error
225            reports.
226    
227            * HTML.pm.src: Set line=1, column=1 to the document node.
228    
229    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
230    
231            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
232            and URL checkers.  Support for more error levels for bogus
233            langauge tag and URL "standards".
234    
235            * LangTag.pm, URIChecker.pm: Support for new style error
236            level reporting.
237    
238    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
239    
240            * ContentChecker.pm: Support for RDF/XML error levels.
241    
242            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
243            specifying.  Error types are revised.
244    
245    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
246    
247            * ContentChecker.pm: All error reporting method calls are
248            renewed.
249    
250    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: All error type names and "text" parameters
253            are revised.  Use new style for "level" specification.
254    
255            * mkhtmlparser.pl: Use new style for "level" specification.
256    
257    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
258    
259            * WebIDL.pm (parse_char_string): Simplified error
260            reporting process for broken ignored valuetype definition.
261            (Valuetype idl_text): Support for special "DOMString" name.
262    
263    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
264    
265            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
266            terminal of the ScopedName is "DOMString", such that whether
267            the last part of the scoped name is "DOMString" or "_DOMString"
268            later.  It is necessary to determine whether a |typedef|
269            definition should be ignored or not.
270            (parse_char_string): Unescape the identifier of
271            exception members.
272            ($resolve): Return undef for builtin types and sequence<T>
273            types (we might not have to do this, however...).
274            (check): Support checking for Exceptions, Valuetypes,
275            and Typedefs.
276            ($serialize_type): Support for "DOMString::::" syntax.
277            (Typedef idl_text): Output Type as "DOMString" if it
278            is really "DOMString" (i.e. its internal representation
279            is "::DOMString::").
280    
281    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
282    
283            * WebIDL.pm ($resolve): New code, based on resolve code
284            for constant types in the |check| method.
285            (check): Support for checking of attributes, operations, and
286            arguments.
287            (Attribute/Operation idl_text): Exception names in getraises,
288            setraises, and raises clauses is serizlied by |$serialize_type|
289            code.
290    
291    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
292    
293            * WebIDL.pm ($integer): Order of selections are changed to match
294            hexadecimal numbers (the original pattern, taken from the spec,
295            was not work for hexadecimal numbers, because the "0" prefix
296            matches to the [0-7]* part (as an empty string) and therefore
297            it does not match with remaining "x..." part of a "0x..." integer
298            literal.
299            ($get_type): It now returns a string, not an array reference,
300            for regular types and |sequence| types (i.e. it in any case
301            returns a string).
302            ($get_next_token): The second item in the array that represents
303            a integer or float token is now a Perl number value, not the
304            original string representation of the number.
305            (check): Support for const value consistency checking.
306            No extended attribute is defined for constants.
307            (Node subclasses): Use simple strings rather than array references
308            for default data type values.
309            ($serialize_type): Type values are now simple strings.
310            (value): If the new attribute value is a false value, then
311            a FALSE value is set to the attribute.
312    
313    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
314    
315            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
316            in its stringified format ("scoped name" as defined in the
317            spec).  Note that future version of this module should not use
318            array references for type values and the |type_text| attribute
319            should be made obsolete.
320            (parse_char_string): Unescape attribute names.
321            (check): Support for checking of whether inherited interfaces
322            are actually defined or not.  Support for checking of whether
323            interface member identifiers are duplicated or not.
324            ($serialize_type): Scoped names are returned as is.  A future
325            version of this code should escape identifiers other than "DOMString",
326            otherwise the idl_text would be non-conforming.
327    
328    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
329    
330            * WebIDL.pm (parse_char_string): Set line/column numbers
331            to generated nodes.  Unescape identifiers.  Extended attributes
332            for Definition's were ignored.
333            (append_child): Set |parent_node| attribute.
334            (parent_node): New attribute.
335            (check): Support interface/exception members.  Support
336            extended attributes.  Support definition identifier uniqueness
337            constraint.
338            (qualified_name): New attribute.
339            (Interface/Exception idl_text): Extended attributes were
340            not prepended to the returned text.
341    
342    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
343    
344            * WebIDL.pm (parse_char_string): Set line/column numbers
345            to interface object experimentally.  s/shift/pop/g, shift
346            would make things wrong.  Support for interface forward
347            declarations was missing.  Broken interface declarations
348            with no block were not ignored entirely.
349            (Whatpm::WebIDL::Node): New abstract class.  This class
350            makes things easier.
351            (child_nodes): New attribute.  Unlike DOM's attribute with
352            same name, this attribute returns a dead list of nodes for
353            simplicity.
354            (get_user_data, set_user_data): New methods.
355            (Module idl_text): A SPACE character should be inserted
356            before the |{| character.
357            (Interface idl_text): Support for interface forward declarations.
358            (is_forward_declaration): New attribute.
359    
360    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
361    
362            * WebIDL.pm (type_text): Better serializer.
363    
364    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
365    
366            * WebIDL.pm: Revise forward-compatible parsing so that
367            it now can handle broken extended attributes and as such.
368    
369    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
370    
371            * WebIDL.pm: Real support for extended attributes.
372            Support for extended attributes with arguments.
373    
374    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
375    
376            * WebIDL.pm: Support for |exception| syntax.
377            (Interface->idl_text): Tentative support for inheritances.
378    
379    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
380    
381            * WebIDL.pm: Hierarchical scoped name support was broken.
382            Support for raises, setraises, and getraises syntaxes.
383    
384    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
385    
386            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
387            proper support for types, extended attributes, and exceptions yet).
388            WebIDL parser, version 1 (no support for exceptions yet,
389            no proper support for extended attributes yet).
390    
391    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
392    
393            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
394    
395    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
396    
397            * WebIDL.pm: Support for valuetype and const.
398    
399    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
400            
401            * WebIDL.pm: New module.
402    
403    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
404    
405            * Makefile (Entities.html): URI changed.
406    
407    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
410    
411    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
412    
413            * HTML.pm.src (_get_next_token): A parse error was missing.
414    
415    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
416    
417            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
418            spec (commented out). (HTML5 revision 1711).
419    
420    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
421    
422            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
423    
424            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
425            as UTF-16 (HTML5 revision 1701).
426    
427    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
428    
429            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
430            1692).
431    
432    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
433    
434            * HTML.pm.src: The secondary insertion mode used when switching
435            to foreign content is the "in body" insertion mode (HTML5 revision
436            1696).
437    
438    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
439    
440            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
441            revision 1697).
442    
443    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
444    
445            * HTML.pm.src: Support for end-of-file token in foreign content
446            insertion mode (HTML5 revision 1693).  Update SVG camelCase
447            attribute list (HTML5 revision 1700).  <textarea> closes
448            </select> (HTML5 revision 1699).  More start tags close in
449            foreign content insertion mode (HTML5 revision 1698).
450    
451    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
452    
453            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
454    
455    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
456    
457            * HTML.pm.src: More robust charset parameter detection (HTML5
458            revision 1674).
459    
460    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
461    
462            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
463            revision 1676).
464    
465    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
466    
467            * HTML.pm.src: Ignore language part of public identifiers for
468            quriks mode detection (HTML5 revision 1679).
469    
470    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
471    
472            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
473            revision 1685).
474    
475    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
476    
477            * HTML.pm.src: Support for EOF in new states for tags (HTML5
478            revision 1684).
479    
480    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
481    
482            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
483            work (HTML5 revision 1690).
484    
485    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
486    
487            * HTML.pm.src (_tree_construction_main): Change handling of
488            end tags in head insertion modes (HTML5 revision 1686).
489            (parse_char_string): Bug fix for non-utf8 character string handlings.
490            (parse_char_stream): |ungetc| does not work well for this context.
491    
492    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
493    
494            * HTML.pm.src (parse_byte_string): Redefined to invoke
495            |parse_byte_stream|.
496            (parse_byte_stream): New method.
497    
498    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
499    
500            * HTML.pm.src (parse_byte_string): Fix the column number reported
501            by encoding layer error reporter.
502    
503    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
504    
505            * HTML.pm.src (parse_byte_string): Use streaming decoder
506            rather than converting the whole byte string and then parsing.
507            Propagate errors in character encoding layer.
508            (get_next_token): Precise error reporting for |bare stago| error.
509    
510    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
511    
512            * HTML.pm.src (parse_char_stream): New method.
513            (parse_char_string): This method is now defined as an invocation
514            of the |parse_char_stream| method.
515    
516    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
517    
518            * HTML.pm.src (parse_byte_string): Report various status
519            of the sniffing as info-level errors.  Support for new
520            decoding framework in parser resestting.
521            (new): Various default error levels were not set.
522    
523    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
524    
525            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
526            algorithm, except for the actual sniffing, is implemented
527            with new framework with Message::Charset::Info.
528    
529    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
530    
531            * CacheManifest.pm (_parse): Drop fragment identifiers from
532            URIs in fallback section (HTML5 revision 1596).
533    
534    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
535    
536            * Makefile (Entities.html): URI has changed.
537    
538    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
539    
540            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
541            1553).
542    
543    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
544    
545            * ContentChecker.pm: Noted that those returned in |table| are
546            no longer table elements, but table objects returned
547            by Whatpm::HTMLTable.
548    
549            * HTMLTable.pm (form_table): Return table element node
550            as |$table->{element}|.
551            (assign_header): Support for the |headers=""| attribute.
552    
553    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
554    
555            * HTMLTable.pm (assign_header): New function; first version
556            with no support for headers="".
557            (form_table): Include table width and height to the returned
558            table object for covenience.  Indexing in column assignement was wrong.
559            Set whether a data cell is empty or not for convenience.
560    
561    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
562    
563            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
564    
565    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
566    
567            * HTMLTable.pm: How table model errors are detected is
568            changed (HTML5 revision 1387).
569    
570    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
571    
572            * HTMLTable.pm: The algorithm now moves |tfoot| elements
573            to the end of the table (HTML5 revision 1380).
574    
575    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
576    
577            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
578            of 1-based (HTML5 revision 1376).
579    
580    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
581    
582            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
583            and "Content-Encoding" no longer prevent sniffing (HTML5
584            revision 1288).
585    
586    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
587    
588            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
589            1282).
590    
591    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
592    
593            * ContentChecker.pm: Support for global attributes.
594            Status of XML specs are added.
595    
596    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
597    
598            * ContentChecker.pm (check_element): Support for |template=""|
599            and |ref=""| attribute (referent element type checking).
600    
601    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
602    
603            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
604            1500) is implemented (except for IDNA part and URI-scheme-specific
605            knowledge).  Line number counting was wrong for LF-only
606            documents.
607    
608    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
609    
610            * HTML.pm.src: Raise an parse error for any disallowed
611            character (HTML5 revision 1263).
612    
613    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
614    
615            * mkentitylist.pl: Support for new HTML5 entity table format (the
616            definition for |AElig;| was missing).
617    
618    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
619    
620            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
621            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
622    
623    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
624    
625            * HTML.pm.src: List of element names that close foreign content
626            insertion mode is added (HTML5 revisions 1412 and 1418).
627    
628    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
629    
630            * HTML.pm.src: Support for |mglyph| and |malignmark|
631            elements (HTML5 revision 1410).
632    
633    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
634    
635            * HTML.pm.src: Support for new long MathML entities (HTML5
636            revision 1406).
637    
638    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
639    
640            * HTML.pm.src: CDATA section support for MathML and SVG
641            elements (HTML5 revision 1404 and 1420).
642    
643    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
644    
645            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
646            elements (HTML5 revision 1404).  Unused !!!macro definitions
647            are removed.
648    
649    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
650    
651            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
652            are raised is changed (HTML5 revision 1404).
653    
654    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
655    
656            * HTML.pm.src: Category sets that are no longer used are removed.
657    
658    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
659    
660            * HTML.pm.src: The ->[1] property of stack entries are now
661            replaced by constants representing element category.
662    
663    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
664    
665            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
666            for error reporting.  (This is a preparation for using constant
667            value for ->[1].)
668    
669    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
670    
671            * RDFXML.pm: Typo fixed.
672    
673    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
674    
675            * ContentChecker.pm: |fact_level| is now treated
676            as same as |must_level|, i.e. level = |m|.
677            (check_element): Make list of URIs in the DOM.
678    
679    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
680    
681            * RDFXML.pm: Language accessor implemented.
682            Local (null-namespace) attribute support.
683    
684    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
685    
686            * RDFXML.pm: Factored out ID checking code.
687    
688    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
689    
690            * RDFXML.pm: TODO items noted.  Validation of ID
691            and URI attributes is implemented.  Warn if unknown
692            value is used in rdf:parseType="" attribute.
693    
694            * URIChecker.pm (check_rdf_uri_reference): New function.
695    
696    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
697    
698            * RDFXML.pm: bnodeid implemented.  Relative references
699            are now resolved.
700    
701    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
702    
703            * ContentChecker.pm: RDF reification implemented.
704    
705            * RDFXML.pm: undef vs false bug fixed.
706            Reification implemented.
707    
708    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
709    
710            * RDFXML.pm: s/id/ID/ for attribute name.
711            The |node| arguments are added for |ontriple| calls.
712            Too many "attribute not allowed" errors were raised.
713    
714            * ContentChecker.pm: Initial experimental support for rdf:RDF
715            element.
716    
717    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
718    
719            * RDFXML.pm: New module.
720    
721    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
722    
723            * HTML.pm.src (set_inner_html): Line/column number
724            code was old one yet.
725    
726    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
727    
728            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
729            errors.  Line/column markings for DOCTYPE, comment, and
730            character tokens are reintroduced; otherwise,
731            error location for "not HTML5" error and errors
732            for implied elements are not attached.
733    
734    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
735    
736            * HTML.pm.src: Set line/column numbers to attributes.
737    
738            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
739            (value): Setter implemented.
740    
741            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
742    
743    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
744    
745            * HTML.pm.src: Unused line/column markings are removed.
746    
747    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
748    
749            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
750            flag, which is no longer used.
751    
752    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
753    
754            * HTML.pm.src: Set line/column information to element nodes.
755    
756            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
757            and !!!insert-element-t): Set line/column information to
758            element nodes.
759    
760    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
761    
762            * HTML.pm.src (_get_next_token): The first "<" character
763            in "<?", "<>", or "</>" should be the error point.
764    
765    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
766    
767            * HTML.pm.src: Some more fixes on error position reporting.
768    
769    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
770    
771            * HTML.pm.src: Token-level precious error reporting.
772    
773    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
774    
775            * HTML.pm.src: Preparation for more precious error point
776            reporting.
777    
778    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
779    
780            * HTML.pm.src: Error type revised.
781    
782    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
783    
784            * HTML.pm.src: Similar codes are merged together, again.
785    
786    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
787    
788            * HTML.pm.src: Similar codes are merged together.
789    
790    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
791    
792            * mkhtmlparser.pl: Set "level" parameter to parse errors.
793    
794            * HTML.pm.src: Code refined.
795    
796    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
797    
798            * HTML.pm.src: |</body>| treatement has been changed (HTML5
799            revision 1348).  Note that I really don't know this makes
800            any difference in the black-box behavior of the parser.
801    
802    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
803    
804            * HTML.pm.src: New end-of-file token implementation (HTML5
805            revision 1348).
806    
807    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
808    
809            * HTML.pm.src: |applet| support (HTML5 revision 1347).
810    
811    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
812    
813            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
814    
815    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
816    
817            * HTML.pm.src: Support for |<input>| in the "in select" insertion
818            mode and support for the "in select in table" insertion mode (HTML5
819            revision 1342).
820    
821    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
822    
823            * HTML.pm.src: No foster parenting for <script> and <script>
824            in non-tainted <table>s (HTML5 revision 1336).
825    
826    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
827    
828            * HTML.pm.src: Ignore white space characters between <html>
829            and <head> (HTML5 revision 1332).
830    
831    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
832    
833            * HTML.pm.src: Treat <input type=hidden> as if it were a
834            white space (HTML5 revision 1331).
835    
836    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
837    
838            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
839            element (HTML5 revision 1330).
840    
841    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
842    
843            * HTML.pm.src: <title> is always appended to the current
844            element (HTML5 revision 1328).
845    
846    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
847    
848            * HTML.pm.src: White space in tainted tables are moved
849            into foster parents (HTML5 revision 1326).
850    
851    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
852    
853            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
854            revision 1321).
855    
856    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
857    
858            * HTML.pm.src: |</h/n/>| case code rearranged to align with
859            the spec (HTML5 revision 1320).  Note that we finally complete
860            all of HTML5 revision 1320 changes.
861    
862    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
863    
864            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
865            tags (HTML5 revision 1320).
866    
867    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
868    
869            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
870    
871    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
872    
873            * HTML.pm.src: A "generate implied end tags" code (t409.1)
874            could not be reached so that it is now removed (HTML5 revision
875            1320).
876    
877    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
878    
879            * HTML.pm.src: Code for the case of |</div>| and so on are revised
880            to align with new spec text (HTML5 revision 1320).
881    
882    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
883    
884            * HTML.pm.src: Remove strange |if| condition; however, it should
885            have had no harm in theory.
886    
887    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
888    
889            * HTML.pm.src (_tree_construction_main): '</p>' in body
890            case is split from other end tags for the preparation
891            of implementing HTML revision 1320.
892    
893    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
894    
895            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
896            revision 1320).
897    
898    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
899    
900            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
901            and "div" (HTML5 revision 1318).  Add comments to where
902            |form| pointer association codes should be inserted (HTML5 revision
903            1319).
904    
905    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
906    
907            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
908    
909    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
910    
911            * HTML.pm.src: Since the case t268 should never be reached (no
912            other token type, there are), it is replaced by a |die| statement.
913    
914    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
915    
916            * HTML.pm.src: Typo fixed.
917    
918    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
919    
920            * HTML.pm.src (_tree_construction_initial): Some limited quirks
921            doctypes were not uppercased for comparison.
922    
923    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
924    
925            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
926            are added.
927    
928    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
929    
930            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
931            are set.  Cases that are unlikely reached are noted as so.
932    
933    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
934    
935            * HTML.pm.src: Checkpoints for debugging are added.
936    
937            * mkhtmlparser.pl: Support for |!!!cp| syntax.
938    
939    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
940    
941            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
942            for simplicity.
943    
944    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
945    
946            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
947            revision 1307).
948    
949    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
950    
951            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
952            is set are changed (HTML5 revision 1305).
953    
954    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
955    
956            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
957            1292).  Entities are not parsed in comment-like part in RCDATA
958            elements (HTML5 revision 1294).  Allow bare & at the end
959            of attribute value literals (HTML5 revision 1296).  More
960            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
961            between attributes and ban attribute names or unquoted
962            attribute values containing single or double quotes (HTML5
963            revision 1303).
964    
965    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
966    
967            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
968            and related errors unless it is an HTML document (though the spec
969            is unclear on whether it is applied to XHTML document).
970    
971            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
972            are added.
973    
974    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
975    
976            * HTML.pm.src: s/local_name/manakai_local_name/g.
977    
978    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
979    
980            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
981    
982            * HTML.pm.src: |charset| in |content| attribute is
983            case-insensitive (HTML5 revision 1270).
984    
985    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
986    
987            * ContentChecker.pm: New status constants are added.
988            ($ElementDefault): |status| added.
989            (check_element): Err for non-standard or deprecated elements.
990            (_attr_status_info): For non-standard or deprecated attributes.
991    
992    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
993    
994            * ContentChecker.pm (_attr_status_info): New internal method.
995    
996    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
997    
998            * ContentChecker.pm (check_element): Element standardized
999            status information is now dispatched.
1000    
1001    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1002    
1003            * ContentChecker.pm (check_element): Fix |del|-and-significant
1004            problem by adding some more arguments.
1005    
1006    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1007    
1008            * ContentChecker.pm (check_element): Use context of
1009            container-for-the-purpose-of-content-model element (not transparent
1010            element) for |check_child_element| calling and significant
1011            text flag marking.  This reintroduces |<del>|-and-significant
1012            problem again.
1013    
1014    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1015    
1016            * ContentChecker.pm (check_element): Make semi-transparent
1017            elements ignored for the purpose of phase changes in
1018            content model checking.
1019    
1020    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1021    
1022            * ContentChecker.pm (check_element): In-element state
1023            was not properly managed for transparent cases.
1024    
1025    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1026    
1027            * ContentChecker.pm (check_element): Support for |video|
1028            and |audio| as semi-transparent elements.
1029    
1030    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1031    
1032            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1033            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1034            Support for |html:object| as a semi-transparent element.
1035    
1036    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1037    
1038            * ContentChecker.pm (check_element): The way to traverse
1039            the tree is entirely revised to make it easier to track
1040            the state of ancestors/descendants.  As a result of this
1041            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1042            support for content model checking for HTML elements |figure|,
1043            |object|, |video|, and |audio| and checking for XML elements (and
1044            some XMLNS checkings) are dropped for now.  They will be
1045            reimplemented in due cource.
1046    
1047    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1048    
1049            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1050            token (HTML5 revision 1225).
1051    
1052    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1053    
1054            * ContentChecker.pm ({unsupported_level}): New value.
1055    
1056            * HTML.pm.src: Save whether |meta| |content| attribute
1057            contains character references or not.
1058    
1059    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1060    
1061            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1062            imlement significant content checking for contents
1063            with |del| elements.
1064    
1065    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1066    
1067            * ContenteChecker.pm ($HTMLTransparentElements): More
1068            elements are added.
1069            (_get_children): HTML |object| elements are now semi-transparent.
1070    
1071            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1072    
1073    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1074    
1075            * CacheManifest.pm: HTML5 revision 1211 implemented.
1076    
1077            * CacheManifest.pod: Updated.
1078    
1079    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1080    
1081            * ContentChecker.pm (check_document, check_element): Support
1082            for second argument ($onsubdoc).
1083            (_get_css_parser): Removed (now it is part of WDCC).
1084    
1085    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1086    
1087            * ContentChecker.pm (_get_css_parser): New.
1088    
1089    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1090    
1091            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1092            for checking had been used.
1093    
1094    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1095    
1096            * ContentChecker.pm (check_element): New todo item type |descendant|.
1097    
1098    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1099    
1100            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1101            specced in RFC 2046 and RFC 4288.
1102            (application/atom+xml): Definition added.
1103    
1104    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1105    
1106            * URIChecker.pm: Make RFC 3986 should-level errors
1107            warnings (rather than SHOULD-level errors).
1108    
1109    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1110    
1111            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1112    
1113            * HTML.pm.src: A flag for character references in attribute
1114            values are added.  Set |manakai_has_reference| user data
1115            to |charset| attribute.
1116    
1117    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1118    
1119            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1120            attributes.
1121    
1122            * ContentChecker.pm (check_document): Warn if charset requirements
1123            cannot be tested.
1124    
1125    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1126    
1127            * HTML.pm.src (parse_byte_string): Detect charset
1128            by universalchardet if charset parameter is not specified.
1129    
1130            * Makefile (Charset-all, Charset-clean): New rules.
1131    
1132    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1133    
1134            * ContentChecker.pm (check_document): Check the existence
1135            of character encoding declaration and selection of encoding
1136            for HTML document.
1137    
1138    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1139    
1140            * ContentType.pm (get_sniffed_type): Return also the official
1141            type in list context.
1142    
1143    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1144    
1145            * ContentType.pm: Sniffing with leading white space
1146            ignoring (HTML5 revisions 1013 and 1016).
1147    
1148    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1149    
1150            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1151            white spaces, are implemented.
1152    
1153    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1154    
1155            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1156            attribute if possible.
1157    
1158    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1159    
1160            * HTML.pm.src (parse_byte_string): New method.
1161            (parse_char_string): New alias for |parse_string|.
1162            (main phase): Invoking "change the encoding" algorithm if desired.
1163    
1164            * HTML.pod: Updated.
1165    
1166    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1167    
1168            * HTML.pod (get_inner_html): Removed.
1169    
1170            * Makefile (HTML-all, HTML-clean): New.
1171    
1172    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1173    
1174            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1175    
1176    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1177    
1178            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1179    
1180    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1181    
1182            * Makefile: |CacheManifest.html| is added.
1183    
1184            * CacheManifest.pod: New file.
1185    
1186    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1187    
1188            * CacheManifest.pm: New module.
1189    
1190    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1191    
1192            * HTML.pm.src: Support for application cache selection algorithm
1193            callback.
1194    
1195    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1196    
1197            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1198    
1199    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1200    
1201            * Makefile (clean): New rule.
1202    
1203            * NanoDOM.pm (public_id, system_id): New attributes.
1204    
1205    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1206    
1207            * Makefile (CSS-all, CSS-clean, clean): New rules.
1208    
1209    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1210    
1211            * ContentChecker.pm (check_document): Support for
1212            new |is_xml_root| flag.
1213            (check_element): Support for new |pluses| state.
1214            (_add_pluses): New method.
1215            (_remove_minuses): Support for new |minus| item.
1216    
1217    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1218    
1219            * ContentChecker.pm: Raise specific error for invalid
1220            root element.
1221    
1222    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1223    
1224            * ContentChecker.pm: Set level values for later uses.
1225    
1226    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1227    
1228            * ContentChecker.pm: Support for language tag validation.
1229    
1230    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1231    
1232            * LangTag.pm (check_rfc3066_language_tag): New method.
1233    
1234    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1235    
1236            * LangTag.pm: New module.
1237    
1238    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1239    
1240            * HTML.pm.src: Some error types were wrong.
1241    
1242    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1243    
1244            * CSS/: New directory.
1245    
1246    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1247    
1248            * ContentChecker.pm (_check_get_children): Support
1249            for |noscript| in |head|.
1250    
1251    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1252    
1253            * URI/: New directory.
1254    
1255    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1256    
1257            * HTML.pm.src: Tokenizer's states are now represented in
1258            number.
1259    
1260    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1261    
1262            * HTML.pm.src: |or|s for insertion modes are replaced
1263            by |&|s.
1264    
1265    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1266    
1267            * HTML.pm.src: Token types are now represented in number.
1268    
1269    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1270    
1271            * ContentType.pm (SEE ALSO): Updated.
1272    
1273            * HTML.pm.src: Insertion modes are now represented in number.
1274    
1275    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1276    
1277            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1278            999) is implemented.
1279    
1280    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1281    
1282            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1283            983) is implemented.
1284    
1285    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1286    
1287            * ContentChecker.pod: New documentation.
1288    
1289            * Makefile: A rule for |ContentChecker.html| is added.
1290    
1291            * ContentChecker.pm: A pod "LICENSE" section is added.
1292    
1293            * NanoDOM.pm ($VERSION): New variable.
1294    
1295    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1296    
1297            * H2H.pm: |b|, |i|, and |sub| are added to the
1298            list of allowed HTML elements.
1299    
1300    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1301    
1302            * H2H.pm: |samp| is added to the list of allowed
1303            HTML elements.
1304    
1305            * URIChecker.pm (check_iri): New.
1306            (check_iri_reference): Error type for IRI reference
1307            syntax error is changed.
1308    
1309    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1310    
1311            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1312            is added.
1313            (check_document): Load appropriate module before validation.
1314    
1315    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1316    
1317            * ContentChecker/: New directory.
1318    
1319    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1320    
1321            * ContentChecker.pm: HTML |time| element is implemented.
1322    
1323            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1324    
1325    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1326    
1327            * ContentChecker.pm (check_document): Return value
1328            even if no document element is found.
1329    
1330    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1331    
1332            * HTML.pm.src: |$in_body| is no longer a function.
1333    
1334    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1335    
1336            * HTML.pm.src: The |$in_body| code has been moved down.
1337    
1338    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1339    
1340            * HTML.pm.src: The "trailing end" insertion mode
1341            is split into "after html body" and "after html frameset"
1342            insertion modes.  Their codes are merged with "after body"
1343            and "after frameset" codes.  |$previous_insertion_mode|
1344            has been removed.  "after frameset" code is
1345            merged with "in frameset" code.
1346    
1347    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1348    
1349            * HTML.pm.src: The "before head" insertion mode is
1350            merged with the "in head" insertion mode.
1351    
1352    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1353    
1354            * HTML.pm.src: Cases in "in head" insertion mode are
1355            reorganized.
1356    
1357    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1358    
1359            * HTML.pm.src: Some cases in "in table" insertion mode
1360            are merged.
1361    
1362    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1363    
1364            * HTML.pm.src: The "in row" insertion mode is merged
1365            with "in table" insertion mode.
1366    
1367    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1368    
1369            * HTML.pm.src: The "in table" and "in table body" insertion
1370            modes are merged.
1371    
1372    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1373    
1374            * HTML.pm.src: There is no "in table head" or "in table foot"
1375            insertion mode!
1376    
1377    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1378    
1379            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1380            now directly invoke the handler.
1381    
1382    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1383    
1384            * HTML.pm.src: Codes for "in cell" insertion mode
1385            is merged to the "in body" insertion mode code.
1386    
1387    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1388    
1389            * HTML.pm.src: Codes for "in body" and "in caption"
1390            insertion modes are merged.
1391    
1392    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1393    
1394            * HTML.pm.src: Two |!!!next-token|s were missing.
1395    
1396    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1397    
1398            * HTML.pm.src: Use numeric constant for |{content_mode}|
1399            instead of string constant for |{content_model_flag}|.
1400    
1401    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1402    
1403            * HTML.pm.src: Add the name of the attribute
1404            to the "duplicate attribute" error.
1405    
1406    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1407    
1408            * ContentChecker.pm: Return the |class| node list.
1409    
1410    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1411    
1412            * ContentChecker.pm: Return the |id| node list.
1413    
1414            * HTML.pm.src: A typo is fixed.
1415    
1416    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1417    
1418            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1419            from "in HTML:xml:lang" error.  Character position
1420            is now the last part of the error type in the URI error
1421            description.  Report "unsupported" status for language
1422            tags, media queries, script codes, and style sheets.
1423    
1424    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1425    
1426            * ContentChecker.pm: Report error if |xml:lang|
1427            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1428            in XML.
1429    
1430            * NanoDOM.pm (Attr.owner_document): New attribute.
1431    
1432    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1433    
1434            * HTML.pm.src: The character immediately following
1435            a bare |hcro| was discarded.  Fix handling of
1436            entity references in attribute values.
1437    
1438    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1439    
1440            * HTML.pm.src (main and trailin end phases): Token
1441            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1442            factored out.  Error types |in html:#DOCTYPE|
1443            and |after html:#DOCTYPE| are merged
1444            into |DOCTYPE in the middle|.  |</frameset>|
1445            in fragment parsing mode changed the
1446            insertion mode.
1447    
1448  2007-07-16  Wakaba  <wakaba@suika.fam.cx>  2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1449    
1450          * HTML.pm.src: |$phase| has been removed; The |trailing end|          * HTML.pm.src: |$phase| has been removed; The |trailing end|

Legend:
Removed from v.1.64  
changed lines
  Added in v.1.309

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24