/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.17 by wakaba, Sat May 19 14:29:09 2007 UTC revision 1.297 by wakaba, Sat Sep 13 10:49:21 2008 UTC
# Line 1  Line 1 
1    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
4            entity related tokenizer states in favor of new states
5            implementing the consume character reference algorithm.
6    
7    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
8    
9            * HTML.pm.src: "Consume a character reference" algorithm is
10            now implemented as a tokenizer's state, rather than
11            a method, with minimum changes (more changes will
12            be made, in due course).  "Bogus comment state"'s inner
13            loop gets removed.
14    
15    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
18            into their own tokenizer states.
19    
20    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
23            is split into three states.
24    
25    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
28            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
29            no longer does the tokenizer have to push back next input
30            characters in those states.
31    
32    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
35            into four states so that no longer does the tokenizer have to push
36            back next input characters in that state.
37    
38    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
41            which can be used to insert some wrapper between the character
42            stream handle and the tokenizer.  (It is currently not supported
43            for |set_inner_html| for |Element|s).
44    
45    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
46    
47            * HTML.pm.src: Ignore punctuations in charset names.
48    
49    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
50    
51            * ContentChecker.pm: Support for charset-layer error levels.
52    
53            * HTML.pm.src: Don't specify |text| argument for the
54            |chardecode:fallback| error, since it is not the encoding
55            being used alternatively.
56    
57    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
60    
61    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
62    
63            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
64    
65    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
66    
67            * HTML.pm.src: Bug fix and sync with the spec with regard
68            to after after frameset insertion mode processing (HTML5
69            revision 1909).  Note that the implementation was wrong
70            per the old spec before the r1909 changes.
71    
72    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTMLTable.pm: scope=auto algorithm fix synced with the
75            spec (HTML5 revision 2093).
76            ($process_row): Algorithm step numbers synced with the
77            spec (HTML5 revision 2092).
78    
79    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
80    
81            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
82            revision 2094).
83    
84    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
85    
86            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
87    
88    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
89    
90            * HTML.pm.src: '"' and "'" at the end of attribute
91            name (after another attribute) now raise parse error (HTML5
92            revision 2123).  Empty unquoted attribute values are no
93            longer allowed (HTML5 revision 2122).
94    
95    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
96    
97            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
98            revision 2130).
99    
100    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
101    
102            * ContentChecker.pm: |xml:lang| attribute value must be same
103            as |lang| attribute value for HTML elements (HTML5 revision 2062
104            and so on).
105    
106    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
107    
108            * ContentChecker.pm: Error level definition for |xml_id_error|
109            was missing.
110    
111            * URIChecker.pm: The end of the URL should be marked as the
112            error location for an empty path error.  The position
113            between the userinfo and the port components should be
114            marked as the error location for an empty host error.
115    
116    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
117    
118            * URIChecker.pm: Set parameters representing where in the
119            value the error occurs for errors.  Report unknown
120            address format error in warning level, since address
121            formats are rarely added.  Path segments starting with "/.."
122            were misinterpreted as a dot-segment.
123    
124    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
125    
126            * URIChecker.pm (check_iri_reference): Requires
127            |Message::DOM::DOMImplementation|.
128    
129    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
130    
131            * IMTChecker.pm: Updated for the new error reporting architecture.
132    
133            * ContentChecker.pm: Error levels for IMTs are added.
134    
135    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
136    
137            * H2H.pm (_shift_token): Support for unquoted HTML attribute
138            values.
139    
140    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
141    
142            * CacheManifest.pm: Support for new style of error
143            reports.
144    
145            * HTML.pm.src: Set line=1, column=1 to the document node.
146    
147    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
148    
149            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
150            and URL checkers.  Support for more error levels for bogus
151            langauge tag and URL "standards".
152    
153            * LangTag.pm, URIChecker.pm: Support for new style error
154            level reporting.
155    
156    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
157    
158            * ContentChecker.pm: Support for RDF/XML error levels.
159    
160            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
161            specifying.  Error types are revised.
162    
163    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
164    
165            * ContentChecker.pm: All error reporting method calls are
166            renewed.
167    
168    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
169    
170            * HTML.pm.src: All error type names and "text" parameters
171            are revised.  Use new style for "level" specification.
172    
173            * mkhtmlparser.pl: Use new style for "level" specification.
174    
175    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
176    
177            * WebIDL.pm (parse_char_string): Simplified error
178            reporting process for broken ignored valuetype definition.
179            (Valuetype idl_text): Support for special "DOMString" name.
180    
181    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
182    
183            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
184            terminal of the ScopedName is "DOMString", such that whether
185            the last part of the scoped name is "DOMString" or "_DOMString"
186            later.  It is necessary to determine whether a |typedef|
187            definition should be ignored or not.
188            (parse_char_string): Unescape the identifier of
189            exception members.
190            ($resolve): Return undef for builtin types and sequence<T>
191            types (we might not have to do this, however...).
192            (check): Support checking for Exceptions, Valuetypes,
193            and Typedefs.
194            ($serialize_type): Support for "DOMString::::" syntax.
195            (Typedef idl_text): Output Type as "DOMString" if it
196            is really "DOMString" (i.e. its internal representation
197            is "::DOMString::").
198    
199    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
200    
201            * WebIDL.pm ($resolve): New code, based on resolve code
202            for constant types in the |check| method.
203            (check): Support for checking of attributes, operations, and
204            arguments.
205            (Attribute/Operation idl_text): Exception names in getraises,
206            setraises, and raises clauses is serizlied by |$serialize_type|
207            code.
208    
209    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
210    
211            * WebIDL.pm ($integer): Order of selections are changed to match
212            hexadecimal numbers (the original pattern, taken from the spec,
213            was not work for hexadecimal numbers, because the "0" prefix
214            matches to the [0-7]* part (as an empty string) and therefore
215            it does not match with remaining "x..." part of a "0x..." integer
216            literal.
217            ($get_type): It now returns a string, not an array reference,
218            for regular types and |sequence| types (i.e. it in any case
219            returns a string).
220            ($get_next_token): The second item in the array that represents
221            a integer or float token is now a Perl number value, not the
222            original string representation of the number.
223            (check): Support for const value consistency checking.
224            No extended attribute is defined for constants.
225            (Node subclasses): Use simple strings rather than array references
226            for default data type values.
227            ($serialize_type): Type values are now simple strings.
228            (value): If the new attribute value is a false value, then
229            a FALSE value is set to the attribute.
230    
231    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
232    
233            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
234            in its stringified format ("scoped name" as defined in the
235            spec).  Note that future version of this module should not use
236            array references for type values and the |type_text| attribute
237            should be made obsolete.
238            (parse_char_string): Unescape attribute names.
239            (check): Support for checking of whether inherited interfaces
240            are actually defined or not.  Support for checking of whether
241            interface member identifiers are duplicated or not.
242            ($serialize_type): Scoped names are returned as is.  A future
243            version of this code should escape identifiers other than "DOMString",
244            otherwise the idl_text would be non-conforming.
245    
246    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
247    
248            * WebIDL.pm (parse_char_string): Set line/column numbers
249            to generated nodes.  Unescape identifiers.  Extended attributes
250            for Definition's were ignored.
251            (append_child): Set |parent_node| attribute.
252            (parent_node): New attribute.
253            (check): Support interface/exception members.  Support
254            extended attributes.  Support definition identifier uniqueness
255            constraint.
256            (qualified_name): New attribute.
257            (Interface/Exception idl_text): Extended attributes were
258            not prepended to the returned text.
259    
260    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
261    
262            * WebIDL.pm (parse_char_string): Set line/column numbers
263            to interface object experimentally.  s/shift/pop/g, shift
264            would make things wrong.  Support for interface forward
265            declarations was missing.  Broken interface declarations
266            with no block were not ignored entirely.
267            (Whatpm::WebIDL::Node): New abstract class.  This class
268            makes things easier.
269            (child_nodes): New attribute.  Unlike DOM's attribute with
270            same name, this attribute returns a dead list of nodes for
271            simplicity.
272            (get_user_data, set_user_data): New methods.
273            (Module idl_text): A SPACE character should be inserted
274            before the |{| character.
275            (Interface idl_text): Support for interface forward declarations.
276            (is_forward_declaration): New attribute.
277    
278    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
279    
280            * WebIDL.pm (type_text): Better serializer.
281    
282    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
283    
284            * WebIDL.pm: Revise forward-compatible parsing so that
285            it now can handle broken extended attributes and as such.
286    
287    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
288    
289            * WebIDL.pm: Real support for extended attributes.
290            Support for extended attributes with arguments.
291    
292    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
293    
294            * WebIDL.pm: Support for |exception| syntax.
295            (Interface->idl_text): Tentative support for inheritances.
296    
297    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
298    
299            * WebIDL.pm: Hierarchical scoped name support was broken.
300            Support for raises, setraises, and getraises syntaxes.
301    
302    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
303    
304            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
305            proper support for types, extended attributes, and exceptions yet).
306            WebIDL parser, version 1 (no support for exceptions yet,
307            no proper support for extended attributes yet).
308    
309    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
310    
311            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
312    
313    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
314    
315            * WebIDL.pm: Support for valuetype and const.
316    
317    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
318            
319            * WebIDL.pm: New module.
320    
321    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
322    
323            * Makefile (Entities.html): URI changed.
324    
325    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
326    
327            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
328    
329    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
330    
331            * HTML.pm.src (_get_next_token): A parse error was missing.
332    
333    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
334    
335            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
336            spec (commented out). (HTML5 revision 1711).
337    
338    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
339    
340            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
341    
342            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
343            as UTF-16 (HTML5 revision 1701).
344    
345    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
346    
347            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
348            1692).
349    
350    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
351    
352            * HTML.pm.src: The secondary insertion mode used when switching
353            to foreign content is the "in body" insertion mode (HTML5 revision
354            1696).
355    
356    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
357    
358            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
359            revision 1697).
360    
361    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
362    
363            * HTML.pm.src: Support for end-of-file token in foreign content
364            insertion mode (HTML5 revision 1693).  Update SVG camelCase
365            attribute list (HTML5 revision 1700).  <textarea> closes
366            </select> (HTML5 revision 1699).  More start tags close in
367            foreign content insertion mode (HTML5 revision 1698).
368    
369    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
370    
371            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
372    
373    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
374    
375            * HTML.pm.src: More robust charset parameter detection (HTML5
376            revision 1674).
377    
378    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
379    
380            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
381            revision 1676).
382    
383    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
384    
385            * HTML.pm.src: Ignore language part of public identifiers for
386            quriks mode detection (HTML5 revision 1679).
387    
388    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
389    
390            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
391            revision 1685).
392    
393    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
394    
395            * HTML.pm.src: Support for EOF in new states for tags (HTML5
396            revision 1684).
397    
398    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
399    
400            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
401            work (HTML5 revision 1690).
402    
403    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
404    
405            * HTML.pm.src (_tree_construction_main): Change handling of
406            end tags in head insertion modes (HTML5 revision 1686).
407            (parse_char_string): Bug fix for non-utf8 character string handlings.
408            (parse_char_stream): |ungetc| does not work well for this context.
409    
410    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
411    
412            * HTML.pm.src (parse_byte_string): Redefined to invoke
413            |parse_byte_stream|.
414            (parse_byte_stream): New method.
415    
416    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
417    
418            * HTML.pm.src (parse_byte_string): Fix the column number reported
419            by encoding layer error reporter.
420    
421    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
422    
423            * HTML.pm.src (parse_byte_string): Use streaming decoder
424            rather than converting the whole byte string and then parsing.
425            Propagate errors in character encoding layer.
426            (get_next_token): Precise error reporting for |bare stago| error.
427    
428    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
429    
430            * HTML.pm.src (parse_char_stream): New method.
431            (parse_char_string): This method is now defined as an invocation
432            of the |parse_char_stream| method.
433    
434    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
435    
436            * HTML.pm.src (parse_byte_string): Report various status
437            of the sniffing as info-level errors.  Support for new
438            decoding framework in parser resestting.
439            (new): Various default error levels were not set.
440    
441    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
442    
443            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
444            algorithm, except for the actual sniffing, is implemented
445            with new framework with Message::Charset::Info.
446    
447    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
448    
449            * CacheManifest.pm (_parse): Drop fragment identifiers from
450            URIs in fallback section (HTML5 revision 1596).
451    
452    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
453    
454            * Makefile (Entities.html): URI has changed.
455    
456    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
457    
458            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
459            1553).
460    
461    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
462    
463            * ContentChecker.pm: Noted that those returned in |table| are
464            no longer table elements, but table objects returned
465            by Whatpm::HTMLTable.
466    
467            * HTMLTable.pm (form_table): Return table element node
468            as |$table->{element}|.
469            (assign_header): Support for the |headers=""| attribute.
470    
471    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
472    
473            * HTMLTable.pm (assign_header): New function; first version
474            with no support for headers="".
475            (form_table): Include table width and height to the returned
476            table object for covenience.  Indexing in column assignement was wrong.
477            Set whether a data cell is empty or not for convenience.
478    
479    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
480    
481            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
482    
483    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
484    
485            * HTMLTable.pm: How table model errors are detected is
486            changed (HTML5 revision 1387).
487    
488    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
489    
490            * HTMLTable.pm: The algorithm now moves |tfoot| elements
491            to the end of the table (HTML5 revision 1380).
492    
493    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
494    
495            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
496            of 1-based (HTML5 revision 1376).
497    
498    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
499    
500            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
501            and "Content-Encoding" no longer prevent sniffing (HTML5
502            revision 1288).
503    
504    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
505    
506            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
507            1282).
508    
509    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
510    
511            * ContentChecker.pm: Support for global attributes.
512            Status of XML specs are added.
513    
514    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
515    
516            * ContentChecker.pm (check_element): Support for |template=""|
517            and |ref=""| attribute (referent element type checking).
518    
519    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
520    
521            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
522            1500) is implemented (except for IDNA part and URI-scheme-specific
523            knowledge).  Line number counting was wrong for LF-only
524            documents.
525    
526    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
527    
528            * HTML.pm.src: Raise an parse error for any disallowed
529            character (HTML5 revision 1263).
530    
531    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
532    
533            * mkentitylist.pl: Support for new HTML5 entity table format (the
534            definition for |AElig;| was missing).
535    
536    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
537    
538            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
539            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
540    
541    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
542    
543            * HTML.pm.src: List of element names that close foreign content
544            insertion mode is added (HTML5 revisions 1412 and 1418).
545    
546    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
547    
548            * HTML.pm.src: Support for |mglyph| and |malignmark|
549            elements (HTML5 revision 1410).
550    
551    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
552    
553            * HTML.pm.src: Support for new long MathML entities (HTML5
554            revision 1406).
555    
556    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
557    
558            * HTML.pm.src: CDATA section support for MathML and SVG
559            elements (HTML5 revision 1404 and 1420).
560    
561    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
562    
563            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
564            elements (HTML5 revision 1404).  Unused !!!macro definitions
565            are removed.
566    
567    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
568    
569            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
570            are raised is changed (HTML5 revision 1404).
571    
572    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
573    
574            * HTML.pm.src: Category sets that are no longer used are removed.
575    
576    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
577    
578            * HTML.pm.src: The ->[1] property of stack entries are now
579            replaced by constants representing element category.
580    
581    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
582    
583            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
584            for error reporting.  (This is a preparation for using constant
585            value for ->[1].)
586    
587    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
588    
589            * RDFXML.pm: Typo fixed.
590    
591    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
592    
593            * ContentChecker.pm: |fact_level| is now treated
594            as same as |must_level|, i.e. level = |m|.
595            (check_element): Make list of URIs in the DOM.
596    
597    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
598    
599            * RDFXML.pm: Language accessor implemented.
600            Local (null-namespace) attribute support.
601    
602    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
603    
604            * RDFXML.pm: Factored out ID checking code.
605    
606    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
607    
608            * RDFXML.pm: TODO items noted.  Validation of ID
609            and URI attributes is implemented.  Warn if unknown
610            value is used in rdf:parseType="" attribute.
611    
612            * URIChecker.pm (check_rdf_uri_reference): New function.
613    
614    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
615    
616            * RDFXML.pm: bnodeid implemented.  Relative references
617            are now resolved.
618    
619    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
620    
621            * ContentChecker.pm: RDF reification implemented.
622    
623            * RDFXML.pm: undef vs false bug fixed.
624            Reification implemented.
625    
626    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
627    
628            * RDFXML.pm: s/id/ID/ for attribute name.
629            The |node| arguments are added for |ontriple| calls.
630            Too many "attribute not allowed" errors were raised.
631    
632            * ContentChecker.pm: Initial experimental support for rdf:RDF
633            element.
634    
635    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
636    
637            * RDFXML.pm: New module.
638    
639    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
640    
641            * HTML.pm.src (set_inner_html): Line/column number
642            code was old one yet.
643    
644    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
645    
646            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
647            errors.  Line/column markings for DOCTYPE, comment, and
648            character tokens are reintroduced; otherwise,
649            error location for "not HTML5" error and errors
650            for implied elements are not attached.
651    
652    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
653    
654            * HTML.pm.src: Set line/column numbers to attributes.
655    
656            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
657            (value): Setter implemented.
658    
659            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
660    
661    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
662    
663            * HTML.pm.src: Unused line/column markings are removed.
664    
665    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
666    
667            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
668            flag, which is no longer used.
669    
670    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
671    
672            * HTML.pm.src: Set line/column information to element nodes.
673    
674            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
675            and !!!insert-element-t): Set line/column information to
676            element nodes.
677    
678    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
679    
680            * HTML.pm.src (_get_next_token): The first "<" character
681            in "<?", "<>", or "</>" should be the error point.
682    
683    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
684    
685            * HTML.pm.src: Some more fixes on error position reporting.
686    
687    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
688    
689            * HTML.pm.src: Token-level precious error reporting.
690    
691    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
692    
693            * HTML.pm.src: Preparation for more precious error point
694            reporting.
695    
696    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
697    
698            * HTML.pm.src: Error type revised.
699    
700    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
701    
702            * HTML.pm.src: Similar codes are merged together, again.
703    
704    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
705    
706            * HTML.pm.src: Similar codes are merged together.
707    
708    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
709    
710            * mkhtmlparser.pl: Set "level" parameter to parse errors.
711    
712            * HTML.pm.src: Code refined.
713    
714    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
715    
716            * HTML.pm.src: |</body>| treatement has been changed (HTML5
717            revision 1348).  Note that I really don't know this makes
718            any difference in the black-box behavior of the parser.
719    
720    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
721    
722            * HTML.pm.src: New end-of-file token implementation (HTML5
723            revision 1348).
724    
725    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
726    
727            * HTML.pm.src: |applet| support (HTML5 revision 1347).
728    
729    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
730    
731            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
732    
733    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
734    
735            * HTML.pm.src: Support for |<input>| in the "in select" insertion
736            mode and support for the "in select in table" insertion mode (HTML5
737            revision 1342).
738    
739    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
740    
741            * HTML.pm.src: No foster parenting for <script> and <script>
742            in non-tainted <table>s (HTML5 revision 1336).
743    
744    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
745    
746            * HTML.pm.src: Ignore white space characters between <html>
747            and <head> (HTML5 revision 1332).
748    
749    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
750    
751            * HTML.pm.src: Treat <input type=hidden> as if it were a
752            white space (HTML5 revision 1331).
753    
754    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
755    
756            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
757            element (HTML5 revision 1330).
758    
759    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
760    
761            * HTML.pm.src: <title> is always appended to the current
762            element (HTML5 revision 1328).
763    
764    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
765    
766            * HTML.pm.src: White space in tainted tables are moved
767            into foster parents (HTML5 revision 1326).
768    
769    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
770    
771            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
772            revision 1321).
773    
774    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
775    
776            * HTML.pm.src: |</h/n/>| case code rearranged to align with
777            the spec (HTML5 revision 1320).  Note that we finally complete
778            all of HTML5 revision 1320 changes.
779    
780    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
781    
782            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
783            tags (HTML5 revision 1320).
784    
785    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
786    
787            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
788    
789    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
790    
791            * HTML.pm.src: A "generate implied end tags" code (t409.1)
792            could not be reached so that it is now removed (HTML5 revision
793            1320).
794    
795    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
796    
797            * HTML.pm.src: Code for the case of |</div>| and so on are revised
798            to align with new spec text (HTML5 revision 1320).
799    
800    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
801    
802            * HTML.pm.src: Remove strange |if| condition; however, it should
803            have had no harm in theory.
804    
805    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
806    
807            * HTML.pm.src (_tree_construction_main): '</p>' in body
808            case is split from other end tags for the preparation
809            of implementing HTML revision 1320.
810    
811    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
812    
813            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
814            revision 1320).
815    
816    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
817    
818            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
819            and "div" (HTML5 revision 1318).  Add comments to where
820            |form| pointer association codes should be inserted (HTML5 revision
821            1319).
822    
823    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
824    
825            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
826    
827    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
828    
829            * HTML.pm.src: Since the case t268 should never be reached (no
830            other token type, there are), it is replaced by a |die| statement.
831    
832    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
833    
834            * HTML.pm.src: Typo fixed.
835    
836    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
837    
838            * HTML.pm.src (_tree_construction_initial): Some limited quirks
839            doctypes were not uppercased for comparison.
840    
841    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
842    
843            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
844            are added.
845    
846    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
847    
848            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
849            are set.  Cases that are unlikely reached are noted as so.
850    
851    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
852    
853            * HTML.pm.src: Checkpoints for debugging are added.
854    
855            * mkhtmlparser.pl: Support for |!!!cp| syntax.
856    
857    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
858    
859            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
860            for simplicity.
861    
862    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
863    
864            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
865            revision 1307).
866    
867    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
868    
869            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
870            is set are changed (HTML5 revision 1305).
871    
872    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
873    
874            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
875            1292).  Entities are not parsed in comment-like part in RCDATA
876            elements (HTML5 revision 1294).  Allow bare & at the end
877            of attribute value literals (HTML5 revision 1296).  More
878            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
879            between attributes and ban attribute names or unquoted
880            attribute values containing single or double quotes (HTML5
881            revision 1303).
882    
883    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
884    
885            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
886            and related errors unless it is an HTML document (though the spec
887            is unclear on whether it is applied to XHTML document).
888    
889            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
890            are added.
891    
892    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
893    
894            * HTML.pm.src: s/local_name/manakai_local_name/g.
895    
896    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
897    
898            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
899    
900            * HTML.pm.src: |charset| in |content| attribute is
901            case-insensitive (HTML5 revision 1270).
902    
903    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
904    
905            * ContentChecker.pm: New status constants are added.
906            ($ElementDefault): |status| added.
907            (check_element): Err for non-standard or deprecated elements.
908            (_attr_status_info): For non-standard or deprecated attributes.
909    
910    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
911    
912            * ContentChecker.pm (_attr_status_info): New internal method.
913    
914    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
915    
916            * ContentChecker.pm (check_element): Element standardized
917            status information is now dispatched.
918    
919    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
920    
921            * ContentChecker.pm (check_element): Fix |del|-and-significant
922            problem by adding some more arguments.
923    
924    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
925    
926            * ContentChecker.pm (check_element): Use context of
927            container-for-the-purpose-of-content-model element (not transparent
928            element) for |check_child_element| calling and significant
929            text flag marking.  This reintroduces |<del>|-and-significant
930            problem again.
931    
932    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
933    
934            * ContentChecker.pm (check_element): Make semi-transparent
935            elements ignored for the purpose of phase changes in
936            content model checking.
937    
938    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
939    
940            * ContentChecker.pm (check_element): In-element state
941            was not properly managed for transparent cases.
942    
943    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
944    
945            * ContentChecker.pm (check_element): Support for |video|
946            and |audio| as semi-transparent elements.
947    
948    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
949    
950            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
951            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
952            Support for |html:object| as a semi-transparent element.
953    
954    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
955    
956            * ContentChecker.pm (check_element): The way to traverse
957            the tree is entirely revised to make it easier to track
958            the state of ancestors/descendants.  As a result of this
959            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
960            support for content model checking for HTML elements |figure|,
961            |object|, |video|, and |audio| and checking for XML elements (and
962            some XMLNS checkings) are dropped for now.  They will be
963            reimplemented in due cource.
964    
965    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
966    
967            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
968            token (HTML5 revision 1225).
969    
970    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
971    
972            * ContentChecker.pm ({unsupported_level}): New value.
973    
974            * HTML.pm.src: Save whether |meta| |content| attribute
975            contains character references or not.
976    
977    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
978    
979            * ContentChecker.pm (_get_children): (Incompleted) attempt to
980            imlement significant content checking for contents
981            with |del| elements.
982    
983    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
984    
985            * ContenteChecker.pm ($HTMLTransparentElements): More
986            elements are added.
987            (_get_children): HTML |object| elements are now semi-transparent.
988    
989            * NanoDOM.pm (manakai_html, manakai_head): New methods.
990    
991    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
992    
993            * CacheManifest.pm: HTML5 revision 1211 implemented.
994    
995            * CacheManifest.pod: Updated.
996    
997    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
998    
999            * ContentChecker.pm (check_document, check_element): Support
1000            for second argument ($onsubdoc).
1001            (_get_css_parser): Removed (now it is part of WDCC).
1002    
1003    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1004    
1005            * ContentChecker.pm (_get_css_parser): New.
1006    
1007    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1008    
1009            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1010            for checking had been used.
1011    
1012    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1013    
1014            * ContentChecker.pm (check_element): New todo item type |descendant|.
1015    
1016    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1017    
1018            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1019            specced in RFC 2046 and RFC 4288.
1020            (application/atom+xml): Definition added.
1021    
1022    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * URIChecker.pm: Make RFC 3986 should-level errors
1025            warnings (rather than SHOULD-level errors).
1026    
1027    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1028    
1029            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1030    
1031            * HTML.pm.src: A flag for character references in attribute
1032            values are added.  Set |manakai_has_reference| user data
1033            to |charset| attribute.
1034    
1035    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1036    
1037            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1038            attributes.
1039    
1040            * ContentChecker.pm (check_document): Warn if charset requirements
1041            cannot be tested.
1042    
1043    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1044    
1045            * HTML.pm.src (parse_byte_string): Detect charset
1046            by universalchardet if charset parameter is not specified.
1047    
1048            * Makefile (Charset-all, Charset-clean): New rules.
1049    
1050    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1051    
1052            * ContentChecker.pm (check_document): Check the existence
1053            of character encoding declaration and selection of encoding
1054            for HTML document.
1055    
1056    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1057    
1058            * ContentType.pm (get_sniffed_type): Return also the official
1059            type in list context.
1060    
1061    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1062    
1063            * ContentType.pm: Sniffing with leading white space
1064            ignoring (HTML5 revisions 1013 and 1016).
1065    
1066    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1067    
1068            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1069            white spaces, are implemented.
1070    
1071    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1072    
1073            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1074            attribute if possible.
1075    
1076    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1077    
1078            * HTML.pm.src (parse_byte_string): New method.
1079            (parse_char_string): New alias for |parse_string|.
1080            (main phase): Invoking "change the encoding" algorithm if desired.
1081    
1082            * HTML.pod: Updated.
1083    
1084    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1085    
1086            * HTML.pod (get_inner_html): Removed.
1087    
1088            * Makefile (HTML-all, HTML-clean): New.
1089    
1090    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1091    
1092            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1093    
1094    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1095    
1096            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1097    
1098    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1099    
1100            * Makefile: |CacheManifest.html| is added.
1101    
1102            * CacheManifest.pod: New file.
1103    
1104    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1105    
1106            * CacheManifest.pm: New module.
1107    
1108    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1109    
1110            * HTML.pm.src: Support for application cache selection algorithm
1111            callback.
1112    
1113    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1114    
1115            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1116    
1117    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1118    
1119            * Makefile (clean): New rule.
1120    
1121            * NanoDOM.pm (public_id, system_id): New attributes.
1122    
1123    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1124    
1125            * Makefile (CSS-all, CSS-clean, clean): New rules.
1126    
1127    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1128    
1129            * ContentChecker.pm (check_document): Support for
1130            new |is_xml_root| flag.
1131            (check_element): Support for new |pluses| state.
1132            (_add_pluses): New method.
1133            (_remove_minuses): Support for new |minus| item.
1134    
1135    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1136    
1137            * ContentChecker.pm: Raise specific error for invalid
1138            root element.
1139    
1140    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1141    
1142            * ContentChecker.pm: Set level values for later uses.
1143    
1144    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1145    
1146            * ContentChecker.pm: Support for language tag validation.
1147    
1148    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1149    
1150            * LangTag.pm (check_rfc3066_language_tag): New method.
1151    
1152    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1153    
1154            * LangTag.pm: New module.
1155    
1156    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1157    
1158            * HTML.pm.src: Some error types were wrong.
1159    
1160    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1161    
1162            * CSS/: New directory.
1163    
1164    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1165    
1166            * ContentChecker.pm (_check_get_children): Support
1167            for |noscript| in |head|.
1168    
1169    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1170    
1171            * URI/: New directory.
1172    
1173    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1174    
1175            * HTML.pm.src: Tokenizer's states are now represented in
1176            number.
1177    
1178    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1179    
1180            * HTML.pm.src: |or|s for insertion modes are replaced
1181            by |&|s.
1182    
1183    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1184    
1185            * HTML.pm.src: Token types are now represented in number.
1186    
1187    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1188    
1189            * ContentType.pm (SEE ALSO): Updated.
1190    
1191            * HTML.pm.src: Insertion modes are now represented in number.
1192    
1193    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1194    
1195            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1196            999) is implemented.
1197    
1198    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1199    
1200            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1201            983) is implemented.
1202    
1203    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1204    
1205            * ContentChecker.pod: New documentation.
1206    
1207            * Makefile: A rule for |ContentChecker.html| is added.
1208    
1209            * ContentChecker.pm: A pod "LICENSE" section is added.
1210    
1211            * NanoDOM.pm ($VERSION): New variable.
1212    
1213    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1214    
1215            * H2H.pm: |b|, |i|, and |sub| are added to the
1216            list of allowed HTML elements.
1217    
1218    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1219    
1220            * H2H.pm: |samp| is added to the list of allowed
1221            HTML elements.
1222    
1223            * URIChecker.pm (check_iri): New.
1224            (check_iri_reference): Error type for IRI reference
1225            syntax error is changed.
1226    
1227    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1228    
1229            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1230            is added.
1231            (check_document): Load appropriate module before validation.
1232    
1233    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1234    
1235            * ContentChecker/: New directory.
1236    
1237    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1238    
1239            * ContentChecker.pm: HTML |time| element is implemented.
1240    
1241            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1242    
1243    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1244    
1245            * ContentChecker.pm (check_document): Return value
1246            even if no document element is found.
1247    
1248    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1249    
1250            * HTML.pm.src: |$in_body| is no longer a function.
1251    
1252    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1253    
1254            * HTML.pm.src: The |$in_body| code has been moved down.
1255    
1256    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1257    
1258            * HTML.pm.src: The "trailing end" insertion mode
1259            is split into "after html body" and "after html frameset"
1260            insertion modes.  Their codes are merged with "after body"
1261            and "after frameset" codes.  |$previous_insertion_mode|
1262            has been removed.  "after frameset" code is
1263            merged with "in frameset" code.
1264    
1265    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1266    
1267            * HTML.pm.src: The "before head" insertion mode is
1268            merged with the "in head" insertion mode.
1269    
1270    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1271    
1272            * HTML.pm.src: Cases in "in head" insertion mode are
1273            reorganized.
1274    
1275    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1276    
1277            * HTML.pm.src: Some cases in "in table" insertion mode
1278            are merged.
1279    
1280    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1281    
1282            * HTML.pm.src: The "in row" insertion mode is merged
1283            with "in table" insertion mode.
1284    
1285    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1286    
1287            * HTML.pm.src: The "in table" and "in table body" insertion
1288            modes are merged.
1289    
1290    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1291    
1292            * HTML.pm.src: There is no "in table head" or "in table foot"
1293            insertion mode!
1294    
1295    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1296    
1297            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1298            now directly invoke the handler.
1299    
1300    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1301    
1302            * HTML.pm.src: Codes for "in cell" insertion mode
1303            is merged to the "in body" insertion mode code.
1304    
1305    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1306    
1307            * HTML.pm.src: Codes for "in body" and "in caption"
1308            insertion modes are merged.
1309    
1310    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1311    
1312            * HTML.pm.src: Two |!!!next-token|s were missing.
1313    
1314    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1315    
1316            * HTML.pm.src: Use numeric constant for |{content_mode}|
1317            instead of string constant for |{content_model_flag}|.
1318    
1319    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1320    
1321            * HTML.pm.src: Add the name of the attribute
1322            to the "duplicate attribute" error.
1323    
1324    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1325    
1326            * ContentChecker.pm: Return the |class| node list.
1327    
1328    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1329    
1330            * ContentChecker.pm: Return the |id| node list.
1331    
1332            * HTML.pm.src: A typo is fixed.
1333    
1334    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1335    
1336            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1337            from "in HTML:xml:lang" error.  Character position
1338            is now the last part of the error type in the URI error
1339            description.  Report "unsupported" status for language
1340            tags, media queries, script codes, and style sheets.
1341    
1342    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1343    
1344            * ContentChecker.pm: Report error if |xml:lang|
1345            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1346            in XML.
1347    
1348            * NanoDOM.pm (Attr.owner_document): New attribute.
1349    
1350    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1351    
1352            * HTML.pm.src: The character immediately following
1353            a bare |hcro| was discarded.  Fix handling of
1354            entity references in attribute values.
1355    
1356    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1357    
1358            * HTML.pm.src (main and trailin end phases): Token
1359            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1360            factored out.  Error types |in html:#DOCTYPE|
1361            and |after html:#DOCTYPE| are merged
1362            into |DOCTYPE in the middle|.  |</frameset>|
1363            in fragment parsing mode changed the
1364            insertion mode.
1365    
1366    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1367    
1368            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1369            phase is now an insertion mode.  Treatments for white
1370            space character tokens were incorrect for some
1371            insertion modes.  An old |meta| case was not removed.
1372    
1373    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1374    
1375            * HTML.pm.src: |meta| charset declaration extraction
1376            implemented (but changing the encoding is not yet:-).
1377    
1378    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1379    
1380            * Charset/: New directory.
1381    
1382    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1383    
1384            * H2H.pm: New Perl module (created from
1385            manakai's H2H.dis).
1386    
1387    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1388    
1389            * XMLSerializer.pm: New Perl module (created from
1390            manakai's SimpleLS.dis).
1391    
1392    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
1393    
1394            * HTML.pm.src: |<!---x-->| was not processed correctly.
1395    
1396    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1397    
1398            * HTML.pm.src: Report correct error message
1399            for |<body></div></body>|.
1400    
1401    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1402    
1403            * HTMLTable.pm: An error description was incorrect.
1404    
1405    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1406    
1407            * ContentChecker.pm: Return |{term}| list.
1408    
1409    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1410    
1411            * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1412            nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1413            and <title> outside of head).
1414    
1415    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1416    
1417            * IMTChecker.pm: Report warning for unregistered
1418            and private types/subtypes.
1419    
1420            * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1421            URIChecker.pm, HTMLTable.pm: Error messages are now
1422            consistent; they are all listed in
1423            <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1424    
1425    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1426    
1427            * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1428            is now erred.  |<datalist>| is implemented.
1429            Attribute checker for |<command>| and |<menu>| are
1430            added.  Support for |contextmenu| global attribute
1431            is added.
1432    
1433    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1434    
1435            * HTML.pm.src (_reset_insertion_mode): Interpretation
1436            of Step 3 has been changed.
1437    
1438    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1439    
1440            * HTML.pm.src: Late |<html>| parse error is implemented.
1441    
1442    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1443    
1444            * URIChecker.pm (check_iri_reference): A |decode| method name was
1445            incorrect.
1446    
1447            * ContentChecker.pm: Support for the |footer| element.
1448            Check URI syntax for space-separated URI attributes.
1449            Support for the |tabindex| attribute.  Support
1450            for |datetime| attribute.
1451    
1452    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1453    
1454            * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1455            character references).  HTML5 revision 1.146 (white space
1456            characters before root start tag).  HTML5 revision
1457            1.148 (named character references in attribute values).
1458            HTML5 revision 1.152 (<plaintext>.innerHTML get).
1459    
1460    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1461    
1462            * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1463            in <head>).
1464    
1465    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1466    
1467            * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1468            in body).
1469    
1470            * ContentChecker.pm: HTML5 revision 938 (scoped="").
1471    
1472    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1473    
1474            * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1475            or RCDATA in fragment parsing mode).  HTML5 revision
1476            924 (<!--> and <!--->).  HTML5 revision 926 (hn in hn).
1477    
1478    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1479    
1480            * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1481            for <pre> and <textarea>).
1482    
1483    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1484    
1485            * HTML.pm.src: HTML5 revision 920 (<isindex>).
1486    
1487    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1488    
1489            * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1490    
1491    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1492    
1493            * HTML.pm.src: HTML5 revision 916 (</body>).
1494            HTML5 revision 917 (conforming bare &).
1495    
1496    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1497    
1498            * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1499    
1500            * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1501            HTML5 revision 915 (<nobr>).
1502    
1503    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1504    
1505            * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1506            
1507            * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1508            New attributes.
1509    
1510    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1511    
1512            * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1513            in tags).
1514    
1515    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1516    
1517            * .cvsignore: |Entities.html| is added.
1518    
1519            * HTML.pm.src: |$entity_char| is removed and
1520            requires |Whatpm::_NamedEntityList| instead.
1521            HTML5 revision 898 (refc), except that lack of refc
1522            is parse error.
1523    
1524            * mkentitylist.pl: New script.
1525    
1526            * Makefile (all): |_NamedEntityList.pm| is added.
1527            (_NamedEntityList.pm, Entities.html): New rules.
1528    
1529    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1530    
1531            * HTML.pm.src: Parse errors immediately after U+000D
1532            were ignored and U+000D immediately following another
1533            U+000D was not converted to U+000A.
1534    
1535    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1536    
1537            * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1538            nodes before appended).  Parser was not ready for NULL
1539            parse error and escape flag.
1540    
1541            * NanoDOM.pm (adopt_node): New.
1542    
1543    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1544    
1545            * HTML.pm.src: HTML5 revision 886 (insane comment in
1546            CDATA and RCDATA).  Note that current implementation
1547            is simply repeating what the spec says and it is maybe not
1548            a best way to do it.
1549    
1550    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1551    
1552            * HTML.pm.src: HTML5 revision 884 (</form> don't close
1553            the form element if a descendant element without implied end tag has
1554            still been open).
1555    
1556    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1557    
1558            * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1559            space characters non-conforming).
1560    
1561    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1562    
1563            * HTML.pm.src: An error message was incorrect.
1564            HTML5 revision 869 (C1 character references).
1565            
1566    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1567    
1568            * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1569            a |textarea| is removed).
1570    
1571    2007-06-05  Wakaba  <wakaba@suika.fam.cx>
1572    
1573            * NanoDOM.pm (get_attribute_node_ns): New method.
1574    
1575            * ContentChecker.pm: |script| |async| and |defer|
1576            no longer require |src|.  |async| MUST NOT be
1577            specified if |defer|.  (HTML5 revision 858).
1578    
1579    2007-05-30  Wakaba  <wakaba@suika.fam.cx>
1580    
1581            * HTML.pm.src: |<form><form>| went to inifinite loop.
1582    
1583    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
1584    
1585            * ContentChecker.pm (html): Set |is_root| (allowed
1586            as a document element) flag on.
1587            (new): Removed.
1588            (check_document): New method.
1589    
1590    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
1591    
1592            * ContentChecker.pm (thead, tfoot): Checker specifications
1593            were incorrect.
1594    
1595    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
1596    
1597            * ContentChecker.pm ($HTMLURIAttrChecker): Include
1598            error position in the |type| option of the error.
1599    
1600            * HTMLTable.pm (form_table): The |$onerror| parameter
1601            is now optional.  Some bugs are fixed.
1602    
1603    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
1604    
1605            * HTMLTable.pm: New module.
1606    
1607            * ContentChecker.pm (table): Invoke table model error checker.
1608    
1609            * NanoDOM.pm (first_child, get_attribute_ns): New.
1610    
1611    2007-05-26  Wakaba  <wakaba@suika.fam.cx>
1612    
1613            * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1614            (link/@rel, a/@rel, area/@rel): Use new checker.
1615    
1616            * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1617    
1618            * _LinkTypeList.pm: New file.
1619    
1620            * mklinktypelist.pl: New file.
1621    
1622            * .cvsignore: |RelExtensions.html| added.
1623    
1624            * NanoDOM.pm (child_nodes): Returns an empty array
1625            for non-child-containing node types.
1626            (text_content): New attribute.
1627    
1628    2007-05-26  Wakaba  <wakaba@suika.fam.cx>
1629    
1630            * IMTChecker.pm: New module.
1631    
1632            * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1633            to test parameter value validity.
1634    
1635            * HTML.pm.src ($style_start_tag): Attributes were
1636            discarded.
1637    
1638    2007-05-25  Wakaba  <wakaba@suika.fam.cx>
1639    
1640            * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1641    
1642    2007-05-25  Wakaba  <wakaba@suika.fam.cx>
1643    
1644            * URIChecker.pm: All recommendations from RFC 3986
1645            and RFC 3987 are listed (not all testable items are checked yet).
1646    
1647    2007-05-25  Wakaba  <wakaba@suika.fam.cx>
1648    
1649            * URIChecker.pm: New module.
1650    
1651    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
1652    
1653            * ContentChecker.pm: Now most attributes are implemented
1654            or associated to some placeholder.
1655            ($ElementDefault): Warn unknown attributes
1656            for unknown elements as "attribute not supported".
1657            ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1658            ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1659            (|a| attribute checker): Reimplemented.
1660    
1661    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
1662    
1663            * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1664            ($HTMLAttrChecker): Event handler content attributes
1665            are added.
1666            (link, embed): Required attribute is now checked.
1667            (embed): Unknown local attributes are no longer warned.
1668    
1669    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
1670    
1671            * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1672            ($HTMLIMTAttrChecker): New checker.
1673            (link@rel, link@href, link@type, style@type,
1674            a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1675            object@data, object@type, source@src, source@type, area@alt,
1676            area@shape, area@coords, area@href,
1677            area@ping, area@rel, area@type, script@src,
1678            script@defer, script@async, script@type): Checkers added.
1679    
1680    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
1681    
1682            * ContentChecker.pm: Descendant checking was incorrect.
1683    
1684  2007-05-19  Wakaba  <wakaba@suika.fam.cx>  2007-05-19  Wakaba  <wakaba@suika.fam.cx>
1685    
1686          * ContentChecker.pm: Support |xml:*| and |xmlns:*|          * ContentChecker.pm: Support |xml:*| and |xmlns:*|

Legend:
Removed from v.1.17  
changed lines
  Added in v.1.297

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24