/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.16 by wakaba, Sat May 19 11:37:24 2007 UTC revision 1.305 by wakaba, Sun Sep 14 07:19:47 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Make a "bare ero" error for unknown
4            entities point the "&" character.
5    
6    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: It turns out that U+FFFD don't have to
9            be added to the list of excluded characters.
10    
11    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src ($char_onerror): Have character decoder's |line|
14            and |column| a higher priority than the one set by the
15            tokenizer's input handler.
16            ($self->{read_until}): Exclude U+FFFD (but this might
17            not be necessary, since now we do line/column fixup in
18            the character decode handle).
19    
20    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: Use |{read_until}| where possible.
23    
24    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
27            and |manakai_getc_until| to |manakai_read_until| to
28            reduce the number of string copies.
29    
30    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src (parse_char_string): Use newly created
33            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
34            standard feature to |open| a string as a filehandle,
35            since Perl's string filehandle seems not supporting |ungetc|
36            method correctly.
37            (parse_char_stream): Define |{getc_until}| method.
38            (DATA_STATE): Experimental support for |getc_until| feature.
39    
40    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
41    
42            * HTML.pm.src: Check points added to newly added branches.
43    
44    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
45    
46            * HTML.pm.src: Remove |{char}|, which is no longer used.
47            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
48            and replaced by |{prev_state}|.
49    
50            * mkhtmlparser.pl: Remove |{char}| feature.
51            Remove |!!!back-next-input-character;| macro.
52    
53    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
56            entity related tokenizer states in favor of new states
57            implementing the consume character reference algorithm.
58    
59    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
60    
61            * HTML.pm.src: "Consume a character reference" algorithm is
62            now implemented as a tokenizer's state, rather than
63            a method, with minimum changes (more changes will
64            be made, in due course).  "Bogus comment state"'s inner
65            loop gets removed.
66    
67    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
70            into their own tokenizer states.
71    
72    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
75            is split into three states.
76    
77    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
78    
79            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
80            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
81            no longer does the tokenizer have to push back next input
82            characters in those states.
83    
84    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
85    
86            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
87            into four states so that no longer does the tokenizer have to push
88            back next input characters in that state.
89    
90    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
93            which can be used to insert some wrapper between the character
94            stream handle and the tokenizer.  (It is currently not supported
95            for |set_inner_html| for |Element|s).
96    
97    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: Ignore punctuations in charset names.
100    
101    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
102    
103            * ContentChecker.pm: Support for charset-layer error levels.
104    
105            * HTML.pm.src: Don't specify |text| argument for the
106            |chardecode:fallback| error, since it is not the encoding
107            being used alternatively.
108    
109    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
112    
113    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
114    
115            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
116    
117    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src: Bug fix and sync with the spec with regard
120            to after after frameset insertion mode processing (HTML5
121            revision 1909).  Note that the implementation was wrong
122            per the old spec before the r1909 changes.
123    
124    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTMLTable.pm: scope=auto algorithm fix synced with the
127            spec (HTML5 revision 2093).
128            ($process_row): Algorithm step numbers synced with the
129            spec (HTML5 revision 2092).
130    
131    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
132    
133            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
134            revision 2094).
135    
136    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
137    
138            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
139    
140    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: '"' and "'" at the end of attribute
143            name (after another attribute) now raise parse error (HTML5
144            revision 2123).  Empty unquoted attribute values are no
145            longer allowed (HTML5 revision 2122).
146    
147    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
148    
149            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
150            revision 2130).
151    
152    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
153    
154            * ContentChecker.pm: |xml:lang| attribute value must be same
155            as |lang| attribute value for HTML elements (HTML5 revision 2062
156            and so on).
157    
158    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
159    
160            * ContentChecker.pm: Error level definition for |xml_id_error|
161            was missing.
162    
163            * URIChecker.pm: The end of the URL should be marked as the
164            error location for an empty path error.  The position
165            between the userinfo and the port components should be
166            marked as the error location for an empty host error.
167    
168    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
169    
170            * URIChecker.pm: Set parameters representing where in the
171            value the error occurs for errors.  Report unknown
172            address format error in warning level, since address
173            formats are rarely added.  Path segments starting with "/.."
174            were misinterpreted as a dot-segment.
175    
176    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
177    
178            * URIChecker.pm (check_iri_reference): Requires
179            |Message::DOM::DOMImplementation|.
180    
181    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
182    
183            * IMTChecker.pm: Updated for the new error reporting architecture.
184    
185            * ContentChecker.pm: Error levels for IMTs are added.
186    
187    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
188    
189            * H2H.pm (_shift_token): Support for unquoted HTML attribute
190            values.
191    
192    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
193    
194            * CacheManifest.pm: Support for new style of error
195            reports.
196    
197            * HTML.pm.src: Set line=1, column=1 to the document node.
198    
199    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
200    
201            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
202            and URL checkers.  Support for more error levels for bogus
203            langauge tag and URL "standards".
204    
205            * LangTag.pm, URIChecker.pm: Support for new style error
206            level reporting.
207    
208    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
209    
210            * ContentChecker.pm: Support for RDF/XML error levels.
211    
212            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
213            specifying.  Error types are revised.
214    
215    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
216    
217            * ContentChecker.pm: All error reporting method calls are
218            renewed.
219    
220    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: All error type names and "text" parameters
223            are revised.  Use new style for "level" specification.
224    
225            * mkhtmlparser.pl: Use new style for "level" specification.
226    
227    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
228    
229            * WebIDL.pm (parse_char_string): Simplified error
230            reporting process for broken ignored valuetype definition.
231            (Valuetype idl_text): Support for special "DOMString" name.
232    
233    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
234    
235            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
236            terminal of the ScopedName is "DOMString", such that whether
237            the last part of the scoped name is "DOMString" or "_DOMString"
238            later.  It is necessary to determine whether a |typedef|
239            definition should be ignored or not.
240            (parse_char_string): Unescape the identifier of
241            exception members.
242            ($resolve): Return undef for builtin types and sequence<T>
243            types (we might not have to do this, however...).
244            (check): Support checking for Exceptions, Valuetypes,
245            and Typedefs.
246            ($serialize_type): Support for "DOMString::::" syntax.
247            (Typedef idl_text): Output Type as "DOMString" if it
248            is really "DOMString" (i.e. its internal representation
249            is "::DOMString::").
250    
251    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
252    
253            * WebIDL.pm ($resolve): New code, based on resolve code
254            for constant types in the |check| method.
255            (check): Support for checking of attributes, operations, and
256            arguments.
257            (Attribute/Operation idl_text): Exception names in getraises,
258            setraises, and raises clauses is serizlied by |$serialize_type|
259            code.
260    
261    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
262    
263            * WebIDL.pm ($integer): Order of selections are changed to match
264            hexadecimal numbers (the original pattern, taken from the spec,
265            was not work for hexadecimal numbers, because the "0" prefix
266            matches to the [0-7]* part (as an empty string) and therefore
267            it does not match with remaining "x..." part of a "0x..." integer
268            literal.
269            ($get_type): It now returns a string, not an array reference,
270            for regular types and |sequence| types (i.e. it in any case
271            returns a string).
272            ($get_next_token): The second item in the array that represents
273            a integer or float token is now a Perl number value, not the
274            original string representation of the number.
275            (check): Support for const value consistency checking.
276            No extended attribute is defined for constants.
277            (Node subclasses): Use simple strings rather than array references
278            for default data type values.
279            ($serialize_type): Type values are now simple strings.
280            (value): If the new attribute value is a false value, then
281            a FALSE value is set to the attribute.
282    
283    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
284    
285            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
286            in its stringified format ("scoped name" as defined in the
287            spec).  Note that future version of this module should not use
288            array references for type values and the |type_text| attribute
289            should be made obsolete.
290            (parse_char_string): Unescape attribute names.
291            (check): Support for checking of whether inherited interfaces
292            are actually defined or not.  Support for checking of whether
293            interface member identifiers are duplicated or not.
294            ($serialize_type): Scoped names are returned as is.  A future
295            version of this code should escape identifiers other than "DOMString",
296            otherwise the idl_text would be non-conforming.
297    
298    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
299    
300            * WebIDL.pm (parse_char_string): Set line/column numbers
301            to generated nodes.  Unescape identifiers.  Extended attributes
302            for Definition's were ignored.
303            (append_child): Set |parent_node| attribute.
304            (parent_node): New attribute.
305            (check): Support interface/exception members.  Support
306            extended attributes.  Support definition identifier uniqueness
307            constraint.
308            (qualified_name): New attribute.
309            (Interface/Exception idl_text): Extended attributes were
310            not prepended to the returned text.
311    
312    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
313    
314            * WebIDL.pm (parse_char_string): Set line/column numbers
315            to interface object experimentally.  s/shift/pop/g, shift
316            would make things wrong.  Support for interface forward
317            declarations was missing.  Broken interface declarations
318            with no block were not ignored entirely.
319            (Whatpm::WebIDL::Node): New abstract class.  This class
320            makes things easier.
321            (child_nodes): New attribute.  Unlike DOM's attribute with
322            same name, this attribute returns a dead list of nodes for
323            simplicity.
324            (get_user_data, set_user_data): New methods.
325            (Module idl_text): A SPACE character should be inserted
326            before the |{| character.
327            (Interface idl_text): Support for interface forward declarations.
328            (is_forward_declaration): New attribute.
329    
330    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
331    
332            * WebIDL.pm (type_text): Better serializer.
333    
334    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
335    
336            * WebIDL.pm: Revise forward-compatible parsing so that
337            it now can handle broken extended attributes and as such.
338    
339    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
340    
341            * WebIDL.pm: Real support for extended attributes.
342            Support for extended attributes with arguments.
343    
344    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
345    
346            * WebIDL.pm: Support for |exception| syntax.
347            (Interface->idl_text): Tentative support for inheritances.
348    
349    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
350    
351            * WebIDL.pm: Hierarchical scoped name support was broken.
352            Support for raises, setraises, and getraises syntaxes.
353    
354    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
355    
356            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
357            proper support for types, extended attributes, and exceptions yet).
358            WebIDL parser, version 1 (no support for exceptions yet,
359            no proper support for extended attributes yet).
360    
361    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
362    
363            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
364    
365    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
366    
367            * WebIDL.pm: Support for valuetype and const.
368    
369    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
370            
371            * WebIDL.pm: New module.
372    
373    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
374    
375            * Makefile (Entities.html): URI changed.
376    
377    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
378    
379            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
380    
381    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
382    
383            * HTML.pm.src (_get_next_token): A parse error was missing.
384    
385    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
386    
387            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
388            spec (commented out). (HTML5 revision 1711).
389    
390    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
391    
392            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
393    
394            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
395            as UTF-16 (HTML5 revision 1701).
396    
397    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
398    
399            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
400            1692).
401    
402    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
403    
404            * HTML.pm.src: The secondary insertion mode used when switching
405            to foreign content is the "in body" insertion mode (HTML5 revision
406            1696).
407    
408    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
409    
410            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
411            revision 1697).
412    
413    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
414    
415            * HTML.pm.src: Support for end-of-file token in foreign content
416            insertion mode (HTML5 revision 1693).  Update SVG camelCase
417            attribute list (HTML5 revision 1700).  <textarea> closes
418            </select> (HTML5 revision 1699).  More start tags close in
419            foreign content insertion mode (HTML5 revision 1698).
420    
421    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
422    
423            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
424    
425    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
426    
427            * HTML.pm.src: More robust charset parameter detection (HTML5
428            revision 1674).
429    
430    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
431    
432            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
433            revision 1676).
434    
435    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
436    
437            * HTML.pm.src: Ignore language part of public identifiers for
438            quriks mode detection (HTML5 revision 1679).
439    
440    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
441    
442            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
443            revision 1685).
444    
445    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
446    
447            * HTML.pm.src: Support for EOF in new states for tags (HTML5
448            revision 1684).
449    
450    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
451    
452            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
453            work (HTML5 revision 1690).
454    
455    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
456    
457            * HTML.pm.src (_tree_construction_main): Change handling of
458            end tags in head insertion modes (HTML5 revision 1686).
459            (parse_char_string): Bug fix for non-utf8 character string handlings.
460            (parse_char_stream): |ungetc| does not work well for this context.
461    
462    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
463    
464            * HTML.pm.src (parse_byte_string): Redefined to invoke
465            |parse_byte_stream|.
466            (parse_byte_stream): New method.
467    
468    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
469    
470            * HTML.pm.src (parse_byte_string): Fix the column number reported
471            by encoding layer error reporter.
472    
473    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
474    
475            * HTML.pm.src (parse_byte_string): Use streaming decoder
476            rather than converting the whole byte string and then parsing.
477            Propagate errors in character encoding layer.
478            (get_next_token): Precise error reporting for |bare stago| error.
479    
480    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
481    
482            * HTML.pm.src (parse_char_stream): New method.
483            (parse_char_string): This method is now defined as an invocation
484            of the |parse_char_stream| method.
485    
486    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
487    
488            * HTML.pm.src (parse_byte_string): Report various status
489            of the sniffing as info-level errors.  Support for new
490            decoding framework in parser resestting.
491            (new): Various default error levels were not set.
492    
493    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
494    
495            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
496            algorithm, except for the actual sniffing, is implemented
497            with new framework with Message::Charset::Info.
498    
499    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
500    
501            * CacheManifest.pm (_parse): Drop fragment identifiers from
502            URIs in fallback section (HTML5 revision 1596).
503    
504    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
505    
506            * Makefile (Entities.html): URI has changed.
507    
508    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
509    
510            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
511            1553).
512    
513    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
514    
515            * ContentChecker.pm: Noted that those returned in |table| are
516            no longer table elements, but table objects returned
517            by Whatpm::HTMLTable.
518    
519            * HTMLTable.pm (form_table): Return table element node
520            as |$table->{element}|.
521            (assign_header): Support for the |headers=""| attribute.
522    
523    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
524    
525            * HTMLTable.pm (assign_header): New function; first version
526            with no support for headers="".
527            (form_table): Include table width and height to the returned
528            table object for covenience.  Indexing in column assignement was wrong.
529            Set whether a data cell is empty or not for convenience.
530    
531    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
532    
533            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
534    
535    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
536    
537            * HTMLTable.pm: How table model errors are detected is
538            changed (HTML5 revision 1387).
539    
540    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
541    
542            * HTMLTable.pm: The algorithm now moves |tfoot| elements
543            to the end of the table (HTML5 revision 1380).
544    
545    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
546    
547            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
548            of 1-based (HTML5 revision 1376).
549    
550    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
551    
552            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
553            and "Content-Encoding" no longer prevent sniffing (HTML5
554            revision 1288).
555    
556    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
557    
558            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
559            1282).
560    
561    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
562    
563            * ContentChecker.pm: Support for global attributes.
564            Status of XML specs are added.
565    
566    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
567    
568            * ContentChecker.pm (check_element): Support for |template=""|
569            and |ref=""| attribute (referent element type checking).
570    
571    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
572    
573            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
574            1500) is implemented (except for IDNA part and URI-scheme-specific
575            knowledge).  Line number counting was wrong for LF-only
576            documents.
577    
578    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
579    
580            * HTML.pm.src: Raise an parse error for any disallowed
581            character (HTML5 revision 1263).
582    
583    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
584    
585            * mkentitylist.pl: Support for new HTML5 entity table format (the
586            definition for |AElig;| was missing).
587    
588    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
589    
590            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
591            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
592    
593    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
594    
595            * HTML.pm.src: List of element names that close foreign content
596            insertion mode is added (HTML5 revisions 1412 and 1418).
597    
598    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
599    
600            * HTML.pm.src: Support for |mglyph| and |malignmark|
601            elements (HTML5 revision 1410).
602    
603    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
604    
605            * HTML.pm.src: Support for new long MathML entities (HTML5
606            revision 1406).
607    
608    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
609    
610            * HTML.pm.src: CDATA section support for MathML and SVG
611            elements (HTML5 revision 1404 and 1420).
612    
613    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
614    
615            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
616            elements (HTML5 revision 1404).  Unused !!!macro definitions
617            are removed.
618    
619    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
620    
621            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
622            are raised is changed (HTML5 revision 1404).
623    
624    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
625    
626            * HTML.pm.src: Category sets that are no longer used are removed.
627    
628    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
629    
630            * HTML.pm.src: The ->[1] property of stack entries are now
631            replaced by constants representing element category.
632    
633    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
634    
635            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
636            for error reporting.  (This is a preparation for using constant
637            value for ->[1].)
638    
639    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
640    
641            * RDFXML.pm: Typo fixed.
642    
643    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
644    
645            * ContentChecker.pm: |fact_level| is now treated
646            as same as |must_level|, i.e. level = |m|.
647            (check_element): Make list of URIs in the DOM.
648    
649    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
650    
651            * RDFXML.pm: Language accessor implemented.
652            Local (null-namespace) attribute support.
653    
654    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
655    
656            * RDFXML.pm: Factored out ID checking code.
657    
658    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
659    
660            * RDFXML.pm: TODO items noted.  Validation of ID
661            and URI attributes is implemented.  Warn if unknown
662            value is used in rdf:parseType="" attribute.
663    
664            * URIChecker.pm (check_rdf_uri_reference): New function.
665    
666    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
667    
668            * RDFXML.pm: bnodeid implemented.  Relative references
669            are now resolved.
670    
671    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
672    
673            * ContentChecker.pm: RDF reification implemented.
674    
675            * RDFXML.pm: undef vs false bug fixed.
676            Reification implemented.
677    
678    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
679    
680            * RDFXML.pm: s/id/ID/ for attribute name.
681            The |node| arguments are added for |ontriple| calls.
682            Too many "attribute not allowed" errors were raised.
683    
684            * ContentChecker.pm: Initial experimental support for rdf:RDF
685            element.
686    
687    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
688    
689            * RDFXML.pm: New module.
690    
691    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
692    
693            * HTML.pm.src (set_inner_html): Line/column number
694            code was old one yet.
695    
696    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
697    
698            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
699            errors.  Line/column markings for DOCTYPE, comment, and
700            character tokens are reintroduced; otherwise,
701            error location for "not HTML5" error and errors
702            for implied elements are not attached.
703    
704    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
705    
706            * HTML.pm.src: Set line/column numbers to attributes.
707    
708            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
709            (value): Setter implemented.
710    
711            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
712    
713    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
714    
715            * HTML.pm.src: Unused line/column markings are removed.
716    
717    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
718    
719            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
720            flag, which is no longer used.
721    
722    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
723    
724            * HTML.pm.src: Set line/column information to element nodes.
725    
726            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
727            and !!!insert-element-t): Set line/column information to
728            element nodes.
729    
730    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
731    
732            * HTML.pm.src (_get_next_token): The first "<" character
733            in "<?", "<>", or "</>" should be the error point.
734    
735    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
736    
737            * HTML.pm.src: Some more fixes on error position reporting.
738    
739    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
740    
741            * HTML.pm.src: Token-level precious error reporting.
742    
743    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
744    
745            * HTML.pm.src: Preparation for more precious error point
746            reporting.
747    
748    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
749    
750            * HTML.pm.src: Error type revised.
751    
752    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
753    
754            * HTML.pm.src: Similar codes are merged together, again.
755    
756    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
757    
758            * HTML.pm.src: Similar codes are merged together.
759    
760    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
761    
762            * mkhtmlparser.pl: Set "level" parameter to parse errors.
763    
764            * HTML.pm.src: Code refined.
765    
766    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
767    
768            * HTML.pm.src: |</body>| treatement has been changed (HTML5
769            revision 1348).  Note that I really don't know this makes
770            any difference in the black-box behavior of the parser.
771    
772    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
773    
774            * HTML.pm.src: New end-of-file token implementation (HTML5
775            revision 1348).
776    
777    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
778    
779            * HTML.pm.src: |applet| support (HTML5 revision 1347).
780    
781    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
782    
783            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
784    
785    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
786    
787            * HTML.pm.src: Support for |<input>| in the "in select" insertion
788            mode and support for the "in select in table" insertion mode (HTML5
789            revision 1342).
790    
791    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
792    
793            * HTML.pm.src: No foster parenting for <script> and <script>
794            in non-tainted <table>s (HTML5 revision 1336).
795    
796    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
797    
798            * HTML.pm.src: Ignore white space characters between <html>
799            and <head> (HTML5 revision 1332).
800    
801    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
802    
803            * HTML.pm.src: Treat <input type=hidden> as if it were a
804            white space (HTML5 revision 1331).
805    
806    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
807    
808            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
809            element (HTML5 revision 1330).
810    
811    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
812    
813            * HTML.pm.src: <title> is always appended to the current
814            element (HTML5 revision 1328).
815    
816    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
817    
818            * HTML.pm.src: White space in tainted tables are moved
819            into foster parents (HTML5 revision 1326).
820    
821    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
822    
823            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
824            revision 1321).
825    
826    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
827    
828            * HTML.pm.src: |</h/n/>| case code rearranged to align with
829            the spec (HTML5 revision 1320).  Note that we finally complete
830            all of HTML5 revision 1320 changes.
831    
832    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
833    
834            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
835            tags (HTML5 revision 1320).
836    
837    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
838    
839            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
840    
841    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
842    
843            * HTML.pm.src: A "generate implied end tags" code (t409.1)
844            could not be reached so that it is now removed (HTML5 revision
845            1320).
846    
847    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
848    
849            * HTML.pm.src: Code for the case of |</div>| and so on are revised
850            to align with new spec text (HTML5 revision 1320).
851    
852    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
853    
854            * HTML.pm.src: Remove strange |if| condition; however, it should
855            have had no harm in theory.
856    
857    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
858    
859            * HTML.pm.src (_tree_construction_main): '</p>' in body
860            case is split from other end tags for the preparation
861            of implementing HTML revision 1320.
862    
863    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
864    
865            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
866            revision 1320).
867    
868    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
869    
870            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
871            and "div" (HTML5 revision 1318).  Add comments to where
872            |form| pointer association codes should be inserted (HTML5 revision
873            1319).
874    
875    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
876    
877            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
878    
879    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
880    
881            * HTML.pm.src: Since the case t268 should never be reached (no
882            other token type, there are), it is replaced by a |die| statement.
883    
884    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
885    
886            * HTML.pm.src: Typo fixed.
887    
888    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
889    
890            * HTML.pm.src (_tree_construction_initial): Some limited quirks
891            doctypes were not uppercased for comparison.
892    
893    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
894    
895            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
896            are added.
897    
898    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
899    
900            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
901            are set.  Cases that are unlikely reached are noted as so.
902    
903    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
904    
905            * HTML.pm.src: Checkpoints for debugging are added.
906    
907            * mkhtmlparser.pl: Support for |!!!cp| syntax.
908    
909    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
910    
911            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
912            for simplicity.
913    
914    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
915    
916            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
917            revision 1307).
918    
919    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
920    
921            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
922            is set are changed (HTML5 revision 1305).
923    
924    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
925    
926            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
927            1292).  Entities are not parsed in comment-like part in RCDATA
928            elements (HTML5 revision 1294).  Allow bare & at the end
929            of attribute value literals (HTML5 revision 1296).  More
930            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
931            between attributes and ban attribute names or unquoted
932            attribute values containing single or double quotes (HTML5
933            revision 1303).
934    
935    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
936    
937            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
938            and related errors unless it is an HTML document (though the spec
939            is unclear on whether it is applied to XHTML document).
940    
941            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
942            are added.
943    
944    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
945    
946            * HTML.pm.src: s/local_name/manakai_local_name/g.
947    
948    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
949    
950            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
951    
952            * HTML.pm.src: |charset| in |content| attribute is
953            case-insensitive (HTML5 revision 1270).
954    
955    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
956    
957            * ContentChecker.pm: New status constants are added.
958            ($ElementDefault): |status| added.
959            (check_element): Err for non-standard or deprecated elements.
960            (_attr_status_info): For non-standard or deprecated attributes.
961    
962    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
963    
964            * ContentChecker.pm (_attr_status_info): New internal method.
965    
966    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
967    
968            * ContentChecker.pm (check_element): Element standardized
969            status information is now dispatched.
970    
971    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
972    
973            * ContentChecker.pm (check_element): Fix |del|-and-significant
974            problem by adding some more arguments.
975    
976    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
977    
978            * ContentChecker.pm (check_element): Use context of
979            container-for-the-purpose-of-content-model element (not transparent
980            element) for |check_child_element| calling and significant
981            text flag marking.  This reintroduces |<del>|-and-significant
982            problem again.
983    
984    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
985    
986            * ContentChecker.pm (check_element): Make semi-transparent
987            elements ignored for the purpose of phase changes in
988            content model checking.
989    
990    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
991    
992            * ContentChecker.pm (check_element): In-element state
993            was not properly managed for transparent cases.
994    
995    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
996    
997            * ContentChecker.pm (check_element): Support for |video|
998            and |audio| as semi-transparent elements.
999    
1000    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1001    
1002            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1003            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1004            Support for |html:object| as a semi-transparent element.
1005    
1006    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1007    
1008            * ContentChecker.pm (check_element): The way to traverse
1009            the tree is entirely revised to make it easier to track
1010            the state of ancestors/descendants.  As a result of this
1011            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1012            support for content model checking for HTML elements |figure|,
1013            |object|, |video|, and |audio| and checking for XML elements (and
1014            some XMLNS checkings) are dropped for now.  They will be
1015            reimplemented in due cource.
1016    
1017    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1018    
1019            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1020            token (HTML5 revision 1225).
1021    
1022    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * ContentChecker.pm ({unsupported_level}): New value.
1025    
1026            * HTML.pm.src: Save whether |meta| |content| attribute
1027            contains character references or not.
1028    
1029    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1030    
1031            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1032            imlement significant content checking for contents
1033            with |del| elements.
1034    
1035    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1036    
1037            * ContenteChecker.pm ($HTMLTransparentElements): More
1038            elements are added.
1039            (_get_children): HTML |object| elements are now semi-transparent.
1040    
1041            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1042    
1043    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1044    
1045            * CacheManifest.pm: HTML5 revision 1211 implemented.
1046    
1047            * CacheManifest.pod: Updated.
1048    
1049    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1050    
1051            * ContentChecker.pm (check_document, check_element): Support
1052            for second argument ($onsubdoc).
1053            (_get_css_parser): Removed (now it is part of WDCC).
1054    
1055    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1056    
1057            * ContentChecker.pm (_get_css_parser): New.
1058    
1059    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1060    
1061            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1062            for checking had been used.
1063    
1064    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1065    
1066            * ContentChecker.pm (check_element): New todo item type |descendant|.
1067    
1068    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1069    
1070            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1071            specced in RFC 2046 and RFC 4288.
1072            (application/atom+xml): Definition added.
1073    
1074    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1075    
1076            * URIChecker.pm: Make RFC 3986 should-level errors
1077            warnings (rather than SHOULD-level errors).
1078    
1079    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1080    
1081            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1082    
1083            * HTML.pm.src: A flag for character references in attribute
1084            values are added.  Set |manakai_has_reference| user data
1085            to |charset| attribute.
1086    
1087    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1088    
1089            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1090            attributes.
1091    
1092            * ContentChecker.pm (check_document): Warn if charset requirements
1093            cannot be tested.
1094    
1095    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1096    
1097            * HTML.pm.src (parse_byte_string): Detect charset
1098            by universalchardet if charset parameter is not specified.
1099    
1100            * Makefile (Charset-all, Charset-clean): New rules.
1101    
1102    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1103    
1104            * ContentChecker.pm (check_document): Check the existence
1105            of character encoding declaration and selection of encoding
1106            for HTML document.
1107    
1108    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1109    
1110            * ContentType.pm (get_sniffed_type): Return also the official
1111            type in list context.
1112    
1113    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1114    
1115            * ContentType.pm: Sniffing with leading white space
1116            ignoring (HTML5 revisions 1013 and 1016).
1117    
1118    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1119    
1120            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1121            white spaces, are implemented.
1122    
1123    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1124    
1125            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1126            attribute if possible.
1127    
1128    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1129    
1130            * HTML.pm.src (parse_byte_string): New method.
1131            (parse_char_string): New alias for |parse_string|.
1132            (main phase): Invoking "change the encoding" algorithm if desired.
1133    
1134            * HTML.pod: Updated.
1135    
1136    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1137    
1138            * HTML.pod (get_inner_html): Removed.
1139    
1140            * Makefile (HTML-all, HTML-clean): New.
1141    
1142    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1143    
1144            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1145    
1146    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1147    
1148            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1149    
1150    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1151    
1152            * Makefile: |CacheManifest.html| is added.
1153    
1154            * CacheManifest.pod: New file.
1155    
1156    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1157    
1158            * CacheManifest.pm: New module.
1159    
1160    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1161    
1162            * HTML.pm.src: Support for application cache selection algorithm
1163            callback.
1164    
1165    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1166    
1167            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1168    
1169    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1170    
1171            * Makefile (clean): New rule.
1172    
1173            * NanoDOM.pm (public_id, system_id): New attributes.
1174    
1175    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1176    
1177            * Makefile (CSS-all, CSS-clean, clean): New rules.
1178    
1179    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1180    
1181            * ContentChecker.pm (check_document): Support for
1182            new |is_xml_root| flag.
1183            (check_element): Support for new |pluses| state.
1184            (_add_pluses): New method.
1185            (_remove_minuses): Support for new |minus| item.
1186    
1187    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1188    
1189            * ContentChecker.pm: Raise specific error for invalid
1190            root element.
1191    
1192    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1193    
1194            * ContentChecker.pm: Set level values for later uses.
1195    
1196    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1197    
1198            * ContentChecker.pm: Support for language tag validation.
1199    
1200    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1201    
1202            * LangTag.pm (check_rfc3066_language_tag): New method.
1203    
1204    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1205    
1206            * LangTag.pm: New module.
1207    
1208    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1209    
1210            * HTML.pm.src: Some error types were wrong.
1211    
1212    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1213    
1214            * CSS/: New directory.
1215    
1216    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1217    
1218            * ContentChecker.pm (_check_get_children): Support
1219            for |noscript| in |head|.
1220    
1221    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1222    
1223            * URI/: New directory.
1224    
1225    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1226    
1227            * HTML.pm.src: Tokenizer's states are now represented in
1228            number.
1229    
1230    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1231    
1232            * HTML.pm.src: |or|s for insertion modes are replaced
1233            by |&|s.
1234    
1235    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1236    
1237            * HTML.pm.src: Token types are now represented in number.
1238    
1239    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1240    
1241            * ContentType.pm (SEE ALSO): Updated.
1242    
1243            * HTML.pm.src: Insertion modes are now represented in number.
1244    
1245    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1246    
1247            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1248            999) is implemented.
1249    
1250    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1251    
1252            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1253            983) is implemented.
1254    
1255    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1256    
1257            * ContentChecker.pod: New documentation.
1258    
1259            * Makefile: A rule for |ContentChecker.html| is added.
1260    
1261            * ContentChecker.pm: A pod "LICENSE" section is added.
1262    
1263            * NanoDOM.pm ($VERSION): New variable.
1264    
1265    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1266    
1267            * H2H.pm: |b|, |i|, and |sub| are added to the
1268            list of allowed HTML elements.
1269    
1270    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1271    
1272            * H2H.pm: |samp| is added to the list of allowed
1273            HTML elements.
1274    
1275            * URIChecker.pm (check_iri): New.
1276            (check_iri_reference): Error type for IRI reference
1277            syntax error is changed.
1278    
1279    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1280    
1281            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1282            is added.
1283            (check_document): Load appropriate module before validation.
1284    
1285    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1286    
1287            * ContentChecker/: New directory.
1288    
1289    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1290    
1291            * ContentChecker.pm: HTML |time| element is implemented.
1292    
1293            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1294    
1295    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1296    
1297            * ContentChecker.pm (check_document): Return value
1298            even if no document element is found.
1299    
1300    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1301    
1302            * HTML.pm.src: |$in_body| is no longer a function.
1303    
1304    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1305    
1306            * HTML.pm.src: The |$in_body| code has been moved down.
1307    
1308    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1309    
1310            * HTML.pm.src: The "trailing end" insertion mode
1311            is split into "after html body" and "after html frameset"
1312            insertion modes.  Their codes are merged with "after body"
1313            and "after frameset" codes.  |$previous_insertion_mode|
1314            has been removed.  "after frameset" code is
1315            merged with "in frameset" code.
1316    
1317    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1318    
1319            * HTML.pm.src: The "before head" insertion mode is
1320            merged with the "in head" insertion mode.
1321    
1322    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1323    
1324            * HTML.pm.src: Cases in "in head" insertion mode are
1325            reorganized.
1326    
1327    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1328    
1329            * HTML.pm.src: Some cases in "in table" insertion mode
1330            are merged.
1331    
1332    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1333    
1334            * HTML.pm.src: The "in row" insertion mode is merged
1335            with "in table" insertion mode.
1336    
1337    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1338    
1339            * HTML.pm.src: The "in table" and "in table body" insertion
1340            modes are merged.
1341    
1342    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1343    
1344            * HTML.pm.src: There is no "in table head" or "in table foot"
1345            insertion mode!
1346    
1347    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1348    
1349            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1350            now directly invoke the handler.
1351    
1352    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1353    
1354            * HTML.pm.src: Codes for "in cell" insertion mode
1355            is merged to the "in body" insertion mode code.
1356    
1357    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1358    
1359            * HTML.pm.src: Codes for "in body" and "in caption"
1360            insertion modes are merged.
1361    
1362    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1363    
1364            * HTML.pm.src: Two |!!!next-token|s were missing.
1365    
1366    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1367    
1368            * HTML.pm.src: Use numeric constant for |{content_mode}|
1369            instead of string constant for |{content_model_flag}|.
1370    
1371    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1372    
1373            * HTML.pm.src: Add the name of the attribute
1374            to the "duplicate attribute" error.
1375    
1376    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1377    
1378            * ContentChecker.pm: Return the |class| node list.
1379    
1380    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1381    
1382            * ContentChecker.pm: Return the |id| node list.
1383    
1384            * HTML.pm.src: A typo is fixed.
1385    
1386    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1387    
1388            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1389            from "in HTML:xml:lang" error.  Character position
1390            is now the last part of the error type in the URI error
1391            description.  Report "unsupported" status for language
1392            tags, media queries, script codes, and style sheets.
1393    
1394    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1395    
1396            * ContentChecker.pm: Report error if |xml:lang|
1397            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1398            in XML.
1399    
1400            * NanoDOM.pm (Attr.owner_document): New attribute.
1401    
1402    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1403    
1404            * HTML.pm.src: The character immediately following
1405            a bare |hcro| was discarded.  Fix handling of
1406            entity references in attribute values.
1407    
1408    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1409    
1410            * HTML.pm.src (main and trailin end phases): Token
1411            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1412            factored out.  Error types |in html:#DOCTYPE|
1413            and |after html:#DOCTYPE| are merged
1414            into |DOCTYPE in the middle|.  |</frameset>|
1415            in fragment parsing mode changed the
1416            insertion mode.
1417    
1418    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1419    
1420            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1421            phase is now an insertion mode.  Treatments for white
1422            space character tokens were incorrect for some
1423            insertion modes.  An old |meta| case was not removed.
1424    
1425    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1426    
1427            * HTML.pm.src: |meta| charset declaration extraction
1428            implemented (but changing the encoding is not yet:-).
1429    
1430    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1431    
1432            * Charset/: New directory.
1433    
1434    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1435    
1436            * H2H.pm: New Perl module (created from
1437            manakai's H2H.dis).
1438    
1439    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1440    
1441            * XMLSerializer.pm: New Perl module (created from
1442            manakai's SimpleLS.dis).
1443    
1444    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
1445    
1446            * HTML.pm.src: |<!---x-->| was not processed correctly.
1447    
1448    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1449    
1450            * HTML.pm.src: Report correct error message
1451            for |<body></div></body>|.
1452    
1453    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
1454    
1455            * HTMLTable.pm: An error description was incorrect.
1456    
1457    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1458    
1459            * ContentChecker.pm: Return |{term}| list.
1460    
1461    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1462    
1463            * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1464            nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1465            and <title> outside of head).
1466    
1467    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
1468    
1469            * IMTChecker.pm: Report warning for unregistered
1470            and private types/subtypes.
1471    
1472            * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1473            URIChecker.pm, HTMLTable.pm: Error messages are now
1474            consistent; they are all listed in
1475            <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1476    
1477    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1478    
1479            * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1480            is now erred.  |<datalist>| is implemented.
1481            Attribute checker for |<command>| and |<menu>| are
1482            added.  Support for |contextmenu| global attribute
1483            is added.
1484    
1485    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1486    
1487            * HTML.pm.src (_reset_insertion_mode): Interpretation
1488            of Step 3 has been changed.
1489    
1490    2007-06-25  Wakaba  <wakaba@suika.fam.cx>
1491    
1492            * HTML.pm.src: Late |<html>| parse error is implemented.
1493    
1494    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1495    
1496            * URIChecker.pm (check_iri_reference): A |decode| method name was
1497            incorrect.
1498    
1499            * ContentChecker.pm: Support for the |footer| element.
1500            Check URI syntax for space-separated URI attributes.
1501            Support for the |tabindex| attribute.  Support
1502            for |datetime| attribute.
1503    
1504    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1505    
1506            * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1507            character references).  HTML5 revision 1.146 (white space
1508            characters before root start tag).  HTML5 revision
1509            1.148 (named character references in attribute values).
1510            HTML5 revision 1.152 (<plaintext>.innerHTML get).
1511    
1512    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1513    
1514            * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1515            in <head>).
1516    
1517    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1518    
1519            * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1520            in body).
1521    
1522            * ContentChecker.pm: HTML5 revision 938 (scoped="").
1523    
1524    2007-06-24  Wakaba  <wakaba@suika.fam.cx>
1525    
1526            * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1527            or RCDATA in fragment parsing mode).  HTML5 revision
1528            924 (<!--> and <!--->).  HTML5 revision 926 (hn in hn).
1529    
1530    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1531    
1532            * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1533            for <pre> and <textarea>).
1534    
1535    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1536    
1537            * HTML.pm.src: HTML5 revision 920 (<isindex>).
1538    
1539    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1540    
1541            * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1542    
1543    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1544    
1545            * HTML.pm.src: HTML5 revision 916 (</body>).
1546            HTML5 revision 917 (conforming bare &).
1547    
1548    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1549    
1550            * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1551    
1552            * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1553            HTML5 revision 915 (<nobr>).
1554    
1555    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1556    
1557            * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1558            
1559            * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1560            New attributes.
1561    
1562    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1563    
1564            * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1565            in tags).
1566    
1567    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1568    
1569            * .cvsignore: |Entities.html| is added.
1570    
1571            * HTML.pm.src: |$entity_char| is removed and
1572            requires |Whatpm::_NamedEntityList| instead.
1573            HTML5 revision 898 (refc), except that lack of refc
1574            is parse error.
1575    
1576            * mkentitylist.pl: New script.
1577    
1578            * Makefile (all): |_NamedEntityList.pm| is added.
1579            (_NamedEntityList.pm, Entities.html): New rules.
1580    
1581    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1582    
1583            * HTML.pm.src: Parse errors immediately after U+000D
1584            were ignored and U+000D immediately following another
1585            U+000D was not converted to U+000A.
1586    
1587    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1588    
1589            * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1590            nodes before appended).  Parser was not ready for NULL
1591            parse error and escape flag.
1592    
1593            * NanoDOM.pm (adopt_node): New.
1594    
1595    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1596    
1597            * HTML.pm.src: HTML5 revision 886 (insane comment in
1598            CDATA and RCDATA).  Note that current implementation
1599            is simply repeating what the spec says and it is maybe not
1600            a best way to do it.
1601    
1602    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1603    
1604            * HTML.pm.src: HTML5 revision 884 (</form> don't close
1605            the form element if a descendant element without implied end tag has
1606            still been open).
1607    
1608    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1609    
1610            * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1611            space characters non-conforming).
1612    
1613    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1614    
1615            * HTML.pm.src: An error message was incorrect.
1616            HTML5 revision 869 (C1 character references).
1617            
1618    2007-06-23  Wakaba  <wakaba@suika.fam.cx>
1619    
1620            * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1621            a |textarea| is removed).
1622    
1623    2007-06-05  Wakaba  <wakaba@suika.fam.cx>
1624    
1625            * NanoDOM.pm (get_attribute_node_ns): New method.
1626    
1627            * ContentChecker.pm: |script| |async| and |defer|
1628            no longer require |src|.  |async| MUST NOT be
1629            specified if |defer|.  (HTML5 revision 858).
1630    
1631    2007-05-30  Wakaba  <wakaba@suika.fam.cx>
1632    
1633            * HTML.pm.src: |<form><form>| went to inifinite loop.
1634    
1635    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
1636    
1637            * ContentChecker.pm (html): Set |is_root| (allowed
1638            as a document element) flag on.
1639            (new): Removed.
1640            (check_document): New method.
1641    
1642    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
1643    
1644            * ContentChecker.pm (thead, tfoot): Checker specifications
1645            were incorrect.
1646    
1647    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
1648    
1649            * ContentChecker.pm ($HTMLURIAttrChecker): Include
1650            error position in the |type| option of the error.
1651    
1652            * HTMLTable.pm (form_table): The |$onerror| parameter
1653            is now optional.  Some bugs are fixed.
1654    
1655    2007-05-27  Wakaba  <wakaba@suika.fam.cx>
1656    
1657            * HTMLTable.pm: New module.
1658    
1659            * ContentChecker.pm (table): Invoke table model error checker.
1660    
1661            * NanoDOM.pm (first_child, get_attribute_ns): New.
1662    
1663    2007-05-26  Wakaba  <wakaba@suika.fam.cx>
1664    
1665            * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1666            (link/@rel, a/@rel, area/@rel): Use new checker.
1667    
1668            * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1669    
1670            * _LinkTypeList.pm: New file.
1671    
1672            * mklinktypelist.pl: New file.
1673    
1674            * .cvsignore: |RelExtensions.html| added.
1675    
1676            * NanoDOM.pm (child_nodes): Returns an empty array
1677            for non-child-containing node types.
1678            (text_content): New attribute.
1679    
1680    2007-05-26  Wakaba  <wakaba@suika.fam.cx>
1681    
1682            * IMTChecker.pm: New module.
1683    
1684            * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1685            to test parameter value validity.
1686    
1687            * HTML.pm.src ($style_start_tag): Attributes were
1688            discarded.
1689    
1690    2007-05-25  Wakaba  <wakaba@suika.fam.cx>
1691    
1692            * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1693    
1694    2007-05-25  Wakaba  <wakaba@suika.fam.cx>
1695    
1696            * URIChecker.pm: All recommendations from RFC 3986
1697            and RFC 3987 are listed (not all testable items are checked yet).
1698    
1699    2007-05-25  Wakaba  <wakaba@suika.fam.cx>
1700    
1701            * URIChecker.pm: New module.
1702    
1703    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
1704    
1705            * ContentChecker.pm: Now most attributes are implemented
1706            or associated to some placeholder.
1707            ($ElementDefault): Warn unknown attributes
1708            for unknown elements as "attribute not supported".
1709            ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1710            ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1711            (|a| attribute checker): Reimplemented.
1712    
1713    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
1714    
1715            * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1716            ($HTMLAttrChecker): Event handler content attributes
1717            are added.
1718            (link, embed): Required attribute is now checked.
1719            (embed): Unknown local attributes are no longer warned.
1720    
1721    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
1722    
1723            * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1724            ($HTMLIMTAttrChecker): New checker.
1725            (link@rel, link@href, link@type, style@type,
1726            a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1727            object@data, object@type, source@src, source@type, area@alt,
1728            area@shape, area@coords, area@href,
1729            area@ping, area@rel, area@type, script@src,
1730            script@defer, script@async, script@type): Checkers added.
1731    
1732    2007-05-20  Wakaba  <wakaba@suika.fam.cx>
1733    
1734            * ContentChecker.pm: Descendant checking was incorrect.
1735    
1736    2007-05-19  Wakaba  <wakaba@suika.fam.cx>
1737    
1738            * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1739            attributes.  Report an error if |Element.prefix|
1740            is |xmlns|.
1741    
1742            * NanoDOM.pm (prefix): New attribute.
1743    
1744  2007-05-19  Wakaba  <wakaba@suika.fam.cx>  2007-05-19  Wakaba  <wakaba@suika.fam.cx>
1745    
1746          * HTML.pm.src: In |main| phase, |in body| insertion          * HTML.pm.src: In |main| phase, |in body| insertion

Legend:
Removed from v.1.16  
changed lines
  Added in v.1.305

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24