/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.61 by wakaba, Sun Jul 15 07:52:01 2007 UTC revision 1.303 by wakaba, Sun Sep 14 06:32:48 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src ($char_onerror): Have character decoder's |line|
4            and |column| a higher priority than the one set by the
5            tokenizer's input handler.
6            ($self->{read_until}): Exclude U+FFFD (but this might
7            not be necessary, since now we do line/column fixup in
8            the character decode handle).
9    
10    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Use |{read_until}| where possible.
13    
14    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
15    
16            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
17            and |manakai_getc_until| to |manakai_read_until| to
18            reduce the number of string copies.
19    
20    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src (parse_char_string): Use newly created
23            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
24            standard feature to |open| a string as a filehandle,
25            since Perl's string filehandle seems not supporting |ungetc|
26            method correctly.
27            (parse_char_stream): Define |{getc_until}| method.
28            (DATA_STATE): Experimental support for |getc_until| feature.
29    
30    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src: Check points added to newly added branches.
33    
34    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
35    
36            * HTML.pm.src: Remove |{char}|, which is no longer used.
37            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
38            and replaced by |{prev_state}|.
39    
40            * mkhtmlparser.pl: Remove |{char}| feature.
41            Remove |!!!back-next-input-character;| macro.
42    
43    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
46            entity related tokenizer states in favor of new states
47            implementing the consume character reference algorithm.
48    
49    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src: "Consume a character reference" algorithm is
52            now implemented as a tokenizer's state, rather than
53            a method, with minimum changes (more changes will
54            be made, in due course).  "Bogus comment state"'s inner
55            loop gets removed.
56    
57    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
60            into their own tokenizer states.
61    
62    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
65            is split into three states.
66    
67    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
70            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
71            no longer does the tokenizer have to push back next input
72            characters in those states.
73    
74    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
77            into four states so that no longer does the tokenizer have to push
78            back next input characters in that state.
79    
80    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
83            which can be used to insert some wrapper between the character
84            stream handle and the tokenizer.  (It is currently not supported
85            for |set_inner_html| for |Element|s).
86    
87    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: Ignore punctuations in charset names.
90    
91    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
92    
93            * ContentChecker.pm: Support for charset-layer error levels.
94    
95            * HTML.pm.src: Don't specify |text| argument for the
96            |chardecode:fallback| error, since it is not the encoding
97            being used alternatively.
98    
99    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
102    
103    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
104    
105            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
106    
107    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
108    
109            * HTML.pm.src: Bug fix and sync with the spec with regard
110            to after after frameset insertion mode processing (HTML5
111            revision 1909).  Note that the implementation was wrong
112            per the old spec before the r1909 changes.
113    
114    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTMLTable.pm: scope=auto algorithm fix synced with the
117            spec (HTML5 revision 2093).
118            ($process_row): Algorithm step numbers synced with the
119            spec (HTML5 revision 2092).
120    
121    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
122    
123            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
124            revision 2094).
125    
126    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
127    
128            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
129    
130    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: '"' and "'" at the end of attribute
133            name (after another attribute) now raise parse error (HTML5
134            revision 2123).  Empty unquoted attribute values are no
135            longer allowed (HTML5 revision 2122).
136    
137    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
138    
139            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
140            revision 2130).
141    
142    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
143    
144            * ContentChecker.pm: |xml:lang| attribute value must be same
145            as |lang| attribute value for HTML elements (HTML5 revision 2062
146            and so on).
147    
148    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
149    
150            * ContentChecker.pm: Error level definition for |xml_id_error|
151            was missing.
152    
153            * URIChecker.pm: The end of the URL should be marked as the
154            error location for an empty path error.  The position
155            between the userinfo and the port components should be
156            marked as the error location for an empty host error.
157    
158    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
159    
160            * URIChecker.pm: Set parameters representing where in the
161            value the error occurs for errors.  Report unknown
162            address format error in warning level, since address
163            formats are rarely added.  Path segments starting with "/.."
164            were misinterpreted as a dot-segment.
165    
166    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
167    
168            * URIChecker.pm (check_iri_reference): Requires
169            |Message::DOM::DOMImplementation|.
170    
171    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
172    
173            * IMTChecker.pm: Updated for the new error reporting architecture.
174    
175            * ContentChecker.pm: Error levels for IMTs are added.
176    
177    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
178    
179            * H2H.pm (_shift_token): Support for unquoted HTML attribute
180            values.
181    
182    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
183    
184            * CacheManifest.pm: Support for new style of error
185            reports.
186    
187            * HTML.pm.src: Set line=1, column=1 to the document node.
188    
189    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
190    
191            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
192            and URL checkers.  Support for more error levels for bogus
193            langauge tag and URL "standards".
194    
195            * LangTag.pm, URIChecker.pm: Support for new style error
196            level reporting.
197    
198    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
199    
200            * ContentChecker.pm: Support for RDF/XML error levels.
201    
202            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
203            specifying.  Error types are revised.
204    
205    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
206    
207            * ContentChecker.pm: All error reporting method calls are
208            renewed.
209    
210    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
211    
212            * HTML.pm.src: All error type names and "text" parameters
213            are revised.  Use new style for "level" specification.
214    
215            * mkhtmlparser.pl: Use new style for "level" specification.
216    
217    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
218    
219            * WebIDL.pm (parse_char_string): Simplified error
220            reporting process for broken ignored valuetype definition.
221            (Valuetype idl_text): Support for special "DOMString" name.
222    
223    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
224    
225            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
226            terminal of the ScopedName is "DOMString", such that whether
227            the last part of the scoped name is "DOMString" or "_DOMString"
228            later.  It is necessary to determine whether a |typedef|
229            definition should be ignored or not.
230            (parse_char_string): Unescape the identifier of
231            exception members.
232            ($resolve): Return undef for builtin types and sequence<T>
233            types (we might not have to do this, however...).
234            (check): Support checking for Exceptions, Valuetypes,
235            and Typedefs.
236            ($serialize_type): Support for "DOMString::::" syntax.
237            (Typedef idl_text): Output Type as "DOMString" if it
238            is really "DOMString" (i.e. its internal representation
239            is "::DOMString::").
240    
241    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
242    
243            * WebIDL.pm ($resolve): New code, based on resolve code
244            for constant types in the |check| method.
245            (check): Support for checking of attributes, operations, and
246            arguments.
247            (Attribute/Operation idl_text): Exception names in getraises,
248            setraises, and raises clauses is serizlied by |$serialize_type|
249            code.
250    
251    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
252    
253            * WebIDL.pm ($integer): Order of selections are changed to match
254            hexadecimal numbers (the original pattern, taken from the spec,
255            was not work for hexadecimal numbers, because the "0" prefix
256            matches to the [0-7]* part (as an empty string) and therefore
257            it does not match with remaining "x..." part of a "0x..." integer
258            literal.
259            ($get_type): It now returns a string, not an array reference,
260            for regular types and |sequence| types (i.e. it in any case
261            returns a string).
262            ($get_next_token): The second item in the array that represents
263            a integer or float token is now a Perl number value, not the
264            original string representation of the number.
265            (check): Support for const value consistency checking.
266            No extended attribute is defined for constants.
267            (Node subclasses): Use simple strings rather than array references
268            for default data type values.
269            ($serialize_type): Type values are now simple strings.
270            (value): If the new attribute value is a false value, then
271            a FALSE value is set to the attribute.
272    
273    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
274    
275            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
276            in its stringified format ("scoped name" as defined in the
277            spec).  Note that future version of this module should not use
278            array references for type values and the |type_text| attribute
279            should be made obsolete.
280            (parse_char_string): Unescape attribute names.
281            (check): Support for checking of whether inherited interfaces
282            are actually defined or not.  Support for checking of whether
283            interface member identifiers are duplicated or not.
284            ($serialize_type): Scoped names are returned as is.  A future
285            version of this code should escape identifiers other than "DOMString",
286            otherwise the idl_text would be non-conforming.
287    
288    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
289    
290            * WebIDL.pm (parse_char_string): Set line/column numbers
291            to generated nodes.  Unescape identifiers.  Extended attributes
292            for Definition's were ignored.
293            (append_child): Set |parent_node| attribute.
294            (parent_node): New attribute.
295            (check): Support interface/exception members.  Support
296            extended attributes.  Support definition identifier uniqueness
297            constraint.
298            (qualified_name): New attribute.
299            (Interface/Exception idl_text): Extended attributes were
300            not prepended to the returned text.
301    
302    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
303    
304            * WebIDL.pm (parse_char_string): Set line/column numbers
305            to interface object experimentally.  s/shift/pop/g, shift
306            would make things wrong.  Support for interface forward
307            declarations was missing.  Broken interface declarations
308            with no block were not ignored entirely.
309            (Whatpm::WebIDL::Node): New abstract class.  This class
310            makes things easier.
311            (child_nodes): New attribute.  Unlike DOM's attribute with
312            same name, this attribute returns a dead list of nodes for
313            simplicity.
314            (get_user_data, set_user_data): New methods.
315            (Module idl_text): A SPACE character should be inserted
316            before the |{| character.
317            (Interface idl_text): Support for interface forward declarations.
318            (is_forward_declaration): New attribute.
319    
320    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
321    
322            * WebIDL.pm (type_text): Better serializer.
323    
324    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
325    
326            * WebIDL.pm: Revise forward-compatible parsing so that
327            it now can handle broken extended attributes and as such.
328    
329    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
330    
331            * WebIDL.pm: Real support for extended attributes.
332            Support for extended attributes with arguments.
333    
334    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
335    
336            * WebIDL.pm: Support for |exception| syntax.
337            (Interface->idl_text): Tentative support for inheritances.
338    
339    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
340    
341            * WebIDL.pm: Hierarchical scoped name support was broken.
342            Support for raises, setraises, and getraises syntaxes.
343    
344    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
345    
346            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
347            proper support for types, extended attributes, and exceptions yet).
348            WebIDL parser, version 1 (no support for exceptions yet,
349            no proper support for extended attributes yet).
350    
351    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
352    
353            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
354    
355    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
356    
357            * WebIDL.pm: Support for valuetype and const.
358    
359    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
360            
361            * WebIDL.pm: New module.
362    
363    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
364    
365            * Makefile (Entities.html): URI changed.
366    
367    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
368    
369            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
370    
371    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
372    
373            * HTML.pm.src (_get_next_token): A parse error was missing.
374    
375    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
376    
377            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
378            spec (commented out). (HTML5 revision 1711).
379    
380    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
381    
382            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
383    
384            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
385            as UTF-16 (HTML5 revision 1701).
386    
387    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
388    
389            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
390            1692).
391    
392    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
393    
394            * HTML.pm.src: The secondary insertion mode used when switching
395            to foreign content is the "in body" insertion mode (HTML5 revision
396            1696).
397    
398    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
399    
400            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
401            revision 1697).
402    
403    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
404    
405            * HTML.pm.src: Support for end-of-file token in foreign content
406            insertion mode (HTML5 revision 1693).  Update SVG camelCase
407            attribute list (HTML5 revision 1700).  <textarea> closes
408            </select> (HTML5 revision 1699).  More start tags close in
409            foreign content insertion mode (HTML5 revision 1698).
410    
411    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
412    
413            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
414    
415    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
416    
417            * HTML.pm.src: More robust charset parameter detection (HTML5
418            revision 1674).
419    
420    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
421    
422            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
423            revision 1676).
424    
425    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
426    
427            * HTML.pm.src: Ignore language part of public identifiers for
428            quriks mode detection (HTML5 revision 1679).
429    
430    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
431    
432            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
433            revision 1685).
434    
435    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
436    
437            * HTML.pm.src: Support for EOF in new states for tags (HTML5
438            revision 1684).
439    
440    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
441    
442            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
443            work (HTML5 revision 1690).
444    
445    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
446    
447            * HTML.pm.src (_tree_construction_main): Change handling of
448            end tags in head insertion modes (HTML5 revision 1686).
449            (parse_char_string): Bug fix for non-utf8 character string handlings.
450            (parse_char_stream): |ungetc| does not work well for this context.
451    
452    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
453    
454            * HTML.pm.src (parse_byte_string): Redefined to invoke
455            |parse_byte_stream|.
456            (parse_byte_stream): New method.
457    
458    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
459    
460            * HTML.pm.src (parse_byte_string): Fix the column number reported
461            by encoding layer error reporter.
462    
463    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
464    
465            * HTML.pm.src (parse_byte_string): Use streaming decoder
466            rather than converting the whole byte string and then parsing.
467            Propagate errors in character encoding layer.
468            (get_next_token): Precise error reporting for |bare stago| error.
469    
470    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
471    
472            * HTML.pm.src (parse_char_stream): New method.
473            (parse_char_string): This method is now defined as an invocation
474            of the |parse_char_stream| method.
475    
476    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
477    
478            * HTML.pm.src (parse_byte_string): Report various status
479            of the sniffing as info-level errors.  Support for new
480            decoding framework in parser resestting.
481            (new): Various default error levels were not set.
482    
483    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
484    
485            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
486            algorithm, except for the actual sniffing, is implemented
487            with new framework with Message::Charset::Info.
488    
489    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
490    
491            * CacheManifest.pm (_parse): Drop fragment identifiers from
492            URIs in fallback section (HTML5 revision 1596).
493    
494    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
495    
496            * Makefile (Entities.html): URI has changed.
497    
498    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
499    
500            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
501            1553).
502    
503    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
504    
505            * ContentChecker.pm: Noted that those returned in |table| are
506            no longer table elements, but table objects returned
507            by Whatpm::HTMLTable.
508    
509            * HTMLTable.pm (form_table): Return table element node
510            as |$table->{element}|.
511            (assign_header): Support for the |headers=""| attribute.
512    
513    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
514    
515            * HTMLTable.pm (assign_header): New function; first version
516            with no support for headers="".
517            (form_table): Include table width and height to the returned
518            table object for covenience.  Indexing in column assignement was wrong.
519            Set whether a data cell is empty or not for convenience.
520    
521    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
522    
523            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
524    
525    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
526    
527            * HTMLTable.pm: How table model errors are detected is
528            changed (HTML5 revision 1387).
529    
530    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
531    
532            * HTMLTable.pm: The algorithm now moves |tfoot| elements
533            to the end of the table (HTML5 revision 1380).
534    
535    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
536    
537            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
538            of 1-based (HTML5 revision 1376).
539    
540    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
541    
542            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
543            and "Content-Encoding" no longer prevent sniffing (HTML5
544            revision 1288).
545    
546    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
547    
548            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
549            1282).
550    
551    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
552    
553            * ContentChecker.pm: Support for global attributes.
554            Status of XML specs are added.
555    
556    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
557    
558            * ContentChecker.pm (check_element): Support for |template=""|
559            and |ref=""| attribute (referent element type checking).
560    
561    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
562    
563            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
564            1500) is implemented (except for IDNA part and URI-scheme-specific
565            knowledge).  Line number counting was wrong for LF-only
566            documents.
567    
568    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
569    
570            * HTML.pm.src: Raise an parse error for any disallowed
571            character (HTML5 revision 1263).
572    
573    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
574    
575            * mkentitylist.pl: Support for new HTML5 entity table format (the
576            definition for |AElig;| was missing).
577    
578    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
579    
580            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
581            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
582    
583    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
584    
585            * HTML.pm.src: List of element names that close foreign content
586            insertion mode is added (HTML5 revisions 1412 and 1418).
587    
588    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
589    
590            * HTML.pm.src: Support for |mglyph| and |malignmark|
591            elements (HTML5 revision 1410).
592    
593    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
594    
595            * HTML.pm.src: Support for new long MathML entities (HTML5
596            revision 1406).
597    
598    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
599    
600            * HTML.pm.src: CDATA section support for MathML and SVG
601            elements (HTML5 revision 1404 and 1420).
602    
603    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
604    
605            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
606            elements (HTML5 revision 1404).  Unused !!!macro definitions
607            are removed.
608    
609    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
610    
611            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
612            are raised is changed (HTML5 revision 1404).
613    
614    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
615    
616            * HTML.pm.src: Category sets that are no longer used are removed.
617    
618    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
619    
620            * HTML.pm.src: The ->[1] property of stack entries are now
621            replaced by constants representing element category.
622    
623    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
624    
625            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
626            for error reporting.  (This is a preparation for using constant
627            value for ->[1].)
628    
629    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
630    
631            * RDFXML.pm: Typo fixed.
632    
633    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
634    
635            * ContentChecker.pm: |fact_level| is now treated
636            as same as |must_level|, i.e. level = |m|.
637            (check_element): Make list of URIs in the DOM.
638    
639    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
640    
641            * RDFXML.pm: Language accessor implemented.
642            Local (null-namespace) attribute support.
643    
644    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
645    
646            * RDFXML.pm: Factored out ID checking code.
647    
648    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
649    
650            * RDFXML.pm: TODO items noted.  Validation of ID
651            and URI attributes is implemented.  Warn if unknown
652            value is used in rdf:parseType="" attribute.
653    
654            * URIChecker.pm (check_rdf_uri_reference): New function.
655    
656    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
657    
658            * RDFXML.pm: bnodeid implemented.  Relative references
659            are now resolved.
660    
661    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
662    
663            * ContentChecker.pm: RDF reification implemented.
664    
665            * RDFXML.pm: undef vs false bug fixed.
666            Reification implemented.
667    
668    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
669    
670            * RDFXML.pm: s/id/ID/ for attribute name.
671            The |node| arguments are added for |ontriple| calls.
672            Too many "attribute not allowed" errors were raised.
673    
674            * ContentChecker.pm: Initial experimental support for rdf:RDF
675            element.
676    
677    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
678    
679            * RDFXML.pm: New module.
680    
681    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
682    
683            * HTML.pm.src (set_inner_html): Line/column number
684            code was old one yet.
685    
686    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
687    
688            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
689            errors.  Line/column markings for DOCTYPE, comment, and
690            character tokens are reintroduced; otherwise,
691            error location for "not HTML5" error and errors
692            for implied elements are not attached.
693    
694    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
695    
696            * HTML.pm.src: Set line/column numbers to attributes.
697    
698            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
699            (value): Setter implemented.
700    
701            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
702    
703    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
704    
705            * HTML.pm.src: Unused line/column markings are removed.
706    
707    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
708    
709            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
710            flag, which is no longer used.
711    
712    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
713    
714            * HTML.pm.src: Set line/column information to element nodes.
715    
716            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
717            and !!!insert-element-t): Set line/column information to
718            element nodes.
719    
720    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
721    
722            * HTML.pm.src (_get_next_token): The first "<" character
723            in "<?", "<>", or "</>" should be the error point.
724    
725    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
726    
727            * HTML.pm.src: Some more fixes on error position reporting.
728    
729    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
730    
731            * HTML.pm.src: Token-level precious error reporting.
732    
733    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
734    
735            * HTML.pm.src: Preparation for more precious error point
736            reporting.
737    
738    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
739    
740            * HTML.pm.src: Error type revised.
741    
742    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
743    
744            * HTML.pm.src: Similar codes are merged together, again.
745    
746    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
747    
748            * HTML.pm.src: Similar codes are merged together.
749    
750    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
751    
752            * mkhtmlparser.pl: Set "level" parameter to parse errors.
753    
754            * HTML.pm.src: Code refined.
755    
756    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
757    
758            * HTML.pm.src: |</body>| treatement has been changed (HTML5
759            revision 1348).  Note that I really don't know this makes
760            any difference in the black-box behavior of the parser.
761    
762    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
763    
764            * HTML.pm.src: New end-of-file token implementation (HTML5
765            revision 1348).
766    
767    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
768    
769            * HTML.pm.src: |applet| support (HTML5 revision 1347).
770    
771    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
772    
773            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
774    
775    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
776    
777            * HTML.pm.src: Support for |<input>| in the "in select" insertion
778            mode and support for the "in select in table" insertion mode (HTML5
779            revision 1342).
780    
781    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
782    
783            * HTML.pm.src: No foster parenting for <script> and <script>
784            in non-tainted <table>s (HTML5 revision 1336).
785    
786    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
787    
788            * HTML.pm.src: Ignore white space characters between <html>
789            and <head> (HTML5 revision 1332).
790    
791    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
792    
793            * HTML.pm.src: Treat <input type=hidden> as if it were a
794            white space (HTML5 revision 1331).
795    
796    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
797    
798            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
799            element (HTML5 revision 1330).
800    
801    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
802    
803            * HTML.pm.src: <title> is always appended to the current
804            element (HTML5 revision 1328).
805    
806    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
807    
808            * HTML.pm.src: White space in tainted tables are moved
809            into foster parents (HTML5 revision 1326).
810    
811    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
812    
813            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
814            revision 1321).
815    
816    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
817    
818            * HTML.pm.src: |</h/n/>| case code rearranged to align with
819            the spec (HTML5 revision 1320).  Note that we finally complete
820            all of HTML5 revision 1320 changes.
821    
822    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
823    
824            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
825            tags (HTML5 revision 1320).
826    
827    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
828    
829            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
830    
831    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
832    
833            * HTML.pm.src: A "generate implied end tags" code (t409.1)
834            could not be reached so that it is now removed (HTML5 revision
835            1320).
836    
837    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
838    
839            * HTML.pm.src: Code for the case of |</div>| and so on are revised
840            to align with new spec text (HTML5 revision 1320).
841    
842    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
843    
844            * HTML.pm.src: Remove strange |if| condition; however, it should
845            have had no harm in theory.
846    
847    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
848    
849            * HTML.pm.src (_tree_construction_main): '</p>' in body
850            case is split from other end tags for the preparation
851            of implementing HTML revision 1320.
852    
853    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
854    
855            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
856            revision 1320).
857    
858    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
859    
860            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
861            and "div" (HTML5 revision 1318).  Add comments to where
862            |form| pointer association codes should be inserted (HTML5 revision
863            1319).
864    
865    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
866    
867            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
868    
869    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
870    
871            * HTML.pm.src: Since the case t268 should never be reached (no
872            other token type, there are), it is replaced by a |die| statement.
873    
874    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
875    
876            * HTML.pm.src: Typo fixed.
877    
878    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
879    
880            * HTML.pm.src (_tree_construction_initial): Some limited quirks
881            doctypes were not uppercased for comparison.
882    
883    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
884    
885            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
886            are added.
887    
888    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
889    
890            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
891            are set.  Cases that are unlikely reached are noted as so.
892    
893    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
894    
895            * HTML.pm.src: Checkpoints for debugging are added.
896    
897            * mkhtmlparser.pl: Support for |!!!cp| syntax.
898    
899    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
900    
901            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
902            for simplicity.
903    
904    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
905    
906            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
907            revision 1307).
908    
909    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
910    
911            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
912            is set are changed (HTML5 revision 1305).
913    
914    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
915    
916            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
917            1292).  Entities are not parsed in comment-like part in RCDATA
918            elements (HTML5 revision 1294).  Allow bare & at the end
919            of attribute value literals (HTML5 revision 1296).  More
920            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
921            between attributes and ban attribute names or unquoted
922            attribute values containing single or double quotes (HTML5
923            revision 1303).
924    
925    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
926    
927            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
928            and related errors unless it is an HTML document (though the spec
929            is unclear on whether it is applied to XHTML document).
930    
931            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
932            are added.
933    
934    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
935    
936            * HTML.pm.src: s/local_name/manakai_local_name/g.
937    
938    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
939    
940            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
941    
942            * HTML.pm.src: |charset| in |content| attribute is
943            case-insensitive (HTML5 revision 1270).
944    
945    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
946    
947            * ContentChecker.pm: New status constants are added.
948            ($ElementDefault): |status| added.
949            (check_element): Err for non-standard or deprecated elements.
950            (_attr_status_info): For non-standard or deprecated attributes.
951    
952    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
953    
954            * ContentChecker.pm (_attr_status_info): New internal method.
955    
956    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
957    
958            * ContentChecker.pm (check_element): Element standardized
959            status information is now dispatched.
960    
961    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
962    
963            * ContentChecker.pm (check_element): Fix |del|-and-significant
964            problem by adding some more arguments.
965    
966    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
967    
968            * ContentChecker.pm (check_element): Use context of
969            container-for-the-purpose-of-content-model element (not transparent
970            element) for |check_child_element| calling and significant
971            text flag marking.  This reintroduces |<del>|-and-significant
972            problem again.
973    
974    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
975    
976            * ContentChecker.pm (check_element): Make semi-transparent
977            elements ignored for the purpose of phase changes in
978            content model checking.
979    
980    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
981    
982            * ContentChecker.pm (check_element): In-element state
983            was not properly managed for transparent cases.
984    
985    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
986    
987            * ContentChecker.pm (check_element): Support for |video|
988            and |audio| as semi-transparent elements.
989    
990    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
991    
992            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
993            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
994            Support for |html:object| as a semi-transparent element.
995    
996    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
997    
998            * ContentChecker.pm (check_element): The way to traverse
999            the tree is entirely revised to make it easier to track
1000            the state of ancestors/descendants.  As a result of this
1001            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1002            support for content model checking for HTML elements |figure|,
1003            |object|, |video|, and |audio| and checking for XML elements (and
1004            some XMLNS checkings) are dropped for now.  They will be
1005            reimplemented in due cource.
1006    
1007    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1008    
1009            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1010            token (HTML5 revision 1225).
1011    
1012    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1013    
1014            * ContentChecker.pm ({unsupported_level}): New value.
1015    
1016            * HTML.pm.src: Save whether |meta| |content| attribute
1017            contains character references or not.
1018    
1019    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1020    
1021            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1022            imlement significant content checking for contents
1023            with |del| elements.
1024    
1025    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1026    
1027            * ContenteChecker.pm ($HTMLTransparentElements): More
1028            elements are added.
1029            (_get_children): HTML |object| elements are now semi-transparent.
1030    
1031            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1032    
1033    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1034    
1035            * CacheManifest.pm: HTML5 revision 1211 implemented.
1036    
1037            * CacheManifest.pod: Updated.
1038    
1039    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * ContentChecker.pm (check_document, check_element): Support
1042            for second argument ($onsubdoc).
1043            (_get_css_parser): Removed (now it is part of WDCC).
1044    
1045    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1046    
1047            * ContentChecker.pm (_get_css_parser): New.
1048    
1049    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1050    
1051            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1052            for checking had been used.
1053    
1054    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1055    
1056            * ContentChecker.pm (check_element): New todo item type |descendant|.
1057    
1058    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1059    
1060            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1061            specced in RFC 2046 and RFC 4288.
1062            (application/atom+xml): Definition added.
1063    
1064    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1065    
1066            * URIChecker.pm: Make RFC 3986 should-level errors
1067            warnings (rather than SHOULD-level errors).
1068    
1069    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1070    
1071            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1072    
1073            * HTML.pm.src: A flag for character references in attribute
1074            values are added.  Set |manakai_has_reference| user data
1075            to |charset| attribute.
1076    
1077    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1078    
1079            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1080            attributes.
1081    
1082            * ContentChecker.pm (check_document): Warn if charset requirements
1083            cannot be tested.
1084    
1085    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1086    
1087            * HTML.pm.src (parse_byte_string): Detect charset
1088            by universalchardet if charset parameter is not specified.
1089    
1090            * Makefile (Charset-all, Charset-clean): New rules.
1091    
1092    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1093    
1094            * ContentChecker.pm (check_document): Check the existence
1095            of character encoding declaration and selection of encoding
1096            for HTML document.
1097    
1098    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1099    
1100            * ContentType.pm (get_sniffed_type): Return also the official
1101            type in list context.
1102    
1103    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1104    
1105            * ContentType.pm: Sniffing with leading white space
1106            ignoring (HTML5 revisions 1013 and 1016).
1107    
1108    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1109    
1110            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1111            white spaces, are implemented.
1112    
1113    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1114    
1115            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1116            attribute if possible.
1117    
1118    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1119    
1120            * HTML.pm.src (parse_byte_string): New method.
1121            (parse_char_string): New alias for |parse_string|.
1122            (main phase): Invoking "change the encoding" algorithm if desired.
1123    
1124            * HTML.pod: Updated.
1125    
1126    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1127    
1128            * HTML.pod (get_inner_html): Removed.
1129    
1130            * Makefile (HTML-all, HTML-clean): New.
1131    
1132    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1133    
1134            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1135    
1136    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1137    
1138            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1139    
1140    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1141    
1142            * Makefile: |CacheManifest.html| is added.
1143    
1144            * CacheManifest.pod: New file.
1145    
1146    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1147    
1148            * CacheManifest.pm: New module.
1149    
1150    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1151    
1152            * HTML.pm.src: Support for application cache selection algorithm
1153            callback.
1154    
1155    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1156    
1157            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1158    
1159    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1160    
1161            * Makefile (clean): New rule.
1162    
1163            * NanoDOM.pm (public_id, system_id): New attributes.
1164    
1165    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1166    
1167            * Makefile (CSS-all, CSS-clean, clean): New rules.
1168    
1169    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1170    
1171            * ContentChecker.pm (check_document): Support for
1172            new |is_xml_root| flag.
1173            (check_element): Support for new |pluses| state.
1174            (_add_pluses): New method.
1175            (_remove_minuses): Support for new |minus| item.
1176    
1177    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1178    
1179            * ContentChecker.pm: Raise specific error for invalid
1180            root element.
1181    
1182    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1183    
1184            * ContentChecker.pm: Set level values for later uses.
1185    
1186    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1187    
1188            * ContentChecker.pm: Support for language tag validation.
1189    
1190    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1191    
1192            * LangTag.pm (check_rfc3066_language_tag): New method.
1193    
1194    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1195    
1196            * LangTag.pm: New module.
1197    
1198    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1199    
1200            * HTML.pm.src: Some error types were wrong.
1201    
1202    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1203    
1204            * CSS/: New directory.
1205    
1206    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1207    
1208            * ContentChecker.pm (_check_get_children): Support
1209            for |noscript| in |head|.
1210    
1211    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1212    
1213            * URI/: New directory.
1214    
1215    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1216    
1217            * HTML.pm.src: Tokenizer's states are now represented in
1218            number.
1219    
1220    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1221    
1222            * HTML.pm.src: |or|s for insertion modes are replaced
1223            by |&|s.
1224    
1225    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1226    
1227            * HTML.pm.src: Token types are now represented in number.
1228    
1229    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1230    
1231            * ContentType.pm (SEE ALSO): Updated.
1232    
1233            * HTML.pm.src: Insertion modes are now represented in number.
1234    
1235    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1236    
1237            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1238            999) is implemented.
1239    
1240    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1241    
1242            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1243            983) is implemented.
1244    
1245    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1246    
1247            * ContentChecker.pod: New documentation.
1248    
1249            * Makefile: A rule for |ContentChecker.html| is added.
1250    
1251            * ContentChecker.pm: A pod "LICENSE" section is added.
1252    
1253            * NanoDOM.pm ($VERSION): New variable.
1254    
1255    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1256    
1257            * H2H.pm: |b|, |i|, and |sub| are added to the
1258            list of allowed HTML elements.
1259    
1260    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1261    
1262            * H2H.pm: |samp| is added to the list of allowed
1263            HTML elements.
1264    
1265            * URIChecker.pm (check_iri): New.
1266            (check_iri_reference): Error type for IRI reference
1267            syntax error is changed.
1268    
1269    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1270    
1271            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1272            is added.
1273            (check_document): Load appropriate module before validation.
1274    
1275    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1276    
1277            * ContentChecker/: New directory.
1278    
1279    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1280    
1281            * ContentChecker.pm: HTML |time| element is implemented.
1282    
1283            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1284    
1285    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1286    
1287            * ContentChecker.pm (check_document): Return value
1288            even if no document element is found.
1289    
1290    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1291    
1292            * HTML.pm.src: |$in_body| is no longer a function.
1293    
1294    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1295    
1296            * HTML.pm.src: The |$in_body| code has been moved down.
1297    
1298    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1299    
1300            * HTML.pm.src: The "trailing end" insertion mode
1301            is split into "after html body" and "after html frameset"
1302            insertion modes.  Their codes are merged with "after body"
1303            and "after frameset" codes.  |$previous_insertion_mode|
1304            has been removed.  "after frameset" code is
1305            merged with "in frameset" code.
1306    
1307    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1308    
1309            * HTML.pm.src: The "before head" insertion mode is
1310            merged with the "in head" insertion mode.
1311    
1312    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1313    
1314            * HTML.pm.src: Cases in "in head" insertion mode are
1315            reorganized.
1316    
1317    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1318    
1319            * HTML.pm.src: Some cases in "in table" insertion mode
1320            are merged.
1321    
1322    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1323    
1324            * HTML.pm.src: The "in row" insertion mode is merged
1325            with "in table" insertion mode.
1326    
1327    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1328    
1329            * HTML.pm.src: The "in table" and "in table body" insertion
1330            modes are merged.
1331    
1332    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1333    
1334            * HTML.pm.src: There is no "in table head" or "in table foot"
1335            insertion mode!
1336    
1337    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1338    
1339            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1340            now directly invoke the handler.
1341    
1342    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1343    
1344            * HTML.pm.src: Codes for "in cell" insertion mode
1345            is merged to the "in body" insertion mode code.
1346    
1347    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1348    
1349            * HTML.pm.src: Codes for "in body" and "in caption"
1350            insertion modes are merged.
1351    
1352    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1353    
1354            * HTML.pm.src: Two |!!!next-token|s were missing.
1355    
1356    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1357    
1358            * HTML.pm.src: Use numeric constant for |{content_mode}|
1359            instead of string constant for |{content_model_flag}|.
1360    
1361    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1362    
1363            * HTML.pm.src: Add the name of the attribute
1364            to the "duplicate attribute" error.
1365    
1366    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1367    
1368            * ContentChecker.pm: Return the |class| node list.
1369    
1370    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
1371    
1372            * ContentChecker.pm: Return the |id| node list.
1373    
1374            * HTML.pm.src: A typo is fixed.
1375    
1376    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1377    
1378            * ContentChecker.pm: Drop wrong |level => 'error'| specification
1379            from "in HTML:xml:lang" error.  Character position
1380            is now the last part of the error type in the URI error
1381            description.  Report "unsupported" status for language
1382            tags, media queries, script codes, and style sheets.
1383    
1384    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1385    
1386            * ContentChecker.pm: Report error if |xml:lang|
1387            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1388            in XML.
1389    
1390            * NanoDOM.pm (Attr.owner_document): New attribute.
1391    
1392    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1393    
1394            * HTML.pm.src: The character immediately following
1395            a bare |hcro| was discarded.  Fix handling of
1396            entity references in attribute values.
1397    
1398    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1399    
1400            * HTML.pm.src (main and trailin end phases): Token
1401            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1402            factored out.  Error types |in html:#DOCTYPE|
1403            and |after html:#DOCTYPE| are merged
1404            into |DOCTYPE in the middle|.  |</frameset>|
1405            in fragment parsing mode changed the
1406            insertion mode.
1407    
1408    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1409    
1410            * HTML.pm.src: |$phase| has been removed; The |trailing end|
1411            phase is now an insertion mode.  Treatments for white
1412            space character tokens were incorrect for some
1413            insertion modes.  An old |meta| case was not removed.
1414    
1415    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
1416    
1417            * HTML.pm.src: |meta| charset declaration extraction
1418            implemented (but changing the encoding is not yet:-).
1419    
1420    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1421    
1422            * Charset/: New directory.
1423    
1424  2007-07-15  Wakaba  <wakaba@suika.fam.cx>  2007-07-15  Wakaba  <wakaba@suika.fam.cx>
1425    
1426          * H2H.pm: New Perl module (created from          * H2H.pm: New Perl module (created from

Legend:
Removed from v.1.61  
changed lines
  Added in v.1.303

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24