/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.295 by wakaba, Sat Sep 13 08:21:35 2008 UTC revision 1.387 by wakaba, Sun Aug 23 02:35:32 2009 UTC
# Line 1  Line 1 
1    2009-08-23  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: Allow <input type=hidden> in context where
4            interactive content is disallowed (HTML5 revision 2392).
5    
6    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: Dropped support for "XSLT-compat" and added
9            "about:legacy-compat" (HTML5 revision 2725).
10    
11    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
14            spec (HTML5 revision 2700).
15    
16    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Added support for <script> element in the "in
19            select" and "in select in table" insertion modes (HTML5 revision
20            2500).
21    
22    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: An <hn> start tag in an <hn> element should close
25            the earlier element (HTML5 revision 2501).
26    
27    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src: The </br> end tags before the <body> start tag
30            should equally survive as the first br element child of the body
31            element (HTML5 revision 2586).
32    
33    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: <font> in SVG land is handled as an SVG element as
36            long as it did not have HTML attributes (HTML5 revision 2590).
37    
38    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src (_tree_construction_main): Improved error reporting
41            on "select" start tag in "in select" and "in select in table"
42            parsing modes (c.f. HTML5 revision 3604).
43    
44    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
45    
46            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
47            invalid (HTML5 revision 2701).
48    
49    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
52            optional before </body> (HTML5 revision 2806).
53    
54    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
55    
56            * HTML.pm.src: Marked where SVG |script| execution should be
57            implemented (HTML5 revision 2904).
58    
59    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
60    
61            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
62            inherits the document quirksness flag of the main document (HTML5
63            revision 2935).
64    
65    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
66    
67            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
68            quirks mode (The Hixie Quirk, HTML5 revision 3132).
69    
70    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
73            3163).
74    
75    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
76    
77            * HTML.pm.src: Don't change encoding to UTF-8 if the input
78            encoding is UTF-16 (HTML5 revision 3204).
79    
80    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src (parse_byte_stream): Don't change to another
83            encoding if the input is UTF-16 (HTML5 revision 3205).
84    
85    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src: |isindex| should no longer generate |p| element
88            node (HTML5 revision 3236).
89    
90    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
91    
92            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
93            support for new |id_type| value of |any|, which means any kind of
94            element is allowed as a referenced element.
95    
96    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
97    
98            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
99            of defining codes for checking required attribute and as such.
100    
101    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
102    
103            * IMTChecker.pm: Added more definitions for subtypes.
104    
105    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
106    
107            * URIChecker.pm: Some of |pos_end| values were wrong.
108    
109    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
110    
111            * ContentChecker.pm (check_element): Added support for "no
112            referenced datalist" error.
113    
114    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
115    
116            * URIChecker.pm: Bug fix: It did not work unless
117            Message::DOM::DOMImplementation has been |require|d.
118    
119    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
120    
121            * NanoDOM.pm (document_uri): New attribute.
122    
123            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
124    
125    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
126    
127            * NanoDOM.pm (text_content): Don't create a Text node if the new
128            value is empty.
129    
130    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
131    
132            * SWML/: New directory.
133    
134    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
135    
136            * NanoDOM.pm (specified, all_declarations_processed,
137            manakai_attribute_type): New attributes.
138    
139    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
140    
141            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
142            array.
143    
144    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
145    
146            * NanoDOM.pm (notation_name): New attribute.
147    
148    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
149    
150            * NanoDOM.pm (public_id, system_id): New attributes.a
151    
152    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
153    
154            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
155            implemented.
156            (allowed_tokens, default_type, declared_type): Implemented.
157    
158    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
159    
160            * NanoDOM.pm (node_name): New attribute.
161            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
162            constants.
163            (create_element_type_definition_node, create_attribute_definition,
164            create_notation, create_general_entity,
165            get_element_type_definition_node,
166            set_element_type_definition_node, get_general_entity_node,
167            set_general_entity_node, get_notation_node, set_notation_node,
168            get_attribute_definition_node, set_attribute_definition_node): New
169            methods.
170            (element_types, entities, notations, attribute_definitions): New
171            attributes.
172            (DocumentType): Support for child nodes, entities, notations, and
173            element types.
174            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
175            New classes.
176    
177            * Dumper.pm: Support for general entities, notations, element type
178            definitions, and attribute definitions.
179    
180    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
181    
182            * NanoDOM.pm (create_processing_instruction): New method.
183            (xml_version, xml_encoding, xml_standalone): New attributes.
184            (ProcessingInstruction): New class.
185    
186    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
187    
188            * HTML.pm.src: Handling of end tags in the foreign content
189            insertion mode was partially wrong, because of wrong bit
190            operations.
191    
192    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
193    
194            * NanoDOM.pm (dom_config): New attribute (do nothing), for
195            Whatpm::XML::Parser support.
196    
197    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
198    
199            * Makefile: New rule to make HTML/Tokenizer.pm is added.
200    
201            * HTML.pm.src: Tokenizer part moved to another file.
202    
203    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
206    
207    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
208    
209            * HTML.pm.src: Element category constants redefined.
210            
211    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
214            construction stage synced with the spec (HTML5 revisions 2139 and
215            2302).
216    
217    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
218    
219            * ContentChecker.pm: New error level "html5_fact" added, which
220            should be tentatively used until all of requirements are properly
221            specced as RFC 2119 "MUST" in HTML5.
222    
223    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
224    
225            * ContentChecker.pod: Note on internal flags is added.
226    
227    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
228    
229            * HTML.pm.src: An AAA bug fixed.
230    
231    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src: If another node is inserted by the parser, don't
234            reuse existing Text node to append a character (HTML5 revision
235            2124).
236    
237    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
240            revisions 1731 and 2128).
241    
242    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
243    
244            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
245            Support for end tags of camelCase SVG elements were broken.  A
246            wrong error type text fixed.
247    
248    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
249    
250            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
251    
252    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
253    
254            * HTML.pm.src: Support for new definition of |param| and |source|
255            start tag parsing (HTML5 revision 1731).
256    
257    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
258    
259            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
260    
261    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
264            revisions 1731 and 1831).
265    
266    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
269            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
270            revision 1731).
271    
272    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
273    
274            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
275            revision 1731).  End tags of |option| and |optgroup| elements are
276            now optional (HTML5 revision 1731).
277    
278    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: New "special" elements added to the list (HTML5
281            revision 1778).  "strile" -> "strike".
282    
283    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
284    
285            * ContentType.pm (get_sniffed_type): Support for the "better"
286            content sniffing (HTML5 revision 1927).  In a case the official
287            type was not returned when the method is invoked in the list
288            context.
289    
290    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
291    
292            * HTML.pm.src: Character references for non-space C0 characters,
293            including U+000B VT, DEL character, noncharacter code points, are
294            now converted to the U+FFFD character (cf. HTML5 revision 2138).
295    
296    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
297    
298            * ContentChecker.pm: |form=""| check support added.
299    
300    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
301    
302            * ContentChecker.pm: |contextmenu| validness is now checked using
303            |id| and |id_type| properties, and |menu| property is removed.
304    
305    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
306    
307            * ContentChecker.pm: Prepare for |form| |name| attribute's
308            duplication checking.
309    
310    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
311    
312            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
313            unsupported charset="" parameter value (HTML5 revision 2131).
314    
315    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
316    
317            * HTML.pm.src: Reminding places where U+000B is allowed as a space
318            character is fixed (cf. HTML5 revision 1738).
319    
320            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
321            space characters (HTML5 revision 1738).
322    
323    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
324    
325            * HTML.pm.src: The "anything else" case for the "after after body"
326            insertion mode was not updated to swtich to the "in body"
327            insertion mode.  U+000B is no longer a space character for the
328            purpose of tree construction phase (HTML5 revision 1738).
329    
330    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
331    
332            * HTML.pm.src: U+000B is no longer a space character (HTML5
333            revision 1738).
334    
335    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
336    
337            * ContentType.pm: 0x0B is no longer a space character (HTML5
338            revision 1738).
339    
340            * HTML.pm.src: U+000B is no longer a space character for the
341            algorithm for extracting an encoding from a Content-Type (HTML5
342            revision 1738).
343    
344    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
345    
346            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
347    
348    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
349    
350            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
351    
352            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
353            matching.
354    
355    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
356    
357            * LangTag.pm: Warn for private use language subtags.  Error level
358            typos fixed.  Support for Suppress-Script field.
359    
360            * mklangreg.pl: Support for dumping of nested structure.
361    
362    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
363    
364            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
365            recommended case as per RFC 4646.
366    
367    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
368    
369            * LangTag.pm (check_rfc4646_langtag): New method.
370    
371    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
372    
373            * mklangreg.pl: New script.
374    
375            * Makefile: Updated for creation of the module for language subtag
376            registry.
377            
378    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
379    
380            * Makefile: WebIDL.html added.
381    
382            * WebIDL.pod: New documentation.
383    
384    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
385    
386            * WebIDL.pm: Checker's error types are redefined.
387    
388    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
389    
390            * WebIDL.pm: Parser's error types are redefined.  Some forward
391            compatible parsing bugs are fixed.  Some unreachable codes are
392            commented out.
393    
394    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
395    
396            * WebIDL.pm: Support for the reminding extended attributes are
397            added.  It does not satisfy the definition that a forward
398            interface declaration has an extended attribute.  It seems that
399            unless explicitly allowed multiple extended attributes with the
400            same name is not allowed, though it is not explicitly mentioned in
401            the spec.
402    
403    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
404    
405            * WebIDL.pm: Unescapes extended attribute names and extended
406            attribute identifiers.  Preserve whether an extended attribute has
407            an argument list of not.  Support for extended attributes:
408            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
409            NameGetter, NameSetter, and Null.
410            (has_argument_list): New attribute.
411            (idl_text): Stringifies argument lists, if any, even if it is
412            empty.
413    
414    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
415    
416            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
417            |{s_kwd}| in DATA_STATE as default.
418    
419    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
420    
421            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
422            by |{s_kwd}| in DATA_STATE.
423    
424    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
425    
426            * HTML.pm.src: Shorten keys.
427    
428    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
429    
430            * HTML.pm.src: Remove checking for control character, surrogate
431            pair, or noncharacter code points and non-Unicode code
432            points (they should be handled by Whatpm::Charset::UnicodeChecker).
433            (parse_char_stream): Support for the |$get_wrapper| argument and
434            character stream error handlers.
435    
436    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
437    
438            * ContentChecker.pm: Don't call |loda_ns_module|
439            for null-namespace elements/attributes.
440    
441            * HTML.pm.src: Fact out $disallowed_control_chars
442            as a hash.
443    
444    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
445    
446            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
447            and |{next_char}| initializations are moved to initialization
448            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
449            with |parse_char_stream|.
450    
451    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
452    
453            * HTML.pm.src (parse_char_stream): Make |set_next_char|
454            invoke |manakai_read_until|, not only |read|, where
455            possible, to decrease the number of |read| method calls.
456    
457            * mkhtmlparser.pl: Related changes to the aforementioned
458            modification.
459    
460    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
461    
462            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
463            would report character error from now.
464    
465    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
466    
467            * HTML.pm.src: White-space-leaded non-white-space character
468            tokens in "before head insertion mode" was not
469            correctly handled.
470            (set_inner_html): Reimplemented using CharString decodehandle
471            class.  Support for $get_wrapper argument.  Support
472            for |{read_until}| feature.
473    
474    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
475    
476            * HTML.pm.src: Make a "bare ero" error for unknown
477            entities point the "&" character.
478    
479    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
480    
481            * HTML.pm.src: It turns out that U+FFFD don't have to
482            be added to the list of excluded characters.
483    
484    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
485    
486            * HTML.pm.src ($char_onerror): Have character decoder's |line|
487            and |column| a higher priority than the one set by the
488            tokenizer's input handler.
489            ($self->{read_until}): Exclude U+FFFD (but this might
490            not be necessary, since now we do line/column fixup in
491            the character decode handle).
492    
493    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
494    
495            * HTML.pm.src: Use |{read_until}| where possible.
496    
497    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
498    
499            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
500            and |manakai_getc_until| to |manakai_read_until| to
501            reduce the number of string copies.
502    
503    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
504    
505            * HTML.pm.src (parse_char_string): Use newly created
506            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
507            standard feature to |open| a string as a filehandle,
508            since Perl's string filehandle seems not supporting |ungetc|
509            method correctly.
510            (parse_char_stream): Define |{getc_until}| method.
511            (DATA_STATE): Experimental support for |getc_until| feature.
512    
513    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
514    
515            * HTML.pm.src: Check points added to newly added branches.
516    
517    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
518    
519            * HTML.pm.src: Remove |{char}|, which is no longer used.
520            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
521            and replaced by |{prev_state}|.
522    
523            * mkhtmlparser.pl: Remove |{char}| feature.
524            Remove |!!!back-next-input-character;| macro.
525    
526    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
527    
528            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
529            entity related tokenizer states in favor of new states
530            implementing the consume character reference algorithm.
531    
532    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
533    
534            * HTML.pm.src: "Consume a character reference" algorithm is
535            now implemented as a tokenizer's state, rather than
536            a method, with minimum changes (more changes will
537            be made, in due course).  "Bogus comment state"'s inner
538            loop gets removed.
539    
540  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
541    
542          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
543          into their own tokenizer states.          into their own tokenizer states.
544    
545  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.295  
changed lines
  Added in v.1.387

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24