/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.294 by wakaba, Sat Sep 13 07:51:32 2008 UTC revision 1.383 by wakaba, Sun Aug 16 05:57:24 2009 UTC
# Line 1  Line 1 
1    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: An <hn> start tag in an <hn> element should close
4            the earlier element (HTML5 revision 2501).
5    
6    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: The </br> end tags before the <body> start tag
9            should equally survive as the first br element child of the body
10            element (HTML5 revision 2586).
11    
12    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: <font> in SVG land is handled as an SVG element as
15            long as it did not have HTML attributes (HTML5 revision 2590).
16    
17    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src (_tree_construction_main): Improved error reporting
20            on "select" start tag in "in select" and "in select in table"
21            parsing modes (c.f. HTML5 revision 3604).
22    
23    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
24    
25            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
26            invalid (HTML5 revision 2701).
27    
28    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
31            optional before </body> (HTML5 revision 2806).
32    
33    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Marked where SVG |script| execution should be
36            implemented (HTML5 revision 2904).
37    
38    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
41            inherits the document quirksness flag of the main document (HTML5
42            revision 2935).
43    
44    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
45    
46            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
47            quirks mode (The Hixie Quirk, HTML5 revision 3132).
48    
49    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
52            3163).
53    
54    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
55    
56            * HTML.pm.src: Don't change encoding to UTF-8 if the input
57            encoding is UTF-16 (HTML5 revision 3204).
58    
59    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
60    
61            * HTML.pm.src (parse_byte_stream): Don't change to another
62            encoding if the input is UTF-16 (HTML5 revision 3205).
63    
64    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
65    
66            * HTML.pm.src: |isindex| should no longer generate |p| element
67            node (HTML5 revision 3236).
68    
69    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
70    
71            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
72            support for new |id_type| value of |any|, which means any kind of
73            element is allowed as a referenced element.
74    
75    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
76    
77            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
78            of defining codes for checking required attribute and as such.
79    
80    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
81    
82            * IMTChecker.pm: Added more definitions for subtypes.
83    
84    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
85    
86            * URIChecker.pm: Some of |pos_end| values were wrong.
87    
88    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
89    
90            * ContentChecker.pm (check_element): Added support for "no
91            referenced datalist" error.
92    
93    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
94    
95            * URIChecker.pm: Bug fix: It did not work unless
96            Message::DOM::DOMImplementation has been |require|d.
97    
98    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
99    
100            * NanoDOM.pm (document_uri): New attribute.
101    
102            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
103    
104    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
105    
106            * NanoDOM.pm (text_content): Don't create a Text node if the new
107            value is empty.
108    
109    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
110    
111            * SWML/: New directory.
112    
113    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
114    
115            * NanoDOM.pm (specified, all_declarations_processed,
116            manakai_attribute_type): New attributes.
117    
118    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
119    
120            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
121            array.
122    
123    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
124    
125            * NanoDOM.pm (notation_name): New attribute.
126    
127    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
128    
129            * NanoDOM.pm (public_id, system_id): New attributes.a
130    
131    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
132    
133            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
134            implemented.
135            (allowed_tokens, default_type, declared_type): Implemented.
136    
137    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
138    
139            * NanoDOM.pm (node_name): New attribute.
140            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
141            constants.
142            (create_element_type_definition_node, create_attribute_definition,
143            create_notation, create_general_entity,
144            get_element_type_definition_node,
145            set_element_type_definition_node, get_general_entity_node,
146            set_general_entity_node, get_notation_node, set_notation_node,
147            get_attribute_definition_node, set_attribute_definition_node): New
148            methods.
149            (element_types, entities, notations, attribute_definitions): New
150            attributes.
151            (DocumentType): Support for child nodes, entities, notations, and
152            element types.
153            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
154            New classes.
155    
156            * Dumper.pm: Support for general entities, notations, element type
157            definitions, and attribute definitions.
158    
159    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
160    
161            * NanoDOM.pm (create_processing_instruction): New method.
162            (xml_version, xml_encoding, xml_standalone): New attributes.
163            (ProcessingInstruction): New class.
164    
165    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: Handling of end tags in the foreign content
168            insertion mode was partially wrong, because of wrong bit
169            operations.
170    
171    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
172    
173            * NanoDOM.pm (dom_config): New attribute (do nothing), for
174            Whatpm::XML::Parser support.
175    
176    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
177    
178            * Makefile: New rule to make HTML/Tokenizer.pm is added.
179    
180            * HTML.pm.src: Tokenizer part moved to another file.
181    
182    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
183    
184            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
185    
186    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
187    
188            * HTML.pm.src: Element category constants redefined.
189            
190    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
191    
192            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
193            construction stage synced with the spec (HTML5 revisions 2139 and
194            2302).
195    
196    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
197    
198            * ContentChecker.pm: New error level "html5_fact" added, which
199            should be tentatively used until all of requirements are properly
200            specced as RFC 2119 "MUST" in HTML5.
201    
202    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
203    
204            * ContentChecker.pod: Note on internal flags is added.
205    
206    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
207    
208            * HTML.pm.src: An AAA bug fixed.
209    
210    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
211    
212            * HTML.pm.src: If another node is inserted by the parser, don't
213            reuse existing Text node to append a character (HTML5 revision
214            2124).
215    
216    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
217    
218            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
219            revisions 1731 and 2128).
220    
221    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
222    
223            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
224            Support for end tags of camelCase SVG elements were broken.  A
225            wrong error type text fixed.
226    
227    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
228    
229            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
230    
231    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src: Support for new definition of |param| and |source|
234            start tag parsing (HTML5 revision 1731).
235    
236    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
239    
240    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
241    
242            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
243            revisions 1731 and 1831).
244    
245    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
246    
247            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
248            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
249            revision 1731).
250    
251    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
252    
253            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
254            revision 1731).  End tags of |option| and |optgroup| elements are
255            now optional (HTML5 revision 1731).
256    
257    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
258    
259            * HTML.pm.src: New "special" elements added to the list (HTML5
260            revision 1778).  "strile" -> "strike".
261    
262    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
263    
264            * ContentType.pm (get_sniffed_type): Support for the "better"
265            content sniffing (HTML5 revision 1927).  In a case the official
266            type was not returned when the method is invoked in the list
267            context.
268    
269    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
270    
271            * HTML.pm.src: Character references for non-space C0 characters,
272            including U+000B VT, DEL character, noncharacter code points, are
273            now converted to the U+FFFD character (cf. HTML5 revision 2138).
274    
275    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
276    
277            * ContentChecker.pm: |form=""| check support added.
278    
279    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
280    
281            * ContentChecker.pm: |contextmenu| validness is now checked using
282            |id| and |id_type| properties, and |menu| property is removed.
283    
284    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
285    
286            * ContentChecker.pm: Prepare for |form| |name| attribute's
287            duplication checking.
288    
289    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
290    
291            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
292            unsupported charset="" parameter value (HTML5 revision 2131).
293    
294    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
295    
296            * HTML.pm.src: Reminding places where U+000B is allowed as a space
297            character is fixed (cf. HTML5 revision 1738).
298    
299            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
300            space characters (HTML5 revision 1738).
301    
302    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
303    
304            * HTML.pm.src: The "anything else" case for the "after after body"
305            insertion mode was not updated to swtich to the "in body"
306            insertion mode.  U+000B is no longer a space character for the
307            purpose of tree construction phase (HTML5 revision 1738).
308    
309    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTML.pm.src: U+000B is no longer a space character (HTML5
312            revision 1738).
313    
314    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
315    
316            * ContentType.pm: 0x0B is no longer a space character (HTML5
317            revision 1738).
318    
319            * HTML.pm.src: U+000B is no longer a space character for the
320            algorithm for extracting an encoding from a Content-Type (HTML5
321            revision 1738).
322    
323    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
324    
325            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
326    
327    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
328    
329            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
330    
331            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
332            matching.
333    
334    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
335    
336            * LangTag.pm: Warn for private use language subtags.  Error level
337            typos fixed.  Support for Suppress-Script field.
338    
339            * mklangreg.pl: Support for dumping of nested structure.
340    
341    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
342    
343            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
344            recommended case as per RFC 4646.
345    
346    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
347    
348            * LangTag.pm (check_rfc4646_langtag): New method.
349    
350    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
351    
352            * mklangreg.pl: New script.
353    
354            * Makefile: Updated for creation of the module for language subtag
355            registry.
356            
357    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
358    
359            * Makefile: WebIDL.html added.
360    
361            * WebIDL.pod: New documentation.
362    
363    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
364    
365            * WebIDL.pm: Checker's error types are redefined.
366    
367    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
368    
369            * WebIDL.pm: Parser's error types are redefined.  Some forward
370            compatible parsing bugs are fixed.  Some unreachable codes are
371            commented out.
372    
373    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
374    
375            * WebIDL.pm: Support for the reminding extended attributes are
376            added.  It does not satisfy the definition that a forward
377            interface declaration has an extended attribute.  It seems that
378            unless explicitly allowed multiple extended attributes with the
379            same name is not allowed, though it is not explicitly mentioned in
380            the spec.
381    
382    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
383    
384            * WebIDL.pm: Unescapes extended attribute names and extended
385            attribute identifiers.  Preserve whether an extended attribute has
386            an argument list of not.  Support for extended attributes:
387            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
388            NameGetter, NameSetter, and Null.
389            (has_argument_list): New attribute.
390            (idl_text): Stringifies argument lists, if any, even if it is
391            empty.
392    
393    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
394    
395            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
396            |{s_kwd}| in DATA_STATE as default.
397    
398    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
399    
400            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
401            by |{s_kwd}| in DATA_STATE.
402    
403    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
404    
405            * HTML.pm.src: Shorten keys.
406    
407    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pm.src: Remove checking for control character, surrogate
410            pair, or noncharacter code points and non-Unicode code
411            points (they should be handled by Whatpm::Charset::UnicodeChecker).
412            (parse_char_stream): Support for the |$get_wrapper| argument and
413            character stream error handlers.
414    
415    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
416    
417            * ContentChecker.pm: Don't call |loda_ns_module|
418            for null-namespace elements/attributes.
419    
420            * HTML.pm.src: Fact out $disallowed_control_chars
421            as a hash.
422    
423    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
424    
425            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
426            and |{next_char}| initializations are moved to initialization
427            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
428            with |parse_char_stream|.
429    
430    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
431    
432            * HTML.pm.src (parse_char_stream): Make |set_next_char|
433            invoke |manakai_read_until|, not only |read|, where
434            possible, to decrease the number of |read| method calls.
435    
436            * mkhtmlparser.pl: Related changes to the aforementioned
437            modification.
438    
439    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
440    
441            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
442            would report character error from now.
443    
444    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
445    
446            * HTML.pm.src: White-space-leaded non-white-space character
447            tokens in "before head insertion mode" was not
448            correctly handled.
449            (set_inner_html): Reimplemented using CharString decodehandle
450            class.  Support for $get_wrapper argument.  Support
451            for |{read_until}| feature.
452    
453    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
454    
455            * HTML.pm.src: Make a "bare ero" error for unknown
456            entities point the "&" character.
457    
458    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
459    
460            * HTML.pm.src: It turns out that U+FFFD don't have to
461            be added to the list of excluded characters.
462    
463    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
464    
465            * HTML.pm.src ($char_onerror): Have character decoder's |line|
466            and |column| a higher priority than the one set by the
467            tokenizer's input handler.
468            ($self->{read_until}): Exclude U+FFFD (but this might
469            not be necessary, since now we do line/column fixup in
470            the character decode handle).
471    
472    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
473    
474            * HTML.pm.src: Use |{read_until}| where possible.
475    
476    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
477    
478            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
479            and |manakai_getc_until| to |manakai_read_until| to
480            reduce the number of string copies.
481    
482    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
483    
484            * HTML.pm.src (parse_char_string): Use newly created
485            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
486            standard feature to |open| a string as a filehandle,
487            since Perl's string filehandle seems not supporting |ungetc|
488            method correctly.
489            (parse_char_stream): Define |{getc_until}| method.
490            (DATA_STATE): Experimental support for |getc_until| feature.
491    
492    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
493    
494            * HTML.pm.src: Check points added to newly added branches.
495    
496    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
497    
498            * HTML.pm.src: Remove |{char}|, which is no longer used.
499            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
500            and replaced by |{prev_state}|.
501    
502            * mkhtmlparser.pl: Remove |{char}| feature.
503            Remove |!!!back-next-input-character;| macro.
504    
505    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
506    
507            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
508            entity related tokenizer states in favor of new states
509            implementing the consume character reference algorithm.
510    
511    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
512    
513            * HTML.pm.src: "Consume a character reference" algorithm is
514            now implemented as a tokenizer's state, rather than
515            a method, with minimum changes (more changes will
516            be made, in due course).  "Bogus comment state"'s inner
517            loop gets removed.
518    
519    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
520    
521            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
522            into their own tokenizer states.
523    
524  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
525    
526          * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|          * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|

Legend:
Removed from v.1.294  
changed lines
  Added in v.1.383

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24