/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.284 by wakaba, Sat Aug 30 14:37:46 2008 UTC revision 1.380 by wakaba, Sun Aug 16 03:41:52 2009 UTC
# Line 1  Line 1 
1    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (_tree_construction_main): Improved error reporting
4            on "select" start tag in "in select" and "in select in table"
5            parsing modes (c.f. HTML5 revision 3604).
6    
7    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
8    
9            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
10            invalid (HTML5 revision 2701).
11    
12    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
15            optional before </body> (HTML5 revision 2806).
16    
17    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: Marked where SVG |script| execution should be
20            implemented (HTML5 revision 2904).
21    
22    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
25            inherits the document quirksness flag of the main document (HTML5
26            revision 2935).
27    
28    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
31            quirks mode (The Hixie Quirk, HTML5 revision 3132).
32    
33    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
36            3163).
37    
38    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Don't change encoding to UTF-8 if the input
41            encoding is UTF-16 (HTML5 revision 3204).
42    
43    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTML.pm.src (parse_byte_stream): Don't change to another
46            encoding if the input is UTF-16 (HTML5 revision 3205).
47    
48    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTML.pm.src: |isindex| should no longer generate |p| element
51            node (HTML5 revision 3236).
52    
53    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
54    
55            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
56            support for new |id_type| value of |any|, which means any kind of
57            element is allowed as a referenced element.
58    
59    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
60    
61            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
62            of defining codes for checking required attribute and as such.
63    
64    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
65    
66            * IMTChecker.pm: Added more definitions for subtypes.
67    
68    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
69    
70            * URIChecker.pm: Some of |pos_end| values were wrong.
71    
72    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
73    
74            * ContentChecker.pm (check_element): Added support for "no
75            referenced datalist" error.
76    
77    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
78    
79            * URIChecker.pm: Bug fix: It did not work unless
80            Message::DOM::DOMImplementation has been |require|d.
81    
82    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
83    
84            * NanoDOM.pm (document_uri): New attribute.
85    
86            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
87    
88    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
89    
90            * NanoDOM.pm (text_content): Don't create a Text node if the new
91            value is empty.
92    
93    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
94    
95            * SWML/: New directory.
96    
97    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
98    
99            * NanoDOM.pm (specified, all_declarations_processed,
100            manakai_attribute_type): New attributes.
101    
102    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
103    
104            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
105            array.
106    
107    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
108    
109            * NanoDOM.pm (notation_name): New attribute.
110    
111    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
112    
113            * NanoDOM.pm (public_id, system_id): New attributes.a
114    
115    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
116    
117            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
118            implemented.
119            (allowed_tokens, default_type, declared_type): Implemented.
120    
121    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
122    
123            * NanoDOM.pm (node_name): New attribute.
124            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
125            constants.
126            (create_element_type_definition_node, create_attribute_definition,
127            create_notation, create_general_entity,
128            get_element_type_definition_node,
129            set_element_type_definition_node, get_general_entity_node,
130            set_general_entity_node, get_notation_node, set_notation_node,
131            get_attribute_definition_node, set_attribute_definition_node): New
132            methods.
133            (element_types, entities, notations, attribute_definitions): New
134            attributes.
135            (DocumentType): Support for child nodes, entities, notations, and
136            element types.
137            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
138            New classes.
139    
140            * Dumper.pm: Support for general entities, notations, element type
141            definitions, and attribute definitions.
142    
143    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
144    
145            * NanoDOM.pm (create_processing_instruction): New method.
146            (xml_version, xml_encoding, xml_standalone): New attributes.
147            (ProcessingInstruction): New class.
148    
149    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: Handling of end tags in the foreign content
152            insertion mode was partially wrong, because of wrong bit
153            operations.
154    
155    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
156    
157            * NanoDOM.pm (dom_config): New attribute (do nothing), for
158            Whatpm::XML::Parser support.
159    
160    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
161    
162            * Makefile: New rule to make HTML/Tokenizer.pm is added.
163    
164            * HTML.pm.src: Tokenizer part moved to another file.
165    
166    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
167    
168            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
169    
170    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Element category constants redefined.
173            
174    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
175    
176            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
177            construction stage synced with the spec (HTML5 revisions 2139 and
178            2302).
179    
180    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
181    
182            * ContentChecker.pm: New error level "html5_fact" added, which
183            should be tentatively used until all of requirements are properly
184            specced as RFC 2119 "MUST" in HTML5.
185    
186    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
187    
188            * ContentChecker.pod: Note on internal flags is added.
189    
190    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
191    
192            * HTML.pm.src: An AAA bug fixed.
193    
194    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
195    
196            * HTML.pm.src: If another node is inserted by the parser, don't
197            reuse existing Text node to append a character (HTML5 revision
198            2124).
199    
200    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
201    
202            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
203            revisions 1731 and 2128).
204    
205    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
206    
207            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
208            Support for end tags of camelCase SVG elements were broken.  A
209            wrong error type text fixed.
210    
211    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
214    
215    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
216    
217            * HTML.pm.src: Support for new definition of |param| and |source|
218            start tag parsing (HTML5 revision 1731).
219    
220    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
223    
224    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
225    
226            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
227            revisions 1731 and 1831).
228    
229    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
232            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
233            revision 1731).
234    
235    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
236    
237            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
238            revision 1731).  End tags of |option| and |optgroup| elements are
239            now optional (HTML5 revision 1731).
240    
241    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: New "special" elements added to the list (HTML5
244            revision 1778).  "strile" -> "strike".
245    
246    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
247    
248            * ContentType.pm (get_sniffed_type): Support for the "better"
249            content sniffing (HTML5 revision 1927).  In a case the official
250            type was not returned when the method is invoked in the list
251            context.
252    
253    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: Character references for non-space C0 characters,
256            including U+000B VT, DEL character, noncharacter code points, are
257            now converted to the U+FFFD character (cf. HTML5 revision 2138).
258    
259    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
260    
261            * ContentChecker.pm: |form=""| check support added.
262    
263    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
264    
265            * ContentChecker.pm: |contextmenu| validness is now checked using
266            |id| and |id_type| properties, and |menu| property is removed.
267    
268    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
269    
270            * ContentChecker.pm: Prepare for |form| |name| attribute's
271            duplication checking.
272    
273    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
274    
275            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
276            unsupported charset="" parameter value (HTML5 revision 2131).
277    
278    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: Reminding places where U+000B is allowed as a space
281            character is fixed (cf. HTML5 revision 1738).
282    
283            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
284            space characters (HTML5 revision 1738).
285    
286    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src: The "anything else" case for the "after after body"
289            insertion mode was not updated to swtich to the "in body"
290            insertion mode.  U+000B is no longer a space character for the
291            purpose of tree construction phase (HTML5 revision 1738).
292    
293    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src: U+000B is no longer a space character (HTML5
296            revision 1738).
297    
298    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
299    
300            * ContentType.pm: 0x0B is no longer a space character (HTML5
301            revision 1738).
302    
303            * HTML.pm.src: U+000B is no longer a space character for the
304            algorithm for extracting an encoding from a Content-Type (HTML5
305            revision 1738).
306    
307    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
308    
309            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
310    
311    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
312    
313            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
314    
315            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
316            matching.
317    
318    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
319    
320            * LangTag.pm: Warn for private use language subtags.  Error level
321            typos fixed.  Support for Suppress-Script field.
322    
323            * mklangreg.pl: Support for dumping of nested structure.
324    
325    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
326    
327            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
328            recommended case as per RFC 4646.
329    
330    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
331    
332            * LangTag.pm (check_rfc4646_langtag): New method.
333    
334    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
335    
336            * mklangreg.pl: New script.
337    
338            * Makefile: Updated for creation of the module for language subtag
339            registry.
340            
341    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
342    
343            * Makefile: WebIDL.html added.
344    
345            * WebIDL.pod: New documentation.
346    
347    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
348    
349            * WebIDL.pm: Checker's error types are redefined.
350    
351    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
352    
353            * WebIDL.pm: Parser's error types are redefined.  Some forward
354            compatible parsing bugs are fixed.  Some unreachable codes are
355            commented out.
356    
357    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
358    
359            * WebIDL.pm: Support for the reminding extended attributes are
360            added.  It does not satisfy the definition that a forward
361            interface declaration has an extended attribute.  It seems that
362            unless explicitly allowed multiple extended attributes with the
363            same name is not allowed, though it is not explicitly mentioned in
364            the spec.
365    
366    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
367    
368            * WebIDL.pm: Unescapes extended attribute names and extended
369            attribute identifiers.  Preserve whether an extended attribute has
370            an argument list of not.  Support for extended attributes:
371            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
372            NameGetter, NameSetter, and Null.
373            (has_argument_list): New attribute.
374            (idl_text): Stringifies argument lists, if any, even if it is
375            empty.
376    
377    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
378    
379            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
380            |{s_kwd}| in DATA_STATE as default.
381    
382    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
383    
384            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
385            by |{s_kwd}| in DATA_STATE.
386    
387    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
388    
389            * HTML.pm.src: Shorten keys.
390    
391    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
392    
393            * HTML.pm.src: Remove checking for control character, surrogate
394            pair, or noncharacter code points and non-Unicode code
395            points (they should be handled by Whatpm::Charset::UnicodeChecker).
396            (parse_char_stream): Support for the |$get_wrapper| argument and
397            character stream error handlers.
398    
399    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
400    
401            * ContentChecker.pm: Don't call |loda_ns_module|
402            for null-namespace elements/attributes.
403    
404            * HTML.pm.src: Fact out $disallowed_control_chars
405            as a hash.
406    
407    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
410            and |{next_char}| initializations are moved to initialization
411            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
412            with |parse_char_stream|.
413    
414    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
415    
416            * HTML.pm.src (parse_char_stream): Make |set_next_char|
417            invoke |manakai_read_until|, not only |read|, where
418            possible, to decrease the number of |read| method calls.
419    
420            * mkhtmlparser.pl: Related changes to the aforementioned
421            modification.
422    
423    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
424    
425            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
426            would report character error from now.
427    
428    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
429    
430            * HTML.pm.src: White-space-leaded non-white-space character
431            tokens in "before head insertion mode" was not
432            correctly handled.
433            (set_inner_html): Reimplemented using CharString decodehandle
434            class.  Support for $get_wrapper argument.  Support
435            for |{read_until}| feature.
436    
437    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
438    
439            * HTML.pm.src: Make a "bare ero" error for unknown
440            entities point the "&" character.
441    
442    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
443    
444            * HTML.pm.src: It turns out that U+FFFD don't have to
445            be added to the list of excluded characters.
446    
447    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
448    
449            * HTML.pm.src ($char_onerror): Have character decoder's |line|
450            and |column| a higher priority than the one set by the
451            tokenizer's input handler.
452            ($self->{read_until}): Exclude U+FFFD (but this might
453            not be necessary, since now we do line/column fixup in
454            the character decode handle).
455    
456    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
457    
458            * HTML.pm.src: Use |{read_until}| where possible.
459    
460    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
461    
462            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
463            and |manakai_getc_until| to |manakai_read_until| to
464            reduce the number of string copies.
465    
466    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
467    
468            * HTML.pm.src (parse_char_string): Use newly created
469            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
470            standard feature to |open| a string as a filehandle,
471            since Perl's string filehandle seems not supporting |ungetc|
472            method correctly.
473            (parse_char_stream): Define |{getc_until}| method.
474            (DATA_STATE): Experimental support for |getc_until| feature.
475    
476    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
477    
478            * HTML.pm.src: Check points added to newly added branches.
479    
480    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
481    
482            * HTML.pm.src: Remove |{char}|, which is no longer used.
483            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
484            and replaced by |{prev_state}|.
485    
486            * mkhtmlparser.pl: Remove |{char}| feature.
487            Remove |!!!back-next-input-character;| macro.
488    
489    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
490    
491            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
492            entity related tokenizer states in favor of new states
493            implementing the consume character reference algorithm.
494    
495    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
496    
497            * HTML.pm.src: "Consume a character reference" algorithm is
498            now implemented as a tokenizer's state, rather than
499            a method, with minimum changes (more changes will
500            be made, in due course).  "Bogus comment state"'s inner
501            loop gets removed.
502    
503    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
504    
505            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
506            into their own tokenizer states.
507    
508    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
509    
510            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
511            is split into three states.
512    
513    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
514    
515            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
516            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
517            no longer does the tokenizer have to push back next input
518            characters in those states.
519    
520    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
521    
522            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
523            into four states so that no longer does the tokenizer have to push
524            back next input characters in that state.
525    
526    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
527    
528            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
529            which can be used to insert some wrapper between the character
530            stream handle and the tokenizer.  (It is currently not supported
531            for |set_inner_html| for |Element|s).
532    
533    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
534    
535            * HTML.pm.src: Ignore punctuations in charset names.
536    
537    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
538    
539            * ContentChecker.pm: Support for charset-layer error levels.
540    
541            * HTML.pm.src: Don't specify |text| argument for the
542            |chardecode:fallback| error, since it is not the encoding
543            being used alternatively.
544    
545    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
546    
547            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
548    
549    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
550    
551            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
552    
553    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
554    
555            * HTML.pm.src: Bug fix and sync with the spec with regard
556            to after after frameset insertion mode processing (HTML5
557            revision 1909).  Note that the implementation was wrong
558            per the old spec before the r1909 changes.
559    
560    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
561    
562            * HTMLTable.pm: scope=auto algorithm fix synced with the
563            spec (HTML5 revision 2093).
564            ($process_row): Algorithm step numbers synced with the
565            spec (HTML5 revision 2092).
566    
567  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
568    
569          * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5          * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5

Legend:
Removed from v.1.284  
changed lines
  Added in v.1.380

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24