/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.225 by wakaba, Mon May 5 06:57:07 2008 UTC revision 1.398 by wakaba, Sun Sep 6 09:53:29 2009 UTC
# Line 1  Line 1 
1    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Rewrote end tag handling in |head| area (cf. HTML5
4            revision 2730, but it was entirely broken, maybe I missed some
5            spec changes before rev.2730).
6    
7    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
8    
9            * HTML.pm.src: Added the |hgroup| element (HTML5 revision 3039 and
10            HTML5 revision 3040).
11    
12    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
15            revision 3689).  It should make no difference since the |xmp|
16            element has the PCDATA content and cannot be the non-bottommost
17            element in the stack of open elements.
18    
19    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
22            fragment parsing algorithm is invoked (HTML5 revision 2699).
23    
24    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
27    
28    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src: No longer does foster parenting |style|, |script|,
31            and |input type=hidden| elements (HTML5 revision 2988).
32    
33    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
36            2870).
37    
38    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
41            element (HTML5 revision 2960).
42    
43    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTML.pm.src: Moved the "in table text" insertion mode code up
46            such that it does also work as specced in HTML5 in the remaining
47            cases (HTML5 revision 3382).
48    
49    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src: Implementing the "in table text" insertion mode,
52            first half (HTML5 revision 3382).
53    
54    2009-08-30  Wakaba  <wakaba@suika.fam.cx>
55    
56            * ContentChecker.pm: Allow |xml:lang| in HTML document if the
57            owner element is not an HTML element (HTML5 revision 3697).
58    
59    2009-08-23  Wakaba  <wakaba@suika.fam.cx>
60    
61            * ContentChecker.pm: Allow <input type=hidden> in context where
62            interactive content is disallowed (HTML5 revision 2392).
63    
64    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
65    
66            * HTML.pm.src: Dropped support for "XSLT-compat" and added
67            "about:legacy-compat" (HTML5 revision 2725).
68    
69    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
70    
71            * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
72            spec (HTML5 revision 2700).
73    
74    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src: Added support for <script> element in the "in
77            select" and "in select in table" insertion modes (HTML5 revision
78            2500).
79    
80    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: An <hn> start tag in an <hn> element should close
83            the earlier element (HTML5 revision 2501).
84    
85    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src: The </br> end tags before the <body> start tag
88            should equally survive as the first br element child of the body
89            element (HTML5 revision 2586).
90    
91    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
92    
93            * HTML.pm.src: <font> in SVG land is handled as an SVG element as
94            long as it did not have HTML attributes (HTML5 revision 2590).
95    
96    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
97    
98            * HTML.pm.src (_tree_construction_main): Improved error reporting
99            on "select" start tag in "in select" and "in select in table"
100            parsing modes (c.f. HTML5 revision 3604).
101    
102    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
103    
104            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
105            invalid (HTML5 revision 2701).
106    
107    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
108    
109            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
110            optional before </body> (HTML5 revision 2806).
111    
112    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
113    
114            * HTML.pm.src: Marked where SVG |script| execution should be
115            implemented (HTML5 revision 2904).
116    
117    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
120            inherits the document quirksness flag of the main document (HTML5
121            revision 2935).
122    
123    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
124    
125            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
126            quirks mode (The Hixie Quirk, HTML5 revision 3132).
127    
128    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
129    
130            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
131            3163).
132    
133    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTML.pm.src: Don't change encoding to UTF-8 if the input
136            encoding is UTF-16 (HTML5 revision 3204).
137    
138    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
139    
140            * HTML.pm.src (parse_byte_stream): Don't change to another
141            encoding if the input is UTF-16 (HTML5 revision 3205).
142    
143    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
144    
145            * HTML.pm.src: |isindex| should no longer generate |p| element
146            node (HTML5 revision 3236).
147    
148    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
149    
150            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
151            support for new |id_type| value of |any|, which means any kind of
152            element is allowed as a referenced element.
153    
154    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
155    
156            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
157            of defining codes for checking required attribute and as such.
158    
159    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
160    
161            * IMTChecker.pm: Added more definitions for subtypes.
162    
163    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
164    
165            * URIChecker.pm: Some of |pos_end| values were wrong.
166    
167    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
168    
169            * ContentChecker.pm (check_element): Added support for "no
170            referenced datalist" error.
171    
172    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
173    
174            * URIChecker.pm: Bug fix: It did not work unless
175            Message::DOM::DOMImplementation has been |require|d.
176    
177    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
178    
179            * NanoDOM.pm (document_uri): New attribute.
180    
181            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
182    
183    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
184    
185            * NanoDOM.pm (text_content): Don't create a Text node if the new
186            value is empty.
187    
188    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
189    
190            * SWML/: New directory.
191    
192    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
193    
194            * NanoDOM.pm (specified, all_declarations_processed,
195            manakai_attribute_type): New attributes.
196    
197    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
198    
199            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
200            array.
201    
202    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
203    
204            * NanoDOM.pm (notation_name): New attribute.
205    
206    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
207    
208            * NanoDOM.pm (public_id, system_id): New attributes.a
209    
210    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
211    
212            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
213            implemented.
214            (allowed_tokens, default_type, declared_type): Implemented.
215    
216    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
217    
218            * NanoDOM.pm (node_name): New attribute.
219            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
220            constants.
221            (create_element_type_definition_node, create_attribute_definition,
222            create_notation, create_general_entity,
223            get_element_type_definition_node,
224            set_element_type_definition_node, get_general_entity_node,
225            set_general_entity_node, get_notation_node, set_notation_node,
226            get_attribute_definition_node, set_attribute_definition_node): New
227            methods.
228            (element_types, entities, notations, attribute_definitions): New
229            attributes.
230            (DocumentType): Support for child nodes, entities, notations, and
231            element types.
232            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
233            New classes.
234    
235            * Dumper.pm: Support for general entities, notations, element type
236            definitions, and attribute definitions.
237    
238    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
239    
240            * NanoDOM.pm (create_processing_instruction): New method.
241            (xml_version, xml_encoding, xml_standalone): New attributes.
242            (ProcessingInstruction): New class.
243    
244    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
245    
246            * HTML.pm.src: Handling of end tags in the foreign content
247            insertion mode was partially wrong, because of wrong bit
248            operations.
249    
250    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
251    
252            * NanoDOM.pm (dom_config): New attribute (do nothing), for
253            Whatpm::XML::Parser support.
254    
255    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
256    
257            * Makefile: New rule to make HTML/Tokenizer.pm is added.
258    
259            * HTML.pm.src: Tokenizer part moved to another file.
260    
261    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
264    
265    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
266    
267            * HTML.pm.src: Element category constants redefined.
268            
269    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
270    
271            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
272            construction stage synced with the spec (HTML5 revisions 2139 and
273            2302).
274    
275    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
276    
277            * ContentChecker.pm: New error level "html5_fact" added, which
278            should be tentatively used until all of requirements are properly
279            specced as RFC 2119 "MUST" in HTML5.
280    
281    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
282    
283            * ContentChecker.pod: Note on internal flags is added.
284    
285    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
286    
287            * HTML.pm.src: An AAA bug fixed.
288    
289    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
290    
291            * HTML.pm.src: If another node is inserted by the parser, don't
292            reuse existing Text node to append a character (HTML5 revision
293            2124).
294    
295    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
296    
297            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
298            revisions 1731 and 2128).
299    
300    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
301    
302            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
303            Support for end tags of camelCase SVG elements were broken.  A
304            wrong error type text fixed.
305    
306    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
307    
308            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
309    
310    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
311    
312            * HTML.pm.src: Support for new definition of |param| and |source|
313            start tag parsing (HTML5 revision 1731).
314    
315    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
316    
317            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
318    
319    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
320    
321            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
322            revisions 1731 and 1831).
323    
324    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
325    
326            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
327            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
328            revision 1731).
329    
330    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
331    
332            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
333            revision 1731).  End tags of |option| and |optgroup| elements are
334            now optional (HTML5 revision 1731).
335    
336    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
337    
338            * HTML.pm.src: New "special" elements added to the list (HTML5
339            revision 1778).  "strile" -> "strike".
340    
341    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
342    
343            * ContentType.pm (get_sniffed_type): Support for the "better"
344            content sniffing (HTML5 revision 1927).  In a case the official
345            type was not returned when the method is invoked in the list
346            context.
347    
348    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
349    
350            * HTML.pm.src: Character references for non-space C0 characters,
351            including U+000B VT, DEL character, noncharacter code points, are
352            now converted to the U+FFFD character (cf. HTML5 revision 2138).
353    
354    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
355    
356            * ContentChecker.pm: |form=""| check support added.
357    
358    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
359    
360            * ContentChecker.pm: |contextmenu| validness is now checked using
361            |id| and |id_type| properties, and |menu| property is removed.
362    
363    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
364    
365            * ContentChecker.pm: Prepare for |form| |name| attribute's
366            duplication checking.
367    
368    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
369    
370            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
371            unsupported charset="" parameter value (HTML5 revision 2131).
372    
373    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
374    
375            * HTML.pm.src: Reminding places where U+000B is allowed as a space
376            character is fixed (cf. HTML5 revision 1738).
377    
378            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
379            space characters (HTML5 revision 1738).
380    
381    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
382    
383            * HTML.pm.src: The "anything else" case for the "after after body"
384            insertion mode was not updated to swtich to the "in body"
385            insertion mode.  U+000B is no longer a space character for the
386            purpose of tree construction phase (HTML5 revision 1738).
387    
388    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
389    
390            * HTML.pm.src: U+000B is no longer a space character (HTML5
391            revision 1738).
392    
393    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
394    
395            * ContentType.pm: 0x0B is no longer a space character (HTML5
396            revision 1738).
397    
398            * HTML.pm.src: U+000B is no longer a space character for the
399            algorithm for extracting an encoding from a Content-Type (HTML5
400            revision 1738).
401    
402    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
403    
404            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
405    
406    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
407    
408            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
409    
410            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
411            matching.
412    
413    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
414    
415            * LangTag.pm: Warn for private use language subtags.  Error level
416            typos fixed.  Support for Suppress-Script field.
417    
418            * mklangreg.pl: Support for dumping of nested structure.
419    
420    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
421    
422            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
423            recommended case as per RFC 4646.
424    
425    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
426    
427            * LangTag.pm (check_rfc4646_langtag): New method.
428    
429    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
430    
431            * mklangreg.pl: New script.
432    
433            * Makefile: Updated for creation of the module for language subtag
434            registry.
435            
436    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
437    
438            * Makefile: WebIDL.html added.
439    
440            * WebIDL.pod: New documentation.
441    
442    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
443    
444            * WebIDL.pm: Checker's error types are redefined.
445    
446    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
447    
448            * WebIDL.pm: Parser's error types are redefined.  Some forward
449            compatible parsing bugs are fixed.  Some unreachable codes are
450            commented out.
451    
452    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
453    
454            * WebIDL.pm: Support for the reminding extended attributes are
455            added.  It does not satisfy the definition that a forward
456            interface declaration has an extended attribute.  It seems that
457            unless explicitly allowed multiple extended attributes with the
458            same name is not allowed, though it is not explicitly mentioned in
459            the spec.
460    
461    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
462    
463            * WebIDL.pm: Unescapes extended attribute names and extended
464            attribute identifiers.  Preserve whether an extended attribute has
465            an argument list of not.  Support for extended attributes:
466            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
467            NameGetter, NameSetter, and Null.
468            (has_argument_list): New attribute.
469            (idl_text): Stringifies argument lists, if any, even if it is
470            empty.
471    
472    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
473    
474            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
475            |{s_kwd}| in DATA_STATE as default.
476    
477    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
478    
479            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
480            by |{s_kwd}| in DATA_STATE.
481    
482    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
483    
484            * HTML.pm.src: Shorten keys.
485    
486    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
487    
488            * HTML.pm.src: Remove checking for control character, surrogate
489            pair, or noncharacter code points and non-Unicode code
490            points (they should be handled by Whatpm::Charset::UnicodeChecker).
491            (parse_char_stream): Support for the |$get_wrapper| argument and
492            character stream error handlers.
493    
494    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
495    
496            * ContentChecker.pm: Don't call |loda_ns_module|
497            for null-namespace elements/attributes.
498    
499            * HTML.pm.src: Fact out $disallowed_control_chars
500            as a hash.
501    
502    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
503    
504            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
505            and |{next_char}| initializations are moved to initialization
506            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
507            with |parse_char_stream|.
508    
509    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
510    
511            * HTML.pm.src (parse_char_stream): Make |set_next_char|
512            invoke |manakai_read_until|, not only |read|, where
513            possible, to decrease the number of |read| method calls.
514    
515            * mkhtmlparser.pl: Related changes to the aforementioned
516            modification.
517    
518    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
519    
520            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
521            would report character error from now.
522    
523    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
524    
525            * HTML.pm.src: White-space-leaded non-white-space character
526            tokens in "before head insertion mode" was not
527            correctly handled.
528            (set_inner_html): Reimplemented using CharString decodehandle
529            class.  Support for $get_wrapper argument.  Support
530            for |{read_until}| feature.
531    
532    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
533    
534            * HTML.pm.src: Make a "bare ero" error for unknown
535            entities point the "&" character.
536    
537    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
538    
539            * HTML.pm.src: It turns out that U+FFFD don't have to
540            be added to the list of excluded characters.
541    
542    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
543    
544            * HTML.pm.src ($char_onerror): Have character decoder's |line|
545            and |column| a higher priority than the one set by the
546            tokenizer's input handler.
547            ($self->{read_until}): Exclude U+FFFD (but this might
548            not be necessary, since now we do line/column fixup in
549            the character decode handle).
550    
551    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
552    
553            * HTML.pm.src: Use |{read_until}| where possible.
554    
555    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
556    
557            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
558            and |manakai_getc_until| to |manakai_read_until| to
559            reduce the number of string copies.
560    
561    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
562    
563            * HTML.pm.src (parse_char_string): Use newly created
564            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
565            standard feature to |open| a string as a filehandle,
566            since Perl's string filehandle seems not supporting |ungetc|
567            method correctly.
568            (parse_char_stream): Define |{getc_until}| method.
569            (DATA_STATE): Experimental support for |getc_until| feature.
570    
571    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
572    
573            * HTML.pm.src: Check points added to newly added branches.
574    
575    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
576    
577            * HTML.pm.src: Remove |{char}|, which is no longer used.
578            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
579            and replaced by |{prev_state}|.
580    
581            * mkhtmlparser.pl: Remove |{char}| feature.
582            Remove |!!!back-next-input-character;| macro.
583    
584    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
585    
586            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
587            entity related tokenizer states in favor of new states
588            implementing the consume character reference algorithm.
589    
590    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
591    
592            * HTML.pm.src: "Consume a character reference" algorithm is
593            now implemented as a tokenizer's state, rather than
594            a method, with minimum changes (more changes will
595            be made, in due course).  "Bogus comment state"'s inner
596            loop gets removed.
597    
598    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
599    
600            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
601            into their own tokenizer states.
602    
603    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
604    
605            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
606            is split into three states.
607    
608    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
609    
610            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
611            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
612            no longer does the tokenizer have to push back next input
613            characters in those states.
614    
615    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
616    
617            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
618            into four states so that no longer does the tokenizer have to push
619            back next input characters in that state.
620    
621    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
622    
623            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
624            which can be used to insert some wrapper between the character
625            stream handle and the tokenizer.  (It is currently not supported
626            for |set_inner_html| for |Element|s).
627    
628    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
629    
630            * HTML.pm.src: Ignore punctuations in charset names.
631    
632    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
633    
634            * ContentChecker.pm: Support for charset-layer error levels.
635    
636            * HTML.pm.src: Don't specify |text| argument for the
637            |chardecode:fallback| error, since it is not the encoding
638            being used alternatively.
639    
640    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
641    
642            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
643    
644    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
645    
646            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
647    
648    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
649    
650            * HTML.pm.src: Bug fix and sync with the spec with regard
651            to after after frameset insertion mode processing (HTML5
652            revision 1909).  Note that the implementation was wrong
653            per the old spec before the r1909 changes.
654    
655    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
656    
657            * HTMLTable.pm: scope=auto algorithm fix synced with the
658            spec (HTML5 revision 2093).
659            ($process_row): Algorithm step numbers synced with the
660            spec (HTML5 revision 2092).
661    
662    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
663    
664            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
665            revision 2094).
666    
667    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
668    
669            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
670    
671    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
672    
673            * HTML.pm.src: '"' and "'" at the end of attribute
674            name (after another attribute) now raise parse error (HTML5
675            revision 2123).  Empty unquoted attribute values are no
676            longer allowed (HTML5 revision 2122).
677    
678    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
679    
680            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
681            revision 2130).
682    
683    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
684    
685            * ContentChecker.pm: |xml:lang| attribute value must be same
686            as |lang| attribute value for HTML elements (HTML5 revision 2062
687            and so on).
688    
689    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
690    
691            * ContentChecker.pm: Error level definition for |xml_id_error|
692            was missing.
693    
694            * URIChecker.pm: The end of the URL should be marked as the
695            error location for an empty path error.  The position
696            between the userinfo and the port components should be
697            marked as the error location for an empty host error.
698    
699    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
700    
701            * URIChecker.pm: Set parameters representing where in the
702            value the error occurs for errors.  Report unknown
703            address format error in warning level, since address
704            formats are rarely added.  Path segments starting with "/.."
705            were misinterpreted as a dot-segment.
706    
707    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
708    
709            * URIChecker.pm (check_iri_reference): Requires
710            |Message::DOM::DOMImplementation|.
711    
712    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
713    
714            * IMTChecker.pm: Updated for the new error reporting architecture.
715    
716            * ContentChecker.pm: Error levels for IMTs are added.
717    
718    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
719    
720            * H2H.pm (_shift_token): Support for unquoted HTML attribute
721            values.
722    
723    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
724    
725            * CacheManifest.pm: Support for new style of error
726            reports.
727    
728            * HTML.pm.src: Set line=1, column=1 to the document node.
729    
730    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
731    
732            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
733            and URL checkers.  Support for more error levels for bogus
734            langauge tag and URL "standards".
735    
736            * LangTag.pm, URIChecker.pm: Support for new style error
737            level reporting.
738    
739    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
740    
741            * ContentChecker.pm: Support for RDF/XML error levels.
742    
743            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
744            specifying.  Error types are revised.
745    
746    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
747    
748            * ContentChecker.pm: All error reporting method calls are
749            renewed.
750    
751    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
752    
753            * HTML.pm.src: All error type names and "text" parameters
754            are revised.  Use new style for "level" specification.
755    
756            * mkhtmlparser.pl: Use new style for "level" specification.
757    
758    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
759    
760            * WebIDL.pm (parse_char_string): Simplified error
761            reporting process for broken ignored valuetype definition.
762            (Valuetype idl_text): Support for special "DOMString" name.
763    
764    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
765    
766            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
767            terminal of the ScopedName is "DOMString", such that whether
768            the last part of the scoped name is "DOMString" or "_DOMString"
769            later.  It is necessary to determine whether a |typedef|
770            definition should be ignored or not.
771            (parse_char_string): Unescape the identifier of
772            exception members.
773            ($resolve): Return undef for builtin types and sequence<T>
774            types (we might not have to do this, however...).
775            (check): Support checking for Exceptions, Valuetypes,
776            and Typedefs.
777            ($serialize_type): Support for "DOMString::::" syntax.
778            (Typedef idl_text): Output Type as "DOMString" if it
779            is really "DOMString" (i.e. its internal representation
780            is "::DOMString::").
781    
782    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
783    
784            * WebIDL.pm ($resolve): New code, based on resolve code
785            for constant types in the |check| method.
786            (check): Support for checking of attributes, operations, and
787            arguments.
788            (Attribute/Operation idl_text): Exception names in getraises,
789            setraises, and raises clauses is serizlied by |$serialize_type|
790            code.
791    
792    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
793    
794            * WebIDL.pm ($integer): Order of selections are changed to match
795            hexadecimal numbers (the original pattern, taken from the spec,
796            was not work for hexadecimal numbers, because the "0" prefix
797            matches to the [0-7]* part (as an empty string) and therefore
798            it does not match with remaining "x..." part of a "0x..." integer
799            literal.
800            ($get_type): It now returns a string, not an array reference,
801            for regular types and |sequence| types (i.e. it in any case
802            returns a string).
803            ($get_next_token): The second item in the array that represents
804            a integer or float token is now a Perl number value, not the
805            original string representation of the number.
806            (check): Support for const value consistency checking.
807            No extended attribute is defined for constants.
808            (Node subclasses): Use simple strings rather than array references
809            for default data type values.
810            ($serialize_type): Type values are now simple strings.
811            (value): If the new attribute value is a false value, then
812            a FALSE value is set to the attribute.
813    
814    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
815    
816            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
817            in its stringified format ("scoped name" as defined in the
818            spec).  Note that future version of this module should not use
819            array references for type values and the |type_text| attribute
820            should be made obsolete.
821            (parse_char_string): Unescape attribute names.
822            (check): Support for checking of whether inherited interfaces
823            are actually defined or not.  Support for checking of whether
824            interface member identifiers are duplicated or not.
825            ($serialize_type): Scoped names are returned as is.  A future
826            version of this code should escape identifiers other than "DOMString",
827            otherwise the idl_text would be non-conforming.
828    
829    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
830    
831            * WebIDL.pm (parse_char_string): Set line/column numbers
832            to generated nodes.  Unescape identifiers.  Extended attributes
833            for Definition's were ignored.
834            (append_child): Set |parent_node| attribute.
835            (parent_node): New attribute.
836            (check): Support interface/exception members.  Support
837            extended attributes.  Support definition identifier uniqueness
838            constraint.
839            (qualified_name): New attribute.
840            (Interface/Exception idl_text): Extended attributes were
841            not prepended to the returned text.
842    
843    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
844    
845            * WebIDL.pm (parse_char_string): Set line/column numbers
846            to interface object experimentally.  s/shift/pop/g, shift
847            would make things wrong.  Support for interface forward
848            declarations was missing.  Broken interface declarations
849            with no block were not ignored entirely.
850            (Whatpm::WebIDL::Node): New abstract class.  This class
851            makes things easier.
852            (child_nodes): New attribute.  Unlike DOM's attribute with
853            same name, this attribute returns a dead list of nodes for
854            simplicity.
855            (get_user_data, set_user_data): New methods.
856            (Module idl_text): A SPACE character should be inserted
857            before the |{| character.
858            (Interface idl_text): Support for interface forward declarations.
859            (is_forward_declaration): New attribute.
860    
861    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
862    
863            * WebIDL.pm (type_text): Better serializer.
864    
865    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
866    
867            * WebIDL.pm: Revise forward-compatible parsing so that
868            it now can handle broken extended attributes and as such.
869    
870    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
871    
872            * WebIDL.pm: Real support for extended attributes.
873            Support for extended attributes with arguments.
874    
875    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
876    
877            * WebIDL.pm: Support for |exception| syntax.
878            (Interface->idl_text): Tentative support for inheritances.
879    
880    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
881    
882            * WebIDL.pm: Hierarchical scoped name support was broken.
883            Support for raises, setraises, and getraises syntaxes.
884    
885    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
886    
887            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
888            proper support for types, extended attributes, and exceptions yet).
889            WebIDL parser, version 1 (no support for exceptions yet,
890            no proper support for extended attributes yet).
891    
892    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
893    
894            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
895    
896    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
897    
898            * WebIDL.pm: Support for valuetype and const.
899    
900    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
901            
902            * WebIDL.pm: New module.
903    
904    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
905    
906            * Makefile (Entities.html): URI changed.
907    
908    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
909    
910            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
911    
912    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
913    
914            * HTML.pm.src (_get_next_token): A parse error was missing.
915    
916    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
917    
918            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
919            spec (commented out). (HTML5 revision 1711).
920    
921    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
922    
923            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
924    
925            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
926            as UTF-16 (HTML5 revision 1701).
927    
928    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
929    
930            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
931            1692).
932    
933    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
934    
935            * HTML.pm.src: The secondary insertion mode used when switching
936            to foreign content is the "in body" insertion mode (HTML5 revision
937            1696).
938    
939    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
940    
941            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
942            revision 1697).
943    
944    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
945    
946            * HTML.pm.src: Support for end-of-file token in foreign content
947            insertion mode (HTML5 revision 1693).  Update SVG camelCase
948            attribute list (HTML5 revision 1700).  <textarea> closes
949            </select> (HTML5 revision 1699).  More start tags close in
950            foreign content insertion mode (HTML5 revision 1698).
951    
952    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
953    
954            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
955    
956    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
957    
958            * HTML.pm.src: More robust charset parameter detection (HTML5
959            revision 1674).
960    
961    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
962    
963            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
964            revision 1676).
965    
966    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
967    
968            * HTML.pm.src: Ignore language part of public identifiers for
969            quriks mode detection (HTML5 revision 1679).
970    
971    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
972    
973            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
974            revision 1685).
975    
976    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
977    
978            * HTML.pm.src: Support for EOF in new states for tags (HTML5
979            revision 1684).
980    
981    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
982    
983            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
984            work (HTML5 revision 1690).
985    
986    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
987    
988            * HTML.pm.src (_tree_construction_main): Change handling of
989            end tags in head insertion modes (HTML5 revision 1686).
990            (parse_char_string): Bug fix for non-utf8 character string handlings.
991            (parse_char_stream): |ungetc| does not work well for this context.
992    
993    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
994    
995            * HTML.pm.src (parse_byte_string): Redefined to invoke
996            |parse_byte_stream|.
997            (parse_byte_stream): New method.
998    
999    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
1000    
1001            * HTML.pm.src (parse_byte_string): Fix the column number reported
1002            by encoding layer error reporter.
1003    
1004    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1005    
1006            * HTML.pm.src (parse_byte_string): Use streaming decoder
1007            rather than converting the whole byte string and then parsing.
1008            Propagate errors in character encoding layer.
1009            (get_next_token): Precise error reporting for |bare stago| error.
1010    
1011    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1012    
1013            * HTML.pm.src (parse_char_stream): New method.
1014            (parse_char_string): This method is now defined as an invocation
1015            of the |parse_char_stream| method.
1016    
1017    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1018    
1019            * HTML.pm.src (parse_byte_string): Report various status
1020            of the sniffing as info-level errors.  Support for new
1021            decoding framework in parser resestting.
1022            (new): Various default error levels were not set.
1023    
1024    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1025    
1026            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1027            algorithm, except for the actual sniffing, is implemented
1028            with new framework with Message::Charset::Info.
1029    
1030    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
1031    
1032            * CacheManifest.pm (_parse): Drop fragment identifiers from
1033            URIs in fallback section (HTML5 revision 1596).
1034    
1035    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1036    
1037            * Makefile (Entities.html): URI has changed.
1038    
1039    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1042            1553).
1043    
1044    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1045    
1046            * ContentChecker.pm: Noted that those returned in |table| are
1047            no longer table elements, but table objects returned
1048            by Whatpm::HTMLTable.
1049    
1050            * HTMLTable.pm (form_table): Return table element node
1051            as |$table->{element}|.
1052            (assign_header): Support for the |headers=""| attribute.
1053    
1054    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1055    
1056            * HTMLTable.pm (assign_header): New function; first version
1057            with no support for headers="".
1058            (form_table): Include table width and height to the returned
1059            table object for covenience.  Indexing in column assignement was wrong.
1060            Set whether a data cell is empty or not for convenience.
1061    
1062    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1063    
1064            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1065    
1066    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1067    
1068            * HTMLTable.pm: How table model errors are detected is
1069            changed (HTML5 revision 1387).
1070    
1071  2008-05-05  Wakaba  <wakaba@suika.fam.cx>  2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1072    
1073          * HTMLTable.pm: The algorithm now moves |tfoot| elements          * HTMLTable.pm: The algorithm now moves |tfoot| elements

Legend:
Removed from v.1.225  
changed lines
  Added in v.1.398

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24