/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.113 by wakaba, Sun Nov 4 04:15:06 2007 UTC revision 1.392 by wakaba, Sun Sep 6 01:30:08 2009 UTC
# Line 1  Line 1 
1    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
4            2870).
5    
6    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
9            element (HTML5 revision 2960).
10    
11    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src: Moved the "in table text" insertion mode code up
14            such that it does also work as specced in HTML5 in the remaining
15            cases (HTML5 revision 3382).
16    
17    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: Implementing the "in table text" insertion mode,
20            first half (HTML5 revision 3382).
21    
22    2009-08-30  Wakaba  <wakaba@suika.fam.cx>
23    
24            * ContentChecker.pm: Allow |xml:lang| in HTML document if the
25            owner element is not an HTML element (HTML5 revision 3697).
26    
27    2009-08-23  Wakaba  <wakaba@suika.fam.cx>
28    
29            * ContentChecker.pm: Allow <input type=hidden> in context where
30            interactive content is disallowed (HTML5 revision 2392).
31    
32    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: Dropped support for "XSLT-compat" and added
35            "about:legacy-compat" (HTML5 revision 2725).
36    
37    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
38    
39            * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
40            spec (HTML5 revision 2700).
41    
42    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Added support for <script> element in the "in
45            select" and "in select in table" insertion modes (HTML5 revision
46            2500).
47    
48    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTML.pm.src: An <hn> start tag in an <hn> element should close
51            the earlier element (HTML5 revision 2501).
52    
53    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: The </br> end tags before the <body> start tag
56            should equally survive as the first br element child of the body
57            element (HTML5 revision 2586).
58    
59    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
60    
61            * HTML.pm.src: <font> in SVG land is handled as an SVG element as
62            long as it did not have HTML attributes (HTML5 revision 2590).
63    
64    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
65    
66            * HTML.pm.src (_tree_construction_main): Improved error reporting
67            on "select" start tag in "in select" and "in select in table"
68            parsing modes (c.f. HTML5 revision 3604).
69    
70    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
71    
72            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
73            invalid (HTML5 revision 2701).
74    
75    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
76    
77            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
78            optional before </body> (HTML5 revision 2806).
79    
80    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: Marked where SVG |script| execution should be
83            implemented (HTML5 revision 2904).
84    
85    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
88            inherits the document quirksness flag of the main document (HTML5
89            revision 2935).
90    
91    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
92    
93            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
94            quirks mode (The Hixie Quirk, HTML5 revision 3132).
95    
96    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
97    
98            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
99            3163).
100    
101    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
102    
103            * HTML.pm.src: Don't change encoding to UTF-8 if the input
104            encoding is UTF-16 (HTML5 revision 3204).
105    
106    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
107    
108            * HTML.pm.src (parse_byte_stream): Don't change to another
109            encoding if the input is UTF-16 (HTML5 revision 3205).
110    
111    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
112    
113            * HTML.pm.src: |isindex| should no longer generate |p| element
114            node (HTML5 revision 3236).
115    
116    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
117    
118            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
119            support for new |id_type| value of |any|, which means any kind of
120            element is allowed as a referenced element.
121    
122    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
123    
124            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
125            of defining codes for checking required attribute and as such.
126    
127    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
128    
129            * IMTChecker.pm: Added more definitions for subtypes.
130    
131    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
132    
133            * URIChecker.pm: Some of |pos_end| values were wrong.
134    
135    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
136    
137            * ContentChecker.pm (check_element): Added support for "no
138            referenced datalist" error.
139    
140    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
141    
142            * URIChecker.pm: Bug fix: It did not work unless
143            Message::DOM::DOMImplementation has been |require|d.
144    
145    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
146    
147            * NanoDOM.pm (document_uri): New attribute.
148    
149            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
150    
151    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
152    
153            * NanoDOM.pm (text_content): Don't create a Text node if the new
154            value is empty.
155    
156    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
157    
158            * SWML/: New directory.
159    
160    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
161    
162            * NanoDOM.pm (specified, all_declarations_processed,
163            manakai_attribute_type): New attributes.
164    
165    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
166    
167            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
168            array.
169    
170    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
171    
172            * NanoDOM.pm (notation_name): New attribute.
173    
174    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
175    
176            * NanoDOM.pm (public_id, system_id): New attributes.a
177    
178    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
179    
180            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
181            implemented.
182            (allowed_tokens, default_type, declared_type): Implemented.
183    
184    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
185    
186            * NanoDOM.pm (node_name): New attribute.
187            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
188            constants.
189            (create_element_type_definition_node, create_attribute_definition,
190            create_notation, create_general_entity,
191            get_element_type_definition_node,
192            set_element_type_definition_node, get_general_entity_node,
193            set_general_entity_node, get_notation_node, set_notation_node,
194            get_attribute_definition_node, set_attribute_definition_node): New
195            methods.
196            (element_types, entities, notations, attribute_definitions): New
197            attributes.
198            (DocumentType): Support for child nodes, entities, notations, and
199            element types.
200            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
201            New classes.
202    
203            * Dumper.pm: Support for general entities, notations, element type
204            definitions, and attribute definitions.
205    
206    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
207    
208            * NanoDOM.pm (create_processing_instruction): New method.
209            (xml_version, xml_encoding, xml_standalone): New attributes.
210            (ProcessingInstruction): New class.
211    
212    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
213    
214            * HTML.pm.src: Handling of end tags in the foreign content
215            insertion mode was partially wrong, because of wrong bit
216            operations.
217    
218    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
219    
220            * NanoDOM.pm (dom_config): New attribute (do nothing), for
221            Whatpm::XML::Parser support.
222    
223    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
224    
225            * Makefile: New rule to make HTML/Tokenizer.pm is added.
226    
227            * HTML.pm.src: Tokenizer part moved to another file.
228    
229    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
232    
233    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
234    
235            * HTML.pm.src: Element category constants redefined.
236            
237    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
240            construction stage synced with the spec (HTML5 revisions 2139 and
241            2302).
242    
243    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
244    
245            * ContentChecker.pm: New error level "html5_fact" added, which
246            should be tentatively used until all of requirements are properly
247            specced as RFC 2119 "MUST" in HTML5.
248    
249    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
250    
251            * ContentChecker.pod: Note on internal flags is added.
252    
253    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: An AAA bug fixed.
256    
257    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
258    
259            * HTML.pm.src: If another node is inserted by the parser, don't
260            reuse existing Text node to append a character (HTML5 revision
261            2124).
262    
263    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
264    
265            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
266            revisions 1731 and 2128).
267    
268    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
271            Support for end tags of camelCase SVG elements were broken.  A
272            wrong error type text fixed.
273    
274    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
277    
278    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: Support for new definition of |param| and |source|
281            start tag parsing (HTML5 revision 1731).
282    
283    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
286    
287    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
290            revisions 1731 and 1831).
291    
292    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
293    
294            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
295            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
296            revision 1731).
297    
298    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
299    
300            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
301            revision 1731).  End tags of |option| and |optgroup| elements are
302            now optional (HTML5 revision 1731).
303    
304    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
305    
306            * HTML.pm.src: New "special" elements added to the list (HTML5
307            revision 1778).  "strile" -> "strike".
308    
309    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
310    
311            * ContentType.pm (get_sniffed_type): Support for the "better"
312            content sniffing (HTML5 revision 1927).  In a case the official
313            type was not returned when the method is invoked in the list
314            context.
315    
316    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
317    
318            * HTML.pm.src: Character references for non-space C0 characters,
319            including U+000B VT, DEL character, noncharacter code points, are
320            now converted to the U+FFFD character (cf. HTML5 revision 2138).
321    
322    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
323    
324            * ContentChecker.pm: |form=""| check support added.
325    
326    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
327    
328            * ContentChecker.pm: |contextmenu| validness is now checked using
329            |id| and |id_type| properties, and |menu| property is removed.
330    
331    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
332    
333            * ContentChecker.pm: Prepare for |form| |name| attribute's
334            duplication checking.
335    
336    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
337    
338            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
339            unsupported charset="" parameter value (HTML5 revision 2131).
340    
341    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: Reminding places where U+000B is allowed as a space
344            character is fixed (cf. HTML5 revision 1738).
345    
346            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
347            space characters (HTML5 revision 1738).
348    
349    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
350    
351            * HTML.pm.src: The "anything else" case for the "after after body"
352            insertion mode was not updated to swtich to the "in body"
353            insertion mode.  U+000B is no longer a space character for the
354            purpose of tree construction phase (HTML5 revision 1738).
355    
356    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
357    
358            * HTML.pm.src: U+000B is no longer a space character (HTML5
359            revision 1738).
360    
361    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
362    
363            * ContentType.pm: 0x0B is no longer a space character (HTML5
364            revision 1738).
365    
366            * HTML.pm.src: U+000B is no longer a space character for the
367            algorithm for extracting an encoding from a Content-Type (HTML5
368            revision 1738).
369    
370    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
371    
372            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
373    
374    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
375    
376            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
377    
378            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
379            matching.
380    
381    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
382    
383            * LangTag.pm: Warn for private use language subtags.  Error level
384            typos fixed.  Support for Suppress-Script field.
385    
386            * mklangreg.pl: Support for dumping of nested structure.
387    
388    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
389    
390            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
391            recommended case as per RFC 4646.
392    
393    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
394    
395            * LangTag.pm (check_rfc4646_langtag): New method.
396    
397    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
398    
399            * mklangreg.pl: New script.
400    
401            * Makefile: Updated for creation of the module for language subtag
402            registry.
403            
404    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
405    
406            * Makefile: WebIDL.html added.
407    
408            * WebIDL.pod: New documentation.
409    
410    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
411    
412            * WebIDL.pm: Checker's error types are redefined.
413    
414    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
415    
416            * WebIDL.pm: Parser's error types are redefined.  Some forward
417            compatible parsing bugs are fixed.  Some unreachable codes are
418            commented out.
419    
420    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
421    
422            * WebIDL.pm: Support for the reminding extended attributes are
423            added.  It does not satisfy the definition that a forward
424            interface declaration has an extended attribute.  It seems that
425            unless explicitly allowed multiple extended attributes with the
426            same name is not allowed, though it is not explicitly mentioned in
427            the spec.
428    
429    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
430    
431            * WebIDL.pm: Unescapes extended attribute names and extended
432            attribute identifiers.  Preserve whether an extended attribute has
433            an argument list of not.  Support for extended attributes:
434            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
435            NameGetter, NameSetter, and Null.
436            (has_argument_list): New attribute.
437            (idl_text): Stringifies argument lists, if any, even if it is
438            empty.
439    
440    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
441    
442            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
443            |{s_kwd}| in DATA_STATE as default.
444    
445    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
446    
447            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
448            by |{s_kwd}| in DATA_STATE.
449    
450    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
451    
452            * HTML.pm.src: Shorten keys.
453    
454    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
455    
456            * HTML.pm.src: Remove checking for control character, surrogate
457            pair, or noncharacter code points and non-Unicode code
458            points (they should be handled by Whatpm::Charset::UnicodeChecker).
459            (parse_char_stream): Support for the |$get_wrapper| argument and
460            character stream error handlers.
461    
462    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
463    
464            * ContentChecker.pm: Don't call |loda_ns_module|
465            for null-namespace elements/attributes.
466    
467            * HTML.pm.src: Fact out $disallowed_control_chars
468            as a hash.
469    
470    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
471    
472            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
473            and |{next_char}| initializations are moved to initialization
474            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
475            with |parse_char_stream|.
476    
477    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
478    
479            * HTML.pm.src (parse_char_stream): Make |set_next_char|
480            invoke |manakai_read_until|, not only |read|, where
481            possible, to decrease the number of |read| method calls.
482    
483            * mkhtmlparser.pl: Related changes to the aforementioned
484            modification.
485    
486    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
487    
488            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
489            would report character error from now.
490    
491    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
492    
493            * HTML.pm.src: White-space-leaded non-white-space character
494            tokens in "before head insertion mode" was not
495            correctly handled.
496            (set_inner_html): Reimplemented using CharString decodehandle
497            class.  Support for $get_wrapper argument.  Support
498            for |{read_until}| feature.
499    
500    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
501    
502            * HTML.pm.src: Make a "bare ero" error for unknown
503            entities point the "&" character.
504    
505    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
506    
507            * HTML.pm.src: It turns out that U+FFFD don't have to
508            be added to the list of excluded characters.
509    
510    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
511    
512            * HTML.pm.src ($char_onerror): Have character decoder's |line|
513            and |column| a higher priority than the one set by the
514            tokenizer's input handler.
515            ($self->{read_until}): Exclude U+FFFD (but this might
516            not be necessary, since now we do line/column fixup in
517            the character decode handle).
518    
519    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
520    
521            * HTML.pm.src: Use |{read_until}| where possible.
522    
523    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
524    
525            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
526            and |manakai_getc_until| to |manakai_read_until| to
527            reduce the number of string copies.
528    
529    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
530    
531            * HTML.pm.src (parse_char_string): Use newly created
532            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
533            standard feature to |open| a string as a filehandle,
534            since Perl's string filehandle seems not supporting |ungetc|
535            method correctly.
536            (parse_char_stream): Define |{getc_until}| method.
537            (DATA_STATE): Experimental support for |getc_until| feature.
538    
539    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
540    
541            * HTML.pm.src: Check points added to newly added branches.
542    
543    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
544    
545            * HTML.pm.src: Remove |{char}|, which is no longer used.
546            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
547            and replaced by |{prev_state}|.
548    
549            * mkhtmlparser.pl: Remove |{char}| feature.
550            Remove |!!!back-next-input-character;| macro.
551    
552    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
553    
554            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
555            entity related tokenizer states in favor of new states
556            implementing the consume character reference algorithm.
557    
558    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
559    
560            * HTML.pm.src: "Consume a character reference" algorithm is
561            now implemented as a tokenizer's state, rather than
562            a method, with minimum changes (more changes will
563            be made, in due course).  "Bogus comment state"'s inner
564            loop gets removed.
565    
566    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
567    
568            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
569            into their own tokenizer states.
570    
571    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
572    
573            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
574            is split into three states.
575    
576    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
577    
578            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
579            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
580            no longer does the tokenizer have to push back next input
581            characters in those states.
582    
583    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
584    
585            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
586            into four states so that no longer does the tokenizer have to push
587            back next input characters in that state.
588    
589    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
590    
591            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
592            which can be used to insert some wrapper between the character
593            stream handle and the tokenizer.  (It is currently not supported
594            for |set_inner_html| for |Element|s).
595    
596    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
597    
598            * HTML.pm.src: Ignore punctuations in charset names.
599    
600    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
601    
602            * ContentChecker.pm: Support for charset-layer error levels.
603    
604            * HTML.pm.src: Don't specify |text| argument for the
605            |chardecode:fallback| error, since it is not the encoding
606            being used alternatively.
607    
608    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
609    
610            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
611    
612    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
613    
614            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
615    
616    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
617    
618            * HTML.pm.src: Bug fix and sync with the spec with regard
619            to after after frameset insertion mode processing (HTML5
620            revision 1909).  Note that the implementation was wrong
621            per the old spec before the r1909 changes.
622    
623    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
624    
625            * HTMLTable.pm: scope=auto algorithm fix synced with the
626            spec (HTML5 revision 2093).
627            ($process_row): Algorithm step numbers synced with the
628            spec (HTML5 revision 2092).
629    
630    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
631    
632            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
633            revision 2094).
634    
635    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
636    
637            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
638    
639    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
640    
641            * HTML.pm.src: '"' and "'" at the end of attribute
642            name (after another attribute) now raise parse error (HTML5
643            revision 2123).  Empty unquoted attribute values are no
644            longer allowed (HTML5 revision 2122).
645    
646    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
647    
648            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
649            revision 2130).
650    
651    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
652    
653            * ContentChecker.pm: |xml:lang| attribute value must be same
654            as |lang| attribute value for HTML elements (HTML5 revision 2062
655            and so on).
656    
657    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
658    
659            * ContentChecker.pm: Error level definition for |xml_id_error|
660            was missing.
661    
662            * URIChecker.pm: The end of the URL should be marked as the
663            error location for an empty path error.  The position
664            between the userinfo and the port components should be
665            marked as the error location for an empty host error.
666    
667    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
668    
669            * URIChecker.pm: Set parameters representing where in the
670            value the error occurs for errors.  Report unknown
671            address format error in warning level, since address
672            formats are rarely added.  Path segments starting with "/.."
673            were misinterpreted as a dot-segment.
674    
675    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
676    
677            * URIChecker.pm (check_iri_reference): Requires
678            |Message::DOM::DOMImplementation|.
679    
680    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
681    
682            * IMTChecker.pm: Updated for the new error reporting architecture.
683    
684            * ContentChecker.pm: Error levels for IMTs are added.
685    
686    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
687    
688            * H2H.pm (_shift_token): Support for unquoted HTML attribute
689            values.
690    
691    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
692    
693            * CacheManifest.pm: Support for new style of error
694            reports.
695    
696            * HTML.pm.src: Set line=1, column=1 to the document node.
697    
698    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
699    
700            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
701            and URL checkers.  Support for more error levels for bogus
702            langauge tag and URL "standards".
703    
704            * LangTag.pm, URIChecker.pm: Support for new style error
705            level reporting.
706    
707    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
708    
709            * ContentChecker.pm: Support for RDF/XML error levels.
710    
711            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
712            specifying.  Error types are revised.
713    
714    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
715    
716            * ContentChecker.pm: All error reporting method calls are
717            renewed.
718    
719    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
720    
721            * HTML.pm.src: All error type names and "text" parameters
722            are revised.  Use new style for "level" specification.
723    
724            * mkhtmlparser.pl: Use new style for "level" specification.
725    
726    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
727    
728            * WebIDL.pm (parse_char_string): Simplified error
729            reporting process for broken ignored valuetype definition.
730            (Valuetype idl_text): Support for special "DOMString" name.
731    
732    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
733    
734            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
735            terminal of the ScopedName is "DOMString", such that whether
736            the last part of the scoped name is "DOMString" or "_DOMString"
737            later.  It is necessary to determine whether a |typedef|
738            definition should be ignored or not.
739            (parse_char_string): Unescape the identifier of
740            exception members.
741            ($resolve): Return undef for builtin types and sequence<T>
742            types (we might not have to do this, however...).
743            (check): Support checking for Exceptions, Valuetypes,
744            and Typedefs.
745            ($serialize_type): Support for "DOMString::::" syntax.
746            (Typedef idl_text): Output Type as "DOMString" if it
747            is really "DOMString" (i.e. its internal representation
748            is "::DOMString::").
749    
750    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
751    
752            * WebIDL.pm ($resolve): New code, based on resolve code
753            for constant types in the |check| method.
754            (check): Support for checking of attributes, operations, and
755            arguments.
756            (Attribute/Operation idl_text): Exception names in getraises,
757            setraises, and raises clauses is serizlied by |$serialize_type|
758            code.
759    
760    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
761    
762            * WebIDL.pm ($integer): Order of selections are changed to match
763            hexadecimal numbers (the original pattern, taken from the spec,
764            was not work for hexadecimal numbers, because the "0" prefix
765            matches to the [0-7]* part (as an empty string) and therefore
766            it does not match with remaining "x..." part of a "0x..." integer
767            literal.
768            ($get_type): It now returns a string, not an array reference,
769            for regular types and |sequence| types (i.e. it in any case
770            returns a string).
771            ($get_next_token): The second item in the array that represents
772            a integer or float token is now a Perl number value, not the
773            original string representation of the number.
774            (check): Support for const value consistency checking.
775            No extended attribute is defined for constants.
776            (Node subclasses): Use simple strings rather than array references
777            for default data type values.
778            ($serialize_type): Type values are now simple strings.
779            (value): If the new attribute value is a false value, then
780            a FALSE value is set to the attribute.
781    
782    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
783    
784            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
785            in its stringified format ("scoped name" as defined in the
786            spec).  Note that future version of this module should not use
787            array references for type values and the |type_text| attribute
788            should be made obsolete.
789            (parse_char_string): Unescape attribute names.
790            (check): Support for checking of whether inherited interfaces
791            are actually defined or not.  Support for checking of whether
792            interface member identifiers are duplicated or not.
793            ($serialize_type): Scoped names are returned as is.  A future
794            version of this code should escape identifiers other than "DOMString",
795            otherwise the idl_text would be non-conforming.
796    
797    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
798    
799            * WebIDL.pm (parse_char_string): Set line/column numbers
800            to generated nodes.  Unescape identifiers.  Extended attributes
801            for Definition's were ignored.
802            (append_child): Set |parent_node| attribute.
803            (parent_node): New attribute.
804            (check): Support interface/exception members.  Support
805            extended attributes.  Support definition identifier uniqueness
806            constraint.
807            (qualified_name): New attribute.
808            (Interface/Exception idl_text): Extended attributes were
809            not prepended to the returned text.
810    
811    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
812    
813            * WebIDL.pm (parse_char_string): Set line/column numbers
814            to interface object experimentally.  s/shift/pop/g, shift
815            would make things wrong.  Support for interface forward
816            declarations was missing.  Broken interface declarations
817            with no block were not ignored entirely.
818            (Whatpm::WebIDL::Node): New abstract class.  This class
819            makes things easier.
820            (child_nodes): New attribute.  Unlike DOM's attribute with
821            same name, this attribute returns a dead list of nodes for
822            simplicity.
823            (get_user_data, set_user_data): New methods.
824            (Module idl_text): A SPACE character should be inserted
825            before the |{| character.
826            (Interface idl_text): Support for interface forward declarations.
827            (is_forward_declaration): New attribute.
828    
829    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
830    
831            * WebIDL.pm (type_text): Better serializer.
832    
833    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
834    
835            * WebIDL.pm: Revise forward-compatible parsing so that
836            it now can handle broken extended attributes and as such.
837    
838    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
839    
840            * WebIDL.pm: Real support for extended attributes.
841            Support for extended attributes with arguments.
842    
843    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
844    
845            * WebIDL.pm: Support for |exception| syntax.
846            (Interface->idl_text): Tentative support for inheritances.
847    
848    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
849    
850            * WebIDL.pm: Hierarchical scoped name support was broken.
851            Support for raises, setraises, and getraises syntaxes.
852    
853    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
854    
855            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
856            proper support for types, extended attributes, and exceptions yet).
857            WebIDL parser, version 1 (no support for exceptions yet,
858            no proper support for extended attributes yet).
859    
860    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
861    
862            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
863    
864    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
865    
866            * WebIDL.pm: Support for valuetype and const.
867    
868    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
869            
870            * WebIDL.pm: New module.
871    
872    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
873    
874            * Makefile (Entities.html): URI changed.
875    
876    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
877    
878            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
879    
880    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
881    
882            * HTML.pm.src (_get_next_token): A parse error was missing.
883    
884    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
885    
886            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
887            spec (commented out). (HTML5 revision 1711).
888    
889    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
890    
891            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
892    
893            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
894            as UTF-16 (HTML5 revision 1701).
895    
896    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
897    
898            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
899            1692).
900    
901    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
902    
903            * HTML.pm.src: The secondary insertion mode used when switching
904            to foreign content is the "in body" insertion mode (HTML5 revision
905            1696).
906    
907    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
908    
909            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
910            revision 1697).
911    
912    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
913    
914            * HTML.pm.src: Support for end-of-file token in foreign content
915            insertion mode (HTML5 revision 1693).  Update SVG camelCase
916            attribute list (HTML5 revision 1700).  <textarea> closes
917            </select> (HTML5 revision 1699).  More start tags close in
918            foreign content insertion mode (HTML5 revision 1698).
919    
920    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
921    
922            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
923    
924    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
925    
926            * HTML.pm.src: More robust charset parameter detection (HTML5
927            revision 1674).
928    
929    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
930    
931            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
932            revision 1676).
933    
934    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
935    
936            * HTML.pm.src: Ignore language part of public identifiers for
937            quriks mode detection (HTML5 revision 1679).
938    
939    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
940    
941            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
942            revision 1685).
943    
944    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
945    
946            * HTML.pm.src: Support for EOF in new states for tags (HTML5
947            revision 1684).
948    
949    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
950    
951            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
952            work (HTML5 revision 1690).
953    
954    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
955    
956            * HTML.pm.src (_tree_construction_main): Change handling of
957            end tags in head insertion modes (HTML5 revision 1686).
958            (parse_char_string): Bug fix for non-utf8 character string handlings.
959            (parse_char_stream): |ungetc| does not work well for this context.
960    
961    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
962    
963            * HTML.pm.src (parse_byte_string): Redefined to invoke
964            |parse_byte_stream|.
965            (parse_byte_stream): New method.
966    
967    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
968    
969            * HTML.pm.src (parse_byte_string): Fix the column number reported
970            by encoding layer error reporter.
971    
972    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
973    
974            * HTML.pm.src (parse_byte_string): Use streaming decoder
975            rather than converting the whole byte string and then parsing.
976            Propagate errors in character encoding layer.
977            (get_next_token): Precise error reporting for |bare stago| error.
978    
979    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
980    
981            * HTML.pm.src (parse_char_stream): New method.
982            (parse_char_string): This method is now defined as an invocation
983            of the |parse_char_stream| method.
984    
985    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
986    
987            * HTML.pm.src (parse_byte_string): Report various status
988            of the sniffing as info-level errors.  Support for new
989            decoding framework in parser resestting.
990            (new): Various default error levels were not set.
991    
992    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
993    
994            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
995            algorithm, except for the actual sniffing, is implemented
996            with new framework with Message::Charset::Info.
997    
998    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
999    
1000            * CacheManifest.pm (_parse): Drop fragment identifiers from
1001            URIs in fallback section (HTML5 revision 1596).
1002    
1003    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1004    
1005            * Makefile (Entities.html): URI has changed.
1006    
1007    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1008    
1009            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1010            1553).
1011    
1012    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1013    
1014            * ContentChecker.pm: Noted that those returned in |table| are
1015            no longer table elements, but table objects returned
1016            by Whatpm::HTMLTable.
1017    
1018            * HTMLTable.pm (form_table): Return table element node
1019            as |$table->{element}|.
1020            (assign_header): Support for the |headers=""| attribute.
1021    
1022    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * HTMLTable.pm (assign_header): New function; first version
1025            with no support for headers="".
1026            (form_table): Include table width and height to the returned
1027            table object for covenience.  Indexing in column assignement was wrong.
1028            Set whether a data cell is empty or not for convenience.
1029    
1030    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1031    
1032            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1033    
1034    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1035    
1036            * HTMLTable.pm: How table model errors are detected is
1037            changed (HTML5 revision 1387).
1038    
1039    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * HTMLTable.pm: The algorithm now moves |tfoot| elements
1042            to the end of the table (HTML5 revision 1380).
1043    
1044    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1045    
1046            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1047            of 1-based (HTML5 revision 1376).
1048    
1049    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1050    
1051            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1052            and "Content-Encoding" no longer prevent sniffing (HTML5
1053            revision 1288).
1054    
1055    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1056    
1057            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1058            1282).
1059    
1060    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1061    
1062            * ContentChecker.pm: Support for global attributes.
1063            Status of XML specs are added.
1064    
1065    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1066    
1067            * ContentChecker.pm (check_element): Support for |template=""|
1068            and |ref=""| attribute (referent element type checking).
1069    
1070    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
1071    
1072            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1073            1500) is implemented (except for IDNA part and URI-scheme-specific
1074            knowledge).  Line number counting was wrong for LF-only
1075            documents.
1076    
1077    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1078    
1079            * HTML.pm.src: Raise an parse error for any disallowed
1080            character (HTML5 revision 1263).
1081    
1082    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1083    
1084            * mkentitylist.pl: Support for new HTML5 entity table format (the
1085            definition for |AElig;| was missing).
1086    
1087    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1088    
1089            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1090            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1091    
1092    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1093    
1094            * HTML.pm.src: List of element names that close foreign content
1095            insertion mode is added (HTML5 revisions 1412 and 1418).
1096    
1097    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1098    
1099            * HTML.pm.src: Support for |mglyph| and |malignmark|
1100            elements (HTML5 revision 1410).
1101    
1102    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1103    
1104            * HTML.pm.src: Support for new long MathML entities (HTML5
1105            revision 1406).
1106    
1107    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1108    
1109            * HTML.pm.src: CDATA section support for MathML and SVG
1110            elements (HTML5 revision 1404 and 1420).
1111    
1112    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1113    
1114            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1115            elements (HTML5 revision 1404).  Unused !!!macro definitions
1116            are removed.
1117    
1118    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1119    
1120            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1121            are raised is changed (HTML5 revision 1404).
1122    
1123    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1124    
1125            * HTML.pm.src: Category sets that are no longer used are removed.
1126    
1127    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1128    
1129            * HTML.pm.src: The ->[1] property of stack entries are now
1130            replaced by constants representing element category.
1131    
1132    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1133    
1134            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1135            for error reporting.  (This is a preparation for using constant
1136            value for ->[1].)
1137    
1138    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1139    
1140            * RDFXML.pm: Typo fixed.
1141    
1142    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1143    
1144            * ContentChecker.pm: |fact_level| is now treated
1145            as same as |must_level|, i.e. level = |m|.
1146            (check_element): Make list of URIs in the DOM.
1147    
1148    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1149    
1150            * RDFXML.pm: Language accessor implemented.
1151            Local (null-namespace) attribute support.
1152    
1153    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1154    
1155            * RDFXML.pm: Factored out ID checking code.
1156    
1157    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1158    
1159            * RDFXML.pm: TODO items noted.  Validation of ID
1160            and URI attributes is implemented.  Warn if unknown
1161            value is used in rdf:parseType="" attribute.
1162    
1163            * URIChecker.pm (check_rdf_uri_reference): New function.
1164    
1165    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1166    
1167            * RDFXML.pm: bnodeid implemented.  Relative references
1168            are now resolved.
1169    
1170    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1171    
1172            * ContentChecker.pm: RDF reification implemented.
1173    
1174            * RDFXML.pm: undef vs false bug fixed.
1175            Reification implemented.
1176    
1177    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1178    
1179            * RDFXML.pm: s/id/ID/ for attribute name.
1180            The |node| arguments are added for |ontriple| calls.
1181            Too many "attribute not allowed" errors were raised.
1182    
1183            * ContentChecker.pm: Initial experimental support for rdf:RDF
1184            element.
1185    
1186    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1187    
1188            * RDFXML.pm: New module.
1189    
1190    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1191    
1192            * HTML.pm.src (set_inner_html): Line/column number
1193            code was old one yet.
1194    
1195    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1196    
1197            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1198            errors.  Line/column markings for DOCTYPE, comment, and
1199            character tokens are reintroduced; otherwise,
1200            error location for "not HTML5" error and errors
1201            for implied elements are not attached.
1202    
1203    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1204    
1205            * HTML.pm.src: Set line/column numbers to attributes.
1206    
1207            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1208            (value): Setter implemented.
1209    
1210            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1211    
1212    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1213    
1214            * HTML.pm.src: Unused line/column markings are removed.
1215    
1216    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1217    
1218            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1219            flag, which is no longer used.
1220    
1221    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1222    
1223            * HTML.pm.src: Set line/column information to element nodes.
1224    
1225            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1226            and !!!insert-element-t): Set line/column information to
1227            element nodes.
1228    
1229    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1230    
1231            * HTML.pm.src (_get_next_token): The first "<" character
1232            in "<?", "<>", or "</>" should be the error point.
1233    
1234    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1235    
1236            * HTML.pm.src: Some more fixes on error position reporting.
1237    
1238    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1239    
1240            * HTML.pm.src: Token-level precious error reporting.
1241    
1242    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1243    
1244            * HTML.pm.src: Preparation for more precious error point
1245            reporting.
1246    
1247    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1248    
1249            * HTML.pm.src: Error type revised.
1250    
1251    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1252    
1253            * HTML.pm.src: Similar codes are merged together, again.
1254    
1255    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1256    
1257            * HTML.pm.src: Similar codes are merged together.
1258    
1259    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1260    
1261            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1262    
1263            * HTML.pm.src: Code refined.
1264    
1265    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1266    
1267            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1268            revision 1348).  Note that I really don't know this makes
1269            any difference in the black-box behavior of the parser.
1270    
1271    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1272    
1273            * HTML.pm.src: New end-of-file token implementation (HTML5
1274            revision 1348).
1275    
1276    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1277    
1278            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1279    
1280    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1281    
1282            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1283    
1284    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1285    
1286            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1287            mode and support for the "in select in table" insertion mode (HTML5
1288            revision 1342).
1289    
1290    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1291    
1292            * HTML.pm.src: No foster parenting for <script> and <script>
1293            in non-tainted <table>s (HTML5 revision 1336).
1294    
1295    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1296    
1297            * HTML.pm.src: Ignore white space characters between <html>
1298            and <head> (HTML5 revision 1332).
1299    
1300    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1301    
1302            * HTML.pm.src: Treat <input type=hidden> as if it were a
1303            white space (HTML5 revision 1331).
1304    
1305    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1306    
1307            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1308            element (HTML5 revision 1330).
1309    
1310    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1311    
1312            * HTML.pm.src: <title> is always appended to the current
1313            element (HTML5 revision 1328).
1314    
1315    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1316    
1317            * HTML.pm.src: White space in tainted tables are moved
1318            into foster parents (HTML5 revision 1326).
1319    
1320    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1321    
1322            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1323            revision 1321).
1324    
1325    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1326    
1327            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1328            the spec (HTML5 revision 1320).  Note that we finally complete
1329            all of HTML5 revision 1320 changes.
1330    
1331    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1332    
1333            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1334            tags (HTML5 revision 1320).
1335    
1336    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1337    
1338            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1339    
1340    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1341    
1342            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1343            could not be reached so that it is now removed (HTML5 revision
1344            1320).
1345    
1346    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1347    
1348            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1349            to align with new spec text (HTML5 revision 1320).
1350    
1351    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1352    
1353            * HTML.pm.src: Remove strange |if| condition; however, it should
1354            have had no harm in theory.
1355    
1356    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1357    
1358            * HTML.pm.src (_tree_construction_main): '</p>' in body
1359            case is split from other end tags for the preparation
1360            of implementing HTML revision 1320.
1361    
1362    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1363    
1364            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1365            revision 1320).
1366    
1367    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1368    
1369            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1370            and "div" (HTML5 revision 1318).  Add comments to where
1371            |form| pointer association codes should be inserted (HTML5 revision
1372            1319).
1373    
1374    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1375    
1376            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1377    
1378    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1379    
1380            * HTML.pm.src: Since the case t268 should never be reached (no
1381            other token type, there are), it is replaced by a |die| statement.
1382    
1383    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1384    
1385            * HTML.pm.src: Typo fixed.
1386    
1387    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1388    
1389            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1390            doctypes were not uppercased for comparison.
1391    
1392    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1393    
1394            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1395            are added.
1396    
1397    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1398    
1399            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1400            are set.  Cases that are unlikely reached are noted as so.
1401    
1402    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1403    
1404            * HTML.pm.src: Checkpoints for debugging are added.
1405    
1406            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1407    
1408    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1409    
1410            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1411            for simplicity.
1412    
1413    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1414    
1415            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1416            revision 1307).
1417    
1418    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1419    
1420            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1421            is set are changed (HTML5 revision 1305).
1422    
1423    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1424    
1425            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1426            1292).  Entities are not parsed in comment-like part in RCDATA
1427            elements (HTML5 revision 1294).  Allow bare & at the end
1428            of attribute value literals (HTML5 revision 1296).  More
1429            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1430            between attributes and ban attribute names or unquoted
1431            attribute values containing single or double quotes (HTML5
1432            revision 1303).
1433    
1434    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1435    
1436            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1437            and related errors unless it is an HTML document (though the spec
1438            is unclear on whether it is applied to XHTML document).
1439    
1440            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1441            are added.
1442    
1443    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1444    
1445            * HTML.pm.src: s/local_name/manakai_local_name/g.
1446    
1447    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1448    
1449            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1450    
1451            * HTML.pm.src: |charset| in |content| attribute is
1452            case-insensitive (HTML5 revision 1270).
1453    
1454    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1455    
1456            * ContentChecker.pm: New status constants are added.
1457            ($ElementDefault): |status| added.
1458            (check_element): Err for non-standard or deprecated elements.
1459            (_attr_status_info): For non-standard or deprecated attributes.
1460    
1461    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1462    
1463            * ContentChecker.pm (_attr_status_info): New internal method.
1464    
1465    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1466    
1467            * ContentChecker.pm (check_element): Element standardized
1468            status information is now dispatched.
1469    
1470    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1471    
1472            * ContentChecker.pm (check_element): Fix |del|-and-significant
1473            problem by adding some more arguments.
1474    
1475    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1476    
1477            * ContentChecker.pm (check_element): Use context of
1478            container-for-the-purpose-of-content-model element (not transparent
1479            element) for |check_child_element| calling and significant
1480            text flag marking.  This reintroduces |<del>|-and-significant
1481            problem again.
1482    
1483    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1484    
1485            * ContentChecker.pm (check_element): Make semi-transparent
1486            elements ignored for the purpose of phase changes in
1487            content model checking.
1488    
1489    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1490    
1491            * ContentChecker.pm (check_element): In-element state
1492            was not properly managed for transparent cases.
1493    
1494    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1495    
1496            * ContentChecker.pm (check_element): Support for |video|
1497            and |audio| as semi-transparent elements.
1498    
1499    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1500    
1501            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1502            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1503            Support for |html:object| as a semi-transparent element.
1504    
1505    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1506    
1507            * ContentChecker.pm (check_element): The way to traverse
1508            the tree is entirely revised to make it easier to track
1509            the state of ancestors/descendants.  As a result of this
1510            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1511            support for content model checking for HTML elements |figure|,
1512            |object|, |video|, and |audio| and checking for XML elements (and
1513            some XMLNS checkings) are dropped for now.  They will be
1514            reimplemented in due cource.
1515    
1516    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1517    
1518            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1519            token (HTML5 revision 1225).
1520    
1521    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1522    
1523            * ContentChecker.pm ({unsupported_level}): New value.
1524    
1525            * HTML.pm.src: Save whether |meta| |content| attribute
1526            contains character references or not.
1527    
1528    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1529    
1530            * ContentChecker.pm (_get_children): (Incompleted) attempt to
1531            imlement significant content checking for contents
1532            with |del| elements.
1533    
1534    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
1535    
1536            * ContenteChecker.pm ($HTMLTransparentElements): More
1537            elements are added.
1538            (_get_children): HTML |object| elements are now semi-transparent.
1539    
1540            * NanoDOM.pm (manakai_html, manakai_head): New methods.
1541    
1542    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
1543    
1544            * CacheManifest.pm: HTML5 revision 1211 implemented.
1545    
1546            * CacheManifest.pod: Updated.
1547    
1548    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
1549    
1550            * ContentChecker.pm (check_document, check_element): Support
1551            for second argument ($onsubdoc).
1552            (_get_css_parser): Removed (now it is part of WDCC).
1553    
1554    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
1555    
1556            * ContentChecker.pm (_get_css_parser): New.
1557    
1558    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1559    
1560            * ContentChecker.pm ($AnyChecker): Old way to add child elements
1561            for checking had been used.
1562    
1563    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
1564    
1565            * ContentChecker.pm (check_element): New todo item type |descendant|.
1566    
1567    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1568    
1569            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1570            specced in RFC 2046 and RFC 4288.
1571            (application/atom+xml): Definition added.
1572    
1573    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1574    
1575            * URIChecker.pm: Make RFC 3986 should-level errors
1576            warnings (rather than SHOULD-level errors).
1577    
1578    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1579    
1580            * NanoDOM.pm (get_user_data, set_user_data): New methods.
1581    
1582            * HTML.pm.src: A flag for character references in attribute
1583            values are added.  Set |manakai_has_reference| user data
1584            to |charset| attribute.
1585    
1586    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
1587    
1588            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1589            attributes.
1590    
1591            * ContentChecker.pm (check_document): Warn if charset requirements
1592            cannot be tested.
1593    
1594    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
1595    
1596            * HTML.pm.src (parse_byte_string): Detect charset
1597            by universalchardet if charset parameter is not specified.
1598    
1599            * Makefile (Charset-all, Charset-clean): New rules.
1600    
1601    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1602    
1603            * ContentChecker.pm (check_document): Check the existence
1604            of character encoding declaration and selection of encoding
1605            for HTML document.
1606    
1607    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1608    
1609            * ContentType.pm (get_sniffed_type): Return also the official
1610            type in list context.
1611    
1612    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1613    
1614            * ContentType.pm: Sniffing with leading white space
1615            ignoring (HTML5 revisions 1013 and 1016).
1616    
1617    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
1618    
1619            * ContentType.pm: HTML5 revision 1013 changes, except for leading
1620            white spaces, are implemented.
1621    
1622    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1623    
1624            * HTML.pm.src (parse_char_string): Set |inner_encoding|
1625            attribute if possible.
1626    
1627    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1628    
1629            * HTML.pm.src (parse_byte_string): New method.
1630            (parse_char_string): New alias for |parse_string|.
1631            (main phase): Invoking "change the encoding" algorithm if desired.
1632    
1633            * HTML.pod: Updated.
1634    
1635    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1636    
1637            * HTML.pod (get_inner_html): Removed.
1638    
1639            * Makefile (HTML-all, HTML-clean): New.
1640    
1641    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1642    
1643            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1644    
1645    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1646    
1647            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1648    
1649    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1650    
1651            * Makefile: |CacheManifest.html| is added.
1652    
1653            * CacheManifest.pod: New file.
1654    
1655    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1656    
1657            * CacheManifest.pm: New module.
1658    
1659  2007-11-04  Wakaba  <wakaba@suika.fam.cx>  2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1660    
1661          * HTML.pm.src: Support for application cache selection algorithm          * HTML.pm.src: Support for application cache selection algorithm

Legend:
Removed from v.1.113  
changed lines
  Added in v.1.392

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24