/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.141 by wakaba, Sat Feb 23 15:24:49 2008 UTC revision 1.400 by wakaba, Sun Sep 6 10:21:13 2009 UTC
# Line 1  Line 1 
1    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: |<hr/>| was incorrectly marked as an error.
4    
5    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
6    
7            * HTML.pm.src: Changed the handling of </html> end tag of "in
8            body" insertion mode according to the latest spec definition.
9    
10    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Rewrote end tag handling in |head| area (cf. HTML5
13            revision 2730, but it was entirely broken, maybe I missed some
14            spec changes before rev.2730).
15    
16    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Added the |hgroup| element (HTML5 revision 3039 and
19            HTML5 revision 3040).
20    
21    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
24            revision 3689).  It should make no difference since the |xmp|
25            element has the PCDATA content and cannot be the non-bottommost
26            element in the stack of open elements.
27    
28    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
31            fragment parsing algorithm is invoked (HTML5 revision 2699).
32    
33    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
36    
37    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
38    
39            * HTML.pm.src: No longer does foster parenting |style|, |script|,
40            and |input type=hidden| elements (HTML5 revision 2988).
41    
42    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
45            2870).
46    
47    2009-09-06  Wakaba  <wakaba@suika.fam.cx>
48    
49            * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
50            element (HTML5 revision 2960).
51    
52    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: Moved the "in table text" insertion mode code up
55            such that it does also work as specced in HTML5 in the remaining
56            cases (HTML5 revision 3382).
57    
58    2009-09-05  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: Implementing the "in table text" insertion mode,
61            first half (HTML5 revision 3382).
62    
63    2009-08-30  Wakaba  <wakaba@suika.fam.cx>
64    
65            * ContentChecker.pm: Allow |xml:lang| in HTML document if the
66            owner element is not an HTML element (HTML5 revision 3697).
67    
68    2009-08-23  Wakaba  <wakaba@suika.fam.cx>
69    
70            * ContentChecker.pm: Allow <input type=hidden> in context where
71            interactive content is disallowed (HTML5 revision 2392).
72    
73    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
74    
75            * HTML.pm.src: Dropped support for "XSLT-compat" and added
76            "about:legacy-compat" (HTML5 revision 2725).
77    
78    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
79    
80            * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
81            spec (HTML5 revision 2700).
82    
83    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
84    
85            * HTML.pm.src: Added support for <script> element in the "in
86            select" and "in select in table" insertion modes (HTML5 revision
87            2500).
88    
89    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * HTML.pm.src: An <hn> start tag in an <hn> element should close
92            the earlier element (HTML5 revision 2501).
93    
94    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: The </br> end tags before the <body> start tag
97            should equally survive as the first br element child of the body
98            element (HTML5 revision 2586).
99    
100    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: <font> in SVG land is handled as an SVG element as
103            long as it did not have HTML attributes (HTML5 revision 2590).
104    
105    2009-08-16  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src (_tree_construction_main): Improved error reporting
108            on "select" start tag in "in select" and "in select in table"
109            parsing modes (c.f. HTML5 revision 3604).
110    
111    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
112    
113            * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
114            invalid (HTML5 revision 2701).
115    
116    2009-07-26  Wakaba  <wakaba@suika.fam.cx>
117    
118            * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
119            optional before </body> (HTML5 revision 2806).
120    
121    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
122    
123            * HTML.pm.src: Marked where SVG |script| execution should be
124            implemented (HTML5 revision 2904).
125    
126    2009-07-25  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src (set_inner_html): The fragment parsing algorithm
129            inherits the document quirksness flag of the main document (HTML5
130            revision 2935).
131    
132    2009-07-05  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
135            quirks mode (The Hixie Quirk, HTML5 revision 3132).
136    
137    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
140            3163).
141    
142    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
143    
144            * HTML.pm.src: Don't change encoding to UTF-8 if the input
145            encoding is UTF-16 (HTML5 revision 3204).
146    
147    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src (parse_byte_stream): Don't change to another
150            encoding if the input is UTF-16 (HTML5 revision 3205).
151    
152    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src: |isindex| should no longer generate |p| element
155            node (HTML5 revision 3236).
156    
157    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
158    
159            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
160            support for new |id_type| value of |any|, which means any kind of
161            element is allowed as a referenced element.
162    
163    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
164    
165            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
166            of defining codes for checking required attribute and as such.
167    
168    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
169    
170            * IMTChecker.pm: Added more definitions for subtypes.
171    
172    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
173    
174            * URIChecker.pm: Some of |pos_end| values were wrong.
175    
176    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
177    
178            * ContentChecker.pm (check_element): Added support for "no
179            referenced datalist" error.
180    
181    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
182    
183            * URIChecker.pm: Bug fix: It did not work unless
184            Message::DOM::DOMImplementation has been |require|d.
185    
186    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
187    
188            * NanoDOM.pm (document_uri): New attribute.
189    
190            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
191    
192    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
193    
194            * NanoDOM.pm (text_content): Don't create a Text node if the new
195            value is empty.
196    
197    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
198    
199            * SWML/: New directory.
200    
201    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
202    
203            * NanoDOM.pm (specified, all_declarations_processed,
204            manakai_attribute_type): New attributes.
205    
206    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
207    
208            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
209            array.
210    
211    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
212    
213            * NanoDOM.pm (notation_name): New attribute.
214    
215    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
216    
217            * NanoDOM.pm (public_id, system_id): New attributes.a
218    
219    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
220    
221            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
222            implemented.
223            (allowed_tokens, default_type, declared_type): Implemented.
224    
225    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
226    
227            * NanoDOM.pm (node_name): New attribute.
228            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
229            constants.
230            (create_element_type_definition_node, create_attribute_definition,
231            create_notation, create_general_entity,
232            get_element_type_definition_node,
233            set_element_type_definition_node, get_general_entity_node,
234            set_general_entity_node, get_notation_node, set_notation_node,
235            get_attribute_definition_node, set_attribute_definition_node): New
236            methods.
237            (element_types, entities, notations, attribute_definitions): New
238            attributes.
239            (DocumentType): Support for child nodes, entities, notations, and
240            element types.
241            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
242            New classes.
243    
244            * Dumper.pm: Support for general entities, notations, element type
245            definitions, and attribute definitions.
246    
247    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
248    
249            * NanoDOM.pm (create_processing_instruction): New method.
250            (xml_version, xml_encoding, xml_standalone): New attributes.
251            (ProcessingInstruction): New class.
252    
253    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: Handling of end tags in the foreign content
256            insertion mode was partially wrong, because of wrong bit
257            operations.
258    
259    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
260    
261            * NanoDOM.pm (dom_config): New attribute (do nothing), for
262            Whatpm::XML::Parser support.
263    
264    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
265    
266            * Makefile: New rule to make HTML/Tokenizer.pm is added.
267    
268            * HTML.pm.src: Tokenizer part moved to another file.
269    
270    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
271    
272            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
273    
274    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src: Element category constants redefined.
277            
278    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
281            construction stage synced with the spec (HTML5 revisions 2139 and
282            2302).
283    
284    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
285    
286            * ContentChecker.pm: New error level "html5_fact" added, which
287            should be tentatively used until all of requirements are properly
288            specced as RFC 2119 "MUST" in HTML5.
289    
290    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
291    
292            * ContentChecker.pod: Note on internal flags is added.
293    
294    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
295    
296            * HTML.pm.src: An AAA bug fixed.
297    
298    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
299    
300            * HTML.pm.src: If another node is inserted by the parser, don't
301            reuse existing Text node to append a character (HTML5 revision
302            2124).
303    
304    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
305    
306            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
307            revisions 1731 and 2128).
308    
309    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
312            Support for end tags of camelCase SVG elements were broken.  A
313            wrong error type text fixed.
314    
315    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
316    
317            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
318    
319    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
320    
321            * HTML.pm.src: Support for new definition of |param| and |source|
322            start tag parsing (HTML5 revision 1731).
323    
324    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
325    
326            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
327    
328    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
329    
330            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
331            revisions 1731 and 1831).
332    
333    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
334    
335            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
336            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
337            revision 1731).
338    
339    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
340    
341            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
342            revision 1731).  End tags of |option| and |optgroup| elements are
343            now optional (HTML5 revision 1731).
344    
345    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
346    
347            * HTML.pm.src: New "special" elements added to the list (HTML5
348            revision 1778).  "strile" -> "strike".
349    
350    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
351    
352            * ContentType.pm (get_sniffed_type): Support for the "better"
353            content sniffing (HTML5 revision 1927).  In a case the official
354            type was not returned when the method is invoked in the list
355            context.
356    
357    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
358    
359            * HTML.pm.src: Character references for non-space C0 characters,
360            including U+000B VT, DEL character, noncharacter code points, are
361            now converted to the U+FFFD character (cf. HTML5 revision 2138).
362    
363    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
364    
365            * ContentChecker.pm: |form=""| check support added.
366    
367    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
368    
369            * ContentChecker.pm: |contextmenu| validness is now checked using
370            |id| and |id_type| properties, and |menu| property is removed.
371    
372    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
373    
374            * ContentChecker.pm: Prepare for |form| |name| attribute's
375            duplication checking.
376    
377    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
378    
379            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
380            unsupported charset="" parameter value (HTML5 revision 2131).
381    
382    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
383    
384            * HTML.pm.src: Reminding places where U+000B is allowed as a space
385            character is fixed (cf. HTML5 revision 1738).
386    
387            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
388            space characters (HTML5 revision 1738).
389    
390    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
391    
392            * HTML.pm.src: The "anything else" case for the "after after body"
393            insertion mode was not updated to swtich to the "in body"
394            insertion mode.  U+000B is no longer a space character for the
395            purpose of tree construction phase (HTML5 revision 1738).
396    
397    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
398    
399            * HTML.pm.src: U+000B is no longer a space character (HTML5
400            revision 1738).
401    
402    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
403    
404            * ContentType.pm: 0x0B is no longer a space character (HTML5
405            revision 1738).
406    
407            * HTML.pm.src: U+000B is no longer a space character for the
408            algorithm for extracting an encoding from a Content-Type (HTML5
409            revision 1738).
410    
411    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
412    
413            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
414    
415    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
416    
417            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
418    
419            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
420            matching.
421    
422    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
423    
424            * LangTag.pm: Warn for private use language subtags.  Error level
425            typos fixed.  Support for Suppress-Script field.
426    
427            * mklangreg.pl: Support for dumping of nested structure.
428    
429    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
430    
431            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
432            recommended case as per RFC 4646.
433    
434    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
435    
436            * LangTag.pm (check_rfc4646_langtag): New method.
437    
438    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
439    
440            * mklangreg.pl: New script.
441    
442            * Makefile: Updated for creation of the module for language subtag
443            registry.
444            
445    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
446    
447            * Makefile: WebIDL.html added.
448    
449            * WebIDL.pod: New documentation.
450    
451    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
452    
453            * WebIDL.pm: Checker's error types are redefined.
454    
455    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
456    
457            * WebIDL.pm: Parser's error types are redefined.  Some forward
458            compatible parsing bugs are fixed.  Some unreachable codes are
459            commented out.
460    
461    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
462    
463            * WebIDL.pm: Support for the reminding extended attributes are
464            added.  It does not satisfy the definition that a forward
465            interface declaration has an extended attribute.  It seems that
466            unless explicitly allowed multiple extended attributes with the
467            same name is not allowed, though it is not explicitly mentioned in
468            the spec.
469    
470    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
471    
472            * WebIDL.pm: Unescapes extended attribute names and extended
473            attribute identifiers.  Preserve whether an extended attribute has
474            an argument list of not.  Support for extended attributes:
475            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
476            NameGetter, NameSetter, and Null.
477            (has_argument_list): New attribute.
478            (idl_text): Stringifies argument lists, if any, even if it is
479            empty.
480    
481    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
482    
483            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
484            |{s_kwd}| in DATA_STATE as default.
485    
486    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
487    
488            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
489            by |{s_kwd}| in DATA_STATE.
490    
491    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
492    
493            * HTML.pm.src: Shorten keys.
494    
495    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
496    
497            * HTML.pm.src: Remove checking for control character, surrogate
498            pair, or noncharacter code points and non-Unicode code
499            points (they should be handled by Whatpm::Charset::UnicodeChecker).
500            (parse_char_stream): Support for the |$get_wrapper| argument and
501            character stream error handlers.
502    
503    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
504    
505            * ContentChecker.pm: Don't call |loda_ns_module|
506            for null-namespace elements/attributes.
507    
508            * HTML.pm.src: Fact out $disallowed_control_chars
509            as a hash.
510    
511    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
512    
513            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
514            and |{next_char}| initializations are moved to initialization
515            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
516            with |parse_char_stream|.
517    
518    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
519    
520            * HTML.pm.src (parse_char_stream): Make |set_next_char|
521            invoke |manakai_read_until|, not only |read|, where
522            possible, to decrease the number of |read| method calls.
523    
524            * mkhtmlparser.pl: Related changes to the aforementioned
525            modification.
526    
527    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
528    
529            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
530            would report character error from now.
531    
532    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
533    
534            * HTML.pm.src: White-space-leaded non-white-space character
535            tokens in "before head insertion mode" was not
536            correctly handled.
537            (set_inner_html): Reimplemented using CharString decodehandle
538            class.  Support for $get_wrapper argument.  Support
539            for |{read_until}| feature.
540    
541    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
542    
543            * HTML.pm.src: Make a "bare ero" error for unknown
544            entities point the "&" character.
545    
546    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
547    
548            * HTML.pm.src: It turns out that U+FFFD don't have to
549            be added to the list of excluded characters.
550    
551    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
552    
553            * HTML.pm.src ($char_onerror): Have character decoder's |line|
554            and |column| a higher priority than the one set by the
555            tokenizer's input handler.
556            ($self->{read_until}): Exclude U+FFFD (but this might
557            not be necessary, since now we do line/column fixup in
558            the character decode handle).
559    
560    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
561    
562            * HTML.pm.src: Use |{read_until}| where possible.
563    
564    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
565    
566            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
567            and |manakai_getc_until| to |manakai_read_until| to
568            reduce the number of string copies.
569    
570    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
571    
572            * HTML.pm.src (parse_char_string): Use newly created
573            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
574            standard feature to |open| a string as a filehandle,
575            since Perl's string filehandle seems not supporting |ungetc|
576            method correctly.
577            (parse_char_stream): Define |{getc_until}| method.
578            (DATA_STATE): Experimental support for |getc_until| feature.
579    
580    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
581    
582            * HTML.pm.src: Check points added to newly added branches.
583    
584    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
585    
586            * HTML.pm.src: Remove |{char}|, which is no longer used.
587            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
588            and replaced by |{prev_state}|.
589    
590            * mkhtmlparser.pl: Remove |{char}| feature.
591            Remove |!!!back-next-input-character;| macro.
592    
593    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
594    
595            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
596            entity related tokenizer states in favor of new states
597            implementing the consume character reference algorithm.
598    
599    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
600    
601            * HTML.pm.src: "Consume a character reference" algorithm is
602            now implemented as a tokenizer's state, rather than
603            a method, with minimum changes (more changes will
604            be made, in due course).  "Bogus comment state"'s inner
605            loop gets removed.
606    
607    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
608    
609            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
610            into their own tokenizer states.
611    
612    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
613    
614            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
615            is split into three states.
616    
617    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
618    
619            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
620            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
621            no longer does the tokenizer have to push back next input
622            characters in those states.
623    
624    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
625    
626            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
627            into four states so that no longer does the tokenizer have to push
628            back next input characters in that state.
629    
630    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
631    
632            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
633            which can be used to insert some wrapper between the character
634            stream handle and the tokenizer.  (It is currently not supported
635            for |set_inner_html| for |Element|s).
636    
637    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
638    
639            * HTML.pm.src: Ignore punctuations in charset names.
640    
641    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
642    
643            * ContentChecker.pm: Support for charset-layer error levels.
644    
645            * HTML.pm.src: Don't specify |text| argument for the
646            |chardecode:fallback| error, since it is not the encoding
647            being used alternatively.
648    
649    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
650    
651            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
652    
653    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
654    
655            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
656    
657    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
658    
659            * HTML.pm.src: Bug fix and sync with the spec with regard
660            to after after frameset insertion mode processing (HTML5
661            revision 1909).  Note that the implementation was wrong
662            per the old spec before the r1909 changes.
663    
664    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
665    
666            * HTMLTable.pm: scope=auto algorithm fix synced with the
667            spec (HTML5 revision 2093).
668            ($process_row): Algorithm step numbers synced with the
669            spec (HTML5 revision 2092).
670    
671    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
672    
673            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
674            revision 2094).
675    
676    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
677    
678            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
679    
680    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
681    
682            * HTML.pm.src: '"' and "'" at the end of attribute
683            name (after another attribute) now raise parse error (HTML5
684            revision 2123).  Empty unquoted attribute values are no
685            longer allowed (HTML5 revision 2122).
686    
687    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
688    
689            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
690            revision 2130).
691    
692    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
693    
694            * ContentChecker.pm: |xml:lang| attribute value must be same
695            as |lang| attribute value for HTML elements (HTML5 revision 2062
696            and so on).
697    
698    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
699    
700            * ContentChecker.pm: Error level definition for |xml_id_error|
701            was missing.
702    
703            * URIChecker.pm: The end of the URL should be marked as the
704            error location for an empty path error.  The position
705            between the userinfo and the port components should be
706            marked as the error location for an empty host error.
707    
708    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
709    
710            * URIChecker.pm: Set parameters representing where in the
711            value the error occurs for errors.  Report unknown
712            address format error in warning level, since address
713            formats are rarely added.  Path segments starting with "/.."
714            were misinterpreted as a dot-segment.
715    
716    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
717    
718            * URIChecker.pm (check_iri_reference): Requires
719            |Message::DOM::DOMImplementation|.
720    
721    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
722    
723            * IMTChecker.pm: Updated for the new error reporting architecture.
724    
725            * ContentChecker.pm: Error levels for IMTs are added.
726    
727    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
728    
729            * H2H.pm (_shift_token): Support for unquoted HTML attribute
730            values.
731    
732    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
733    
734            * CacheManifest.pm: Support for new style of error
735            reports.
736    
737            * HTML.pm.src: Set line=1, column=1 to the document node.
738    
739    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
740    
741            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
742            and URL checkers.  Support for more error levels for bogus
743            langauge tag and URL "standards".
744    
745            * LangTag.pm, URIChecker.pm: Support for new style error
746            level reporting.
747    
748    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
749    
750            * ContentChecker.pm: Support for RDF/XML error levels.
751    
752            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
753            specifying.  Error types are revised.
754    
755    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
756    
757            * ContentChecker.pm: All error reporting method calls are
758            renewed.
759    
760    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
761    
762            * HTML.pm.src: All error type names and "text" parameters
763            are revised.  Use new style for "level" specification.
764    
765            * mkhtmlparser.pl: Use new style for "level" specification.
766    
767    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
768    
769            * WebIDL.pm (parse_char_string): Simplified error
770            reporting process for broken ignored valuetype definition.
771            (Valuetype idl_text): Support for special "DOMString" name.
772    
773    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
774    
775            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
776            terminal of the ScopedName is "DOMString", such that whether
777            the last part of the scoped name is "DOMString" or "_DOMString"
778            later.  It is necessary to determine whether a |typedef|
779            definition should be ignored or not.
780            (parse_char_string): Unescape the identifier of
781            exception members.
782            ($resolve): Return undef for builtin types and sequence<T>
783            types (we might not have to do this, however...).
784            (check): Support checking for Exceptions, Valuetypes,
785            and Typedefs.
786            ($serialize_type): Support for "DOMString::::" syntax.
787            (Typedef idl_text): Output Type as "DOMString" if it
788            is really "DOMString" (i.e. its internal representation
789            is "::DOMString::").
790    
791    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
792    
793            * WebIDL.pm ($resolve): New code, based on resolve code
794            for constant types in the |check| method.
795            (check): Support for checking of attributes, operations, and
796            arguments.
797            (Attribute/Operation idl_text): Exception names in getraises,
798            setraises, and raises clauses is serizlied by |$serialize_type|
799            code.
800    
801    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
802    
803            * WebIDL.pm ($integer): Order of selections are changed to match
804            hexadecimal numbers (the original pattern, taken from the spec,
805            was not work for hexadecimal numbers, because the "0" prefix
806            matches to the [0-7]* part (as an empty string) and therefore
807            it does not match with remaining "x..." part of a "0x..." integer
808            literal.
809            ($get_type): It now returns a string, not an array reference,
810            for regular types and |sequence| types (i.e. it in any case
811            returns a string).
812            ($get_next_token): The second item in the array that represents
813            a integer or float token is now a Perl number value, not the
814            original string representation of the number.
815            (check): Support for const value consistency checking.
816            No extended attribute is defined for constants.
817            (Node subclasses): Use simple strings rather than array references
818            for default data type values.
819            ($serialize_type): Type values are now simple strings.
820            (value): If the new attribute value is a false value, then
821            a FALSE value is set to the attribute.
822    
823    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
824    
825            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
826            in its stringified format ("scoped name" as defined in the
827            spec).  Note that future version of this module should not use
828            array references for type values and the |type_text| attribute
829            should be made obsolete.
830            (parse_char_string): Unescape attribute names.
831            (check): Support for checking of whether inherited interfaces
832            are actually defined or not.  Support for checking of whether
833            interface member identifiers are duplicated or not.
834            ($serialize_type): Scoped names are returned as is.  A future
835            version of this code should escape identifiers other than "DOMString",
836            otherwise the idl_text would be non-conforming.
837    
838    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
839    
840            * WebIDL.pm (parse_char_string): Set line/column numbers
841            to generated nodes.  Unescape identifiers.  Extended attributes
842            for Definition's were ignored.
843            (append_child): Set |parent_node| attribute.
844            (parent_node): New attribute.
845            (check): Support interface/exception members.  Support
846            extended attributes.  Support definition identifier uniqueness
847            constraint.
848            (qualified_name): New attribute.
849            (Interface/Exception idl_text): Extended attributes were
850            not prepended to the returned text.
851    
852    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
853    
854            * WebIDL.pm (parse_char_string): Set line/column numbers
855            to interface object experimentally.  s/shift/pop/g, shift
856            would make things wrong.  Support for interface forward
857            declarations was missing.  Broken interface declarations
858            with no block were not ignored entirely.
859            (Whatpm::WebIDL::Node): New abstract class.  This class
860            makes things easier.
861            (child_nodes): New attribute.  Unlike DOM's attribute with
862            same name, this attribute returns a dead list of nodes for
863            simplicity.
864            (get_user_data, set_user_data): New methods.
865            (Module idl_text): A SPACE character should be inserted
866            before the |{| character.
867            (Interface idl_text): Support for interface forward declarations.
868            (is_forward_declaration): New attribute.
869    
870    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
871    
872            * WebIDL.pm (type_text): Better serializer.
873    
874    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
875    
876            * WebIDL.pm: Revise forward-compatible parsing so that
877            it now can handle broken extended attributes and as such.
878    
879    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
880    
881            * WebIDL.pm: Real support for extended attributes.
882            Support for extended attributes with arguments.
883    
884    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
885    
886            * WebIDL.pm: Support for |exception| syntax.
887            (Interface->idl_text): Tentative support for inheritances.
888    
889    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
890    
891            * WebIDL.pm: Hierarchical scoped name support was broken.
892            Support for raises, setraises, and getraises syntaxes.
893    
894    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
895    
896            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
897            proper support for types, extended attributes, and exceptions yet).
898            WebIDL parser, version 1 (no support for exceptions yet,
899            no proper support for extended attributes yet).
900    
901    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
902    
903            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
904    
905    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
906    
907            * WebIDL.pm: Support for valuetype and const.
908    
909    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
910            
911            * WebIDL.pm: New module.
912    
913    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
914    
915            * Makefile (Entities.html): URI changed.
916    
917    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
920    
921    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
922    
923            * HTML.pm.src (_get_next_token): A parse error was missing.
924    
925    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
926    
927            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
928            spec (commented out). (HTML5 revision 1711).
929    
930    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
931    
932            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
933    
934            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
935            as UTF-16 (HTML5 revision 1701).
936    
937    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
938    
939            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
940            1692).
941    
942    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
943    
944            * HTML.pm.src: The secondary insertion mode used when switching
945            to foreign content is the "in body" insertion mode (HTML5 revision
946            1696).
947    
948    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
949    
950            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
951            revision 1697).
952    
953    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
954    
955            * HTML.pm.src: Support for end-of-file token in foreign content
956            insertion mode (HTML5 revision 1693).  Update SVG camelCase
957            attribute list (HTML5 revision 1700).  <textarea> closes
958            </select> (HTML5 revision 1699).  More start tags close in
959            foreign content insertion mode (HTML5 revision 1698).
960    
961    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
962    
963            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
964    
965    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
966    
967            * HTML.pm.src: More robust charset parameter detection (HTML5
968            revision 1674).
969    
970    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
971    
972            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
973            revision 1676).
974    
975    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
976    
977            * HTML.pm.src: Ignore language part of public identifiers for
978            quriks mode detection (HTML5 revision 1679).
979    
980    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
981    
982            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
983            revision 1685).
984    
985    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
986    
987            * HTML.pm.src: Support for EOF in new states for tags (HTML5
988            revision 1684).
989    
990    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
991    
992            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
993            work (HTML5 revision 1690).
994    
995    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
996    
997            * HTML.pm.src (_tree_construction_main): Change handling of
998            end tags in head insertion modes (HTML5 revision 1686).
999            (parse_char_string): Bug fix for non-utf8 character string handlings.
1000            (parse_char_stream): |ungetc| does not work well for this context.
1001    
1002    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
1003    
1004            * HTML.pm.src (parse_byte_string): Redefined to invoke
1005            |parse_byte_stream|.
1006            (parse_byte_stream): New method.
1007    
1008    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
1009    
1010            * HTML.pm.src (parse_byte_string): Fix the column number reported
1011            by encoding layer error reporter.
1012    
1013    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1014    
1015            * HTML.pm.src (parse_byte_string): Use streaming decoder
1016            rather than converting the whole byte string and then parsing.
1017            Propagate errors in character encoding layer.
1018            (get_next_token): Precise error reporting for |bare stago| error.
1019    
1020    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1021    
1022            * HTML.pm.src (parse_char_stream): New method.
1023            (parse_char_string): This method is now defined as an invocation
1024            of the |parse_char_stream| method.
1025    
1026    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1027    
1028            * HTML.pm.src (parse_byte_string): Report various status
1029            of the sniffing as info-level errors.  Support for new
1030            decoding framework in parser resestting.
1031            (new): Various default error levels were not set.
1032    
1033    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
1034    
1035            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1036            algorithm, except for the actual sniffing, is implemented
1037            with new framework with Message::Charset::Info.
1038    
1039    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * CacheManifest.pm (_parse): Drop fragment identifiers from
1042            URIs in fallback section (HTML5 revision 1596).
1043    
1044    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1045    
1046            * Makefile (Entities.html): URI has changed.
1047    
1048    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1051            1553).
1052    
1053    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1054    
1055            * ContentChecker.pm: Noted that those returned in |table| are
1056            no longer table elements, but table objects returned
1057            by Whatpm::HTMLTable.
1058    
1059            * HTMLTable.pm (form_table): Return table element node
1060            as |$table->{element}|.
1061            (assign_header): Support for the |headers=""| attribute.
1062    
1063    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
1064    
1065            * HTMLTable.pm (assign_header): New function; first version
1066            with no support for headers="".
1067            (form_table): Include table width and height to the returned
1068            table object for covenience.  Indexing in column assignement was wrong.
1069            Set whether a data cell is empty or not for convenience.
1070    
1071    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1072    
1073            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1074    
1075    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1076    
1077            * HTMLTable.pm: How table model errors are detected is
1078            changed (HTML5 revision 1387).
1079    
1080    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1081    
1082            * HTMLTable.pm: The algorithm now moves |tfoot| elements
1083            to the end of the table (HTML5 revision 1380).
1084    
1085    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1086    
1087            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1088            of 1-based (HTML5 revision 1376).
1089    
1090    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1091    
1092            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1093            and "Content-Encoding" no longer prevent sniffing (HTML5
1094            revision 1288).
1095    
1096    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
1097    
1098            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1099            1282).
1100    
1101    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1102    
1103            * ContentChecker.pm: Support for global attributes.
1104            Status of XML specs are added.
1105    
1106    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
1107    
1108            * ContentChecker.pm (check_element): Support for |template=""|
1109            and |ref=""| attribute (referent element type checking).
1110    
1111    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
1112    
1113            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1114            1500) is implemented (except for IDNA part and URI-scheme-specific
1115            knowledge).  Line number counting was wrong for LF-only
1116            documents.
1117    
1118    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1119    
1120            * HTML.pm.src: Raise an parse error for any disallowed
1121            character (HTML5 revision 1263).
1122    
1123    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1124    
1125            * mkentitylist.pl: Support for new HTML5 entity table format (the
1126            definition for |AElig;| was missing).
1127    
1128    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1129    
1130            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1131            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1132    
1133    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1134    
1135            * HTML.pm.src: List of element names that close foreign content
1136            insertion mode is added (HTML5 revisions 1412 and 1418).
1137    
1138    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1139    
1140            * HTML.pm.src: Support for |mglyph| and |malignmark|
1141            elements (HTML5 revision 1410).
1142    
1143    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1144    
1145            * HTML.pm.src: Support for new long MathML entities (HTML5
1146            revision 1406).
1147    
1148    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1149    
1150            * HTML.pm.src: CDATA section support for MathML and SVG
1151            elements (HTML5 revision 1404 and 1420).
1152    
1153    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1154    
1155            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1156            elements (HTML5 revision 1404).  Unused !!!macro definitions
1157            are removed.
1158    
1159    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1160    
1161            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1162            are raised is changed (HTML5 revision 1404).
1163    
1164    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1165    
1166            * HTML.pm.src: Category sets that are no longer used are removed.
1167    
1168    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1169    
1170            * HTML.pm.src: The ->[1] property of stack entries are now
1171            replaced by constants representing element category.
1172    
1173    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1174    
1175            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1176            for error reporting.  (This is a preparation for using constant
1177            value for ->[1].)
1178    
1179    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1180    
1181            * RDFXML.pm: Typo fixed.
1182    
1183    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1184    
1185            * ContentChecker.pm: |fact_level| is now treated
1186            as same as |must_level|, i.e. level = |m|.
1187            (check_element): Make list of URIs in the DOM.
1188    
1189    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1190    
1191            * RDFXML.pm: Language accessor implemented.
1192            Local (null-namespace) attribute support.
1193    
1194    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1195    
1196            * RDFXML.pm: Factored out ID checking code.
1197    
1198    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1199    
1200            * RDFXML.pm: TODO items noted.  Validation of ID
1201            and URI attributes is implemented.  Warn if unknown
1202            value is used in rdf:parseType="" attribute.
1203    
1204            * URIChecker.pm (check_rdf_uri_reference): New function.
1205    
1206    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1207    
1208            * RDFXML.pm: bnodeid implemented.  Relative references
1209            are now resolved.
1210    
1211    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1212    
1213            * ContentChecker.pm: RDF reification implemented.
1214    
1215            * RDFXML.pm: undef vs false bug fixed.
1216            Reification implemented.
1217    
1218    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1219    
1220            * RDFXML.pm: s/id/ID/ for attribute name.
1221            The |node| arguments are added for |ontriple| calls.
1222            Too many "attribute not allowed" errors were raised.
1223    
1224            * ContentChecker.pm: Initial experimental support for rdf:RDF
1225            element.
1226    
1227    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1228    
1229            * RDFXML.pm: New module.
1230    
1231    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1232    
1233            * HTML.pm.src (set_inner_html): Line/column number
1234            code was old one yet.
1235    
1236    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1237    
1238            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1239            errors.  Line/column markings for DOCTYPE, comment, and
1240            character tokens are reintroduced; otherwise,
1241            error location for "not HTML5" error and errors
1242            for implied elements are not attached.
1243    
1244    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1245    
1246            * HTML.pm.src: Set line/column numbers to attributes.
1247    
1248            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1249            (value): Setter implemented.
1250    
1251            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1252    
1253    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1254    
1255            * HTML.pm.src: Unused line/column markings are removed.
1256    
1257    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1258    
1259            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1260            flag, which is no longer used.
1261    
1262    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1263    
1264            * HTML.pm.src: Set line/column information to element nodes.
1265    
1266            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1267            and !!!insert-element-t): Set line/column information to
1268            element nodes.
1269    
1270    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1271    
1272            * HTML.pm.src (_get_next_token): The first "<" character
1273            in "<?", "<>", or "</>" should be the error point.
1274    
1275    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1276    
1277            * HTML.pm.src: Some more fixes on error position reporting.
1278    
1279    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1280    
1281            * HTML.pm.src: Token-level precious error reporting.
1282    
1283    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1284    
1285            * HTML.pm.src: Preparation for more precious error point
1286            reporting.
1287    
1288    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1289    
1290            * HTML.pm.src: Error type revised.
1291    
1292    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1293    
1294            * HTML.pm.src: Similar codes are merged together, again.
1295    
1296    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1297    
1298            * HTML.pm.src: Similar codes are merged together.
1299    
1300    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1301    
1302            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1303    
1304            * HTML.pm.src: Code refined.
1305    
1306    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1307    
1308            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1309            revision 1348).  Note that I really don't know this makes
1310            any difference in the black-box behavior of the parser.
1311    
1312    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1313    
1314            * HTML.pm.src: New end-of-file token implementation (HTML5
1315            revision 1348).
1316    
1317    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1318    
1319            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1320    
1321    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1322    
1323            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1324    
1325    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1326    
1327            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1328            mode and support for the "in select in table" insertion mode (HTML5
1329            revision 1342).
1330    
1331    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1332    
1333            * HTML.pm.src: No foster parenting for <script> and <script>
1334            in non-tainted <table>s (HTML5 revision 1336).
1335    
1336    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1337    
1338            * HTML.pm.src: Ignore white space characters between <html>
1339            and <head> (HTML5 revision 1332).
1340    
1341    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1342    
1343            * HTML.pm.src: Treat <input type=hidden> as if it were a
1344            white space (HTML5 revision 1331).
1345    
1346    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1347    
1348            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1349            element (HTML5 revision 1330).
1350    
1351    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1352    
1353            * HTML.pm.src: <title> is always appended to the current
1354            element (HTML5 revision 1328).
1355    
1356    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1357    
1358            * HTML.pm.src: White space in tainted tables are moved
1359            into foster parents (HTML5 revision 1326).
1360    
1361    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1362    
1363            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1364            revision 1321).
1365    
1366    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1367    
1368            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1369            the spec (HTML5 revision 1320).  Note that we finally complete
1370            all of HTML5 revision 1320 changes.
1371    
1372    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1373    
1374            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1375            tags (HTML5 revision 1320).
1376    
1377    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1378    
1379            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1380    
1381    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1382    
1383            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1384            could not be reached so that it is now removed (HTML5 revision
1385            1320).
1386    
1387    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1388    
1389            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1390            to align with new spec text (HTML5 revision 1320).
1391    
1392    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1393    
1394            * HTML.pm.src: Remove strange |if| condition; however, it should
1395            have had no harm in theory.
1396    
1397    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1398    
1399            * HTML.pm.src (_tree_construction_main): '</p>' in body
1400            case is split from other end tags for the preparation
1401            of implementing HTML revision 1320.
1402    
1403    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1404    
1405            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1406            revision 1320).
1407    
1408    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1409    
1410            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1411            and "div" (HTML5 revision 1318).  Add comments to where
1412            |form| pointer association codes should be inserted (HTML5 revision
1413            1319).
1414    
1415    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1416    
1417            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1418    
1419    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1420    
1421            * HTML.pm.src: Since the case t268 should never be reached (no
1422            other token type, there are), it is replaced by a |die| statement.
1423    
1424    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1425    
1426            * HTML.pm.src: Typo fixed.
1427    
1428    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1429    
1430            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1431            doctypes were not uppercased for comparison.
1432    
1433    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1434    
1435            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1436            are added.
1437    
1438    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1439    
1440            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1441            are set.  Cases that are unlikely reached are noted as so.
1442    
1443    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1444    
1445            * HTML.pm.src: Checkpoints for debugging are added.
1446    
1447            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1448    
1449    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1450    
1451            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1452            for simplicity.
1453    
1454    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1455    
1456            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1457            revision 1307).
1458    
1459    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1460    
1461            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1462            is set are changed (HTML5 revision 1305).
1463    
1464    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1465    
1466            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1467            1292).  Entities are not parsed in comment-like part in RCDATA
1468            elements (HTML5 revision 1294).  Allow bare & at the end
1469            of attribute value literals (HTML5 revision 1296).  More
1470            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1471            between attributes and ban attribute names or unquoted
1472            attribute values containing single or double quotes (HTML5
1473            revision 1303).
1474    
1475    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1476    
1477            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1478            and related errors unless it is an HTML document (though the spec
1479            is unclear on whether it is applied to XHTML document).
1480    
1481            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1482            are added.
1483    
1484    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1485    
1486            * HTML.pm.src: s/local_name/manakai_local_name/g.
1487    
1488    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1489    
1490            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1491    
1492            * HTML.pm.src: |charset| in |content| attribute is
1493            case-insensitive (HTML5 revision 1270).
1494    
1495    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1496    
1497            * ContentChecker.pm: New status constants are added.
1498            ($ElementDefault): |status| added.
1499            (check_element): Err for non-standard or deprecated elements.
1500            (_attr_status_info): For non-standard or deprecated attributes.
1501    
1502    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1503    
1504            * ContentChecker.pm (_attr_status_info): New internal method.
1505    
1506    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1507    
1508            * ContentChecker.pm (check_element): Element standardized
1509            status information is now dispatched.
1510    
1511    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1512    
1513            * ContentChecker.pm (check_element): Fix |del|-and-significant
1514            problem by adding some more arguments.
1515    
1516    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1517    
1518            * ContentChecker.pm (check_element): Use context of
1519            container-for-the-purpose-of-content-model element (not transparent
1520            element) for |check_child_element| calling and significant
1521            text flag marking.  This reintroduces |<del>|-and-significant
1522            problem again.
1523    
1524    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1525    
1526            * ContentChecker.pm (check_element): Make semi-transparent
1527            elements ignored for the purpose of phase changes in
1528            content model checking.
1529    
1530  2008-02-23  Wakaba  <wakaba@suika.fam.cx>  2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1531    
1532          * ContentChecker.pm (check_element): In-element state          * ContentChecker.pm (check_element): In-element state

Legend:
Removed from v.1.141  
changed lines
  Added in v.1.400

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24