/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.178 by wakaba, Sun Mar 9 05:12:04 2008 UTC revision 1.372 by wakaba, Thu Jul 2 22:08:36 2009 UTC
# Line 1  Line 1 
1    2009-07-03  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (parse_byte_stream): Don't change to another
4            encoding if the input is UTF-16 (HTML5 revision 3205).
5    
6    2009-06-28  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: |isindex| should no longer generate |p| element
9            node (HTML5 revision 3236).
10    
11    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
12    
13            * ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
14            support for new |id_type| value of |any|, which means any kind of
15            element is allowed as a referenced element.
16    
17    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
18    
19            * ContentChecker.pm: Introduced |check_attrs2| method for the ease
20            of defining codes for checking required attribute and as such.
21    
22    2008-12-12  Wakaba  <wakaba@suika.fam.cx>
23    
24            * IMTChecker.pm: Added more definitions for subtypes.
25    
26    2008-12-11  Wakaba  <wakaba@suika.fam.cx>
27    
28            * URIChecker.pm: Some of |pos_end| values were wrong.
29    
30    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
31    
32            * ContentChecker.pm (check_element): Added support for "no
33            referenced datalist" error.
34    
35    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
36    
37            * URIChecker.pm: Bug fix: It did not work unless
38            Message::DOM::DOMImplementation has been |require|d.
39    
40    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
41    
42            * NanoDOM.pm (document_uri): New attribute.
43    
44            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
45    
46    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
47    
48            * NanoDOM.pm (text_content): Don't create a Text node if the new
49            value is empty.
50    
51    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
52    
53            * SWML/: New directory.
54    
55    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
56    
57            * NanoDOM.pm (specified, all_declarations_processed,
58            manakai_attribute_type): New attributes.
59    
60    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
61    
62            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
63            array.
64    
65    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
66    
67            * NanoDOM.pm (notation_name): New attribute.
68    
69    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
70    
71            * NanoDOM.pm (public_id, system_id): New attributes.a
72    
73    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
74    
75            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
76            implemented.
77            (allowed_tokens, default_type, declared_type): Implemented.
78    
79    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
80    
81            * NanoDOM.pm (node_name): New attribute.
82            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
83            constants.
84            (create_element_type_definition_node, create_attribute_definition,
85            create_notation, create_general_entity,
86            get_element_type_definition_node,
87            set_element_type_definition_node, get_general_entity_node,
88            set_general_entity_node, get_notation_node, set_notation_node,
89            get_attribute_definition_node, set_attribute_definition_node): New
90            methods.
91            (element_types, entities, notations, attribute_definitions): New
92            attributes.
93            (DocumentType): Support for child nodes, entities, notations, and
94            element types.
95            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
96            New classes.
97    
98            * Dumper.pm: Support for general entities, notations, element type
99            definitions, and attribute definitions.
100    
101    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
102    
103            * NanoDOM.pm (create_processing_instruction): New method.
104            (xml_version, xml_encoding, xml_standalone): New attributes.
105            (ProcessingInstruction): New class.
106    
107    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
108    
109            * HTML.pm.src: Handling of end tags in the foreign content
110            insertion mode was partially wrong, because of wrong bit
111            operations.
112    
113    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
114    
115            * NanoDOM.pm (dom_config): New attribute (do nothing), for
116            Whatpm::XML::Parser support.
117    
118    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
119    
120            * Makefile: New rule to make HTML/Tokenizer.pm is added.
121    
122            * HTML.pm.src: Tokenizer part moved to another file.
123    
124    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
127    
128    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
129    
130            * HTML.pm.src: Element category constants redefined.
131            
132    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
135            construction stage synced with the spec (HTML5 revisions 2139 and
136            2302).
137    
138    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
139    
140            * ContentChecker.pm: New error level "html5_fact" added, which
141            should be tentatively used until all of requirements are properly
142            specced as RFC 2119 "MUST" in HTML5.
143    
144    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
145    
146            * ContentChecker.pod: Note on internal flags is added.
147    
148    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
149    
150            * HTML.pm.src: An AAA bug fixed.
151    
152    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src: If another node is inserted by the parser, don't
155            reuse existing Text node to append a character (HTML5 revision
156            2124).
157    
158    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
159    
160            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
161            revisions 1731 and 2128).
162    
163    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
164    
165            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
166            Support for end tags of camelCase SVG elements were broken.  A
167            wrong error type text fixed.
168    
169    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
170    
171            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
172    
173    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
174    
175            * HTML.pm.src: Support for new definition of |param| and |source|
176            start tag parsing (HTML5 revision 1731).
177    
178    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
179    
180            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
181    
182    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
183    
184            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
185            revisions 1731 and 1831).
186    
187    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
188    
189            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
190            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
191            revision 1731).
192    
193    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
194    
195            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
196            revision 1731).  End tags of |option| and |optgroup| elements are
197            now optional (HTML5 revision 1731).
198    
199    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
200    
201            * HTML.pm.src: New "special" elements added to the list (HTML5
202            revision 1778).  "strile" -> "strike".
203    
204    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
205    
206            * ContentType.pm (get_sniffed_type): Support for the "better"
207            content sniffing (HTML5 revision 1927).  In a case the official
208            type was not returned when the method is invoked in the list
209            context.
210    
211    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTML.pm.src: Character references for non-space C0 characters,
214            including U+000B VT, DEL character, noncharacter code points, are
215            now converted to the U+FFFD character (cf. HTML5 revision 2138).
216    
217    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
218    
219            * ContentChecker.pm: |form=""| check support added.
220    
221    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
222    
223            * ContentChecker.pm: |contextmenu| validness is now checked using
224            |id| and |id_type| properties, and |menu| property is removed.
225    
226    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
227    
228            * ContentChecker.pm: Prepare for |form| |name| attribute's
229            duplication checking.
230    
231    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
234            unsupported charset="" parameter value (HTML5 revision 2131).
235    
236    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: Reminding places where U+000B is allowed as a space
239            character is fixed (cf. HTML5 revision 1738).
240    
241            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
242            space characters (HTML5 revision 1738).
243    
244    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
245    
246            * HTML.pm.src: The "anything else" case for the "after after body"
247            insertion mode was not updated to swtich to the "in body"
248            insertion mode.  U+000B is no longer a space character for the
249            purpose of tree construction phase (HTML5 revision 1738).
250    
251    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
252    
253            * HTML.pm.src: U+000B is no longer a space character (HTML5
254            revision 1738).
255    
256    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
257    
258            * ContentType.pm: 0x0B is no longer a space character (HTML5
259            revision 1738).
260    
261            * HTML.pm.src: U+000B is no longer a space character for the
262            algorithm for extracting an encoding from a Content-Type (HTML5
263            revision 1738).
264    
265    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
266    
267            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
268    
269    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
270    
271            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
272    
273            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
274            matching.
275    
276    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
277    
278            * LangTag.pm: Warn for private use language subtags.  Error level
279            typos fixed.  Support for Suppress-Script field.
280    
281            * mklangreg.pl: Support for dumping of nested structure.
282    
283    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
284    
285            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
286            recommended case as per RFC 4646.
287    
288    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
289    
290            * LangTag.pm (check_rfc4646_langtag): New method.
291    
292    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
293    
294            * mklangreg.pl: New script.
295    
296            * Makefile: Updated for creation of the module for language subtag
297            registry.
298            
299    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
300    
301            * Makefile: WebIDL.html added.
302    
303            * WebIDL.pod: New documentation.
304    
305    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
306    
307            * WebIDL.pm: Checker's error types are redefined.
308    
309    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
310    
311            * WebIDL.pm: Parser's error types are redefined.  Some forward
312            compatible parsing bugs are fixed.  Some unreachable codes are
313            commented out.
314    
315    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
316    
317            * WebIDL.pm: Support for the reminding extended attributes are
318            added.  It does not satisfy the definition that a forward
319            interface declaration has an extended attribute.  It seems that
320            unless explicitly allowed multiple extended attributes with the
321            same name is not allowed, though it is not explicitly mentioned in
322            the spec.
323    
324    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
325    
326            * WebIDL.pm: Unescapes extended attribute names and extended
327            attribute identifiers.  Preserve whether an extended attribute has
328            an argument list of not.  Support for extended attributes:
329            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
330            NameGetter, NameSetter, and Null.
331            (has_argument_list): New attribute.
332            (idl_text): Stringifies argument lists, if any, even if it is
333            empty.
334    
335    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
336    
337            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
338            |{s_kwd}| in DATA_STATE as default.
339    
340    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
341    
342            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
343            by |{s_kwd}| in DATA_STATE.
344    
345    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
346    
347            * HTML.pm.src: Shorten keys.
348    
349    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
350    
351            * HTML.pm.src: Remove checking for control character, surrogate
352            pair, or noncharacter code points and non-Unicode code
353            points (they should be handled by Whatpm::Charset::UnicodeChecker).
354            (parse_char_stream): Support for the |$get_wrapper| argument and
355            character stream error handlers.
356    
357    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
358    
359            * ContentChecker.pm: Don't call |loda_ns_module|
360            for null-namespace elements/attributes.
361    
362            * HTML.pm.src: Fact out $disallowed_control_chars
363            as a hash.
364    
365    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
366    
367            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
368            and |{next_char}| initializations are moved to initialization
369            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
370            with |parse_char_stream|.
371    
372    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
373    
374            * HTML.pm.src (parse_char_stream): Make |set_next_char|
375            invoke |manakai_read_until|, not only |read|, where
376            possible, to decrease the number of |read| method calls.
377    
378            * mkhtmlparser.pl: Related changes to the aforementioned
379            modification.
380    
381    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
382    
383            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
384            would report character error from now.
385    
386    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
387    
388            * HTML.pm.src: White-space-leaded non-white-space character
389            tokens in "before head insertion mode" was not
390            correctly handled.
391            (set_inner_html): Reimplemented using CharString decodehandle
392            class.  Support for $get_wrapper argument.  Support
393            for |{read_until}| feature.
394    
395    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
396    
397            * HTML.pm.src: Make a "bare ero" error for unknown
398            entities point the "&" character.
399    
400    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
401    
402            * HTML.pm.src: It turns out that U+FFFD don't have to
403            be added to the list of excluded characters.
404    
405    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
406    
407            * HTML.pm.src ($char_onerror): Have character decoder's |line|
408            and |column| a higher priority than the one set by the
409            tokenizer's input handler.
410            ($self->{read_until}): Exclude U+FFFD (but this might
411            not be necessary, since now we do line/column fixup in
412            the character decode handle).
413    
414    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
415    
416            * HTML.pm.src: Use |{read_until}| where possible.
417    
418    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
419    
420            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
421            and |manakai_getc_until| to |manakai_read_until| to
422            reduce the number of string copies.
423    
424    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
425    
426            * HTML.pm.src (parse_char_string): Use newly created
427            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
428            standard feature to |open| a string as a filehandle,
429            since Perl's string filehandle seems not supporting |ungetc|
430            method correctly.
431            (parse_char_stream): Define |{getc_until}| method.
432            (DATA_STATE): Experimental support for |getc_until| feature.
433    
434    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
435    
436            * HTML.pm.src: Check points added to newly added branches.
437    
438    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
439    
440            * HTML.pm.src: Remove |{char}|, which is no longer used.
441            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
442            and replaced by |{prev_state}|.
443    
444            * mkhtmlparser.pl: Remove |{char}| feature.
445            Remove |!!!back-next-input-character;| macro.
446    
447    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
448    
449            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
450            entity related tokenizer states in favor of new states
451            implementing the consume character reference algorithm.
452    
453    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
454    
455            * HTML.pm.src: "Consume a character reference" algorithm is
456            now implemented as a tokenizer's state, rather than
457            a method, with minimum changes (more changes will
458            be made, in due course).  "Bogus comment state"'s inner
459            loop gets removed.
460    
461    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
462    
463            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
464            into their own tokenizer states.
465    
466    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
467    
468            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
469            is split into three states.
470    
471    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
472    
473            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
474            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
475            no longer does the tokenizer have to push back next input
476            characters in those states.
477    
478    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
479    
480            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
481            into four states so that no longer does the tokenizer have to push
482            back next input characters in that state.
483    
484    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
485    
486            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
487            which can be used to insert some wrapper between the character
488            stream handle and the tokenizer.  (It is currently not supported
489            for |set_inner_html| for |Element|s).
490    
491    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
492    
493            * HTML.pm.src: Ignore punctuations in charset names.
494    
495    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
496    
497            * ContentChecker.pm: Support for charset-layer error levels.
498    
499            * HTML.pm.src: Don't specify |text| argument for the
500            |chardecode:fallback| error, since it is not the encoding
501            being used alternatively.
502    
503    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
504    
505            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
506    
507    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
508    
509            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
510    
511    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
512    
513            * HTML.pm.src: Bug fix and sync with the spec with regard
514            to after after frameset insertion mode processing (HTML5
515            revision 1909).  Note that the implementation was wrong
516            per the old spec before the r1909 changes.
517    
518    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
519    
520            * HTMLTable.pm: scope=auto algorithm fix synced with the
521            spec (HTML5 revision 2093).
522            ($process_row): Algorithm step numbers synced with the
523            spec (HTML5 revision 2092).
524    
525    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
526    
527            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
528            revision 2094).
529    
530    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
531    
532            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
533    
534    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
535    
536            * HTML.pm.src: '"' and "'" at the end of attribute
537            name (after another attribute) now raise parse error (HTML5
538            revision 2123).  Empty unquoted attribute values are no
539            longer allowed (HTML5 revision 2122).
540    
541    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
542    
543            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
544            revision 2130).
545    
546    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
547    
548            * ContentChecker.pm: |xml:lang| attribute value must be same
549            as |lang| attribute value for HTML elements (HTML5 revision 2062
550            and so on).
551    
552    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
553    
554            * ContentChecker.pm: Error level definition for |xml_id_error|
555            was missing.
556    
557            * URIChecker.pm: The end of the URL should be marked as the
558            error location for an empty path error.  The position
559            between the userinfo and the port components should be
560            marked as the error location for an empty host error.
561    
562    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
563    
564            * URIChecker.pm: Set parameters representing where in the
565            value the error occurs for errors.  Report unknown
566            address format error in warning level, since address
567            formats are rarely added.  Path segments starting with "/.."
568            were misinterpreted as a dot-segment.
569    
570    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
571    
572            * URIChecker.pm (check_iri_reference): Requires
573            |Message::DOM::DOMImplementation|.
574    
575    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
576    
577            * IMTChecker.pm: Updated for the new error reporting architecture.
578    
579            * ContentChecker.pm: Error levels for IMTs are added.
580    
581    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
582    
583            * H2H.pm (_shift_token): Support for unquoted HTML attribute
584            values.
585    
586    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
587    
588            * CacheManifest.pm: Support for new style of error
589            reports.
590    
591            * HTML.pm.src: Set line=1, column=1 to the document node.
592    
593    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
594    
595            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
596            and URL checkers.  Support for more error levels for bogus
597            langauge tag and URL "standards".
598    
599            * LangTag.pm, URIChecker.pm: Support for new style error
600            level reporting.
601    
602    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
603    
604            * ContentChecker.pm: Support for RDF/XML error levels.
605    
606            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
607            specifying.  Error types are revised.
608    
609    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
610    
611            * ContentChecker.pm: All error reporting method calls are
612            renewed.
613    
614    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
615    
616            * HTML.pm.src: All error type names and "text" parameters
617            are revised.  Use new style for "level" specification.
618    
619            * mkhtmlparser.pl: Use new style for "level" specification.
620    
621    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
622    
623            * WebIDL.pm (parse_char_string): Simplified error
624            reporting process for broken ignored valuetype definition.
625            (Valuetype idl_text): Support for special "DOMString" name.
626    
627    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
628    
629            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
630            terminal of the ScopedName is "DOMString", such that whether
631            the last part of the scoped name is "DOMString" or "_DOMString"
632            later.  It is necessary to determine whether a |typedef|
633            definition should be ignored or not.
634            (parse_char_string): Unescape the identifier of
635            exception members.
636            ($resolve): Return undef for builtin types and sequence<T>
637            types (we might not have to do this, however...).
638            (check): Support checking for Exceptions, Valuetypes,
639            and Typedefs.
640            ($serialize_type): Support for "DOMString::::" syntax.
641            (Typedef idl_text): Output Type as "DOMString" if it
642            is really "DOMString" (i.e. its internal representation
643            is "::DOMString::").
644    
645    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
646    
647            * WebIDL.pm ($resolve): New code, based on resolve code
648            for constant types in the |check| method.
649            (check): Support for checking of attributes, operations, and
650            arguments.
651            (Attribute/Operation idl_text): Exception names in getraises,
652            setraises, and raises clauses is serizlied by |$serialize_type|
653            code.
654    
655    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
656    
657            * WebIDL.pm ($integer): Order of selections are changed to match
658            hexadecimal numbers (the original pattern, taken from the spec,
659            was not work for hexadecimal numbers, because the "0" prefix
660            matches to the [0-7]* part (as an empty string) and therefore
661            it does not match with remaining "x..." part of a "0x..." integer
662            literal.
663            ($get_type): It now returns a string, not an array reference,
664            for regular types and |sequence| types (i.e. it in any case
665            returns a string).
666            ($get_next_token): The second item in the array that represents
667            a integer or float token is now a Perl number value, not the
668            original string representation of the number.
669            (check): Support for const value consistency checking.
670            No extended attribute is defined for constants.
671            (Node subclasses): Use simple strings rather than array references
672            for default data type values.
673            ($serialize_type): Type values are now simple strings.
674            (value): If the new attribute value is a false value, then
675            a FALSE value is set to the attribute.
676    
677    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
678    
679            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
680            in its stringified format ("scoped name" as defined in the
681            spec).  Note that future version of this module should not use
682            array references for type values and the |type_text| attribute
683            should be made obsolete.
684            (parse_char_string): Unescape attribute names.
685            (check): Support for checking of whether inherited interfaces
686            are actually defined or not.  Support for checking of whether
687            interface member identifiers are duplicated or not.
688            ($serialize_type): Scoped names are returned as is.  A future
689            version of this code should escape identifiers other than "DOMString",
690            otherwise the idl_text would be non-conforming.
691    
692    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
693    
694            * WebIDL.pm (parse_char_string): Set line/column numbers
695            to generated nodes.  Unescape identifiers.  Extended attributes
696            for Definition's were ignored.
697            (append_child): Set |parent_node| attribute.
698            (parent_node): New attribute.
699            (check): Support interface/exception members.  Support
700            extended attributes.  Support definition identifier uniqueness
701            constraint.
702            (qualified_name): New attribute.
703            (Interface/Exception idl_text): Extended attributes were
704            not prepended to the returned text.
705    
706    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
707    
708            * WebIDL.pm (parse_char_string): Set line/column numbers
709            to interface object experimentally.  s/shift/pop/g, shift
710            would make things wrong.  Support for interface forward
711            declarations was missing.  Broken interface declarations
712            with no block were not ignored entirely.
713            (Whatpm::WebIDL::Node): New abstract class.  This class
714            makes things easier.
715            (child_nodes): New attribute.  Unlike DOM's attribute with
716            same name, this attribute returns a dead list of nodes for
717            simplicity.
718            (get_user_data, set_user_data): New methods.
719            (Module idl_text): A SPACE character should be inserted
720            before the |{| character.
721            (Interface idl_text): Support for interface forward declarations.
722            (is_forward_declaration): New attribute.
723    
724    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
725    
726            * WebIDL.pm (type_text): Better serializer.
727    
728    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
729    
730            * WebIDL.pm: Revise forward-compatible parsing so that
731            it now can handle broken extended attributes and as such.
732    
733    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
734    
735            * WebIDL.pm: Real support for extended attributes.
736            Support for extended attributes with arguments.
737    
738    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
739    
740            * WebIDL.pm: Support for |exception| syntax.
741            (Interface->idl_text): Tentative support for inheritances.
742    
743    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
744    
745            * WebIDL.pm: Hierarchical scoped name support was broken.
746            Support for raises, setraises, and getraises syntaxes.
747    
748    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
749    
750            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
751            proper support for types, extended attributes, and exceptions yet).
752            WebIDL parser, version 1 (no support for exceptions yet,
753            no proper support for extended attributes yet).
754    
755    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
756    
757            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
758    
759    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
760    
761            * WebIDL.pm: Support for valuetype and const.
762    
763    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
764            
765            * WebIDL.pm: New module.
766    
767    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
768    
769            * Makefile (Entities.html): URI changed.
770    
771    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
772    
773            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
774    
775    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
776    
777            * HTML.pm.src (_get_next_token): A parse error was missing.
778    
779    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
780    
781            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
782            spec (commented out). (HTML5 revision 1711).
783    
784    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
785    
786            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
787    
788            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
789            as UTF-16 (HTML5 revision 1701).
790    
791    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
792    
793            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
794            1692).
795    
796    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
797    
798            * HTML.pm.src: The secondary insertion mode used when switching
799            to foreign content is the "in body" insertion mode (HTML5 revision
800            1696).
801    
802    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
803    
804            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
805            revision 1697).
806    
807    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
808    
809            * HTML.pm.src: Support for end-of-file token in foreign content
810            insertion mode (HTML5 revision 1693).  Update SVG camelCase
811            attribute list (HTML5 revision 1700).  <textarea> closes
812            </select> (HTML5 revision 1699).  More start tags close in
813            foreign content insertion mode (HTML5 revision 1698).
814    
815    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
816    
817            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
818    
819    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
820    
821            * HTML.pm.src: More robust charset parameter detection (HTML5
822            revision 1674).
823    
824    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
825    
826            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
827            revision 1676).
828    
829    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
830    
831            * HTML.pm.src: Ignore language part of public identifiers for
832            quriks mode detection (HTML5 revision 1679).
833    
834    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
835    
836            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
837            revision 1685).
838    
839    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
840    
841            * HTML.pm.src: Support for EOF in new states for tags (HTML5
842            revision 1684).
843    
844    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
845    
846            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
847            work (HTML5 revision 1690).
848    
849    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
850    
851            * HTML.pm.src (_tree_construction_main): Change handling of
852            end tags in head insertion modes (HTML5 revision 1686).
853            (parse_char_string): Bug fix for non-utf8 character string handlings.
854            (parse_char_stream): |ungetc| does not work well for this context.
855    
856    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
857    
858            * HTML.pm.src (parse_byte_string): Redefined to invoke
859            |parse_byte_stream|.
860            (parse_byte_stream): New method.
861    
862    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
863    
864            * HTML.pm.src (parse_byte_string): Fix the column number reported
865            by encoding layer error reporter.
866    
867    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
868    
869            * HTML.pm.src (parse_byte_string): Use streaming decoder
870            rather than converting the whole byte string and then parsing.
871            Propagate errors in character encoding layer.
872            (get_next_token): Precise error reporting for |bare stago| error.
873    
874    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
875    
876            * HTML.pm.src (parse_char_stream): New method.
877            (parse_char_string): This method is now defined as an invocation
878            of the |parse_char_stream| method.
879    
880    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
881    
882            * HTML.pm.src (parse_byte_string): Report various status
883            of the sniffing as info-level errors.  Support for new
884            decoding framework in parser resestting.
885            (new): Various default error levels were not set.
886    
887    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
888    
889            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
890            algorithm, except for the actual sniffing, is implemented
891            with new framework with Message::Charset::Info.
892    
893    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
894    
895            * CacheManifest.pm (_parse): Drop fragment identifiers from
896            URIs in fallback section (HTML5 revision 1596).
897    
898    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
899    
900            * Makefile (Entities.html): URI has changed.
901    
902    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
903    
904            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
905            1553).
906    
907    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
908    
909            * ContentChecker.pm: Noted that those returned in |table| are
910            no longer table elements, but table objects returned
911            by Whatpm::HTMLTable.
912    
913            * HTMLTable.pm (form_table): Return table element node
914            as |$table->{element}|.
915            (assign_header): Support for the |headers=""| attribute.
916    
917    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTMLTable.pm (assign_header): New function; first version
920            with no support for headers="".
921            (form_table): Include table width and height to the returned
922            table object for covenience.  Indexing in column assignement was wrong.
923            Set whether a data cell is empty or not for convenience.
924    
925    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
926    
927            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
928    
929    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
930    
931            * HTMLTable.pm: How table model errors are detected is
932            changed (HTML5 revision 1387).
933    
934    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
935    
936            * HTMLTable.pm: The algorithm now moves |tfoot| elements
937            to the end of the table (HTML5 revision 1380).
938    
939    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
940    
941            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
942            of 1-based (HTML5 revision 1376).
943    
944    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
945    
946            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
947            and "Content-Encoding" no longer prevent sniffing (HTML5
948            revision 1288).
949    
950    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
951    
952            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
953            1282).
954    
955    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
956    
957            * ContentChecker.pm: Support for global attributes.
958            Status of XML specs are added.
959    
960    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
961    
962            * ContentChecker.pm (check_element): Support for |template=""|
963            and |ref=""| attribute (referent element type checking).
964    
965    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
966    
967            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
968            1500) is implemented (except for IDNA part and URI-scheme-specific
969            knowledge).  Line number counting was wrong for LF-only
970            documents.
971    
972    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
973    
974            * HTML.pm.src: Raise an parse error for any disallowed
975            character (HTML5 revision 1263).
976    
977    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
978    
979            * mkentitylist.pl: Support for new HTML5 entity table format (the
980            definition for |AElig;| was missing).
981    
982    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
983    
984            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
985            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
986    
987    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
988    
989            * HTML.pm.src: List of element names that close foreign content
990            insertion mode is added (HTML5 revisions 1412 and 1418).
991    
992    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
993    
994            * HTML.pm.src: Support for |mglyph| and |malignmark|
995            elements (HTML5 revision 1410).
996    
997    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
998    
999            * HTML.pm.src: Support for new long MathML entities (HTML5
1000            revision 1406).
1001    
1002    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
1003    
1004            * HTML.pm.src: CDATA section support for MathML and SVG
1005            elements (HTML5 revision 1404 and 1420).
1006    
1007    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1008    
1009            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1010            elements (HTML5 revision 1404).  Unused !!!macro definitions
1011            are removed.
1012    
1013    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
1014    
1015            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1016            are raised is changed (HTML5 revision 1404).
1017    
1018    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1019    
1020            * HTML.pm.src: Category sets that are no longer used are removed.
1021    
1022    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * HTML.pm.src: The ->[1] property of stack entries are now
1025            replaced by constants representing element category.
1026    
1027    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
1028    
1029            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1030            for error reporting.  (This is a preparation for using constant
1031            value for ->[1].)
1032    
1033    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1034    
1035            * RDFXML.pm: Typo fixed.
1036    
1037    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
1038    
1039            * ContentChecker.pm: |fact_level| is now treated
1040            as same as |must_level|, i.e. level = |m|.
1041            (check_element): Make list of URIs in the DOM.
1042    
1043    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1044    
1045            * RDFXML.pm: Language accessor implemented.
1046            Local (null-namespace) attribute support.
1047    
1048    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * RDFXML.pm: Factored out ID checking code.
1051    
1052    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1053    
1054            * RDFXML.pm: TODO items noted.  Validation of ID
1055            and URI attributes is implemented.  Warn if unknown
1056            value is used in rdf:parseType="" attribute.
1057    
1058            * URIChecker.pm (check_rdf_uri_reference): New function.
1059    
1060    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1061    
1062            * RDFXML.pm: bnodeid implemented.  Relative references
1063            are now resolved.
1064    
1065    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1066    
1067            * ContentChecker.pm: RDF reification implemented.
1068    
1069            * RDFXML.pm: undef vs false bug fixed.
1070            Reification implemented.
1071    
1072    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1073    
1074            * RDFXML.pm: s/id/ID/ for attribute name.
1075            The |node| arguments are added for |ontriple| calls.
1076            Too many "attribute not allowed" errors were raised.
1077    
1078            * ContentChecker.pm: Initial experimental support for rdf:RDF
1079            element.
1080    
1081    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
1082    
1083            * RDFXML.pm: New module.
1084    
1085    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1086    
1087            * HTML.pm.src (set_inner_html): Line/column number
1088            code was old one yet.
1089    
1090    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1091    
1092            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1093            errors.  Line/column markings for DOCTYPE, comment, and
1094            character tokens are reintroduced; otherwise,
1095            error location for "not HTML5" error and errors
1096            for implied elements are not attached.
1097    
1098    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1099    
1100            * HTML.pm.src: Set line/column numbers to attributes.
1101    
1102            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1103            (value): Setter implemented.
1104    
1105            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1106    
1107    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1108    
1109            * HTML.pm.src: Unused line/column markings are removed.
1110    
1111    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1112    
1113            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1114            flag, which is no longer used.
1115    
1116    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1117    
1118            * HTML.pm.src: Set line/column information to element nodes.
1119    
1120            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1121            and !!!insert-element-t): Set line/column information to
1122            element nodes.
1123    
1124    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1125    
1126            * HTML.pm.src (_get_next_token): The first "<" character
1127            in "<?", "<>", or "</>" should be the error point.
1128    
1129    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1130    
1131            * HTML.pm.src: Some more fixes on error position reporting.
1132    
1133    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1134    
1135            * HTML.pm.src: Token-level precious error reporting.
1136    
1137    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1138    
1139            * HTML.pm.src: Preparation for more precious error point
1140            reporting.
1141    
1142    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1143    
1144            * HTML.pm.src: Error type revised.
1145    
1146    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1147    
1148            * HTML.pm.src: Similar codes are merged together, again.
1149    
1150    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1151    
1152            * HTML.pm.src: Similar codes are merged together.
1153    
1154    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1155    
1156            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1157    
1158            * HTML.pm.src: Code refined.
1159    
1160    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1161    
1162            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1163            revision 1348).  Note that I really don't know this makes
1164            any difference in the black-box behavior of the parser.
1165    
1166    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1167    
1168            * HTML.pm.src: New end-of-file token implementation (HTML5
1169            revision 1348).
1170    
1171    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1172    
1173            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1174    
1175    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1176    
1177            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1178    
1179  2008-03-09  Wakaba  <wakaba@suika.fam.cx>  2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1180    
1181          * HTML.pm.src: Support for |<input>| in the "in select" insertion          * HTML.pm.src: Support for |<input>| in the "in select" insertion

Legend:
Removed from v.1.178  
changed lines
  Added in v.1.372

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24