/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.161 by wakaba, Thu Mar 6 15:23:14 2008 UTC revision 1.349 by wakaba, Mon Oct 13 06:18:30 2008 UTC
# Line 1  Line 1 
1    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
4            construction stage synced with the spec (HTML5 revisions 2139 and
5            2302).
6    
7    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
8    
9            * ContentChecker.pm: New error level "html5_fact" added, which
10            should be tentatively used until all of requirements are properly
11            specced as RFC 2119 "MUST" in HTML5.
12    
13    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
14    
15            * ContentChecker.pod: Note on internal flags is added.
16    
17    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: An AAA bug fixed.
20    
21    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: If another node is inserted by the parser, don't
24            reuse existing Text node to append a character (HTML5 revision
25            2124).
26    
27    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
30            revisions 1731 and 2128).
31    
32    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
35            Support for end tags of camelCase SVG elements were broken.  A
36            wrong error type text fixed.
37    
38    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
41    
42    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Support for new definition of |param| and |source|
45            start tag parsing (HTML5 revision 1731).
46    
47    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
48    
49            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
50    
51    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
54            revisions 1731 and 1831).
55    
56    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
57    
58            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
59            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
60            revision 1731).
61    
62    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
65            revision 1731).  End tags of |option| and |optgroup| elements are
66            now optional (HTML5 revision 1731).
67    
68    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: New "special" elements added to the list (HTML5
71            revision 1778).  "strile" -> "strike".
72    
73    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
74    
75            * ContentType.pm (get_sniffed_type): Support for the "better"
76            content sniffing (HTML5 revision 1927).  In a case the official
77            type was not returned when the method is invoked in the list
78            context.
79    
80    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: Character references for non-space C0 characters,
83            including U+000B VT, DEL character, noncharacter code points, are
84            now converted to the U+FFFD character (cf. HTML5 revision 2138).
85    
86    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
87    
88            * ContentChecker.pm: |form=""| check support added.
89    
90    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
91    
92            * ContentChecker.pm: |contextmenu| validness is now checked using
93            |id| and |id_type| properties, and |menu| property is removed.
94    
95    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
96    
97            * ContentChecker.pm: Prepare for |form| |name| attribute's
98            duplication checking.
99    
100    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
103            unsupported charset="" parameter value (HTML5 revision 2131).
104    
105    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: Reminding places where U+000B is allowed as a space
108            character is fixed (cf. HTML5 revision 1738).
109    
110            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
111            space characters (HTML5 revision 1738).
112    
113    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
114    
115            * HTML.pm.src: The "anything else" case for the "after after body"
116            insertion mode was not updated to swtich to the "in body"
117            insertion mode.  U+000B is no longer a space character for the
118            purpose of tree construction phase (HTML5 revision 1738).
119    
120    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: U+000B is no longer a space character (HTML5
123            revision 1738).
124    
125    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
126    
127            * ContentType.pm: 0x0B is no longer a space character (HTML5
128            revision 1738).
129    
130            * HTML.pm.src: U+000B is no longer a space character for the
131            algorithm for extracting an encoding from a Content-Type (HTML5
132            revision 1738).
133    
134    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
135    
136            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
137    
138    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
139    
140            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
141    
142            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
143            matching.
144    
145    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
146    
147            * LangTag.pm: Warn for private use language subtags.  Error level
148            typos fixed.  Support for Suppress-Script field.
149    
150            * mklangreg.pl: Support for dumping of nested structure.
151    
152    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
153    
154            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
155            recommended case as per RFC 4646.
156    
157    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
158    
159            * LangTag.pm (check_rfc4646_langtag): New method.
160    
161    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
162    
163            * mklangreg.pl: New script.
164    
165            * Makefile: Updated for creation of the module for language subtag
166            registry.
167            
168    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
169    
170            * Makefile: WebIDL.html added.
171    
172            * WebIDL.pod: New documentation.
173    
174    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
175    
176            * WebIDL.pm: Checker's error types are redefined.
177    
178    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
179    
180            * WebIDL.pm: Parser's error types are redefined.  Some forward
181            compatible parsing bugs are fixed.  Some unreachable codes are
182            commented out.
183    
184    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
185    
186            * WebIDL.pm: Support for the reminding extended attributes are
187            added.  It does not satisfy the definition that a forward
188            interface declaration has an extended attribute.  It seems that
189            unless explicitly allowed multiple extended attributes with the
190            same name is not allowed, though it is not explicitly mentioned in
191            the spec.
192    
193    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
194    
195            * WebIDL.pm: Unescapes extended attribute names and extended
196            attribute identifiers.  Preserve whether an extended attribute has
197            an argument list of not.  Support for extended attributes:
198            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
199            NameGetter, NameSetter, and Null.
200            (has_argument_list): New attribute.
201            (idl_text): Stringifies argument lists, if any, even if it is
202            empty.
203    
204    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
207            |{s_kwd}| in DATA_STATE as default.
208    
209    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
210    
211            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
212            by |{s_kwd}| in DATA_STATE.
213    
214    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: Shorten keys.
217    
218    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Remove checking for control character, surrogate
221            pair, or noncharacter code points and non-Unicode code
222            points (they should be handled by Whatpm::Charset::UnicodeChecker).
223            (parse_char_stream): Support for the |$get_wrapper| argument and
224            character stream error handlers.
225    
226    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
227    
228            * ContentChecker.pm: Don't call |loda_ns_module|
229            for null-namespace elements/attributes.
230    
231            * HTML.pm.src: Fact out $disallowed_control_chars
232            as a hash.
233    
234    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
235    
236            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
237            and |{next_char}| initializations are moved to initialization
238            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
239            with |parse_char_stream|.
240    
241    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src (parse_char_stream): Make |set_next_char|
244            invoke |manakai_read_until|, not only |read|, where
245            possible, to decrease the number of |read| method calls.
246    
247            * mkhtmlparser.pl: Related changes to the aforementioned
248            modification.
249    
250    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
253            would report character error from now.
254    
255    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: White-space-leaded non-white-space character
258            tokens in "before head insertion mode" was not
259            correctly handled.
260            (set_inner_html): Reimplemented using CharString decodehandle
261            class.  Support for $get_wrapper argument.  Support
262            for |{read_until}| feature.
263    
264    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
265    
266            * HTML.pm.src: Make a "bare ero" error for unknown
267            entities point the "&" character.
268    
269    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
270    
271            * HTML.pm.src: It turns out that U+FFFD don't have to
272            be added to the list of excluded characters.
273    
274    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src ($char_onerror): Have character decoder's |line|
277            and |column| a higher priority than the one set by the
278            tokenizer's input handler.
279            ($self->{read_until}): Exclude U+FFFD (but this might
280            not be necessary, since now we do line/column fixup in
281            the character decode handle).
282    
283    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Use |{read_until}| where possible.
286    
287    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
290            and |manakai_getc_until| to |manakai_read_until| to
291            reduce the number of string copies.
292    
293    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src (parse_char_string): Use newly created
296            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
297            standard feature to |open| a string as a filehandle,
298            since Perl's string filehandle seems not supporting |ungetc|
299            method correctly.
300            (parse_char_stream): Define |{getc_until}| method.
301            (DATA_STATE): Experimental support for |getc_until| feature.
302    
303    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
304    
305            * HTML.pm.src: Check points added to newly added branches.
306    
307    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
308    
309            * HTML.pm.src: Remove |{char}|, which is no longer used.
310            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
311            and replaced by |{prev_state}|.
312    
313            * mkhtmlparser.pl: Remove |{char}| feature.
314            Remove |!!!back-next-input-character;| macro.
315    
316    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
317    
318            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
319            entity related tokenizer states in favor of new states
320            implementing the consume character reference algorithm.
321    
322    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTML.pm.src: "Consume a character reference" algorithm is
325            now implemented as a tokenizer's state, rather than
326            a method, with minimum changes (more changes will
327            be made, in due course).  "Bogus comment state"'s inner
328            loop gets removed.
329    
330    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
331    
332            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
333            into their own tokenizer states.
334    
335    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
336    
337            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
338            is split into three states.
339    
340    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
341    
342            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
343            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
344            no longer does the tokenizer have to push back next input
345            characters in those states.
346    
347    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
348    
349            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
350            into four states so that no longer does the tokenizer have to push
351            back next input characters in that state.
352    
353    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
354    
355            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
356            which can be used to insert some wrapper between the character
357            stream handle and the tokenizer.  (It is currently not supported
358            for |set_inner_html| for |Element|s).
359    
360    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
361    
362            * HTML.pm.src: Ignore punctuations in charset names.
363    
364    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
365    
366            * ContentChecker.pm: Support for charset-layer error levels.
367    
368            * HTML.pm.src: Don't specify |text| argument for the
369            |chardecode:fallback| error, since it is not the encoding
370            being used alternatively.
371    
372    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
373    
374            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
375    
376    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
377    
378            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
379    
380    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
381    
382            * HTML.pm.src: Bug fix and sync with the spec with regard
383            to after after frameset insertion mode processing (HTML5
384            revision 1909).  Note that the implementation was wrong
385            per the old spec before the r1909 changes.
386    
387    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
388    
389            * HTMLTable.pm: scope=auto algorithm fix synced with the
390            spec (HTML5 revision 2093).
391            ($process_row): Algorithm step numbers synced with the
392            spec (HTML5 revision 2092).
393    
394    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
395    
396            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
397            revision 2094).
398    
399    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
400    
401            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
402    
403    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
404    
405            * HTML.pm.src: '"' and "'" at the end of attribute
406            name (after another attribute) now raise parse error (HTML5
407            revision 2123).  Empty unquoted attribute values are no
408            longer allowed (HTML5 revision 2122).
409    
410    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
411    
412            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
413            revision 2130).
414    
415    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
416    
417            * ContentChecker.pm: |xml:lang| attribute value must be same
418            as |lang| attribute value for HTML elements (HTML5 revision 2062
419            and so on).
420    
421    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
422    
423            * ContentChecker.pm: Error level definition for |xml_id_error|
424            was missing.
425    
426            * URIChecker.pm: The end of the URL should be marked as the
427            error location for an empty path error.  The position
428            between the userinfo and the port components should be
429            marked as the error location for an empty host error.
430    
431    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
432    
433            * URIChecker.pm: Set parameters representing where in the
434            value the error occurs for errors.  Report unknown
435            address format error in warning level, since address
436            formats are rarely added.  Path segments starting with "/.."
437            were misinterpreted as a dot-segment.
438    
439    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
440    
441            * URIChecker.pm (check_iri_reference): Requires
442            |Message::DOM::DOMImplementation|.
443    
444    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
445    
446            * IMTChecker.pm: Updated for the new error reporting architecture.
447    
448            * ContentChecker.pm: Error levels for IMTs are added.
449    
450    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
451    
452            * H2H.pm (_shift_token): Support for unquoted HTML attribute
453            values.
454    
455    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
456    
457            * CacheManifest.pm: Support for new style of error
458            reports.
459    
460            * HTML.pm.src: Set line=1, column=1 to the document node.
461    
462    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
463    
464            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
465            and URL checkers.  Support for more error levels for bogus
466            langauge tag and URL "standards".
467    
468            * LangTag.pm, URIChecker.pm: Support for new style error
469            level reporting.
470    
471    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
472    
473            * ContentChecker.pm: Support for RDF/XML error levels.
474    
475            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
476            specifying.  Error types are revised.
477    
478    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
479    
480            * ContentChecker.pm: All error reporting method calls are
481            renewed.
482    
483    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
484    
485            * HTML.pm.src: All error type names and "text" parameters
486            are revised.  Use new style for "level" specification.
487    
488            * mkhtmlparser.pl: Use new style for "level" specification.
489    
490    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
491    
492            * WebIDL.pm (parse_char_string): Simplified error
493            reporting process for broken ignored valuetype definition.
494            (Valuetype idl_text): Support for special "DOMString" name.
495    
496    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
497    
498            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
499            terminal of the ScopedName is "DOMString", such that whether
500            the last part of the scoped name is "DOMString" or "_DOMString"
501            later.  It is necessary to determine whether a |typedef|
502            definition should be ignored or not.
503            (parse_char_string): Unescape the identifier of
504            exception members.
505            ($resolve): Return undef for builtin types and sequence<T>
506            types (we might not have to do this, however...).
507            (check): Support checking for Exceptions, Valuetypes,
508            and Typedefs.
509            ($serialize_type): Support for "DOMString::::" syntax.
510            (Typedef idl_text): Output Type as "DOMString" if it
511            is really "DOMString" (i.e. its internal representation
512            is "::DOMString::").
513    
514    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
515    
516            * WebIDL.pm ($resolve): New code, based on resolve code
517            for constant types in the |check| method.
518            (check): Support for checking of attributes, operations, and
519            arguments.
520            (Attribute/Operation idl_text): Exception names in getraises,
521            setraises, and raises clauses is serizlied by |$serialize_type|
522            code.
523    
524    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
525    
526            * WebIDL.pm ($integer): Order of selections are changed to match
527            hexadecimal numbers (the original pattern, taken from the spec,
528            was not work for hexadecimal numbers, because the "0" prefix
529            matches to the [0-7]* part (as an empty string) and therefore
530            it does not match with remaining "x..." part of a "0x..." integer
531            literal.
532            ($get_type): It now returns a string, not an array reference,
533            for regular types and |sequence| types (i.e. it in any case
534            returns a string).
535            ($get_next_token): The second item in the array that represents
536            a integer or float token is now a Perl number value, not the
537            original string representation of the number.
538            (check): Support for const value consistency checking.
539            No extended attribute is defined for constants.
540            (Node subclasses): Use simple strings rather than array references
541            for default data type values.
542            ($serialize_type): Type values are now simple strings.
543            (value): If the new attribute value is a false value, then
544            a FALSE value is set to the attribute.
545    
546    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
547    
548            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
549            in its stringified format ("scoped name" as defined in the
550            spec).  Note that future version of this module should not use
551            array references for type values and the |type_text| attribute
552            should be made obsolete.
553            (parse_char_string): Unescape attribute names.
554            (check): Support for checking of whether inherited interfaces
555            are actually defined or not.  Support for checking of whether
556            interface member identifiers are duplicated or not.
557            ($serialize_type): Scoped names are returned as is.  A future
558            version of this code should escape identifiers other than "DOMString",
559            otherwise the idl_text would be non-conforming.
560    
561    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
562    
563            * WebIDL.pm (parse_char_string): Set line/column numbers
564            to generated nodes.  Unescape identifiers.  Extended attributes
565            for Definition's were ignored.
566            (append_child): Set |parent_node| attribute.
567            (parent_node): New attribute.
568            (check): Support interface/exception members.  Support
569            extended attributes.  Support definition identifier uniqueness
570            constraint.
571            (qualified_name): New attribute.
572            (Interface/Exception idl_text): Extended attributes were
573            not prepended to the returned text.
574    
575    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
576    
577            * WebIDL.pm (parse_char_string): Set line/column numbers
578            to interface object experimentally.  s/shift/pop/g, shift
579            would make things wrong.  Support for interface forward
580            declarations was missing.  Broken interface declarations
581            with no block were not ignored entirely.
582            (Whatpm::WebIDL::Node): New abstract class.  This class
583            makes things easier.
584            (child_nodes): New attribute.  Unlike DOM's attribute with
585            same name, this attribute returns a dead list of nodes for
586            simplicity.
587            (get_user_data, set_user_data): New methods.
588            (Module idl_text): A SPACE character should be inserted
589            before the |{| character.
590            (Interface idl_text): Support for interface forward declarations.
591            (is_forward_declaration): New attribute.
592    
593    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
594    
595            * WebIDL.pm (type_text): Better serializer.
596    
597    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
598    
599            * WebIDL.pm: Revise forward-compatible parsing so that
600            it now can handle broken extended attributes and as such.
601    
602    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
603    
604            * WebIDL.pm: Real support for extended attributes.
605            Support for extended attributes with arguments.
606    
607    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
608    
609            * WebIDL.pm: Support for |exception| syntax.
610            (Interface->idl_text): Tentative support for inheritances.
611    
612    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
613    
614            * WebIDL.pm: Hierarchical scoped name support was broken.
615            Support for raises, setraises, and getraises syntaxes.
616    
617    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
618    
619            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
620            proper support for types, extended attributes, and exceptions yet).
621            WebIDL parser, version 1 (no support for exceptions yet,
622            no proper support for extended attributes yet).
623    
624    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
625    
626            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
627    
628    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
629    
630            * WebIDL.pm: Support for valuetype and const.
631    
632    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
633            
634            * WebIDL.pm: New module.
635    
636    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
637    
638            * Makefile (Entities.html): URI changed.
639    
640    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
641    
642            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
643    
644    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
645    
646            * HTML.pm.src (_get_next_token): A parse error was missing.
647    
648    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
649    
650            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
651            spec (commented out). (HTML5 revision 1711).
652    
653    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
654    
655            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
656    
657            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
658            as UTF-16 (HTML5 revision 1701).
659    
660    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
661    
662            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
663            1692).
664    
665    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
666    
667            * HTML.pm.src: The secondary insertion mode used when switching
668            to foreign content is the "in body" insertion mode (HTML5 revision
669            1696).
670    
671    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
672    
673            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
674            revision 1697).
675    
676    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
677    
678            * HTML.pm.src: Support for end-of-file token in foreign content
679            insertion mode (HTML5 revision 1693).  Update SVG camelCase
680            attribute list (HTML5 revision 1700).  <textarea> closes
681            </select> (HTML5 revision 1699).  More start tags close in
682            foreign content insertion mode (HTML5 revision 1698).
683    
684    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
685    
686            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
687    
688    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
689    
690            * HTML.pm.src: More robust charset parameter detection (HTML5
691            revision 1674).
692    
693    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
694    
695            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
696            revision 1676).
697    
698    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
699    
700            * HTML.pm.src: Ignore language part of public identifiers for
701            quriks mode detection (HTML5 revision 1679).
702    
703    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
704    
705            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
706            revision 1685).
707    
708    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
709    
710            * HTML.pm.src: Support for EOF in new states for tags (HTML5
711            revision 1684).
712    
713    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
714    
715            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
716            work (HTML5 revision 1690).
717    
718    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
719    
720            * HTML.pm.src (_tree_construction_main): Change handling of
721            end tags in head insertion modes (HTML5 revision 1686).
722            (parse_char_string): Bug fix for non-utf8 character string handlings.
723            (parse_char_stream): |ungetc| does not work well for this context.
724    
725    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
726    
727            * HTML.pm.src (parse_byte_string): Redefined to invoke
728            |parse_byte_stream|.
729            (parse_byte_stream): New method.
730    
731    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
732    
733            * HTML.pm.src (parse_byte_string): Fix the column number reported
734            by encoding layer error reporter.
735    
736    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
737    
738            * HTML.pm.src (parse_byte_string): Use streaming decoder
739            rather than converting the whole byte string and then parsing.
740            Propagate errors in character encoding layer.
741            (get_next_token): Precise error reporting for |bare stago| error.
742    
743    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
744    
745            * HTML.pm.src (parse_char_stream): New method.
746            (parse_char_string): This method is now defined as an invocation
747            of the |parse_char_stream| method.
748    
749    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
750    
751            * HTML.pm.src (parse_byte_string): Report various status
752            of the sniffing as info-level errors.  Support for new
753            decoding framework in parser resestting.
754            (new): Various default error levels were not set.
755    
756    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
757    
758            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
759            algorithm, except for the actual sniffing, is implemented
760            with new framework with Message::Charset::Info.
761    
762    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
763    
764            * CacheManifest.pm (_parse): Drop fragment identifiers from
765            URIs in fallback section (HTML5 revision 1596).
766    
767    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
768    
769            * Makefile (Entities.html): URI has changed.
770    
771    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
772    
773            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
774            1553).
775    
776    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
777    
778            * ContentChecker.pm: Noted that those returned in |table| are
779            no longer table elements, but table objects returned
780            by Whatpm::HTMLTable.
781    
782            * HTMLTable.pm (form_table): Return table element node
783            as |$table->{element}|.
784            (assign_header): Support for the |headers=""| attribute.
785    
786    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
787    
788            * HTMLTable.pm (assign_header): New function; first version
789            with no support for headers="".
790            (form_table): Include table width and height to the returned
791            table object for covenience.  Indexing in column assignement was wrong.
792            Set whether a data cell is empty or not for convenience.
793    
794    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
795    
796            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
797    
798    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
799    
800            * HTMLTable.pm: How table model errors are detected is
801            changed (HTML5 revision 1387).
802    
803    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
804    
805            * HTMLTable.pm: The algorithm now moves |tfoot| elements
806            to the end of the table (HTML5 revision 1380).
807    
808    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
809    
810            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
811            of 1-based (HTML5 revision 1376).
812    
813    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
814    
815            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
816            and "Content-Encoding" no longer prevent sniffing (HTML5
817            revision 1288).
818    
819    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
820    
821            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
822            1282).
823    
824    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
825    
826            * ContentChecker.pm: Support for global attributes.
827            Status of XML specs are added.
828    
829    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
830    
831            * ContentChecker.pm (check_element): Support for |template=""|
832            and |ref=""| attribute (referent element type checking).
833    
834    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
835    
836            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
837            1500) is implemented (except for IDNA part and URI-scheme-specific
838            knowledge).  Line number counting was wrong for LF-only
839            documents.
840    
841    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
842    
843            * HTML.pm.src: Raise an parse error for any disallowed
844            character (HTML5 revision 1263).
845    
846    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
847    
848            * mkentitylist.pl: Support for new HTML5 entity table format (the
849            definition for |AElig;| was missing).
850    
851    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
852    
853            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
854            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
855    
856    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
857    
858            * HTML.pm.src: List of element names that close foreign content
859            insertion mode is added (HTML5 revisions 1412 and 1418).
860    
861    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
862    
863            * HTML.pm.src: Support for |mglyph| and |malignmark|
864            elements (HTML5 revision 1410).
865    
866    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
867    
868            * HTML.pm.src: Support for new long MathML entities (HTML5
869            revision 1406).
870    
871    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
872    
873            * HTML.pm.src: CDATA section support for MathML and SVG
874            elements (HTML5 revision 1404 and 1420).
875    
876    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
877    
878            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
879            elements (HTML5 revision 1404).  Unused !!!macro definitions
880            are removed.
881    
882    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
883    
884            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
885            are raised is changed (HTML5 revision 1404).
886    
887    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
888    
889            * HTML.pm.src: Category sets that are no longer used are removed.
890    
891    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
892    
893            * HTML.pm.src: The ->[1] property of stack entries are now
894            replaced by constants representing element category.
895    
896    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
897    
898            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
899            for error reporting.  (This is a preparation for using constant
900            value for ->[1].)
901    
902    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
903    
904            * RDFXML.pm: Typo fixed.
905    
906    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
907    
908            * ContentChecker.pm: |fact_level| is now treated
909            as same as |must_level|, i.e. level = |m|.
910            (check_element): Make list of URIs in the DOM.
911    
912    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
913    
914            * RDFXML.pm: Language accessor implemented.
915            Local (null-namespace) attribute support.
916    
917    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
918    
919            * RDFXML.pm: Factored out ID checking code.
920    
921    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
922    
923            * RDFXML.pm: TODO items noted.  Validation of ID
924            and URI attributes is implemented.  Warn if unknown
925            value is used in rdf:parseType="" attribute.
926    
927            * URIChecker.pm (check_rdf_uri_reference): New function.
928    
929    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
930    
931            * RDFXML.pm: bnodeid implemented.  Relative references
932            are now resolved.
933    
934    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
935    
936            * ContentChecker.pm: RDF reification implemented.
937    
938            * RDFXML.pm: undef vs false bug fixed.
939            Reification implemented.
940    
941    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
942    
943            * RDFXML.pm: s/id/ID/ for attribute name.
944            The |node| arguments are added for |ontriple| calls.
945            Too many "attribute not allowed" errors were raised.
946    
947            * ContentChecker.pm: Initial experimental support for rdf:RDF
948            element.
949    
950    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
951    
952            * RDFXML.pm: New module.
953    
954    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
955    
956            * HTML.pm.src (set_inner_html): Line/column number
957            code was old one yet.
958    
959    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
960    
961            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
962            errors.  Line/column markings for DOCTYPE, comment, and
963            character tokens are reintroduced; otherwise,
964            error location for "not HTML5" error and errors
965            for implied elements are not attached.
966    
967    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
968    
969            * HTML.pm.src: Set line/column numbers to attributes.
970    
971            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
972            (value): Setter implemented.
973    
974            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
975    
976    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
977    
978            * HTML.pm.src: Unused line/column markings are removed.
979    
980    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
981    
982            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
983            flag, which is no longer used.
984    
985    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
986    
987            * HTML.pm.src: Set line/column information to element nodes.
988    
989            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
990            and !!!insert-element-t): Set line/column information to
991            element nodes.
992    
993    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
994    
995            * HTML.pm.src (_get_next_token): The first "<" character
996            in "<?", "<>", or "</>" should be the error point.
997    
998    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
999    
1000            * HTML.pm.src: Some more fixes on error position reporting.
1001    
1002    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1003    
1004            * HTML.pm.src: Token-level precious error reporting.
1005    
1006    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1007    
1008            * HTML.pm.src: Preparation for more precious error point
1009            reporting.
1010    
1011    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1012    
1013            * HTML.pm.src: Error type revised.
1014    
1015    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1016    
1017            * HTML.pm.src: Similar codes are merged together, again.
1018    
1019    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1020    
1021            * HTML.pm.src: Similar codes are merged together.
1022    
1023    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1024    
1025            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1026    
1027            * HTML.pm.src: Code refined.
1028    
1029    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1030    
1031            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1032            revision 1348).  Note that I really don't know this makes
1033            any difference in the black-box behavior of the parser.
1034    
1035    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1036    
1037            * HTML.pm.src: New end-of-file token implementation (HTML5
1038            revision 1348).
1039    
1040    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1041    
1042            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1043    
1044    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1045    
1046            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1047    
1048    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1051            mode and support for the "in select in table" insertion mode (HTML5
1052            revision 1342).
1053    
1054    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1055    
1056            * HTML.pm.src: No foster parenting for <script> and <script>
1057            in non-tainted <table>s (HTML5 revision 1336).
1058    
1059    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1060    
1061            * HTML.pm.src: Ignore white space characters between <html>
1062            and <head> (HTML5 revision 1332).
1063    
1064    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1065    
1066            * HTML.pm.src: Treat <input type=hidden> as if it were a
1067            white space (HTML5 revision 1331).
1068    
1069    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1070    
1071            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1072            element (HTML5 revision 1330).
1073    
1074    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1075    
1076            * HTML.pm.src: <title> is always appended to the current
1077            element (HTML5 revision 1328).
1078    
1079    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1080    
1081            * HTML.pm.src: White space in tainted tables are moved
1082            into foster parents (HTML5 revision 1326).
1083    
1084    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1085    
1086            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1087            revision 1321).
1088    
1089    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1090    
1091            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1092            the spec (HTML5 revision 1320).  Note that we finally complete
1093            all of HTML5 revision 1320 changes.
1094    
1095    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1096    
1097            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1098            tags (HTML5 revision 1320).
1099    
1100    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1101    
1102            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1103    
1104    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1105    
1106            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1107            could not be reached so that it is now removed (HTML5 revision
1108            1320).
1109    
1110    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1111    
1112            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1113            to align with new spec text (HTML5 revision 1320).
1114    
1115    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1116    
1117            * HTML.pm.src: Remove strange |if| condition; however, it should
1118            have had no harm in theory.
1119    
1120    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1121    
1122            * HTML.pm.src (_tree_construction_main): '</p>' in body
1123            case is split from other end tags for the preparation
1124            of implementing HTML revision 1320.
1125    
1126    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1127    
1128            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1129            revision 1320).
1130    
1131    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1132    
1133            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1134            and "div" (HTML5 revision 1318).  Add comments to where
1135            |form| pointer association codes should be inserted (HTML5 revision
1136            1319).
1137    
1138  2008-03-06  Wakaba  <wakaba@suika.fam.cx>  2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1139    
1140          * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).          * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).

Legend:
Removed from v.1.161  
changed lines
  Added in v.1.349

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24