/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.260 by wakaba, Sat Jul 19 07:41:22 2008 UTC revision 1.333 by wakaba, Sun Sep 21 12:37:09 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: |form=""| check support added.
4    
5    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
6    
7            * ContentChecker.pm: |contextmenu| validness is now checked using
8            |id| and |id_type| properties, and |menu| property is removed.
9    
10    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
11    
12            * ContentChecker.pm: Prepare for |form| |name| attribute's
13            duplication checking.
14    
15    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
18            unsupported charset="" parameter value (HTML5 revision 2131).
19    
20    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: Reminding places where U+000B is allowed as a space
23            character is fixed (cf. HTML5 revision 1738).
24    
25            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
26            space characters (HTML5 revision 1738).
27    
28    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src: The "anything else" case for the "after after body"
31            insertion mode was not updated to swtich to the "in body"
32            insertion mode.  U+000B is no longer a space character for the
33            purpose of tree construction phase (HTML5 revision 1738).
34    
35    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
36    
37            * HTML.pm.src: U+000B is no longer a space character (HTML5
38            revision 1738).
39    
40    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
41    
42            * ContentType.pm: 0x0B is no longer a space character (HTML5
43            revision 1738).
44    
45            * HTML.pm.src: U+000B is no longer a space character for the
46            algorithm for extracting an encoding from a Content-Type (HTML5
47            revision 1738).
48    
49    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
50    
51            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
52    
53    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
54    
55            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
56    
57            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
58            matching.
59    
60    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
61    
62            * LangTag.pm: Warn for private use language subtags.  Error level
63            typos fixed.  Support for Suppress-Script field.
64    
65            * mklangreg.pl: Support for dumping of nested structure.
66    
67    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
68    
69            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
70            recommended case as per RFC 4646.
71    
72    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
73    
74            * LangTag.pm (check_rfc4646_langtag): New method.
75    
76    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
77    
78            * mklangreg.pl: New script.
79    
80            * Makefile: Updated for creation of the module for language subtag
81            registry.
82            
83    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
84    
85            * Makefile: WebIDL.html added.
86    
87            * WebIDL.pod: New documentation.
88    
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * WebIDL.pm: Checker's error types are redefined.
92    
93    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
94    
95            * WebIDL.pm: Parser's error types are redefined.  Some forward
96            compatible parsing bugs are fixed.  Some unreachable codes are
97            commented out.
98    
99    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
100    
101            * WebIDL.pm: Support for the reminding extended attributes are
102            added.  It does not satisfy the definition that a forward
103            interface declaration has an extended attribute.  It seems that
104            unless explicitly allowed multiple extended attributes with the
105            same name is not allowed, though it is not explicitly mentioned in
106            the spec.
107    
108    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
109    
110            * WebIDL.pm: Unescapes extended attribute names and extended
111            attribute identifiers.  Preserve whether an extended attribute has
112            an argument list of not.  Support for extended attributes:
113            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
114            NameGetter, NameSetter, and Null.
115            (has_argument_list): New attribute.
116            (idl_text): Stringifies argument lists, if any, even if it is
117            empty.
118    
119    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
122            |{s_kwd}| in DATA_STATE as default.
123    
124    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
127            by |{s_kwd}| in DATA_STATE.
128    
129    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
130    
131            * HTML.pm.src: Shorten keys.
132    
133    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTML.pm.src: Remove checking for control character, surrogate
136            pair, or noncharacter code points and non-Unicode code
137            points (they should be handled by Whatpm::Charset::UnicodeChecker).
138            (parse_char_stream): Support for the |$get_wrapper| argument and
139            character stream error handlers.
140    
141    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
142    
143            * ContentChecker.pm: Don't call |loda_ns_module|
144            for null-namespace elements/attributes.
145    
146            * HTML.pm.src: Fact out $disallowed_control_chars
147            as a hash.
148    
149    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
152            and |{next_char}| initializations are moved to initialization
153            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
154            with |parse_char_stream|.
155    
156    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src (parse_char_stream): Make |set_next_char|
159            invoke |manakai_read_until|, not only |read|, where
160            possible, to decrease the number of |read| method calls.
161    
162            * mkhtmlparser.pl: Related changes to the aforementioned
163            modification.
164    
165    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
168            would report character error from now.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: White-space-leaded non-white-space character
173            tokens in "before head insertion mode" was not
174            correctly handled.
175            (set_inner_html): Reimplemented using CharString decodehandle
176            class.  Support for $get_wrapper argument.  Support
177            for |{read_until}| feature.
178    
179    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Make a "bare ero" error for unknown
182            entities point the "&" character.
183    
184    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
185    
186            * HTML.pm.src: It turns out that U+FFFD don't have to
187            be added to the list of excluded characters.
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src ($char_onerror): Have character decoder's |line|
192            and |column| a higher priority than the one set by the
193            tokenizer's input handler.
194            ($self->{read_until}): Exclude U+FFFD (but this might
195            not be necessary, since now we do line/column fixup in
196            the character decode handle).
197    
198    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: Use |{read_until}| where possible.
201    
202    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
205            and |manakai_getc_until| to |manakai_read_until| to
206            reduce the number of string copies.
207    
208    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src (parse_char_string): Use newly created
211            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
212            standard feature to |open| a string as a filehandle,
213            since Perl's string filehandle seems not supporting |ungetc|
214            method correctly.
215            (parse_char_stream): Define |{getc_until}| method.
216            (DATA_STATE): Experimental support for |getc_until| feature.
217    
218    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Check points added to newly added branches.
221    
222    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
223    
224            * HTML.pm.src: Remove |{char}|, which is no longer used.
225            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
226            and replaced by |{prev_state}|.
227    
228            * mkhtmlparser.pl: Remove |{char}| feature.
229            Remove |!!!back-next-input-character;| macro.
230    
231    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
234            entity related tokenizer states in favor of new states
235            implementing the consume character reference algorithm.
236    
237    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: "Consume a character reference" algorithm is
240            now implemented as a tokenizer's state, rather than
241            a method, with minimum changes (more changes will
242            be made, in due course).  "Bogus comment state"'s inner
243            loop gets removed.
244    
245    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
246    
247            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
248            into their own tokenizer states.
249    
250    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
253            is split into three states.
254    
255    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
258            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
259            no longer does the tokenizer have to push back next input
260            characters in those states.
261    
262    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
263    
264            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
265            into four states so that no longer does the tokenizer have to push
266            back next input characters in that state.
267    
268    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
271            which can be used to insert some wrapper between the character
272            stream handle and the tokenizer.  (It is currently not supported
273            for |set_inner_html| for |Element|s).
274    
275    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTML.pm.src: Ignore punctuations in charset names.
278    
279    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
280    
281            * ContentChecker.pm: Support for charset-layer error levels.
282    
283            * HTML.pm.src: Don't specify |text| argument for the
284            |chardecode:fallback| error, since it is not the encoding
285            being used alternatively.
286    
287    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
290    
291    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
292    
293            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
294    
295    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
296    
297            * HTML.pm.src: Bug fix and sync with the spec with regard
298            to after after frameset insertion mode processing (HTML5
299            revision 1909).  Note that the implementation was wrong
300            per the old spec before the r1909 changes.
301    
302    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
303    
304            * HTMLTable.pm: scope=auto algorithm fix synced with the
305            spec (HTML5 revision 2093).
306            ($process_row): Algorithm step numbers synced with the
307            spec (HTML5 revision 2092).
308    
309    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
312            revision 2094).
313    
314    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
315    
316            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
317    
318    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src: '"' and "'" at the end of attribute
321            name (after another attribute) now raise parse error (HTML5
322            revision 2123).  Empty unquoted attribute values are no
323            longer allowed (HTML5 revision 2122).
324    
325    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
326    
327            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
328            revision 2130).
329    
330    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
331    
332            * ContentChecker.pm: |xml:lang| attribute value must be same
333            as |lang| attribute value for HTML elements (HTML5 revision 2062
334            and so on).
335    
336    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
337    
338            * ContentChecker.pm: Error level definition for |xml_id_error|
339            was missing.
340    
341            * URIChecker.pm: The end of the URL should be marked as the
342            error location for an empty path error.  The position
343            between the userinfo and the port components should be
344            marked as the error location for an empty host error.
345    
346    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
347    
348            * URIChecker.pm: Set parameters representing where in the
349            value the error occurs for errors.  Report unknown
350            address format error in warning level, since address
351            formats are rarely added.  Path segments starting with "/.."
352            were misinterpreted as a dot-segment.
353    
354    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
355    
356            * URIChecker.pm (check_iri_reference): Requires
357            |Message::DOM::DOMImplementation|.
358    
359    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
360    
361            * IMTChecker.pm: Updated for the new error reporting architecture.
362    
363            * ContentChecker.pm: Error levels for IMTs are added.
364    
365    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
366    
367            * H2H.pm (_shift_token): Support for unquoted HTML attribute
368            values.
369    
370    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
371    
372            * CacheManifest.pm: Support for new style of error
373            reports.
374    
375            * HTML.pm.src: Set line=1, column=1 to the document node.
376    
377    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
378    
379            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
380            and URL checkers.  Support for more error levels for bogus
381            langauge tag and URL "standards".
382    
383            * LangTag.pm, URIChecker.pm: Support for new style error
384            level reporting.
385    
386    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
387    
388            * ContentChecker.pm: Support for RDF/XML error levels.
389    
390            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
391            specifying.  Error types are revised.
392    
393    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
394    
395            * ContentChecker.pm: All error reporting method calls are
396            renewed.
397    
398    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
399    
400            * HTML.pm.src: All error type names and "text" parameters
401            are revised.  Use new style for "level" specification.
402    
403            * mkhtmlparser.pl: Use new style for "level" specification.
404    
405    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
406    
407            * WebIDL.pm (parse_char_string): Simplified error
408            reporting process for broken ignored valuetype definition.
409            (Valuetype idl_text): Support for special "DOMString" name.
410    
411    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
412    
413            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
414            terminal of the ScopedName is "DOMString", such that whether
415            the last part of the scoped name is "DOMString" or "_DOMString"
416            later.  It is necessary to determine whether a |typedef|
417            definition should be ignored or not.
418            (parse_char_string): Unescape the identifier of
419            exception members.
420            ($resolve): Return undef for builtin types and sequence<T>
421            types (we might not have to do this, however...).
422            (check): Support checking for Exceptions, Valuetypes,
423            and Typedefs.
424            ($serialize_type): Support for "DOMString::::" syntax.
425            (Typedef idl_text): Output Type as "DOMString" if it
426            is really "DOMString" (i.e. its internal representation
427            is "::DOMString::").
428    
429    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
430    
431            * WebIDL.pm ($resolve): New code, based on resolve code
432            for constant types in the |check| method.
433            (check): Support for checking of attributes, operations, and
434            arguments.
435            (Attribute/Operation idl_text): Exception names in getraises,
436            setraises, and raises clauses is serizlied by |$serialize_type|
437            code.
438    
439    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
440    
441            * WebIDL.pm ($integer): Order of selections are changed to match
442            hexadecimal numbers (the original pattern, taken from the spec,
443            was not work for hexadecimal numbers, because the "0" prefix
444            matches to the [0-7]* part (as an empty string) and therefore
445            it does not match with remaining "x..." part of a "0x..." integer
446            literal.
447            ($get_type): It now returns a string, not an array reference,
448            for regular types and |sequence| types (i.e. it in any case
449            returns a string).
450            ($get_next_token): The second item in the array that represents
451            a integer or float token is now a Perl number value, not the
452            original string representation of the number.
453            (check): Support for const value consistency checking.
454            No extended attribute is defined for constants.
455            (Node subclasses): Use simple strings rather than array references
456            for default data type values.
457            ($serialize_type): Type values are now simple strings.
458            (value): If the new attribute value is a false value, then
459            a FALSE value is set to the attribute.
460    
461    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
462    
463            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
464            in its stringified format ("scoped name" as defined in the
465            spec).  Note that future version of this module should not use
466            array references for type values and the |type_text| attribute
467            should be made obsolete.
468            (parse_char_string): Unescape attribute names.
469            (check): Support for checking of whether inherited interfaces
470            are actually defined or not.  Support for checking of whether
471            interface member identifiers are duplicated or not.
472            ($serialize_type): Scoped names are returned as is.  A future
473            version of this code should escape identifiers other than "DOMString",
474            otherwise the idl_text would be non-conforming.
475    
476    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
477    
478            * WebIDL.pm (parse_char_string): Set line/column numbers
479            to generated nodes.  Unescape identifiers.  Extended attributes
480            for Definition's were ignored.
481            (append_child): Set |parent_node| attribute.
482            (parent_node): New attribute.
483            (check): Support interface/exception members.  Support
484            extended attributes.  Support definition identifier uniqueness
485            constraint.
486            (qualified_name): New attribute.
487            (Interface/Exception idl_text): Extended attributes were
488            not prepended to the returned text.
489    
490    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
491    
492            * WebIDL.pm (parse_char_string): Set line/column numbers
493            to interface object experimentally.  s/shift/pop/g, shift
494            would make things wrong.  Support for interface forward
495            declarations was missing.  Broken interface declarations
496            with no block were not ignored entirely.
497            (Whatpm::WebIDL::Node): New abstract class.  This class
498            makes things easier.
499            (child_nodes): New attribute.  Unlike DOM's attribute with
500            same name, this attribute returns a dead list of nodes for
501            simplicity.
502            (get_user_data, set_user_data): New methods.
503            (Module idl_text): A SPACE character should be inserted
504            before the |{| character.
505            (Interface idl_text): Support for interface forward declarations.
506            (is_forward_declaration): New attribute.
507    
508    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
509    
510            * WebIDL.pm (type_text): Better serializer.
511    
512    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
513    
514            * WebIDL.pm: Revise forward-compatible parsing so that
515            it now can handle broken extended attributes and as such.
516    
517    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
518    
519            * WebIDL.pm: Real support for extended attributes.
520            Support for extended attributes with arguments.
521    
522  2008-07-19  Wakaba  <wakaba@suika.fam.cx>  2008-07-19  Wakaba  <wakaba@suika.fam.cx>
523    
524          * WebIDL.pm: Support for |exception| syntax.          * WebIDL.pm: Support for |exception| syntax.

Legend:
Removed from v.1.260  
changed lines
  Added in v.1.333

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24